Quantization Patents (Class 704/230)
  • Patent number: 8868432
    Abstract: A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes obtaining a second excitation signal having an audio bandwidth extending beyond the audio bandwidth of the CELP excitation signal, obtaining a set of signals by filtering the second excitation signal with a set of bandpass filters, scaling the set of signals using a set of energy-based parameters, and obtaining a composite output signal by combining the scaled set of signals with a signal based on the audio signal decoded by the CELP-based decoder element.
    Type: Grant
    Filed: September 28, 2011
    Date of Patent: October 21, 2014
    Assignee: Motorola Mobility LLC
    Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
  • Patent number: 8849658
    Abstract: Some embodiments describe methods, programs, and systems for speech encoding. Among other things, a received input signal representing a property of speech is quantized to generate a quantized output signal. Prior to the quantization, a version of the input signal is supplied to a first noise shaping filter having a first set of filter coefficients effective to generate a first filtered signal. Following the quantization, the quantized output signal is supplied to a second noise shaping filter having a second set of filter coefficients, thus generating a second filtered signal. A noise shaping operation is performed to control a frequency spectrum of a noise effect in the quantized output signal caused by the quantization, wherein the noise shaping operation is based on both the first and second filtered signals. Finally, the quantised output signal is transmitted in an encoded signal.
    Type: Grant
    Filed: January 23, 2014
    Date of Patent: September 30, 2014
    Assignee: Skype
    Inventor: Koen Bernard Vos
  • Patent number: 8849655
    Abstract: An encoder whereby the bit efficiency of encoding can be improved, thereby improving the qualities of signals as decoded. In the encoder: a time-frequency converting unit (101) converts signals, which are to be encoded, to frequency domain signals; an adaptive spectrum formation encoding unit (102) determines an effective range in the frequency band of the frequency domain signals; and a pulse vector encoding unit (103) pulse vector encodes only the signal components within the effective range.
    Type: Grant
    Filed: October 29, 2010
    Date of Patent: September 30, 2014
    Assignee: Panasonic Intellectual Property Corporation of America
    Inventors: Zongxian Liu, Kok Seng Chong
  • Patent number: 8838443
    Abstract: There is disclosed an encoder apparatus whereby, when a band expanding technique for encoding, based on the spectral data of a lower frequency portion, the spectral data of a higher frequency portion is applied to a lower layer in a hierarchical encoding/decoding system, an efficient encoding can be performed in an upper layer as well, thereby improving the decoded-signal quality. In an encoder apparatus (101), a second layer decoder unit (207) calculates a spectrum (differential spectrum), which is to be encoded in a third layer encoder unit (210) that is an upper layer of the second layer decoder unit (207), by applying such an ideal gain (first gain parameter a1) that minimizes the energy of the differential spectrum.
    Type: Grant
    Filed: November 11, 2010
    Date of Patent: September 16, 2014
    Assignee: Panasonic Intellectual Property Corporation of America
    Inventors: Tomofumi Yamanashi, Toshiyuki Morii, Hiroyuki Ehara
  • Patent number: 8825476
    Abstract: Provided are a method and apparatus for encoding and decoding a high frequency signal by using a low frequency signal. The high frequency signal can be encoded by extracting a coefficient by linear predicting a high frequency signal, and encoding the coefficient, generating a signal by using the extracted coefficient and a low frequency signal, and encoding the high frequency signal by calculating a ratio between the high frequency signal and an energy value of the generated signal. Also, the high frequency signal can be decoded by decoding a coefficient, which is extracted by linear predicting a high frequency signal, and a low frequency signal, and generating a signal by using the decoded coefficient and the decoded low frequency signal, and adjusting the generated signal by decoding a ratio between the generated signal and an energy value of the high frequency signal.
    Type: Grant
    Filed: April 8, 2013
    Date of Patent: September 2, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Ki-hyun Choo, Lei Miao, Eun-mi Oh
  • Publication number: 20140236591
    Abstract: A method of recognizing speech is provided that includes generating a decoding network that includes a primary sub-network and a classification sub-network. The primary sub-network includes a classification node corresponding to the classification sub-network. The classification sub-network corresponds to a group of uncommon words. A speech input is received and decoded by instantiating a token in the primary sub-network and passing the token through the primary network. When the token reaches the classification node, the method includes transferring the token to the classification sub-network and passing the token through the classification sub-network. When the token reaches an accept node of the classification sub-network, the method includes returning a result of the token passing through the classification sub-network to the primary sub-network. The result includes one or more words in the group of uncommon words. A string corresponding to the speech input is output that includes the one or more words.
    Type: Application
    Filed: April 28, 2014
    Publication date: August 21, 2014
    Applicant: Tencent Technology (Shenzhen) Company Limited
    Inventors: Shuai YUE, Li Lu, Xiang Zhang, Dadong Xie, Bo Chen, Feng Rao
  • Patent number: 8805696
    Abstract: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source.
    Type: Grant
    Filed: October 7, 2013
    Date of Patent: August 12, 2014
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Patent number: 8805680
    Abstract: Provided are a method and an apparatus for encoding and decoding an audio signal. A method for encoding an audio signal includes receiving a transformed audio signal, dividing the transformed audio signal into a plurality of subbands, performing a first sinusoidal pulse coding operation on the subbands, determining a performance region of a second sinusoidal pulse coding operation among the subbands on the basis of coding information of the first sinusoidal pulse coding operation, and performing the second sinusoidal pulse coding operation on the determined performance region, wherein the first sinusoidal pulse coding operation is performed variably according to the coding information. Accordingly, it is possible to further improve the quality of a synthesized signal by considering the sinusoidal pulse coding of a lower layer when encoding or decoding an audio signal in an upper layer by a layered sinusoidal pulse coding scheme.
    Type: Grant
    Filed: May 19, 2010
    Date of Patent: August 12, 2014
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Mi-Suk Lee, Heesik Yang, Hyun-Woo Kim, Jongmo Sung, Hyun-Joo Bae, Byung-Sun Lee
  • Patent number: 8805694
    Abstract: A method and an apparatus for encoding and decoding audio signals using adaptive sinusoidal coding are provided. The audio signal encoding method includes the steps of dividing a synthesized audio signal into a plurality of sub-bands, calculating the energy of each sub-band, selecting a predetermined number of sub-bands having a relatively large amount of energy from the sub-bands, and performing sinusoidal coding with regard to the selected sub-bands. Application of sinusoidal coding based on consideration of the amount of energy of each sub-band of the synthesized signal improves the quality of the synthesized signal more efficiently.
    Type: Grant
    Filed: February 16, 2010
    Date of Patent: August 12, 2014
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Mi-Suk Lee, Hyun-Joo Bae, Byung-Sun Lee
  • Patent number: 8805681
    Abstract: A method and apparatus to search a codebook including pulses that model a predetermined component of a speech signal. The method includes the operations of selecting a predetermined number of paths corresponding to a predetermined number of pulse locations that are most consistent with the predetermined component, from among paths corresponding to pulse locations of a predetermined pulse location set allocated to at least one branch that connects one state of a predetermined Trellis structure to another state, performing the path selecting operation on each of states other than the one state, and selecting a path corresponding to pulse locations that are most consistent with the predetermined component, from among paths including the selected paths, wherein each path corresponds to a union of plural tracks of an Algebraic codebook. Accordingly, the number of calculations required during a codebook search is reduced.
    Type: Grant
    Filed: September 6, 2013
    Date of Patent: August 12, 2014
    Assignee: SAMSUNG Electronics Co., Ltd.
    Inventors: Hosang Sung, Kangeun Lee, Sang-won Kang, Thomas R. Fischer, Ja-kyoung Jun
  • Patent number: 8788264
    Abstract: An audio encoding device (1A) corrects initial gain information calculated for an arbitrary frame, based on gain information of a stored past frame, thereby calculating gain information to be used in the frame. The audio encoding device (1A) encodes the calculated gain information as a difference from the gain information of the past frame. An audio decoding device (3A) receives the differential gain, and calculates the gain of the arbitrary frame based on the gain used in the past frame, thereby generating a decoded audio signal.
    Type: Grant
    Filed: June 25, 2008
    Date of Patent: July 22, 2014
    Assignee: NEC Corporation
    Inventor: Osamu Shimada
  • Patent number: 8781822
    Abstract: Methods and apparatus for audio and speech processing including generating a plurality of frames, each of the frames comprising a plurality of transform coefficients, and allocating bits to the transform coefficients in each of the frames such that at least two of the transform coefficients in the same frame have different bit allocations and the total number of the bits allocated to the transform coefficients in at least two of the frames is equal.
    Type: Grant
    Filed: February 2, 2010
    Date of Patent: July 15, 2014
    Assignee: QUALCOMM Incorporated
    Inventors: Somdeb Majumdar, Amin Fazeldehkordi, Harinath Garudadri
  • Patent number: 8768691
    Abstract: A sound encoder for efficiently encoding stereophonic sound. A prediction parameter analyzer determines a delay difference D and an amplitude ratio g of a first-channel sound signal with respect to a second-channel sound signal as channel-to-channel prediction parameters from a first-channel decoded signal and a second-channel sound signal. A prediction parameter quantizer quantizes the prediction parameters, and a signal predictor predicts a second-channel signal using the first decoded signal and the quantization prediction parameters. The prediction parameter quantizer encodes and quantizes the prediction parameters (the delay difference D and the amplitude ratio g) using a relationship (correlation) between the delay difference D and the amplitude ratio g attributed to a spatial characteristic (e.g., distance) from a sound source of the signal to a receiving point.
    Type: Grant
    Filed: March 23, 2006
    Date of Patent: July 1, 2014
    Assignee: Panasonic Corporation
    Inventor: Koji Yoshida
  • Patent number: 8768694
    Abstract: Facts are extracted from speech and recorded in a document using codings. Each coding represents an extracted fact and includes a code and a datum. The code may represent a type of the extracted fact and the datum may represent a value of the extracted fact. The datum in a coding is rendered based on a specified feature of the coding. For example, the datum may be rendered as boldface text to indicate that the coding has been designated as an “allergy.” In this way, the specified feature of the coding (e.g., “allergy”-ness) is used to modify the manner in which the datum is rendered. A user inspects the rendering and provides, based on the rendering, an indication of whether the coding was accurately designated as having the specified feature. A record of the user's indication may be stored, such as within the coding itself.
    Type: Grant
    Filed: August 26, 2013
    Date of Patent: July 1, 2014
    Assignee: MModal IP LLC
    Inventors: Detlef Koll, Michael Finke
  • Patent number: 8768693
    Abstract: A system and method for assigning one or more tags to an image file. In one aspect, a server computer receives an image file captured by a client device. In one embodiment, the image file includes an audio component embedded therein by the client device, where the audio component was spoken by a user of the client device as a tag of the image file. The server computer determines metadata associated with the image file and identifies a dictionary of potential textual tags from the metadata. The server computer determines a textual tag from the audio component and from the dictionary of potential textual tags. The server computer then associates the textual tag with the image file as additional metadata.
    Type: Grant
    Filed: May 31, 2012
    Date of Patent: July 1, 2014
    Assignee: Yahoo! Inc.
    Inventors: Oren Somekh, Nadav Golbandi, Liran Katzir, Ronny Lempel, Yoelle Maarek
  • Patent number: 8762158
    Abstract: A method and apparatus for generating synthesis audio signals are provided. The method includes decoding a bitstream; splitting the decoded bitstream into n sub-band signals; generating n transformed sub-band signals by transforming the n sub-band signals in a frequency domain; and generating synthesis audio signals by respectively multiplying the n transformed sub-band signals by values corresponding to synthesis filter bank coefficients.
    Type: Grant
    Filed: August 5, 2011
    Date of Patent: June 24, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Hyun-wook Kim, Han-gil Moon, Sang-hoon Lee
  • Patent number: 8762141
    Abstract: This invention relates to indexing an input vector contained in a set of vectors of a plurality of sets of vectors. The indexing comprises performing, in case that the input vector is contained in a set of vectors of a pre-defined group of one or more sets of vectors of the plurality of sets of vectors, a specific processing that is adapted to a characteristic of the sets of vectors in the pre-defined group of sets of vectors and is only applicable in case of input vectors contained in sets of vectors with the characteristic. The indexing further comprises performing, in case that the input vector is not contained in a set of vectors of the pre-defined group of sets of vectors, a general processing. The invention further relates to an according determining of a target vector contained in a set of vectors of a plurality of sets of vectors based on an index associated with said target vector.
    Type: Grant
    Filed: February 15, 2008
    Date of Patent: June 24, 2014
    Assignee: Nokia Corporation
    Inventors: Adriana Vasilache, Lasse Laaksonen, Anssi Rämö, Mikko Tammi
  • Patent number: 8756067
    Abstract: The present invention provides a computationally efficient technique for compression encoding of an audio signal, and further provides a technique to enhance the sound quality of the encoded audio signal. This is accomplished by including more accurate attack detection and a computationally efficient quantization technique. The improved audio coder converts the input audio signal to a digital audio signal. The audio coder then divides the digital audio signal into larger frames having a long-block frame length and partitions each of the frames into multiple short-blocks. The audio coder then computes short-block audio signal characteristics for each of the partitioned short-blocks based on changes in the input audio signal.
    Type: Grant
    Filed: March 21, 2013
    Date of Patent: June 17, 2014
    Assignee: Sasken Communication Technologies Limited
    Inventor: Bishwarup Mondal
  • Patent number: 8756056
    Abstract: For determining a quantizer step size for quantizing a signal including audio or video information, a first quantizer step size as well as an interference threshold are provided. Then, the actual interference introduced by the first quantizer step size is determined and compared with the interference threshold. Despite the fact that the comparison reveals that the actually introduced interference exceeds the threshold, a second, coarser quantizer step size is nevertheless used, which will then be used for quantization if it turns out that the interference introduced by the coarser, second quantizer step size falls below the threshold or falls below the interference introduced by the first quantizer step size. Thus, the quantization interference is reduced while the quantization is coarsened and, thus, the compression gain is increased.
    Type: Grant
    Filed: July 2, 2009
    Date of Patent: June 17, 2014
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.
    Inventors: Bernhard Grill, Michael Schug, Bodo Teichmann, Nikolaus Rettelbach
  • Publication number: 20140163973
    Abstract: A method, system and program for encoding and/or decoding a speech signal. The method comprises: generating a first signal representing a property of an input speech signal; transforming the first signal using a simulated random-noise signal, thus producing a second signal; quantizing the second signal based on a plurality of discrete representation levels, thus generating quantization values for transmission in an encoded speech signal, and also generating a third signal being a quantized version of the second signal; and performing an inverse of the transformation on the third signal, thus generating a quantized output signal, wherein the generation of the first signal is based on feedback of the quantized output signal. The method further comprises controlling the transformation in dependence on a property of the first signal so as to vary the magnitude of a noise effect created by the transformation relative to the representation levels.
    Type: Application
    Filed: February 17, 2014
    Publication date: June 12, 2014
    Inventor: Koen Bernard Vos
  • Patent number: 8751225
    Abstract: Provided is an apparatus and method for encoding a voice and audio signal by expanding a modified discrete cosine transform (MDCT) based CODEC to a wideband and a super-wideband in a communication system. The apparatus for encoding a signal in a communication system, includes a converter configured to convert a time domain signal corresponding to a service to be provided to users to a frequency domain signal, a quantization and normalization unit configured to calculate and quantize gain of each subband in the converted frequency domain signal and normalize a frequency coefficient of the each subband, a search unit configured to search patch information of each subband in the converted frequency domain signal using the normalized frequency coefficient, and a packetizer configured to packetize the quantized gain and the searched patch information and encode gain information of each subband in the frequency domain signal.
    Type: Grant
    Filed: May 12, 2011
    Date of Patent: June 10, 2014
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Mi-Suk Lee, Hong-Kook Kim, Young-Han Lee
  • Publication number: 20140142936
    Abstract: Some embodiments describe methods, programs, and systems for speech encoding. Among other things, a received input signal representing a property of speech is quantized to generate a quantized output signal. Prior to the quantization, a version of the input signal is supplied to a first noise shaping filter having a first set of filter coefficients effective to generate a first filtered signal. Following the quantization, the quantized output signal is supplied to a second noise shaping filter having a second set of filter coefficients, thus generating a second filtered signal. A noise shaping operation is performed to control a frequency spectrum of a noise effect in the quantized output signal caused by the quantization, wherein the noise shaping operation is based on both the first and second filtered signals. Finally, the quantised output signal is transmitted in an encoded signal.
    Type: Application
    Filed: January 23, 2014
    Publication date: May 22, 2014
    Applicant: SKYPE
    Inventor: Koen Bernard Vos
  • Patent number: 8725504
    Abstract: An approach to performing inverse quantization on a quantized integral value is described. This approach involves determining whether a quantized integral value lies within a first range or a second range of possible values. An interpolated inverse quantization value is calculated from the quantized integral value, using a predetermined bit shifting operation, depending on whether the quantized integral value was in the first or the second range.
    Type: Grant
    Filed: June 6, 2007
    Date of Patent: May 13, 2014
    Assignee: Nvidia Corporation
    Inventor: Wei Jia
  • Patent number: 8725500
    Abstract: Apparatus (119) for encoding at least one parameter associated with a signal source for transmission over k frames to a decoder comprises a processor (119) which is configured in operation to assign a predetermined bit pattern to n bits associated with the at least one parameter of a first frame of k frames and set the n bits associated with the at least one parameter of each of k?1 subsequent frames to values, such that the values of the n bits of the k?1 subsequent frames represent the at least one parameter. The predetermined bit pattern indicates a start of the at least one parameter.
    Type: Grant
    Filed: November 19, 2008
    Date of Patent: May 13, 2014
    Assignee: Motorola Mobility LLC
    Inventors: Jonathan A Gibbs, James P Ashley, Holly L Francois, Udar Mittal
  • Publication number: 20140129216
    Abstract: It is inter alia disclosed to determine, for each set of basis code vectors of a plurality of sets of basis code vectors, a potential basis code vector for encoding an input vector, wherein each set of basis code vectors is associated with at least one scale representative of a plurality of scale representatives, and to determine a code vector for encoding the input vector from a subset of code vectors, said subset of code vectors comprising, for each determined potential basis code vector and each scale representative associated with the set of basis code vectors of the respective potential basis code vector, a code vector based on the respective potential basis code vector scaled by the respective scale representative.
    Type: Application
    Filed: July 1, 2011
    Publication date: May 8, 2014
    Applicant: NOKIA CORPORATION
    Inventors: Adriana Vasilache, Lasse Juhani Laaksonen, Anssi Sakari Rämö, Mikko Tapio Tammi
  • Patent number: 8712765
    Abstract: A parameter decoding apparatus includes a prediction residue decoder that finds a quantized prediction residue based on encoded information included in a current frame subject to decoding and a moving-average predictor produces a predicted parameter by multiplying a predictive coefficient with a past quantized prediction residue. An adder decodes a parameter by adding the quantized prediction residue and the predicted parameter, wherein the prediction residue decoder, when the current frame is erased, finds a current-frame quantized prediction residue from a weighted linear sum of a parameter decoded in the past and a future-frame quantized prediction residue.
    Type: Grant
    Filed: May 17, 2013
    Date of Patent: April 29, 2014
    Assignee: Panasonic Corporation
    Inventor: Hiroyuki Ehara
  • Patent number: 8712764
    Abstract: A device and a method for quantizing, in a super-frame including a sequence of frames, LPC filters calculated during the frames of the sequence. The LPC filter quantizing device and method comprises: an absolute quantizer for first quantizing one of the LPC filters using absolute quantization; and at least one quantizer of the other LPC filters using a quantization mode selected from the group consisting of absolute quantization and differential quantization relative to at least one previously quantized filter amongst the LPC filters. For inverse quantizing, at least the first quantized LPC filter is received and an inverse quantizer inverse quantizes the first quantized LPC filter using absolute inverse quantization. If any quantized LPC filter other than the first quantized LPC filter is received, an inverse quantizer inverse quantizes this quantized LPC filter using one of absolute inverse quantization and differential inverse quantization relative to at least one previously received quantized LPC filter.
    Type: Grant
    Filed: July 10, 2009
    Date of Patent: April 29, 2014
    Assignee: Voiceage Corporation
    Inventors: Philippe Gournay, Bruno Bessette, Redwan Salami
  • Patent number: 8706481
    Abstract: A method of multi-path trellis coded quantization (TCQ) usable in a speech coding system, and a quantizer using the method. Specifically the method includes calculating accumulated distortions corresponding to 2N survivor paths, wherein N indicates an integer greater than two, each of the 2N survivor paths is going towards one of nodes at an i th stage of a trellis, and i indicates an integer greater than zero, comparing the accumulated distortions respectively corresponding to the 2N survivor paths to select N paths among the 2N survivor paths, wherein the accumulated distortions corresponding to selected N paths are smaller than the accumulated distortions corresponding to unselected N paths establishing the selected N paths as survivor paths going toward an i+1 th stage, and selecting an optimal path among the 2N survivor paths corresponding to each node of a last stage.
    Type: Grant
    Filed: December 11, 2006
    Date of Patent: April 22, 2014
    Assignee: SAMSUNG Electronics Co., Ltd.
    Inventors: Kang Eun Lee, Eun Mi Oh, Ho Sang Sung, Chang Yong Son
  • Patent number: 8706506
    Abstract: In a waveform compressing apparatus, a trial mode selecting portion selects a trial mode having the highest compression rate from a plurality of candidate modes which have not been selected before as a trial mode for generating a residue code, the selected trial mode comprising a scalar quantization mode or a vector quantization mode. A waveform data compressing portion compresses a given data amount of original waveform data according to the selected trial mode so as to generate the residue code, the data amount being determined in correspondence with the selected trial mode. A waveform data restoring portion generates a restored waveform data from the compressed data using the generated residue code. A determining portion measures an evaluation value of a quantization error contained in the restored waveform data relative to the original waveform data, and determines whether the evaluation value is equal to or smaller than a predetermined allowable value.
    Type: Grant
    Filed: December 27, 2007
    Date of Patent: April 22, 2014
    Assignee: Yamaha Corporation
    Inventor: Masatsugu Okazaki
  • Patent number: 8706488
    Abstract: In one aspect, a method of processing a voice signal to extract information to facilitate training a speech synthesis model is provided. The method comprises acts of detecting a plurality of candidate features in the voice signal, performing at least one comparison between one or more combinations of the plurality of candidate features and the voice signal, and selecting a set of features from the plurality of candidate features based, at least in part, on the at least one comparison. In another aspect, the method is performed by executing a program encoded on a computer readable medium. In another aspect, a speech synthesis model is provided by, at least in part, performing the method.
    Type: Grant
    Filed: February 27, 2013
    Date of Patent: April 22, 2014
    Assignee: Nuance Communications, Inc.
    Inventors: Michael D. Edgington, Laurence Gillick, Jordan R. Cohen
  • Patent number: 8706510
    Abstract: An audio decoder for providing a decoded audio information includes a arithmetic decoder for providing a plurality of decoded spectral values on the basis of an arithmetically-encoded representation of the spectral values and a frequency-domain-to-time-domain converter for providing a time-domain audio representation using the decoded spectral values. The arithmetic decoder is configured to select a mapping rule describing a mapping of a code value onto a symbol code in dependence on a context state. The arithmetic decoder is configured to determine or modify the current context state in dependence on a plurality of previously-decoded spectral values. The arithmetic decoder is configured to detect a group of a plurality of previously-decoded spectral values, which fulfill, individually or taken together, a predetermined condition regarding their magnitudes, and to determine the current context state in dependence on a result of the detection. An audio encoder uses similar principles.
    Type: Grant
    Filed: April 18, 2012
    Date of Patent: April 22, 2014
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.
    Inventors: Guillaume Fuchs, Vignesh Subbaraman, Nikolaus Rettelbach, Markus Multrus, Marc Gayer, Patrick Warmbold, Christian Griebel, Oliver Weiss
  • Patent number: 8694324
    Abstract: The invention relates to a system and method for gathering data for use in a spoken dialog system. An aspect of the invention is generally referred to as an automated hidden human that performs data collection automatically at the beginning of a conversation with a user in a spoken dialog system. The method comprises presenting an initial prompt to a user, recognizing a received user utterance using an automatic speech recognition engine and classifying the recognized user utterance using a spoken language understanding module. If the recognized user utterance is not understood or classifiable to a predetermined acceptance threshold, then the method re-prompts the user. If the recognized user utterance is not classifiable to a predetermined rejection threshold, then the method transfers the user to a human as this may imply a task-specific utterance. The received and classified user utterance is then used for training the spoken dialog system.
    Type: Grant
    Filed: May 21, 2012
    Date of Patent: April 8, 2014
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Giuseppe Di Fabbrizio, Dilek Z. Hakkani-Tur, Mazin G. Rahim, Bernard S. Renger, Gokhan Tur
  • Patent number: 8687829
    Abstract: A parameter transformer generates level parameters, indicating an energy relation between a first and a second audio channel of a multi-channel audio signal associated to a multi-channel loudspeaker configuration. The level parameter are generated based on object parameters for a plurality of audio objects associated to a down-mix channel, which is generated using object audio signals associated to the audio objects. The object parameters have an energy parameter indicating an energy of the object audio signal. To derive the coherence and the level parameters, a parameter generator is used, which combines the energy parameter and object rendering parameters, which depend on a desired rendering configuration.
    Type: Grant
    Filed: October 5, 2007
    Date of Patent: April 1, 2014
    Assignees: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V., Dolby Sweden AB, Koninklijke Philips Electronics N.V.
    Inventors: Johannes Hilpert, Karsten Linzmeier, Juergen Herre, Ralph Sperschneider, Andreas Hoelzer, Lars Villemoes, Jonas Engdegard, Heiko Purnhagen, Kristofer Kjoerling, Dirk Jeroen Breebaart, Werner Oomen
  • Patent number: 8676571
    Abstract: An audio signal processing system including a time-frequency conversion unit which converts an audio signal in time domain into frequency domain in frame units so as to calculate a frequency spectrum of the audio signal, a spectral change calculation unit which calculates an amount of change between a frequency spectrum of a first frame and a frequency spectrum of a second frame before the first frame based on the frequency spectrum of the first frame and the frequency spectrum of the second frame, and a judgment unit which judges the type of the noise which is included in the audio signal of the first frame in accordance with the amount of spectral change.
    Type: Grant
    Filed: December 19, 2011
    Date of Patent: March 18, 2014
    Assignee: Fujitsu Limited
    Inventors: Takeshi Otani, Taro Togawa, Masanao Suzuki, Yasuji Ota
  • Patent number: 8665945
    Abstract: To improve the encoding compressibility of prediction residuals. An encoder performs prediction analysis of input time-series signals to generate prediction residuals expressed by integers, and sets an integer separation parameter that depends on the amplitude of the prediction residuals for each certain time segment. The encoder selects a side information code table corresponding to an index representing the prediction effectiveness of the time-series signals from a set of side information code tables including a side information code table used for variable length coding of side information corresponding to the separation parameter.
    Type: Grant
    Filed: March 5, 2010
    Date of Patent: March 4, 2014
    Assignee: Nippon Telegraph and Telephone Corporation
    Inventors: Takehiro Moriya, Noboru Harada, Yutaka Kamamoto
  • Patent number: 8666752
    Abstract: Provided are an encoding apparatus and a decoding apparatus of a multi-channel signal. The encoding apparatus of the multi-channel signal may process a phase parameter associated with phase information between a plurality of channels constituting the multi-channel signal, based on a characteristic of the multi-channel signal. The encoding apparatus may generate an encoded bitstream with respect to the multi-channel signal using the processed phase parameter and a mono signal extracted from the multi-channel signal.
    Type: Grant
    Filed: March 17, 2010
    Date of Patent: March 4, 2014
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jung-Hoe Kim, Eun Mi Oh
  • Patent number: 8660840
    Abstract: A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.
    Type: Grant
    Filed: August 12, 2008
    Date of Patent: February 25, 2014
    Assignee: QUALCOMM Incorporated
    Inventors: Arasanipalai K. Ananthapadmanabhan, Sarath Manjunath, Pengjun Huang, Eddie-Lun Tik Choy, Andrew P. Dejaco
  • Publication number: 20140052440
    Abstract: It is inter alia disclosed to determine a single code vector index based on combining at least two code vector indexes, each code vector index being associated with a code vector of a respective codebook.
    Type: Application
    Filed: January 28, 2011
    Publication date: February 20, 2014
    Applicant: Nokia Corporation
    Inventors: Adriana Vasilache, Lasse Vaaksonen, Anssi Rämö, Mikko Tammi
  • Patent number: 8655669
    Abstract: An audio decoder has an arithmetic decoder for providing decoded spectral values on the basis of an arithmetically-encoded representation and a frequency-domain-to-time-domain converter for providing a time-domain audio representation. The arithmetic decoder selects a mapping rule describing a mapping of a code value onto a symbol code in dependence on a numeric current context value describing a current context state. The arithmetic decoder determines the numeric current context value in dependence on a plurality of previously decoded spectral values. The arithmetic decoder evaluates at least one table using an iterative interval size reduction to determine whether the numeric current context value is identical to a table context value described by an entry of the table or lies within an interval described by entries of the table, and derives a mapping rule index value describing a selected mapping table. An audio encoder also uses an iterative interval table size reduction.
    Type: Grant
    Filed: April 19, 2012
    Date of Patent: February 18, 2014
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.
    Inventors: Guillaume Fuchs, Vignesh Subbaraman, Nikolaus Rettelbach, Markus Multrus, Marc Gayer, Patrick Warmbold, Christian Griebel, Oliver Weiss
  • Patent number: 8655652
    Abstract: An apparatus for encoding an information signal having discrete values includes a quantizer having a quantizer border, wherein the quantizer is adapted so that a discrete value above the quantization border is quantized to a quantization index, which is different from a quantization index obtained by quantizing a discrete value below the quantization border, a controller for modifying the quantization border, wherein the quantizer having a first quantization border setting is adapted to generate a first set of quantization indices for the discrete values, and wherein the quantizer having a second modified quantization border setting is adapted to generate a second set of quantization indices, and an output interface for outputting an encoded information signal which is either based on the first set of quantization indices or the second set of quantization indices dependent on a decision function.
    Type: Grant
    Filed: September 25, 2007
    Date of Patent: February 18, 2014
    Assignee: Dolby International AB
    Inventor: Michael Schug
  • Patent number: 8655653
    Abstract: A method, system and program for encoding and/or decoding a speech signal. The method comprises: generating a first signal representing a property of an input speech signal; transforming the first signal using a simulated random-noise signal, thus producing a second signal; quantizing the second signal based on a plurality of discrete representation levels, thus generating quantization values for transmission in an encoded speech signal, and also generating a third signal being a quantized version of the second signal; and performing an inverse of the transformation on the third signal, thus generating a quantized output signal, wherein the generation of the first signal is based on feedback of the quantized output signal. The method further comprises controlling the transformation in dependence on a property of the first signal so as to vary the magnitude of a noise effect created by the transformation relative to the representation levels.
    Type: Grant
    Filed: June 4, 2009
    Date of Patent: February 18, 2014
    Assignee: Skype
    Inventor: Koen Bernard Vos
  • Patent number: 8645145
    Abstract: An audio decoder includes an arithmetic decoder for providing a plurality of decoded spectral values on the basis of an arithmetically encoded representation of the spectral values, and a frequency-domain-to-time-domain converter for providing a time-domain audio representation using the decoded spectral values. The arithmetic decoder selects a mapping rule describing a mapping of a code value onto a symbol code in dependence on a context state described by a numeric current context value. The arithmetic decoder determines the numeric current context value in dependence on a plurality of previously decoded spectral values. The arithmetic decoder evaluates a hash table, entries of which define both significant state values and boundaries of intervals of numeric context values, in order to select the mapping rule. A mapping rule index value is individually associated to a numeric context value being a significant state value.
    Type: Grant
    Filed: July 12, 2012
    Date of Patent: February 4, 2014
    Assignee: Fraunhoffer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.
    Inventors: Vignesh Subbaraman, Guillaume Fuchs, Markus Multrus, Nikolaus Rettelbach, Marc Gayer, Oliver Weiss, Christian Griebel, Patrick Warmbold
  • Patent number: 8639504
    Abstract: Some embodiments describe methods, programs, and systems for speech encoding. Among other things, a received input signal representing a property of speech is quantized to generate a quantized output signal. Prior to the quantization, a version of the input signal is supplied to a first noise shaping filter having a first set of filter coefficients effective to generate a first filtered signal. Following the quantization, the quantized output signal is supplied to a second noise shaping filter having a second set of filter coefficients, thus generating a second filtered signal. A noise shaping operation is performed to control a frequency spectrum of a noise effect in the quantized output signal caused by the quantization, wherein the noise shaping operation is based on both the first and second filtered signals. Finally, the quantized output signal is transmitted in an encoded signal.
    Type: Grant
    Filed: May 30, 2013
    Date of Patent: January 28, 2014
    Assignee: Skype
    Inventor: Koen Bernard Vos
  • Patent number: 8639503
    Abstract: A method for encoding speech includes processing an input speech signal using an encoder, resulting in a compressed encoder representation of the input speech signal. The method also includes, if a speech recognizer identifies, in the input speech signal, a corresponding dictionary speech element that approximates the input speech signal, determining, with an electronic device, a compressed recognizer representation of the corresponding dictionary speech element, calculating, with the electronic device, one or more differences between the compressed encoder representation and the compressed recognizer representation, and compiling, with the electronic device, compressed speech information that includes representations of the one or more differences. The encoder and the speech recognizer are implemented with the electronic device.
    Type: Grant
    Filed: January 3, 2013
    Date of Patent: January 28, 2014
    Assignee: Marvell International Ltd.
    Inventors: Khosro Darroudi, Brian R. Mears
  • Patent number: 8630849
    Abstract: A method and apparatus to convert a linear predictive coding (LPC) coefficient into a coefficient having order characteristics, such as a line spectrum frequency (LSF), and to vector quantize the coefficient having the order characteristics when a speech signal is encoded. The method and apparatus split the vector of the coefficient having the order characteristics into a plurality of subvectors, select a codebook in which an available bit is variably allocated to each subvector according to distribution of elements of each subvector, and quantize each subvector according to the selected codebook. The method and apparatus use normalized codebooks.
    Type: Grant
    Filed: November 15, 2006
    Date of Patent: January 14, 2014
    Assignee: SAMSUNG Electronics Co., Ltd.
    Inventors: Chang-Yong Son, Eun-Mi Oh, Ho-Sang Sung, Kang-Eun Lee, Ki-Hyun Choo, Jung-Hoe Kim
  • Patent number: 8626504
    Abstract: Signatures that can be used to identify video and audio content are generated from the content by generating measures of dissimilarity between features of corresponding groups of pixels in frames of video content and by generating low-resolution time-frequency representations of audio segments. The signatures are generated by applying a hash function to intermediate values derived from the measures of dissimilarity and to the low-resolution time-frequency representations. The generated signatures may be used in a variety of applications such as restoring synchronization between video and audio content streams and identifying copies of original video and audio content. The generated signatures can provide reliable identifications despite intentional and unintentional modifications to the content.
    Type: Grant
    Filed: August 30, 2012
    Date of Patent: January 7, 2014
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Regunathan Radhakrishnan, Claus Bauer, Kent Bennett Terry, Brian David Link, Hyung-Suk Kim, Eric Gsell
  • Patent number: 8625705
    Abstract: A data modulation method, a data demodulation method, and a method, include obtaining a target frequency band allocated to a transmitting terminal; calculating a precoding codeword index of spectrum shaping according to a codebook corresponding to a current radio scene and the target frequency band, where the codeword index corresponds to a return-to-zero subcarrier sequence, and the return-to-zero subcarrier sequence serves to calculate a precoding vector for modulating data; and sending the precoding codeword index out through broadcast.
    Type: Grant
    Filed: January 15, 2013
    Date of Patent: January 7, 2014
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Guolin Sun, Jaap van de Beek
  • Patent number: 8620673
    Abstract: Embodiments of the present invention disclose an audio decoding method, including: determining that bitstreams to be decoded are monophony coding layer and first stereo enhancement layer bitstreams; decoding the monophony coding layer to obtain a monophony decoded frequency-domain signal; reconstructing left and right channel frequency-domain signals in a first sub-band region by utilizing the monophony decoded frequency-domain signal after an energy adjustment; and reconstructing left and right channel frequency-domain signals in a second sub-band region by utilizing the monophony decoded frequency-domain signal without the energy adjustment.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: December 31, 2013
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Qi Zhang, Libin Zhang
  • Patent number: 8620645
    Abstract: A decoder arrangement comprising a receiver input for parameters of frame-based coded signals and a decoder arranged to provide frames of decoded audio signals based on the parameters. The receiver input and/or the decoder is arranged to establish a time difference between the occasion when parameters of a first frame is available at the receiver input and the occasion when a decoded audio signal of the first frame is available at an output of the decoder, which time difference corresponds to at least one frame. A postfilter is connected to the output of the decoder and to the receiver input. The postfilter is arranged to provide a filtering of the frames of decoded audio signals into an output signal in response to parameters of a respective subsequent frame.
    Type: Grant
    Filed: December 14, 2007
    Date of Patent: December 31, 2013
    Assignee: Telefonaktiebolaget L M Ericsson (publ)
    Inventor: Stefan Bruhn
  • Patent number: 8620646
    Abstract: A system and method may be configured to analyze audio information derived from an audio signal. The system and method may track sound pitch across the audio signal. The tracking of pitch across the audio signal may take into account change in pitch by determining at individual time sample windows in the signal duration an estimated pitch and a representation of harmonic envelope at the estimated pitch. The estimated pitch and the representation of harmonic envelope may then be implemented to determine an estimated pitch for another time sample window in the signal duration with an enhanced accuracy and/or precision.
    Type: Grant
    Filed: August 8, 2011
    Date of Patent: December 31, 2013
    Assignee: The Intellisis Corporation
    Inventors: David C. Bradley, Rodney Gateau, Daniel S. Goldin, Robert N. Hilton, Nicholas K. Fisher