Vector Quantization Patents (Class 704/222)

VECTOR QUANTIZATION DEVICE, VECTOR INVERSE QUANTIZATION DEVICE, AND METHOD THEREOF

Publication number: 20110004469

Abstract: Disclosed are a vector quantization device and others capable of adaptively adjusting a vector space of a code vector for quantization of a second stage by using a quantization result of a first stage and improving the quantization accuracy.

Type: Application

Filed: October 16, 2007

Publication date: January 6, 2011

Applicant: PANASONIC CORPORATION

Inventors: Kaoru Sato, Toshiyuki Morii, Tomofumi Yamanashi
Erasure of DTMF signal transmitted as speech data

Patent number: 7856096

Abstract: A DTMF signal processing apparatus of the present invention comprises a data divider unit, a DTMF signal component analyzer unit, a weighting processing unit, a buffer, a DTMF signal erasure determination unit, and a DTMF signal erasure processing unit. The data divider unit divides speech data into a plurality of divided speech data, and the DTMF signal component analyzer unit analyzes whether or not the divided speech data has a DTMF signal component. The weighting processing unit applies a weighting value to divided speech data analyzed at this time and stores the resultant speech data in the buffer, and also applies a weighting value to past divided speech data previously stored in the buffer when the result of the analysis indicates that the analyzed divided speech data has the DTMF signal component. The DTMF signal erasure determination unit determines based on the weighting value whether or not to erase the divided speech data stored in the buffer.

Type: Grant

Filed: February 13, 2006

Date of Patent: December 21, 2010

Assignee: NEC Corporation

Inventors: Tatsuya Nakazawa, Kazunori Ozawa
Comparison of data signals using characteristic electronic thumbprints extracted therefrom

Patent number: 7853438

Abstract: A characteristic thumbprint is extracted from a data signal, the thumbprint based on statistics relating to the data signal. The data signal can be compared indirectly by matching this thumbprint against one or more reference thumbprints. The data signal may be any type of signal, including streaming digitized audio or obtained from static files. A database may contain a number of these characteristic thumbprints, and the database can be searched for a particular thumbprint.

Type: Grant

Filed: July 31, 2008

Date of Patent: December 14, 2010

Assignee: Auditude, Inc.

Inventors: Jeffery L. Caruso, Nicholas Seet, William Shawn Yeager
Method for reducing decoder complexity in waveform interpolation speech decoding by converting dimension of vector

Patent number: 7848923

Abstract: Provided is a method for converting a dimension of a vector. The vector dimension conversion method for vector quantization includes the steps of: extracting a specific parameter having a pitch period from an input speech signal and then generating a vector of a dimension that varies according to the pitch period; dividing an entire frequency domain of the generated vector of the variable dimension into at least two frequency domains; and converting the vector of the variable dimension into vectors of mutually different fixed dimensions according to the divided frequency domains. Thereby, not only an error due to the vector dimension conversion is suppressed but codebook memory required for the vector quantization is effectively reduced.

Type: Grant

Filed: April 24, 2006

Date of Patent: December 7, 2010

Assignee: Electronics and Telecommunications Research Institute

Inventors: Kyung Jin Byun, Ik Soo Eo, Hee Bum Jung
Method, apparatus and computer program product for providing voice conversion using temporal dynamic features

Patent number: 7848924

Abstract: An apparatus for providing voice conversion using temporal dynamic features includes a feature extractor and a transformation element. The feature extractor may be configured to extract dynamic feature vectors from source speech. The transformation element may be in communication with the feature extractor and configured to apply a first conversion function to a signal including the extracted dynamic feature vectors to produce converted dynamic feature vectors. The first conversion function may have been trained using at least dynamic feature data associated with training source speech and training target speech. The transformation element may be further configured to produce converted speech based on an output of applying the first conversion function.

Type: Grant

Filed: April 17, 2007

Date of Patent: December 7, 2010

Assignee: Nokia Corporation

Inventors: Jani K. Nurminen, Victor Popa, Jilei Tian
Entropy coding using escape codes to switch between plural code tables

Patent number: 7840403

Abstract: An audio encoder performs entropy encoding of audio data. For example, an audio encoder determines whether a first code table in a group of plural code tables contains a code representing a first vector. If it does, the code is used, and otherwise the escape code from the first code table is used and the first vector is encoded using, at least in part, a second code table. An audio decoder performs corresponding entropy decoding.

Type: Grant

Filed: May 27, 2008

Date of Patent: November 23, 2010

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Adaptive vector Huffman coding and decoding based on a sum of values of audio data symbols

Patent number: 7822601

Abstract: An audio encoder performs entropy encoding of audio data. For example, an audio encoder determines a Huffman code from a Huffman code table to use for encoding a vector of audio data symbols, where the determining is based on a sum of values of the audio data symbols. An audio decoder performs corresponding entropy decoding.

Type: Grant

Filed: May 16, 2008

Date of Patent: October 26, 2010

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Vector transformation apparatus and vector transformation method

Patent number: 7809558

Abstract: There is provided a vector conversion device for converting a reference vector used for quantization of an input vector so as to improve a signal quality including audio. In this vector conversion device, a vector quantization unit (902) acquires a number corresponding to a decoded LPC parameter of a narrow band from all the code vectors stored in a code book (903). A vector dequantization unit (904) references the number of the code vector obtained by the vector quantization unit (902) and selects a code vector from the code book (905). A conversion unit (906) performs calculation by using a sampling-adjusted decoded LPC parameter obtained from an up-sampling unit (901) and a code vector obtained from the vector dequantization unit (904), thereby obtaining a decoded LPC parameter of a broad band.

Type: Grant

Filed: November 1, 2005

Date of Patent: October 5, 2010

Assignee: Panasonic Corporation

Inventor: Toshiyuki Morii
Hidden trajectory modeling with differential cepstra for speech recognition

Patent number: 7805308

Abstract: A novel system for speech recognition uses differential cepstra over time frames as acoustic features, together with the traditional static cepstral features, for hidden trajectory modeling, and provides greater accuracy and performance in automatic speech recognition. According to one illustrative embodiment, an automatic speech recognition method includes receiving a speech input, generating an interpretation of the speech, and providing an output based at least in part on the interpretation of the speech input. The interpretation of the speech uses hidden trajectory modeling with observation vectors that are based on cepstra and on differential cepstra derived from the cepstra. A method is developed that can automatically train the hidden trajectory model's parameters that are corresponding to the components of the differential cepstra in the full acoustic feature vectors.

Type: Grant

Filed: January 19, 2007

Date of Patent: September 28, 2010

Assignee: Microsoft Corporation

Inventors: Li Deng, Dong Yu
System and method for improving robustness of speech recognition using vocal tract length normalization codebooks

Patent number: 7797158

Abstract: Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook.

Type: Grant

Filed: June 20, 2007

Date of Patent: September 14, 2010

Assignee: AT&T Intellectual Property II, L.P.

Inventor: Mazin Gilbert
SPEECH CODER AND SPEECH DECODER

Publication number: 20100228544

Abstract: A vector quantization apparatus performs coding of a linear predictive coding coefficient by multi-stage vector quantization. A first codebook and a second codebook store code vectors, and a storing section stores scalars. A first quantizing section extracts a first code vector stored in the first codebook and performs first stage quantization for quantizing a target vector using the first code vector. A second quantizing section extracts a second code vector stored in the second codebook, calculates a third code vector by multiplying the second code vector and one of the scalars stored in the storing section, performs distance calculation using the target vector, the first code vector and the third code vector, and performs second stage quantization for quantizing the target vector using a result of the distance calculation. Each scalar stored in the storing section is associated with at least one of the vectors stored in the first codebook.

Type: Application

Filed: May 20, 2010

Publication date: September 9, 2010

Applicant: PANASONIC CORPORATION

Inventors: Kazutoshi YASUNAGA, Toshiyuki MORII
Method and apparatus for speech coding

Patent number: 7792670

Abstract: A method and apparatus for prediction in a speech-coding system is provided herein. The method of a 1st order long-term predictor (LTP) filter, using a sub-sample resolution delay, is extended to a multi-tap LTP filter, or, viewed from another vantage point, the conventional integer-sample resolution multi-tap LTP filter is extended to use sub-sample resolution delay. This novel formulation of a multi-tap LTP filter offers a number of advantages over the prior-art LTP filter configurations. Particularly, defining the lag with sub-sample resolution makes it possible to explicitly model the delay values that have a fractional component, within the limits of resolution of the over-sampling factor used by the interpolation filter. The coefficients of such a multi-tap LTP filter are thus largely freed from modeling the effect of delays that have a fractional component.

Type: Grant

Filed: October 14, 2004

Date of Patent: September 7, 2010

Assignee: Motorola, Inc.

Inventors: Mark A. Jasiuk, Tenkasi V. Ramabadran, Udar Mittal, James P. Ashley, Michael J. McLaughlin
Encoding device and decoding device

Patent number: 7783496

Abstract: An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.

Type: Grant

Filed: February 12, 2009

Date of Patent: August 24, 2010

Assignee: Panasonic Corporation

Inventors: Mineo Tsushima, Takeshi Norimatsu, Kosuke Nishio, Naoya Tanaka
Method and device for gain quantization in variable bit rate wideband speech coding

Patent number: 7778827

Abstract: The present invention relates to a gain quantization method and device for implementation in a technique for coding a sampled sound signal processed, during coding, by successive frames of L samples, wherein each frame is divided into a number of subframes and each subframe comprises a number N of samples, where N<L. In the gain quantization method and device, an initial pitch gain is calculated based on a number f of subframes, a portion of a gain quantization codebook is selected in relation to the initial pitch gain, and pitch and fixed-codebook gains are jointly quantized. This joint quantization of the pitch and fixed-codebook gains comprises, for the number f of subframes, searching the gain quantization codebook in relation to a search criterion. The codebook search is restricted to the selected portion of the gain quantization codebook and an index of the selected portion of the gain quantization codebook best meeting the search criterion is found.

Type: Grant

Filed: January 19, 2005

Date of Patent: August 17, 2010

Assignee: Nokia Corporation

Inventors: Milan Jelinek, Redwan Salami
Method of coding a signal using vector quantization

Patent number: 7769581

Abstract: The present invention relates to a method of coding a signal, in particular an audio or speech signal, wherein a codebook comprising k code vectors is provided for vector quantization of a signal vector representing a set of signal values of said signal(s), and wherein an optimal code vector of said codebook is determined by performing a codebook search. Parallelism is employed to accelerate the coding procedure. In particular, the codebook search is highly parallelised.

Type: Grant

Filed: July 11, 2003

Date of Patent: August 3, 2010

Assignee: Alcatel

Inventor: Christian Georg Gerlach
Quantizing feature vectors in decision-making applications

Patent number: 7769583

Abstract: A system, method and computer program product for classification of an analog electrical signal using statistical models of training data. A technique is described to quantize the analog electrical signal in a manner which maximizes the compression of the signal while simultaneously minimizing the diminution in the ability to classify the compressed signal. These goals are achieved by utilizing a quantizer designed to minimize the loss in a power of the log-likelihood ratio. A further technique is described to enhance the quantization process by optimally allocating a number of bits for each dimension of the quantized feature vector subject to a maximum number of bits available across all dimensions.

Type: Grant

Filed: May 13, 2006

Date of Patent: August 3, 2010

Assignee: International Business Machines Corporation

Inventors: Upendra V. Chaudhari, Hsin I. Tseng, Deepak S. Turaga, Olivier Verscheure
Encoder, decoder, encoding method, and decoding method

Patent number: 7769584

Abstract: An encoder, decoder, encoding method, and decoding method enabling acquisition of high-quality decoded signal in scalable encoding of an original signal in first and second layers even if the second or upper layer section performs low bit-rate encoding. In the encoder, a spectrum residue shape codebook (305) stores candidates of spectrum residue shape vectors, a spectrum residue gain codebook (307) stores candidates of spectrum residue gains, and a spectrum residue shape vector and a spectrum residue gain are sequentially outputted from the candidates according to the instruction from a search section (306). A multiplier (308) multiplies a candidate of the spectrum residue shape vector by a candidate of the spectrum residue gain and outputs the result to a filtering section (303).

Type: Grant

Filed: November 2, 2005

Date of Patent: August 3, 2010

Assignee: Panasonic Corporation

Inventors: Masahiro Oshikiri, Hiroyuki Ehara, Koji Yoshida
ADAPTIVE SOUND SOURCE VECTOR QUANTIZING DEVICE AND ADAPTIVE SOUND SOURCE VECTOR QUANTIZING METHOD

Publication number: 20100185442

Abstract: It is an object to disclose an adaptive sound source vector quantizing device, etc. that can be configured to improve quantizing accuracy in adaptive sound source vector quantization to be carried out for every sub-frame.

Type: Application

Filed: June 20, 2008

Publication date: July 22, 2010

Applicant: PANASONIC CORPORATION

Inventors: Kaoru Sato, Toshiyuki Morii
Flexible frequency and time partitioning in perceptual transform coding of audio

Patent number: 7761290

Abstract: An audio encoder/decoder performs band partitioning for vector quantization encoding of spectral holes and missing high frequencies that result from quantization when encoding at low bit rates. The encoder/decoder determines a band structure for spectral holes based on two threshold parameters: a minimum hole size threshold and a maximum band size threshold. Spectral holes wider than the minimum hole size threshold are partitioned evenly into bands not exceeding the maximum band size threshold in size. Such hole filling bands are configured up to a preset number of hole filling bands. The bands for missing high frequencies are then configured by dividing the high frequency region into bands having binary-increasing, linearly-increasing or arbitrarily-configured band sizes up to a maximum overall number of bands.

Type: Grant

Filed: June 15, 2007

Date of Patent: July 20, 2010

Assignee: Microsoft Corporation

Inventors: Kazuhito Koishida, Sanjeev Mehrotra, Wei-Ge Chen
METHOD AND APPARATUS FOR VECTOR QUANTIZATION CODEBOOK SEARCH

Publication number: 20100174539

Abstract: A vector quantization codebook search method and apparatus use support vector machines (“SVMs”) to compute a hyperplane, where the hyperplane is used to separate codebook elements into a plurality of bins. During execution, a controller determines which of the plurality of bins contains a desired codebook element, and then searches the determined bin. Codebook search complexity is reduced and an exhaustive codebook search is selectively avoided.

Type: Application

Filed: January 6, 2009

Publication date: July 8, 2010

Applicant: QUALCOMM Incorporated

Inventors: Rama Muralidhara Reddy Nandhimandalam, Pengjun Huang
Method and device for low bit rate speech coding

Patent number: 7752039

Abstract: A method for coding speech or other generic signals includes dividing a speech signal into a plurality of frames, and dividing at least one of the plurality of frames into at least two subframe units. A search for a fixed codebook contribution and an adaptive codebook contribution for subframe units is conducted. At least one subframe unit is selected to be coded without the fixed codebook contribution. The encoder may iteratively arrange and encode subframes differently for the same frame, and select for transmission that arrangement that minimizes an error measure across the frame. Various embodiments are shown, as are embodied computer programs, a decoder, and a communication system.

Type: Grant

Filed: November 1, 2005

Date of Patent: July 6, 2010

Assignee: Nokia Corporation

Inventor: Bruno Bessette
Method and apparatus for speech decoding based on a parameter of the adaptive code vector

Patent number: 7747441

Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding method according to a code-excited linear prediction (CELP) speech coding, a noise level of a speech in a concerning coding period is evaluated by using a code or coding result of at least one of spectrum information, power information, and pitch information, and various excitation codebooks are used based on an evaluation result.

Type: Grant

Filed: January 16, 2007

Date of Patent: June 29, 2010

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Tadashi Yamaura
Audio encoding

Patent number: 7725310

Abstract: Coding of an audio signal (x) represented by a respective set of sampled signal values (x(t)) for each of a plurality of sequential time segments is disclosed. The sampled signal values are analyzed to determine one or more sinusoidal components for each of the plurality of sequential segments. The sinusoidal components are linked across a plurality of sequential segments to provide sinusoidal tracks, where each track comprises a number of frames. An encoded signal (AS) is generated, including sinusoidal codes (Cs) comprising a representation level (r) for each frame or including sinusoidal codes (Cs) where some of these codes comprise a phase (?), a frequency (?) and a quantization table (Q) for a given frame when the given frame is designated as a random-access frame. The invention allows random access in a track while avoiding long adaptation of the quantization accuracy in a quantizer and/or the need for a large bit stream while still maintaining improved audio quality.

Type: Grant

Filed: October 4, 2004

Date of Patent: May 25, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Albertus Cornelis Den Brinker, Andreas Johannes Gerrits
Method and apparatus for encoding/decoding speech signal

Publication number: 20100114566

Abstract: An apparatus and method for encoding/decoding a speech signal which determines a variable bit rate based on reserved bits obtained from a target bit rate, is provided. The variable bit rate is determined based on a source feature of the speech signal and the reserved bits is obtained based on the target bit rate. The apparatus for encoding the speech signal may include a linear predictive (LP) analysis unit/quantization unit to determine an immittance spectral frequencies (ISF) index, a closed loop pitch search unit, a fixed codebook search unit, a gain vector quantization (VQ) unit to determine a gain vector quantization (VQ) index, and a bit rate control unit to control at least two indexes of the ISF index, the pitch index, the code index, and the gain VQ index to be encoded to be variable bit rates based on a source feature of a speech signal and the reserved bits.

Type: Application

Filed: July 28, 2009

Publication date: May 6, 2010

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Ho Sang Sung, Eun Mi Oh
Sub-sampled excitation waveform codebooks

Patent number: 7698132

Abstract: Methods and apparatus are presented for reducing the number of bits needed to represent an excitation waveform. An acoustic signal in an analysis frame is analyzed to determine whether it is a band-limited signal. A sub-sampled sparse codebook is used to generate the excitation waveform if the acoustic signal is a band-limited signal. The sub-sampled sparse codebook is generated by decimating permissible pulse locations from the codebook track in accordance with the frequency characteristic of the acoustic signal.

Type: Grant

Filed: December 17, 2002

Date of Patent: April 13, 2010

Assignee: QUALCOMM Incorporated

Inventors: Ananthapadamanabhan A. Kandhadai, Sharath Manjunath, Khaled El-Maleh
Voice/musical sound encoding device and voice/musical sound encoding method

Patent number: 7693707

Abstract: A voice and musical tone coding apparatus is provided that can perform high-quality coding by executing vector quantization taking the characteristics of human hearing into consideration. In this voice and musical tone coding apparatus, a quadrature transformation processing section (201) converts a voice and musical tone signal from time components to frequency components. An auditory masking characteristic value calculation section (203) finds an auditory masking characteristic value from a voice and musical tone signal. A vector quantization section (202) performs vector quantization changing a calculation method of a distance between a code vector found from a preset codebook and a frequency component based on an auditory masking characteristic value.

Type: Grant

Filed: December 20, 2004

Date of Patent: April 6, 2010

Assignee: Pansonic Corporation

Inventors: Tomofumi Yamanashi, Kaoru Sato, Toshiyuki Morii
Dimensional vector and variable resolution quantization

Patent number: 7680670

Abstract: The invention relates to compression coding and/or decoding of digital signals, in particular by vector variable-rate quantisation defining a variable resolution. For this purpose an impulsion dictionary comprises: for a given dimension, increasing resolution dictionaries imbricated into each other and, for a given dimension, a union of: a totality (D?i<N>) of code-vectors produced, by inserting elements taken in a final set (A) into smaller dimension code-vectors according to a final set of predetermined insertion rules (F1) and a second totality of code-vectors (Y?) which are not obtainable by insertion into the smaller dimension code vectors according to said set of the insertion rules.

Type: Grant

Filed: January 30, 2004

Date of Patent: March 16, 2010

Assignee: France Telecom

Inventors: Claude Lamblin, David Virette, Balazs Kovesi, Dominique Massaloux
ADAPTIVE SOUND SOURCE VECTOR QUANTIZATION DEVICE AND ADAPTIVE SOUND SOURCE VECTOR QUANTIZATION METHOD

Publication number: 20100063804

Abstract: Provided is an adaptive sound source vector quantization device which can always perform a pitch cycle search with a resolution appropriate for any section of the pitch cycle search range of a second sub-frame when a pitch cycle search range of the second sub-frame changes in accordance with a pitch cycle of a first sub-frame. The device includes a first pitch cycle instruction unit (111), a search range calculation unit (112), and a second pitch cycle instruction unit (113). The first pitch cycle instruction unit (111) successively instructs pitch cycle search candidates in a predetermined search range having a search resolution which transits over a predetermined pitch cycle candidate for the first sub-frame. The search range calculation unit (112) calculates a predetermined range before and after the pitch cycle of the first sub-frame as the pitch cycle search range for the second sub-frame, if the predetermined range includes the predetermined pitch cycle search candidate.

Type: Application

Filed: February 29, 2008

Publication date: March 11, 2010

Applicant: PANASONIC CORPORATION

Inventors: Kaoru Sato, Toshiyuki Morii
MULTICODEBOOK SOURCE-DEPENDENT CODING AND DECODING

Publication number: 20100057448

Abstract: A method for coding data, includes: grouping data into frames; classifying the frames into classes; for each class, transforming the frames belonging to the class into filter parameter vectors, which are extracted from the frames by applying a first mathematical transformation; for each class, computing a filter codebook based on the filter parameter vectors belonging to the class; segmenting each frame into subframes; for each class, transforming the subframes belonging to the class into source parameter vectors, which are extracted from the subframes by applying a second mathematical transformation based on the filter codebook computed for the corresponding class; for each class, computing a source codebook based on the source parameter vectors belonging to the class; and coding the data based on the computed filter and source codebooks.

Type: Application

Filed: November 29, 2006

Publication date: March 4, 2010

Applicant: Loquenda S.p.A.

Inventors: Paolo Massimino, Paolo Coppo, Marco Vecchetti
ENCODING DEVICE AND ENCODING METHOD

Publication number: 20100057446

Abstract: Provided is an encoding device which can obtain a sound quality preferable for auditory sense even if the number of information bits is small. The encoding device includes a shape quantization unit (111) having: a section search unit (121) which searches for a pulse for each of bands into which a predetermined search section is divided; and a whole search unit (122) which performs search for a pulse over the entire search section. The shape of an input spectrum is quantized by a small number of pulse positions and polarities. A gain quantization unit (112) calculates a gain of the pulse searched by the shape quantization unit (111) and quantizes the gain for each of the bands.

Type: Application

Filed: February 29, 2008

Publication date: March 4, 2010

Applicant: PANASONIC CORPORATION

Inventors: Toshiyuki Morii, Masahiro Oshikiri, Tomofumi Yamanashi
AUDIO ENCODING DEVICE AND AUDIO DECODING DEVICE

Publication number: 20100049509

Abstract: Disclosed are an audio encoding device and an audio decoding device which reduce degradation of subjective quality of a decoding signal caused by power mismatch of a decoding signal which is generated by a concealing process upon disappearance of a frame. When a frame is lost, a past encoding parameter is used to obtain a concealed LPC of the current frame and a concealed sound source parameter. A normal CELP decoding is performed from the obtained concealed sound source parameter. Correction is performed by using a conceal parameter on the obtained concealed LPC and the concealed sound source signal. The power of the corrected concealed sound source signal is adjusted to match a reference sound source power. A filter gain of the synthesis filter is adjusted so as to adjust the power of a decoded sound signal to the power of a decoded sound signal during an error-free state.

Type: Application

Filed: February 29, 2008

Publication date: February 25, 2010

Applicant: PANASONIC CORPORATION

Inventors: Takuya Kawashima, Hiroyuki Ehara, Koji Yoshida
AUDIO ENCODING DEVICE AND AUDIO ENCODING METHOD

Publication number: 20100049508

Abstract: Provided is an audio encoding device which performs a closed loop search of a gain and a sound source vector without significantly increasing the calculation amount as compared to an open loop search. In the audio encoding device, firstly, a first parameter decision unit (121) performs a sound source search by an adaptive sound source codebook and then a second parameter decision unit (122) simultaneously performs by a closed loop, the sound source and the gain search by using a fixed sound source codebook. More specifically, for a combination of a fixed sound source vector and gain, the sum of a value obtained by multiplying a candidate fixed sound source vector by a candidate gain and a value obtained by multiplying an adaptive sound source vector by a candidate gain is subjected to a combination filter formed by a filter coefficient based on a quantization linear prediction coefficient so as to generate a combined signal.

Type: Application

Filed: December 14, 2007

Publication date: February 25, 2010

Applicant: PANASONIC CORPORATION

Inventor: Toshiyuki Morii
Methods for selecting an initial quantization step size in audio encoders and systems using the same

Patent number: 7668715

Abstract: A method of performing quantization in an audio encoder includes determining a number of bits available in a frame of encoded audio data. Determinations are also made for the maximum transform coefficient value and a distribution of transform coefficient values across the transform coefficient spectrum being encoded. A an estimate for an initial quantization step value is determined from the number of available bits in the frame, the maximum transform coefficient value, and the distribution of coefficient values across the coefficient spectrum.

Type: Grant

Filed: November 30, 2004

Date of Patent: February 23, 2010

Assignee: Cirrus Logic, Inc.

Inventors: Ravindra Ramkrishna Chaugule, Sachin P. Ghanekar
Audio coding via creation of sinusoidal tracks and phase determination

Patent number: 7664633

Abstract: Coding of an audio signal represented by a respective set of sampled signal values for each of a plurality of sequential segments is disclosed. The sampled signal values are analyzed (40) to determine one or more sinusoidal components for each of the plurality of sequential segments. The sinusoidal components are linked (42) across a plurality of sequential segments to provide sinusoidal tracks. For each sinusoidal track, a phase comprising a generally monotonically changing value is determined and an encoded audio stream including sinusoidal codes (r) representing said phase is generated (46).

Type: Grant

Filed: November 6, 2003

Date of Patent: February 16, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Albertus Cornelis Den Brinker, Andreas Johannes Gerrits, Robert Johannes Sluijter
Speech compression and decompression apparatuses and methods providing scalable bandwidth structure

Publication number: 20100036658

Abstract: A speech compression apparatus including: a first band-transform unit transforming a wideband speech signal to a narrowband low-band speech signal; a narrowband speech compressor compressing the narrowband low-band speech signal and outputting a result of the compressing as a low-band speech packet; a decompression unit decompressing the low-band speech packet and obtaining a decompressed wideband low-band speech signal; an error detection unit detecting an error signal that corresponds to a difference between the wideband speech signal and the decompressed wideband low-band speech signal; and a high-band speech compression unit compressing the error signal and a high-band speech signal of the wideband speech signal and outputting the result of the compressing as a high-band speech packet.

Type: Application

Filed: October 13, 2009

Publication date: February 11, 2010

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Chang-yong Son, Ho-chong Park, Yong-beom Lee, Woo-suk Lee
METHOD AND APPARATUS FOR FAST NEAREST NEIGHBOR SEARCH FOR VECTOR QUANTIZERS

Publication number: 20100014577

Abstract: A method comprises identifying a component k of a codevector from a codebook C having one or more codevectors, the component k introducing highest variance for an input vector; allowing ordering of codevectors in the codebook C; and searching for a best match vector for the input vector using ordered codevectors.

Type: Application

Filed: July 15, 2009

Publication date: January 21, 2010

Applicant: Nokia Corporation

Inventors: Adriana Vasilache, Lasse Juhani Laaksonen, Mikko Tapio Tammi, Anssi Sakari Ramo
ENCODING DEVICE AND ENCODING METHOD

Publication number: 20100017204

Abstract: Provided is a voice encoding device which can accurately encode a spectrum shape of a signal having a strong tonality such as a vowel. The device includes: a sub-band constituting unit (151) which divides a first layer error conversion coefficient to be encoded into M sub-bands so as to generate M sub-band conversion coefficients; a shape vector encoding unit (152) which performs encoding on each of the M sub-band conversion coefficient so as to obtain M shape encoded information and calculates a target gain of each of the M sub-band conversion coefficients; a gain vector forming unit (153) which forms one gain vector by using M target gains; a gain vector encoding unit (154) which encodes the gain vector so as to obtain gain encoded information; and a multiplexing section unit (155) which multiplexes the shape encoded information with the gain encoded information.

Type: Application

Filed: February 29, 2008

Publication date: January 21, 2010

Applicant: PANASONIC CORPORATION

Inventors: Masahiro Oshikiri, Toshiyuki Morii, Tomofumi Yamanashi
Apparatus and methods for encoding digital audio data with a reduced bit rate

Patent number: 7647222

Abstract: A method and an apparatus for encoding digital audio data with reduced bit rates, the apparatus having a provider of psycho-acoustically quantized digital audio data with a bit rate being higher than the reduced bit rate. The apparatus further has an identifier for identifying a frequency band according to a selection criterion, the selection criterion being such that an impact on the quality of the digital audio data when the data in the identified frequency band is replaced by generated noise is smaller than the impact on the quality of the digital audio data, which would arise when the data in a different frequency band is replaced by generated noise. The apparatus further has a replacer for replacing data in the identified frequency band of the digital audio data by a noise synthesis parameter.

Type: Grant

Filed: April 24, 2007

Date of Patent: January 12, 2010

Assignee: Nero AG

Inventors: Ivan Dimkovic, Gian Carlo Pascutto
Reducing Computational Complexity in Determining the Distance from Each of a Set of Input Points to Each of a Set of Fixed Points

Publication number: 20090326932

Abstract: An aspect of the present invention takes advantage of the fact that the coordinates of fixed points do not change, and thus the energy (sum of squares of the coordinates defining the vector) of each fixed point is computed and stored. The energy of each variable input point may also be computed. The distance between each pair of fixed and input points is computed based on the respective energies and the dot product.

Type: Application

Filed: September 8, 2009

Publication date: December 31, 2009

Applicant: TEXAS INSTRUMENTS INCORPORATED

Inventor: Chanaveeragouda V. Goudar
Apparatus and methods for digital audio coding using codebook application ranges

Patent number: 7630902

Abstract: A low bit rate digital audio coding system includes an encoder which assigns codebooks to groups of quantization indexes based on their local properties resulting in codebook application ranges that are independent of block quantization boundaries. The invention also incorporates a resolution filter bank, or a tri-mode resolution filter bank, which is selectively switchable between high and low frequency resolution modes or high, low and intermediate modes such as when detecting transient in a frame. The result is a multichannel audio signal having a significantly lower bit rate for efficient transmission or storage. The decoder is essentially an inverse of the structure and methods of the encoder, and results in a reproduced audio signal that cannot be audibly distinguished from the original signal.

Type: Grant

Filed: January 4, 2005

Date of Patent: December 8, 2009

Assignee: Digital Rise Technology Co., Ltd.

Inventor: Yuli You
Block-constrained TCQ method, and method and apparatus for quantizing LSF parameter employing the same in speech coding system

Patent number: 7630890

Abstract: A block-constrained Trellis coded quantization (TCQ) method and a method and apparatus for quantizing line spectral frequency (LSF) parameters employing the same in a speech coding system wherein the LSF coefficient quantizing method includes: removing the direct current (DC) component in an input LSF coefficient vector; generating a first prediction error vector by performing inter-frame and intra-frame prediction for the LSF coefficient vector, in which the DC component is removed, quantizing the first prediction error vector by using the BC-TCQ algorithm, and by performing intra-frame and inter-frame prediction compensation, generating a quantized first LSF coefficient vector; generating a second prediction error vector by performing intra-frame prediction for the LSF coefficient vector, in which the DC component is removed, quantizing the second prediction error vector by using the BC-TCQ algorithm, and then, by performing intra-frame prediction compensation, generating a quantized second LSF coefficient

Type: Grant

Filed: February 19, 2004

Date of Patent: December 8, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventors: Chang-yong Son, Sang-won Kang, Yong-won Shin, Thomas R. Fischer
Hochwald construction of unitary matrix codebooks via eigen coordinate transformations

Patent number: 7630886

Abstract: A method of deriving a codebook including deriving a first codeword Pl in Euclidean coordinates, transforming the first codeword into eigen-coordinates, applying a Hochwald construction to the first codeword in eigen-coordinates to derive a plurality of codewords in eigen-coordinates and transforming the plurality of codewords in eigen-coordinates into a plurality of codewords in Euclidean coordinates to form the codebook.

Type: Grant

Filed: April 29, 2005

Date of Patent: December 8, 2009

Assignee: Nokia Corporation

Inventors: Jianzhong Zhang, Anthony Reid
VECTOR QUANTIZING DEVICE, VECTOR DEQUANTIZING DEVICE, VECTOR QUANTIZING METHOD, AND VECTOR DEQUANTIZING METHOD

Publication number: 20090299738

Abstract: A vector quantizing device for dividing a sequence of vectors and quantizing them with an enhanced performance of vector quantization by using information on the correlation between the high and low order that the vector sequence has. The vector quantizing device (100) creates a predicted vector by prediction using a first quantization divided vector, creates the difference between the divided vector and the predicted vector as a predicted residual vector, and determines a second code by converting the predicted residual vector into a quantized vector. A vector dequantizing device (150) creates a predated vector by prediction using a first quantization divided vector, creates a second quantization divided vector by adding the predicted vector and the predicted residual vector, and creates a quantized vector by connecting the first and second quantization divided vectors.

Type: Application

Filed: March 29, 2007

Publication date: December 3, 2009

Applicant: MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD.

Inventors: Kaoru Sato, Toshiyuki Morii, Tomofumi Yamanashi
Speech compression and decompression apparatuses and methods providing scalable bandwidth structure

Patent number: 7624022

Abstract: A speech compression apparatus including: a first band-transform unit transforming a wideband speech signal to a narrowband low-band speech signal; a narrowband speech compressor compressing the narrowband low-band speech signal and outputting a result of the compressing as a low-band speech packet; a decompression unit decompressing the low-band speech packet and obtaining a decompressed wideband low-band speech signal; an error detection unit detecting an error signal that corresponds to a difference between the wideband speech signal and the decompressed wideband low-band speech signal; and a high-band speech compression unit compressing the error signal and a high-band speech signal of the wideband speech signal and outputting the result of the compressing as a high-band speech packet.

Type: Grant

Filed: July 2, 2004

Date of Patent: November 24, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventors: Chang-yong Son, Ho-chong Park, Yong-beom Lee, Woo-suk Lee
Robust quantization and inverse quantization using illegal space

Patent number: 7617096

Abstract: A quantizer for quantization of a vector comprises a codevector generator that generates a set of candidate codevectors and a memory for storing an illegal space definition representing illegal vectors. The quantizer also includes a legal status tester that determines legal candidate codevectors among the set of candidate codevectors using the illegal space definition, and a codevector selector that determines a best legal candidate codevector among the one or more legal candidate codevectors. The vector includes parameters relating to a speech and/or audio signal, such as Line Spectral Frequencies (LSFs).

Type: Grant

Filed: June 7, 2002

Date of Patent: November 10, 2009

Assignee: Broadcom Corporation

Inventor: Jes Thyssen
Audio signal encoding apparatus and method

Patent number: 7613605

Abstract: An audio signal encoding apparatus includes a frame dividing unit (1), an auditory psychological arithmetic unit (2), a filter bank unit (3), a scale factor calculation unit (4) which weights the spectra in the respective frequency bands by an arithmetic result of the auditory psychological arithmetic unit (2), a quantization step determination unit (7) which determines a quantization step of the entire frame prior to spectrum quantization by subtracting an information size of all quantized spectra from an auditory information size of all the weighted spectra before quantization, and multiplying the difference by a coefficient obtained from a step width of a quantization coarseness, a spectrum quantization unit (8), and a bit shaping unit (9) which outputs a bitstream obtained by shaping quantized spectra. The quantization step determination unit predicts the information size of all the quantized spectra based on a bit size assigned to a frame to be encoded.

Type: Grant

Filed: May 16, 2007

Date of Patent: November 3, 2009

Assignee: Canon Kabushiki Kaisha

Inventor: Masanobu Funakoshi
METHOD AND APPARATUS FOR SEARCHING FIXED CODEBOOK

Publication number: 20090240493

Abstract: A method and apparatus for searching fixed codebook are provided. The method includes: obtaining a basic codebook which comprises position information of N pulses on M tracks, wherein N and M are positive integers; choosing n pulses as search pulses, wherein the n pulses are parts of the N pulses and n is a positive integer smaller than N; and replacing position information of the n search pulses respectively with other position information on the tracks to obtain a searched codebook; executing the search process for K times, wherein K is a positive integer larger than or equal to 2, at least two or more search pulses are chosen in one of the K search processes , and the chosen search pulses vary in each of the K search processes; and obtaining an optimal codebook from the basic codebook and the searched codebook according to a preset criterion.

Type: Application

Filed: June 4, 2009

Publication date: September 24, 2009

Inventors: Dejun ZHANG, Lixiong Li
Speech coder using an orthogonal search and an orthogonal search method

Patent number: 7590527

Abstract: A code excited linear prediction speech decoder includes an adaptive codebook configured to generate an adaptive code vector. The decoder also includes a random codebook configured to generate a random code vector. The decoder also includes a synthesis filter that receives a signal based on said adaptive code vector and said random code vector, and that is configured to perform linear prediction coefficient synthesis on said signal. The random codebook includes a pulse vector providing system configured to provide a pulse vector having a signed unit pulse. The random codebook also includes a comparing system configured to compare a value of adaptive codebook gain with a preset threshold value. The random codebook further includes a selecting system configured to select a dispersion pattern from a plurality of dispersion patterns stored in a memory in accordance with a result of said comparison.

Type: Grant

Filed: May 10, 2005

Date of Patent: September 15, 2009

Assignee: Panasonic Corporation

Inventors: Kazutoshi Yasunaga, Toshiyuki Morii
Single-codebook vector quantization for multiple-rate applications

Patent number: 7587314

Abstract: This invention relates to a method, a device and a software application product for N-level quantization of vectors, wherein N is selectable prior to said quantization from a set of at least two pre-defined values that are smaller than or equal to a pre-defined maximum number of levels M. A reproduction vector for each vector is selected from an N-level codebook of N reproduction vectors that are, for each N in said set of at least two pre-defined values, represented by the first N reproduction vectors of the same joint codebook of M reproduction vectors. The invention further relates to a method, a device and a software application product for retrieving reproduction vectors for vectors that have been N-level quantized, to a system for transferring representations of vectors, to a method, a device and a software application product for determining a joint codebook, and to such a joint codebook itself.

Type: Grant

Filed: August 29, 2005

Date of Patent: September 8, 2009

Assignee: Nokia Corporation

Inventors: Adriana Vasilache, Anssi Rämö
REW parametric vector quantization and dual-predictive SEW vector quantization for waveform interpolative coding

Patent number: 7584095

Abstract: An enhanced analysis-by-synthesis waveform interpolative speech coder able to operate at 2.8 kbps. Novel features include dual-predictive analysis-by-synthesis quantization of the slowly-evolving waveform, efficient parametrization of the rapidly-evolving waveform magnitude, and analysis-by-synthesis vector quantization of the rapidly evolving waveform parameter. Subjective quality tests indicate that it exceeds G.723.1 at 5.3 kbps, and of G.723.1 at 6.3 kbps.

Type: Grant

Filed: September 23, 2005

Date of Patent: September 1, 2009

Assignee: The Regents of the University of California

Inventors: Oded Gottesman, Allen Gersho

prev … 2 3 4 5 6 7 8 9 10 … next