Method of storing reflection coeffients in a vector quantizer for a speech coder to provide reduced storage requirements
An input speech signal is encoded as one or more reflection coefficients. To reduce storage requirements, the reflection coefficients are scalar quantized by storing an N-bit code rather than the entire reflection coefficient. An exemplary value for N is 8. A table is provided having 2.sup.N reflection coefficient values. The N-bit code is used to look up reflection coefficient values from the table. To reduce spectral distortion due to scalar quantization, the reflection coefficient values in the table are non-linearly scaled.
Latest Motorola, Inc. Patents:
- Communication system and method for securely communicating a message between correspondents through an intermediary terminal
- LINK LAYER ASSISTED ROBUST HEADER COMPRESSION CONTEXT UPDATE MANAGEMENT
- RF TRANSMITTER AND METHOD OF OPERATION
- Substrate with embedded patterned capacitance
- Methods for Associating Objects on a Touch Screen Using Input Gestures
Claims
1. A speech coding method comprising the steps of:
- (a) constructing an excitation codebook of 2.sup.M codevectors using M basis vectors;
- (b) receiving input speech;
- (c) in response to the input speech, computing reflection coefficient values corresponding to speech parameters representative of the input speech;
- (d) storing in a table 2.sup.N reflection coefficient values, each reflection coefficient value addressable with an N-bit code;
- (e) processing codevectors to produce synthesized speech;
- (f) selecting a codevector from the excitation codebook which minimizes an error criterion for the synthesized speech relative to the input speech, including
- (f1) when reflection coefficient values are required for processing, providing corresponding N-bit codes to the table to look up the reflection coefficient values,
- (f2) otherwise storing only the N-bit codes during processing, thereby minimizing storage requirement for the reflection coefficient values.
2. A method of storing reflection coefficient vectors in a vector quantizer for a speech coder in accordance with claim 1 wherein the reflection coefficient values are non-linearly scaled.
3. A method of storing reflection coefficient vectors in a vector quantizer for a speech coder in accordance with claim 1 wherein the reflection coefficient values are arcsine scaled between the values of -1 and +1.
4. A method of storing reflection coefficient vectors in a vector quantizer for a speech coder in accordance with claim 1 where N equals 8.
5. A speech coder comprising:
- a codebook generator which generates an excitation codebook having 2.sup.M codevectors formed using M basis vectors;
- input means for receiving an input speech signal and producing a data vector;
- coding means coupled to the input means for generating reflection coefficients corresponding to speech parameters representative of the input speech signal, the coding means processing the codevectors to produce synthesized speech;
- a vector quantizer for quantizing the reflection coefficients, the vector quantizer including a vector quantizer memory configured to store 2.sup.N reflection coefficient values, the vector quantizer memory having a N-bit input and an output, the vector quantizer memory providing one of the 2.sup.N reflection coefficient values at the output in response to an N-bit address received at the N-bit input; and
- a codebook search controller coupled to the codebook generator which selects a codevector from the excitation codebook to minimize an error criterion between the synthesized speech and the data vector, the codebook search controller being coupled to the vector quantizer and providing a corresponding N-bit code to the vector quantizer to look up a reflection coefficient value for processing, the codebook search controller otherwise storing only the N-bit code to thereby minimize storage requirements.
6. A speech coder as recited in claim 5 wherein each reflection coefficient value is related to an associated N-bit address by an arcsine scaling function.
4544919 | October 1, 1985 | Gerson |
4896361 | January 23, 1990 | Gerson |
4965789 | October 23, 1990 | Bottau et al. |
4975956 | December 4, 1990 | Liu et al. |
5012518 | April 30, 1991 | Liu et al. |
5038377 | August 6, 1991 | Kihara et al. |
5295224 | March 15, 1994 | Makamura et al. |
5307460 | April 26, 1994 | Garten |
5351338 | September 1994 | Wigren |
- Furui, Sadaoki, Digital Speech Processing, Synthesis, and Recognition, 1989, New York, NY, pp. 118-119. Rabiner and Schafer, Digital Processing of Speech Signals, 1978, by Bell Laboratories, Inc., USA, p. 452. R. Viswanathan et al., "Quantization Properties of Transmission Parameters in Linear Predictive Systems," IEEE Trans. Acoustics, Speech and Signal Processing, vol. ASSP-23, Jun. 1975, pp. 309-321. Y. Linde et al., "An Algorithm for Vector Quantizer Design", IEEE Transactions on Communications, vol. Com-28, No. 1, Jan. 1980, pp. 84-95. Motorola, Inc., "Vector Sum Excited Linear Prediction (VSELP) 7950 Bit per Second Voice Coding Algorithm--Technical Description", Nov. 14, 1989, pp. 9-11. Y. Shoham, "Cascaded Likelihood Vector Coding of the LPC Information", Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, 1989, pp. 160-163. K. Paliwal et al., "Efficient Vector Quantization of LPC Parameters at 24 Bits/Frame", Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 1991, pp. 661-664. B. Battacharya et al., "Tree Searched Multi-Stage Vector Quanitzation of LPC Parameters for 4kb/s Speech Coding", Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Mar. 1992, pp. I-105 to I-108. T. Parsons, Voice and Speech Processing, 1987, pp. 159-161.
Type: Grant
Filed: Feb 29, 1996
Date of Patent: Oct 20, 1998
Assignee: Motorola, Inc. (Schaumburg, IL)
Inventors: Ira A. Gerson (Schaumburg, IL), Mark A. Jasiuk (Chicago, IL), Matthew A. Hartman (Schaumburg, IL)
Primary Examiner: David D. Knepper
Attorney: John G. Rauch
Application Number: 8/609,027
International Classification: G10L 914; G10L 908;