Method of storing reflection coeffients in a vector quantizer for a speech coder to provide reduced storage requirements

- Motorola, Inc.

An input speech signal is encoded as one or more reflection coefficients. To reduce storage requirements, the reflection coefficients are scalar quantized by storing an N-bit code rather than the entire reflection coefficient. An exemplary value for N is 8. A table is provided having 2.sup.N reflection coefficient values. The N-bit code is used to look up reflection coefficient values from the table. To reduce spectral distortion due to scalar quantization, the reflection coefficient values in the table are non-linearly scaled.

Skip to:  ·  Claims  ·  References Cited  · Patent History  ·  Patent History

Claims

1. A speech coding method comprising the steps of:

(a) constructing an excitation codebook of 2.sup.M codevectors using M basis vectors;
(b) receiving input speech;
(c) in response to the input speech, computing reflection coefficient values corresponding to speech parameters representative of the input speech;
(d) storing in a table 2.sup.N reflection coefficient values, each reflection coefficient value addressable with an N-bit code;
(e) processing codevectors to produce synthesized speech;
(f) selecting a codevector from the excitation codebook which minimizes an error criterion for the synthesized speech relative to the input speech, including
(f1) when reflection coefficient values are required for processing, providing corresponding N-bit codes to the table to look up the reflection coefficient values,
(f2) otherwise storing only the N-bit codes during processing, thereby minimizing storage requirement for the reflection coefficient values.

2. A method of storing reflection coefficient vectors in a vector quantizer for a speech coder in accordance with claim 1 wherein the reflection coefficient values are non-linearly scaled.

3. A method of storing reflection coefficient vectors in a vector quantizer for a speech coder in accordance with claim 1 wherein the reflection coefficient values are arcsine scaled between the values of -1 and +1.

4. A method of storing reflection coefficient vectors in a vector quantizer for a speech coder in accordance with claim 1 where N equals 8.

5. A speech coder comprising:

a codebook generator which generates an excitation codebook having 2.sup.M codevectors formed using M basis vectors;
input means for receiving an input speech signal and producing a data vector;
coding means coupled to the input means for generating reflection coefficients corresponding to speech parameters representative of the input speech signal, the coding means processing the codevectors to produce synthesized speech;
a vector quantizer for quantizing the reflection coefficients, the vector quantizer including a vector quantizer memory configured to store 2.sup.N reflection coefficient values, the vector quantizer memory having a N-bit input and an output, the vector quantizer memory providing one of the 2.sup.N reflection coefficient values at the output in response to an N-bit address received at the N-bit input; and
a codebook search controller coupled to the codebook generator which selects a codevector from the excitation codebook to minimize an error criterion between the synthesized speech and the data vector, the codebook search controller being coupled to the vector quantizer and providing a corresponding N-bit code to the vector quantizer to look up a reflection coefficient value for processing, the codebook search controller otherwise storing only the N-bit code to thereby minimize storage requirements.

6. A speech coder as recited in claim 5 wherein each reflection coefficient value is related to an associated N-bit address by an arcsine scaling function.

Referenced Cited
U.S. Patent Documents
4544919 October 1, 1985 Gerson
4896361 January 23, 1990 Gerson
4965789 October 23, 1990 Bottau et al.
4975956 December 4, 1990 Liu et al.
5012518 April 30, 1991 Liu et al.
5038377 August 6, 1991 Kihara et al.
5295224 March 15, 1994 Makamura et al.
5307460 April 26, 1994 Garten
5351338 September 1994 Wigren
Other references
  • Furui, Sadaoki, Digital Speech Processing, Synthesis, and Recognition, 1989, New York, NY, pp. 118-119. Rabiner and Schafer, Digital Processing of Speech Signals, 1978, by Bell Laboratories, Inc., USA, p. 452. R. Viswanathan et al., "Quantization Properties of Transmission Parameters in Linear Predictive Systems," IEEE Trans. Acoustics, Speech and Signal Processing, vol. ASSP-23, Jun. 1975, pp. 309-321. Y. Linde et al., "An Algorithm for Vector Quantizer Design", IEEE Transactions on Communications, vol. Com-28, No. 1, Jan. 1980, pp. 84-95. Motorola, Inc., "Vector Sum Excited Linear Prediction (VSELP) 7950 Bit per Second Voice Coding Algorithm--Technical Description", Nov. 14, 1989, pp. 9-11. Y. Shoham, "Cascaded Likelihood Vector Coding of the LPC Information", Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, 1989, pp. 160-163. K. Paliwal et al., "Efficient Vector Quantization of LPC Parameters at 24 Bits/Frame", Proceedings of the International Conference on Acoustics, Speech and Signal Processing, 1991, pp. 661-664. B. Battacharya et al., "Tree Searched Multi-Stage Vector Quanitzation of LPC Parameters for 4kb/s Speech Coding", Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Mar. 1992, pp. I-105 to I-108. T. Parsons, Voice and Speech Processing, 1987, pp. 159-161.
Patent History
Patent number: 5826224
Type: Grant
Filed: Feb 29, 1996
Date of Patent: Oct 20, 1998
Assignee: Motorola, Inc. (Schaumburg, IL)
Inventors: Ira A. Gerson (Schaumburg, IL), Mark A. Jasiuk (Chicago, IL), Matthew A. Hartman (Schaumburg, IL)
Primary Examiner: David D. Knepper
Attorney: John G. Rauch
Application Number: 8/609,027
Classifications
Current U.S. Class: Vector Quantization (704/222); Autocorrelation (704/217); Linear Prediction (704/219)
International Classification: G10L 914; G10L 908;