Pattern Matching Vocoders Patents (Class 704/221)

Vector quantization (Class 704/222)

Excitation patterns (Class 704/223)

Method for controlling the point of time when bypass mode operation is begun, based on the format of PCM data

Patent number: 6034994

Abstract: A method for controlling the point of time when a bypass mode operation is begun, based on the format of pulse code modulation (PCM) data. A switching unit, which is coupled between outgoing-end and incoming-end mobile stations, receives signals output from vocoders respectively associated with the outgoing-end and incoming-end mobile stations, thereby checking respective operation modes of the vocoders. Based on the result of the checking, the switching unit controls the vocoders so that communications between the outgoing-end and incoming-end mobile stations can be enabled when both the mobile stations operate in a bypass mode. Accordingly, it is possible to achieve smooth communications without a degradation in speech quality, as compared to conventional communications methods which do not take into consideration operation modes of outgoing-end and incoming-end mobile stations.

Type: Grant

Filed: December 23, 1997

Date of Patent: March 7, 2000

Assignee: Hyundai Electronics Industries Co., Ltd.

Inventor: Joon Sang Yoon
Apparatus for transmitting synthesized speech with high quality at a low bit rate

Patent number: 6006177

Abstract: The invention provides a speech coding apparatus wherein a perceptual weighting filter is realized with a comparatively small amount of calculation. The speech coding apparatus includes a weighting circuit which in turn includes a coefficient code book in which weighting coefficients are stored, a coefficient determination section which selects and outputs one of the weighting coefficients which corresponds to a short-term prediction code, and a weighting section for performing weighting calculation of a speech signal with the selected weighting coefficient.

Type: Grant

Filed: April 18, 1996

Date of Patent: December 21, 1999

Assignee: NEC Corporation

Inventor: Keiichi Funaki
Multiple impulse excitation speech encoder and decoder

Patent number: 6006174

Abstract: The generation of multipulse excitation codes by digitizing an original speech, partitioning the digitized signal into a number of samples, pre-emphasizing the samples, producing linear predictive reflection coefficients from said samples, quantizing these reflection coefficients, converting the quantized reflection coefficients to spectral coefficients and subjecting the spectral coefficients to pitch analysis to obtain a spectral residual signal.

Type: Grant

Filed: October 15, 1997

Date of Patent: December 21, 1999

Assignee: InterDigital Technology Coporation

Inventors: Daniel Lin, Brian M. McCarthy
Detection and bypass of tandem vocoding using detection codes

Patent number: 5956673

Abstract: A first remote vocoder receives analog voice and produces packetized vocoder data which is transmitted over a wireless link. A first local vocoder receives the packetized vocoder data from the wireless link. The first local vocoder converts the packetized data to a multibit PCM output. The first local vocoder also adds a detection code to one of the least significant bits (LSB) of the PCM output. The first local vocoder passes the PCM signal to the PSTN from the second end user. The first local vocoder also receives PCM input over the PSTN. The first local vocoder constantly monitors the least significant bit of the PCM input for a detection code indicating that a second local vocoder is connected at the receiving end. If the first local vocoder detects the detection code from the second local vocoder, it begins to substitute packetized data and a redundancy check for a second one of the LSB's of the outgoing PCM. The first local vocoder also begins to monitor the second one of the LSB's of the incoming PCM.

Type: Grant

Filed: January 25, 1995

Date of Patent: September 21, 1999

Inventors: Lindsay A. Weaver, Jr., S. Katherine Lam, William Gardner, Paul Jacobs
Gain estimation scheme for LPC vocoders with a shape index based on signal envelopes

Patent number: 5953697

Abstract: A gain estimation method for an LPC vocoder which utilizes shape indexes. The gain is estimated based on the envelope of the speech waveform. The gain is estimated such that the maximum amplitude of the synthetic speech just reaches the speech waveform envelope. The gain during voiced subframes is estimated as the minimum of the absolute value of ratio of the envelope and the impulse response of the LPC filter. The gain during unvoiced subframes is estimated as the minimum of the absolute value of the ratio of the envelope and the noise response of the LPC filter. The method results in a fast technique for estimating the gain.

Type: Grant

Filed: May 5, 1997

Date of Patent: September 14, 1999

Assignee: Holtek Semiconductor, Inc.

Inventors: Chin-Teng Lin, Hsin-An Lin
Speech encoding at variable bit rate

Patent number: 5933803

Abstract: The invention is related digital speech encoding. In a speech codec according to the invention, for modeling a speech signal (301) both prediction parameters (321, 322, 331) modeling a speech signal in a short term and prediction parameters (341, 342, 351) modeling a speech signal in a long term are used. Each prediction parameter (321, 322, 331, 341, 342, 351) is presented using a certain accuracy, in a digital system with a certain number of bits. In speech encoding according to the invention the number of bits used for presenting prediction parameters (321, 322, 331, 341, 342, 351) is adjusted based upon information parameters (321, 322, 331, 341, 342, 351) obtained from a short-term LPC-analysis (32) and from a long-term LTP-analysis (31, 34, 35). The invention is particularly suitable for use at low data transfer speeds, because it offers a speech encoding method of even quality and low average bit rate.

Type: Grant

Filed: December 5, 1997

Date of Patent: August 3, 1999

Assignee: Nokia Mobile Phones Limited

Inventor: Pasi Ojala
Echo canceller with talk state determination to control speech processor functional elements in a digital telephone system

Patent number: 5920834

Abstract: A method and apparatus for controlling various functional elements in a digital telephone system using state determination from an echo canceller. An echo canceller is used to evaluate which one of five talk states two speakers are engaged in during a telephone conversation. This state determination information is used to control a tone detector function, a noise suppressor function, an adaptive equalizer function, a transmission mute function, and a vocoder encoder function within a vocoder. During the talk state where the far-end speaker is active and the near-end speaker is inactive, the echo canceller provides a signal which disables background noise estimates from being performed in the noise suppressor and the vocoder encoder. The same signal is used to disable the tone detector and to enable the transmission mute function during this talk state.

Type: Grant

Filed: January 31, 1997

Date of Patent: July 6, 1999

Assignee: Qualcomm Incorporated

Inventors: Gilbert C. Sih, Anthony P. Mauro
Apparatus and method for determining articulatory-orperation speech parameters

Patent number: 5913188

Abstract: In an apparatus for extracting information from an input speech signal, a preprocessor, a buffer, a segmenter, an acoustic classifier and a feature extractor are provided. The preprocessor generates formant related information for consecutive time frames of the input speech signal. This formant related information is fed into the buffer, which can store signals representative of a plurality of frames. The segmenter monitors the signals representative of the incoming frames and identifies segments in the input speech signal during which variations in the formant related information remain within prespecified limits. The acoustic classifier then determines classification information for each segment identified by the segmenter, based on acoustic classes found in training data. The feature estimator then determines, for each segment, the information required, based on the input speech signal during that segment, training data and the classification information determined by the acoustic classifier.

Type: Grant

Filed: September 11, 1995

Date of Patent: June 15, 1999

Assignee: Canon Kabushiki Kaisha

Inventor: Eli Tzirkel-Hancock
Method and apparatus for performing speech frame encoding mode selection in a variable rate encoding system

Patent number: 5911128

Abstract: A method and apparatus for the selection of an encoding mode for speech frames in a variable rate encoding system. For each speech frame, the method and apparatus selects the encoding mode which provides for rate efficient coding. A mode measurement element receives a speech signal and a signal derived from the same speech signal, and generates a set of parameters which are ideally suited for operational mode selection. Rate determination logic receives the set of parameters and selects an encoding rate using predetermined selection rules. The selection rules further distinguish between unvoiced speech and temporally masked speech, which are encoded at the same rate but with different encoding strategies.

Type: Grant

Filed: March 11, 1997

Date of Patent: June 8, 1999

Inventor: Andrew P. DeJaco
Speech processing coder, decoder and command recognizer

Patent number: 5909662

Abstract: The present invention relates to a speech processing device equipped with both a speech coding/decoding function and a speech recognition function, and is aimed at providing a speech processing device equipped with both a speech coding/decoding function and a speech recognition function by using a small amount of memory.

Type: Grant

Filed: March 11, 1997

Date of Patent: June 1, 1999

Assignee: Fujitsu Limited

Inventors: Yasushi Yamazaki, Tomohiko Taniguchi, Tomonori Sato, Hitoshi Matsuzawa, Chiharu Kawai
Loss tolerant speech decoder for telecommunications

Patent number: 5907822

Abstract: A method and device for extrapolating past signal-history data for insertion into missing data segments in order to conceal digital speech frame errors. The extrapolation method uses past-signal history that is stored in a buffer. The method is implemented with a device that utilizes a finite-impulse response (FIR) multi-layer feed-forward artificial neural network that is trained by back-propagation for one-step extrapolation of speech compression algorithm (SCA) parameters. Once a speech connection has been established, the speech compression algorithm device begins sending encoded speech frames. As the speech frames are received, they are decoded and converted back into speech signal voltages. During the normal decoding process, pre-processing of the required SCA parameters will occur and the results stored in the past-history buffer.

Type: Grant

Filed: April 4, 1997

Date of Patent: May 25, 1999

Assignee: Lincom Corporation

Inventor: Jaime L. Prieto, Jr.
Method and apparatus for detection of tandem vocoding to modify vocoder filtering

Patent number: 5903862

Abstract: When one vocoding system is coupled to another vocoding system, a tandem arrangement results. The tandem configuration results in voice quality degradation as speech is encoded and decoded, then encoded and decoded again. One reason for the degradation is that postfiltering performed at the output of the speech decoding process introduces distortions in the spectral content of the reconstructed speech as compared to the original speech. The present invention prevents the degradation due to the use of postfilters by modifying the postfiltering within the vocoders where a tandem configuration exists. A detection code is embedded within the data signal to indicate the existence of a tandem configuration. If the detection code is received at a vocoder, modified vocoding is established within the vocoders to prevent the degradation due to the postfiltering.

Type: Grant

Filed: January 11, 1996

Date of Patent: May 11, 1999

Inventors: Lindsay A. Weaver, Jr., S. Katherine Lam, William R. Gardner, Paul E. Jacobs, Andrew P. DeJaco, Gilbert C. Sih
Voice analysis-synthesis method using noise having diffusion which varies with frequency band to modify predicted phases of transmitted pitch data blocks

Patent number: 5878388

Abstract: A high efficiency encoding method for encoding data on frequency axis obtained by dividing an input audio signal on block-by-block basis and converting the signal onto the frequency axis, wherein V bands are searched for a band B.sub.VH with the highest center frequency if it is decided that there are one or more shift points of voiced (V)/unvoiced (UV) decision data of all bands on the frequency axis, and wherein the number of V bands N.sub.V up to the band B.sub.VH is found, so as to decide whether proportion of the V bands is equal to or higher than a predetermined threshold N.sub.th, thereby deciding one V/UV boundary point. Thus, it is possible to replace the V/UV decision data for each band by information on one demarcation in all bands, thereby to reduce data volume and to reduce bit rate. Also, by using two-stage hierarchical vector quantization in quantizing the data on the frequency axis, operation volume for codebook search and memory capacity of the codebook are reduced.

Type: Grant

Filed: June 9, 1997

Date of Patent: March 2, 1999

Assignee: Sony Corporation

Inventors: Masayuki Nishiguchi, Jun Matsumoto, Shinobu Ono
Coding apparatus having adaptive coding at different bit rates and pitch emphasis

Patent number: 5878387

Abstract: The coding apparatus comprises an adaptive codebook storing excitation signals as vectors, a synthesis filter for forming a synthesis signal, referring to the vectors stored in the adaptive codebook, a similarity computation circuit for computing a similarity between the synthesis signal obtained by the synthesis filter and a target signal, and a coding scheme determining circuit for deciding one coding scheme from a plurality of coding schemes respectively having coding bit rates different from each other, on the basis of the similarity obtained by the similarity computation circuit.

Type: Grant

Filed: September 29, 1995

Date of Patent: March 2, 1999

Assignee: Kabushiki Kaisha Toshiba

Inventors: Masahiro Oshikiri, Kimio Miseki, Masami Akamine, Tadashi Amada
Method for selecting noise codebook vectors in a variable rate speech coder and decoder

Patent number: 5875423

Abstract: In a variable rate speech coding method for a CELP speech coding system, an adaptive sound source vector and a first noise source vector are selected from a sound source code book and a noise source code book so that a first synthesized speech signal is obtained which has a minimum distortion relative to an input speech signal. A virtual reference speech signal is generated using a sound source signal which is produced using the adaptive sound source vector. A second noise source vector corresponding to the adaptive sound source vector is selected so that a second synthesized speech signal is obtained which has a minimum distortion relative to the virtual reference speech signal. The sending of a noise source code book index corresponding to the first noise source vector is suspended according to the quality of the second synthesized speech signal.

Type: Grant

Filed: October 17, 1997

Date of Patent: February 23, 1999

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Bunkei Matsuoka
Signal coder for wide-band signals

Patent number: 5873060

Abstract: Wide-band speech signals and also music signals are coded with relatively less computational efforts and less sound quality deterioration even at low bit rates. A spectral parameter calculator obtains a spectral parameter from sub-frames of an input signal from a sub-frame divider, and quantizes the obtained spectral parameter. A divider divides the difference result from a subcontractor into a plurality of sub-bands. Adaptive codebook circuits obtain a pitch prediction signal by obtaining pitch data in at least one of the sub-bands. Judging circuits execute pitch prediction judgment by using the pitch data in at least one of the sub-bands. A synthesizer synthesizes a pitch prediction signal. A subtractor subtracts the pitch prediction signal from the difference result obtained from a subtractor and thus obtains an excitation signal. An excitation quantizer quantizes the excitation signal with reference to an excitation codebook.

Type: Grant

Filed: May 27, 1997

Date of Patent: February 16, 1999

Assignee: NEC Corporation

Inventor: Kazunori Ozawa
Speech coder that utilizes correlation maximization to achieve fast excitation coding, and associated coding method

Patent number: 5867814

Abstract: A speech coder, formed with a digital speech encoder and a digital speech decoder, utilizes fast excitation coding to reduce the computation power needed for compressing digital samples of an input speech signal to produce a compressed digital speech datastream that is subsequently decompressed to synthesize digital output speech samples. Much of the fast excitation coding is furnished by an excitation search unit in the encoder. The search unit determines excitation information that defines a non-periodic group of excitation pulses The optimal location of each pulse in the non-periodic pulse group is chosen from a corresponding set of pulse positions stored in the encoder. The search unit ascertains the optimal pulse positions by maximizing the correlation between (a) a target group of filtered versions of digital input speech samples provided to the encoder for compression and (b) a corresponding group of synthesized digital speech samples.

Type: Grant

Filed: November 17, 1995

Date of Patent: February 2, 1999

Assignee: National Semiconductor Corporation

Inventor: Mei Yong
Method and system for pattern recognition based on tree organized probability densities

Patent number: 5857169

Abstract: A time-sequential input pattern (20), which is derived from a continual physical quantity, such as speech is recognized. The system includes input means (30), which accesses the physical quantity and therefrom generates a sequence of input observation vectors. The input observation vectors represent the input pattern. A reference pattern database (40) is used for storing reference patterns, which consist of a sequence of reference units. Each reference unit is represented by associated reference probability densities. A tree builder (60) represents for each reference unit the set of associated reference probability densities as a tree structure. Each leaf node of the tree corresponds to a reference probability density. Each non-leaf node corresponds to a cluster probability density, which is derived from all reference probability densities corresponding to leaf nodes in branches below the non-leaf node.

Type: Grant

Filed: August 28, 1996

Date of Patent: January 5, 1999

Assignee: U.S. Philips Corporation

Inventor: Frank Seide
Phoneme recognition and difference signal for speech coding/decoding

Patent number: 5832425

Abstract: An analog-to-digital converter (20) forms a digital signal based upon an analog speech signal. A phoneme parser (22) parses the digital signal into at least one phoneme. A phoneme recognizer (24) assigns a symbolic code to each phoneme based upon recognition of the phonemes from a predetermined set. A read-only memory (34) contains a standard waveform representation of each phoneme from the predetermined set. A difference processor (32) forms a difference signal between a user-spoken phoneme waveform and a corresponding waveform from the read-only memory (34). The difference signal is stored in a storage device (40). A multiplexer (30) provides a bit stream signal based upon the symbolic code and the difference signal. A synchronizer (70) extracts the symbolic code and the difference signal from the bit stream. A phoneme generator (76) forms the speech signal based upon the symbolic code and the difference signal.

Type: Grant

Filed: April 10, 1997

Date of Patent: November 3, 1998

Assignee: Hughes Electronics Corporation

Inventor: Donald C. Mead
Method for generating random code book of code-excited linear predictive coding

Patent number: 5826223

Abstract: A method for generating a random code book having a characteristic similar to a periodic component of voice in code-excited linear predictive (CELP) coding. The method includes generating an adaptive code book that removes the periodic component of a current subframe of a speech signal. An adaptive code book array is generated with respect to the current subframe on the basis of an optimal delay and gain obtained in generating the adaptive code book. A number of code word arrays are generated from the adaptive code book array and the excited signal of the immediately previous subframe. A code word that has the maximum value is selected from each code word array generated in the code word array generating step. Each code word array is normalized using the selected code word. The normalized maximum value in each code word array is selected and scaled by the power of the most previous frame. A random code book including a set of the scaled selected maximum values is generated.

Type: Grant

Filed: November 27, 1996

Date of Patent: October 20, 1998

Assignee: Samsung Electronics XCo., Ltd.

Inventors: Hong-kook Kim, Kee-eun Oh, Moo-young Kim
Voice encoding method and apparatus using modified discrete cosine transform

Patent number: 5819212

Abstract: A method and apparatus for encoding an input signal, such as a broad-range speech signal, in which a number of decoding operations with different bit rates are enabled for assuring a high encoding bit rate and for minimizing deterioration of the reproduced sound even with a low bit rate. The signal encoding method includes a band-splitting step for splitting an input signal into a number of bands and a step of encoding signals of the bands in a different manner depending on signal characteristics of the bands. Specifically, a low-range side signal is taken out by a low-pass filter from an input signal entering a terminal, and analyzed for Linear Predictive coding by an Linear Predictive coding analysis quantization unit. After finding the Linear Predictive coding residuals, as short-term prediction residuals by an Linear Predictive coding inverted filter, the pitch is found by a pitch analysis circuit. Then, pitch residuals are found by long-term prediction by a pitch inverted filter.

Type: Grant

Filed: October 24, 1996

Date of Patent: October 6, 1998

Assignee: Sony Corporation

Inventors: Jun Matsumoto, Shiro Omori, Masayuki Nishiguchi, Kazuyuki Iijima
Vocoder apparatus using the link margin

Patent number: 5812968

Abstract: An apparatus for improving the link margin of a communication link includes a variable rate vocoder which decreases the output bit stream rate it produces so as to reduce the amount of information having to be transmit in the communication link. In one embodiment, the variable rate vocoder includes a plurality of vocoder portions, each of which produces a different bit stream rate. The selector is used for selecting among the output bit streams produced by each vocoder. In another embodiment, a logic device is coupled to the output of the vocoder. The logic device, upon receipt of a control signal, truncates the less important bits.The method for improving link margin includes reducing the vocoder output rate thereby reducing the amount of data being transmit in an communication link. The method also includes using increased error correction coding and transmitting at increased per bit power levels to increase link margin.

Type: Grant

Filed: August 28, 1996

Date of Patent: September 22, 1998

Assignee: Ericsson, Inc.

Inventors: Amer A. Hassan, Peter D. Karabinis, Nils Rutger Rydbeck
Coding of a speech or music signal with quantization of harmonics components specifically and then residue components

Patent number: 5806024

Abstract: Harmonics coefficients are estimated in primary coefficients of an orthogonal transform of a speech or a music input signal by using a pitch frequency extracted from the input signal and are quantized into a harmonics code vector. Residue coefficients are calculated by removing the harmonics coefficients from the primary coefficients and quantized into residue code vectors and gain code vectors. It is possible to search harmonics excitation pulses at the harmonics locations for harmonics quantization into the harmonics code vector. On the other hand, it is possible to estimate the harmonics coefficients or excitation pulses by using quantized LSP parameters and to calculate secondary coefficients for use in weighting the harmonics quantization and residue quantization and, if applicable, in excitation pulse search.

Type: Grant

Filed: December 23, 1996

Date of Patent: September 8, 1998

Assignee: NEC Corporation

Inventor: Kazunori Ozawa
Speech coding by code-edited linear prediction

Patent number: 5787391

Abstract: In a speech coding method of the present invention, initially, a plurality of samples of speech data are analyzed by a linear prediction analysis and thereby prediction coefficients are calculated. Then, the prediction coefficients are quantized, and the quantized prediction coefficients are set in a synthesis filter. Moreover, a pitch period vector is selected from an adaptive codebook in which a plurality of pitch period vectors are stored, and the selected pitch period vector is multiplied by a first gain which is obtained, at the same time, with a second gain. In addition, a noise waveform vector is selected from a random codebook in which a plurality of the noise waveform vectors are stored, and is multiplied by a predicted gain and the second gain. Then, the speech vector is synthesized by exciting the synthesis filter with the pitch period vector multiplied by the first gain, and with the noise waveform vector multiplied by the predicted gain and the second gain.

Type: Grant

Filed: June 5, 1996

Date of Patent: July 28, 1998

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takehiro Moriya, Akitoshi Kataoka, Kazunori Mano, Satoshi Miki, Hitoshi Omuro, Shinji Hayashi
Very low bit rate voice messaging system using asymmetric voice compression processing

Patent number: 5781882

Abstract: An apparatus and method for processing a voice message to provide low bit rate speech transmission processes the voice message to generate speech parameters which are arranged into a two dimensional parameter matrix (502) including a sequence of parameter frames. The two dimensional parameter matrix (502) is transformed using a predetermined two dimensional matrix transformation function (414) to obtain a two dimensional transform matrix (506). Distance values representing distances between templates of a set of predetermined templates and the two dimensional transform matrix (506) are then derived. The distance values derived are identified by indexes identifying the templates of the set of predetermined templates. The distance values derived are compared, and an index corresponding to a template of the set of predetermined templates having a shortest distance is selected and then transmitted.

Type: Grant

Filed: September 14, 1995

Date of Patent: July 14, 1998

Assignee: Motorola, Inc.

Inventors: Walter Lee Davis, Jian-Cheng Huang, Leon Jasinski
Variable rate vocoder

Patent number: 5778338

Abstract: An apparatus and method for performing speech signal compression, by variable rate coding of frames of digitized speech samples. The level of speech activity for each frame of digitized speech samples is determined and an output data packet rate is selected from a set of rates based upon the determined level of frame speech activity. A lowest rate of the set of rates corresponds to a detected minimum level of speech activity, such as background noise or pauses in speech, while a highest rate corresponds to a detected maximum level of speech activity, such as active vocalization. Each frame is then coded according to a predetermined coding format for the selected rate wherein each rate has a corresponding number of bits representative of the coded frame. A data packet is provided for each coded frame with each output data packet of a bit rate corresponding to the selected rate.

Type: Grant

Filed: January 23, 1997

Date of Patent: July 7, 1998

Assignee: QualComm Incorporated

Inventors: Paul E. Jacobs, William R. Gardner, Chong U. Lee, Klein S. Gilhousen, S. Katherine Lam, Ming-Chang Tsai
High efficiency encoding method

Patent number: 5765127

Abstract: A high efficiency encoding method for encoding data on frequency axis obtained by dividing an input audio signal on block-by-block basis and converting the signal onto the frequency axis, wherein V bands are searched for a band B.sub.VH with the highest center frequency if it is decided that there are one or more shift points of voiced (V)/unvoiced (UV) decision data of all bands on the frequency axis, and wherein the number of V bands N.sub.V up to the band B.sub.VH is found, so as to decide whether proportion of the V bands is equal to or higher than a predetermined threshold N.sub.th, thereby deciding one V/UV boundary point. Thus, it is possible to replace the V/UV decision data for each band by information on one demarcation in all bands, thereby to reduce data volume and to reduce bit rate. Also, by using two-stage hierarchical vector quantization in quantizing the data on the frequency axis, operation volume for codebook search and memory capacity of the codebook are reduced.

Type: Grant

Filed: December 6, 1993

Date of Patent: June 9, 1998

Inventors: Masayuki Nishiguchi, Jun Matsumoto, Shinobu Ono

prev … 5 6 7 8 9