Patents Examined by Michelle Doerrler

Speech recognizer having a speech coder for an acoustic match based on context-dependent speech-transition acoustic models

Patent number: 5333236

Abstract: A speech coding apparatus compares the closeness of the feature value of a feature vector signal of an utterance to the parameter values of prototype vector signals to obtain prototype match scores for the feature vector signal and each prototype vector signal. The speech coding apparatus stores a plurality of speech transition models representing speech transitions. At least one speech transition is represented by a plurality of different models. Each speech transition model has a plurality of model outputs, each comprising a prototype match score for a prototype vector signal. Each model output has an output probability. A model match score for a first feature vector signal and each speech transition model comprises the output probability for at least one prototype match score for the first feature vector signal and a prototype vector signal.

Type: Grant

Filed: September 10, 1992

Date of Patent: July 26, 1994

Assignee: International Business Machines Corporation

Inventors: Lalit R. Bahl, Peter V. De Souza, Ponani S. Gopalakrishnan, Michael A. Picheny
Recognition apparatus with function of displaying plural recognition candidates

Patent number: 5329609

Abstract: A dictionary order sorter resorts character strings of recognition candidates stored in a high-ranking candidate memory in the order of distance into a dictionary order (character code order). Upon receipt of a sort termination signal a display controller displays the character strings of recognition candidates stored in the high-ranking candidate memory in the dictionary order and their ranking numbers in order on a display. Where an attribute-dependent sorter is provided in place of the dictionary order sorter, the character strings of recognition candidates stored in the distance order in the high-ranking candidate memory are sorted (grouped) according to attribute information of categories, such as parts of speech, concepts, etc., which are stored in a template memory for each of the recognition candidates and then displayed classified into groups. The categories of attribute information can arbitrarily be specified by the user.

Type: Grant

Filed: July 30, 1991

Date of Patent: July 12, 1994

Assignee: Fujitsu Limited

Inventors: Toru Sanada, Shinta Kimura, Kyung-Ho Loken-Kin
Pulse pattern excited linear prediction voice coder

Patent number: 5327519

Abstract: Speech coding of the code excited linear predictive type is implemented by providing an excitation vector which comprises a set of a pre-determined number of pulse patterns from a codebook of P pulse patterns, which have a selected orientation and a pre-determined delay with respect to the starting point of the excitation vector. This requires modest computational power and a small memory space, which allows it to be implemented in one signal processor.

Type: Grant

Filed: May 19, 1992

Date of Patent: July 5, 1994

Assignee: Nokia Mobile Phones Ltd.

Inventors: Jari Haggvist, Kari Jarvinen, Kari-Pekka Estola, Jukka Ranta
Method of use of voice message coder/decoder

Patent number: 5327520

Abstract: A code excited linear predictive coder and decoder well suited to speech recording, transmission and reproduction, especially in voice messaging systems, provides backward adaptive gain control of stored codevectors to be applied to a synthesis filter prior to being compared with sequences of input speech signals. Simplified linear predictive parameter quantization using efficient table lookup procedures, efficient codevector storage and search all contribute in an illustrative embodiment to high quality coding and decoding with reduced computational complexity.

Type: Grant

Filed: June 4, 1992

Date of Patent: July 5, 1994

Assignee: AT&T Bell Laboratories

Inventor: Juin-Hwey Chen
Audio analysis/synthesis system

Patent number: 5327518

Abstract: A method and apparatus for the automatic analysis, synthesis and modification of audio signals, based on an overlap-add sinusoidal model, is disclosed. Automatic analysis of amplitude, frequency and phase parameters of the model is achieved using an analysis-by-synthesis procedure which incorporates successive approximation, yielding synthetic waveforms which are very good approximations to the original waveforms and are perceptually identical to the original sounds. A generalized overlap-add sinusoidal model is introduced which can modify audio signals without objectionable artifacts. In addition, a new approach to pitch-scale modification allows for the use of arbitrary spectral envelope estimates and addresses the problems of high-frequency loss and noise amplification encountered with prior art methods.

Type: Grant

Filed: August 22, 1991

Date of Patent: July 5, 1994

Assignee: Georgia Tech Research Corporation

Inventors: E. Bryan George, Mark J. T. Smith
Speech signal coding and decoding system transmitting allowance range information

Patent number: 5325461

Abstract: A speech signal coding apparatus inputs a pitch period generated by coding a speech signal, and outputs information on a range for said pitch period, together with the pitch period. A speech signal decoding apparatus inputs the pitch period and the above information on the range, and determines whether or not the pitch period is within the range. When the pitch period is determined to be within the range, the speech signal decoding apparatus outputs the above pitch period. When the pitch period is determined not to be within the range, the speech signal decoding apparatus outputs as a pitch period a predetermined value within the range.

Type: Grant

Filed: February 20, 1992

Date of Patent: June 28, 1994

Assignee: Fujitsu Limited

Inventors: Yoshinori Tanaka, Yoshihiro Sakai, Yasuko Shirai, Tomohiko Taniguchi, Hideaki Kurihara
Speech coding system having codebook storing differential vectors between each two adjoining code vectors

Patent number: 5323486

Abstract: A speech coding system is provided where input speech is coded by finding via an evaluation computation a code vector giving a minimum error between reproduced signals obtained by linear prediction analysis filter processing, simulating speech path characteristics, on code vectors successively read out from a noise codebook storing a plurality of noise trains as code vectors and an input speech signal and by using a code specifying the code vector. In the speech coding system, the noise codebook includes a delta vector codebook which stores an initial vector and a plurality of delta vectors having difference vectors between adjoining code vectors. In addition, provision is made in the computing unit for the evaluation computation of a cyclic adding unit for cumulatively adding the delta vectors to virtually reproduce the code vectors.

Type: Grant

Filed: May 14, 1992

Date of Patent: June 21, 1994

Assignee: Fujitsu Limited

Inventors: Tomohiko Taniguchi, Mark Johnson, Yasuji Ohta, Hideaki Kurihara, Yoshinori Tanaka, Yoshihiro Sakai
Voice synthesizing apparatus and method and apparatus and method used as part of a voice synthesizing apparatus and method

Patent number: 5321794

Abstract: A voice synthesizing apparatus is arranged to synthesize a voice from text data composed of either character codes or a series of symbols by generating a sound source based on a series of sound-source parameters and synthesizing the sound source on the basis of a series of synthesis parameters. The voice synthesizing apparatus is provided with a sound-source generating circuit for generating the aforesaid sound source from a signal obtained from an instrumental sound generated with a musical instrument. This arrangement serves to easily synthesize voices which convey language information and yet which simulate the sounds of musical instruments such as a guitar, a violin, a harmonica, a musical synthesizer and the like.

Type: Grant

Filed: June 25, 1992

Date of Patent: June 14, 1994

Assignee: Canon Kabushiki Kaisha

Inventor: Junichi Tamura
Variable bit rate speech encoder

Patent number: 5317672

Abstract: A method and apparatus for allocating transmission bits for use in transmitting samples of a digital signal. An aggregate allowable quantization distortion value is selected representing an allowable quantization distortion error for a frame of samples of the digital signal. A set of samples are selected from the frame of samples such that a plurality of the selected samples are greater than a noise threshold. For each sample of the set, a sample quantization distortion value is computed which represents an allowable quantization distortion error for the sample. The sum of all sample quantization distortion values is approximately equal to the aggregate allowable quantization distortion value. For each sample of the set, a quantization step size is selected which yields a quantization distortion error approximately equal to the sample's corresponding quantization distortion value. Each sample is then quantized using its quantization step size.

Type: Grant

Filed: March 4, 1992

Date of Patent: May 31, 1994

Assignee: Picturetel Corporation

Inventors: Antony H. Crossman, Edmund S. Thompson
System for recognizing or counting spoken itemized expressions

Patent number: 5315688

Abstract: A speech categorization system includes first and second timers which generate first and second measured durations indicative of duration of selected higher and lower amplitude segments included in a voice message. A higher amplitude segment is classified in a first category when the first and second measured durations corresponding to the higher amplitude segment and an adjacent lower amplitude segment satisfy a classification test, and a counter counts the number of the higher amplitude segments classified in the first category Accented syllables in the higher amplitude segment are recognized to aid classification.

Type: Grant

Filed: January 18, 1991

Date of Patent: May 24, 1994

Inventor: Peter F. Theis
Speech recognition system having word-based and phoneme-based recognition means

Patent number: 5315689

Abstract: A speech recognition system includes a parameter extracting section for extracting a speech parameter of input speech, a first recognizing section for performing recognition processing by word-based matching, and a second recognizing section for performing word recognition by matching in units of word constituent elements. The first word recognizing section segments the speech parameter in units of words to extract a word speech pattern and performs word recognition by matching the word speech pattern with a predetermined word reference pattern. The second word recognizing section performs recognition in units of word constituent elements by using the extracted speech parameter and performs word recognition on the basis of candidates of an obtained word constituent element series.

Type: Grant

Filed: December 21, 1992

Date of Patent: May 24, 1994

Assignee: Kabushiki Kaisha Toshiba

Inventors: Hiroshi Kanazawa, Yoichi Takebayashi
Side fed superlattice for the production of linear predictor and filter coefficients

Patent number: 5315687

Abstract: Signal processing structures for providing direct prediction coefficients and direct Least Square-Finite Impulse Response (LS-FIR) filter coefficients. The structures include one or more processors, and a storage and retrieval structure for selectively storing predictor and filter coefficients and intermediate variables, to thereby allow the one or more real processors to emulate a plurality of virtual processors, which take the form of a side fed superlattice structure, in the case of linear prediction, and a side-fed superlattice-superladder structure, in the case of direct LS-FIR filtering.

Type: Grant

Filed: September 24, 1992

Date of Patent: May 24, 1994

Assignee: Adler Research Associates

Inventors: George Carayannis, Christos Halkias, Dimitris Manolakis, Elias Koukoutsis
Apparatus for quantizing an input group of data samples into one of N quantized groups of data via a process on less than N/2 reference groups of data samples

Patent number: 5313552

Abstract: A quantizer, which converts an input group of data samples into one of N quantized groups of data samples, does so by performing several dot products and compares. Each dot product is between the input group of data samples and a reference group of data samples u.sub.x, and each compare is with the dot product result and a constant k.sub.y. All of the reference groups of data samples and all of the constants are stored in a memory. This memory holds only N-1 constants and less then N/2 reference groups of data samples. A memory saving of several thousand percent is achieved by limiting the memory to hold no more then 2r reference groups of data samples where r is the number of samples in the input group, or no more then 2 log.sub.2 N reference groups of data samples.

Type: Grant

Filed: March 11, 1993

Date of Patent: May 17, 1994

Assignee: Unisys Corporation

Inventor: Robert A. Lindsay
Lombard voice recognition method and apparatus for recognizing voices in noisy circumstance

Patent number: 5313555

Abstract: A Lombard voice recognition method for recognizing a voice input in a noisy background includes a step of matching and a step of warping. The step of matching matches a frequency spectrum of a feature pattern derived from the input voice to a frequency spectrum of a standard pattern. The step of warping warps a lower frequency spectrum of the feature pattern than a lower frequency side with respect to a frequency axis in case that a noise level of the background in higher than a predetermined noise level.

Type: Grant

Filed: February 7, 1992

Date of Patent: May 17, 1994

Assignee: Sharp Kabushiki Kaisha

Inventor: Shin Kamiya
Backward gain adaptation method in code excited linear prediction coders

Patent number: 5313554

Abstract: An exemplary CELP coder where gain adaptation is performed using previous gain values in conjunction with an entry in a table comprising the logarithms of the root-mean-squared values of the codebook vectors, to predict the next gain value. Not only is this method less complex because the table entries are determined off-line, but in addition the use of a table at both the encoder and the decoder allows fixed-point/floating-point interoperability requirements to be met.

Type: Grant

Filed: June 16, 1992

Date of Patent: May 17, 1994

Assignee: AT&T Bell Laboratories

Inventor: Richard H. Ketchum
Method and apparatus for speaker individuality conversion

Patent number: 5307442

Abstract: Input speech of a reference speaker, who wants to convert his/her voice quality, and speech of a target speaker are converted into a digital signal by an analog to digital (A/D) converter. The digital signal is then subjected to speech analysis by a linear predictive coding (LPC) analyzer. Speech data of the reference speaker is processed into speech segments by a speech segmentation unit. A speech segment correspondence unit makes a dynamic programming (DP) based correspondence between the obtained speech segments and training speech data of the target speaker, thereby making a speech segment correspondence table. A speaker individuality conversion is made on the basis of the speech segment correspondence table by a speech individuality conversion and synthesis unit.

Type: Grant

Filed: September 17, 1991

Date of Patent: April 26, 1994

Assignee: ATR Interpreting Telephony Research Laboratories

Inventors: Masanobu Abe, Shigeki Sagayama
Method and apparatus for determining the excitation signal in VSELP coders

Patent number: 5307460

Abstract: A new basis vector search process that directly results in an optimal linear weighting for a VSELP (Vector Sum Excited Linear Prediction) coder, thus avoiding the need to perform an extensive search. In the present invention, the conventional search process is replaced by a direct formula, thus avoiding the time consuming searching procedure. Using a simple mathematical relationship, the process of filtering the basis signals with an impulse response filter h(n) every subframe is avoided. A simple theorem has been developed to reduce the computation involved in carrying out the filtering of the basis signals with h(n), and is referred to as the switching convolution theorem. As a result, the computation time necessary to produce the optimal weighting is reduced by a factor of from 3 to 4, while maintaining the output quality of the coder. The new apparatus and method are based upon a set of equations that includes several experimentally justified assumptions.

Type: Grant

Filed: February 14, 1992

Date of Patent: April 26, 1994

Assignee: Hughes Aircraft Company

Inventor: Haim Garten
Wear-toll quality 4.8 kbps speech codec

Patent number: 5307441

Abstract: A speech codec operating at low data rates uses an iterative method to jointly optimize pitch and gain parameter sets. A 26-bit spectrum filter coding scheme may be used, involving successive subtractions and quantizations. The codec may preferably use a decomposed multipulse excitation model, wherein the multipulse vectors used as the excitation signal are decomposed into position and amplitude codewords. Multipulse vectors are coded by comparing each vector to a reference multipulse vector and quantizing the resulting difference vector. An expanded multipulse excitation codebook and associated fast search method, optionally with a dynamically-weighted distortion measure, allow selection of the best excitation vector without memory or computational overload. In a dynamic bit allocation technique, the number of bits allocated to the pitch and excitation signals depend on whether the signals are "significant" or "insignificant".

Type: Grant

Filed: November 29, 1989

Date of Patent: April 26, 1994

Assignee: Comsat Corporation

Inventor: Forrest F.-T. Tzeng
Method and apparatus for hearing assistance with speech speed control function

Patent number: 5305420

Abstract: A method and an apparatus for hearing assistance, capable of compensating the lowering of the speech recognition ability related to the deterioration of the auditory sense center. The input speech is divided into voiced speech sections, unvoiced speech sections, and silent sections, of which the voiced speech sections and the silent sections are appropriately extended/contracted while the unvoiced speech sections are left unchanged, and then these sections are combined in an identical order as in the input speech, so as to obtain output speech which is easier to listen for a listener with a handicapped hearing ability.

Type: Grant

Filed: September 22, 1992

Date of Patent: April 19, 1994

Assignee: Nippon Hoso Kyokai

Inventors: Akira Nakamura, Ryou Ikezawa, Nobumasa Seiyama, Tohru Takagi, Eiichi Miyasaka
Low bit rate speech coding system and compression

Patent number: 5305421

Abstract: A speech coder apparatus operates to compress speech signals to a low bit rate. The apparatus includes a continuous speech recognizer (CSR) which has a memory for storing templates. Input speech is processed by the CSR where information in the speech is compared against the templates to provide an output digital signal indicative of recognized words, which signal is transmitted along a first path. There is further included a front end processor which is also responsive to the input speech signal for providing output digitized speech samples during a given frame interval. A side information encoder circuit responds to the output from the front end processor to provide at the output of the encoder a parameter signal indicative of the value of the pitch and word duration for each word as recognized by the CSR unit. The output of the encoder is transmitted as a second signal.

Type: Grant

Filed: August 28, 1991

Date of Patent: April 19, 1994

Assignee: ITT Corporation

Inventor: Kung-Pu Li

prev 1 2 3 4 5 6 7 8 9 … next