Patents by Inventor Lawrence R. Rabiner

Lawrence R. Rabiner has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech recognition employing key word modeling and non-key word modeling

Patent number: 5509104

Abstract: Speaker independent recognition of small vocabularies, spoken over the long distance telephone network, is achieved using two types of models, one type for defined vocabulary words (e.g., collect, calling-card, person, third-number and operator), and one type for extraneous input which ranges from non-speech sounds to groups of non-vocabulary words (e.g. `I want to make a collect call please`). For this type of key word spotting, modifications are made to a connected word speech recognition algorithm based on state-transitional (hidden Markov) models which allow it to recognize words from a pre-defined vocabulary list spoken in an unconstrained fashion. Statistical models of both the actual vocabulary words and the extraneous speech and background noises are created. A syntax-driven connected word recognition system is then used to find the best sequence of extraneous input and vocabulary word models for matching the actual input speech.

Type: Grant

Filed: October 6, 1993

Date of Patent: April 16, 1996

Assignee: AT&T Corp.

Inventors: Chin H. Lee, Lawrence R. Rabiner, Jay G. Wilpon
Speech recognition arrangement with preselection

Patent number: 4860358

Abstract: An arrangement and a method for recognizing a speech pattern employs a quick initial processing based upon subpatterns of speech features from which time-sequence information has been removed is followed by more conventional processing of a plurality of the best candidates. The initial processing, or pre-selection, eliminates large numbers of unlikely candidates, the best candidates being retained by setting an empirically determined threshold for similarity, as measured by a suitable distance measurement. To facilitate retrieving the complete speech feature patterns including time-sequence information for the subsequent processing, the code books of the subpatterns also contain associated indices pointing to the corresponding complete patterns.

Type: Grant

Filed: December 18, 1987

Date of Patent: August 22, 1989

Assignee: American Telephone and Telegraph Company, AT&T Bell Laboratories

Inventor: Lawrence R. Rabiner
Endpoint detector

Patent number: 4821325

Abstract: An arrangement for endpoint detection improves speech recognition accuracy where the input signal includes nonstationary noise. Energy pulses are found by looking for local energy level peaks, then analyzing surrounding energy levels to determine pulse boundaries. Energy pulses are combined according to predetermined criteria to form longer pulses corresponding to words or phrases in the input signal.

Type: Grant

Filed: November 8, 1984

Date of Patent: April 11, 1989

Assignee: American Telephone and Telegraph Company, AT&T Bell Laboratories

Inventors: Thomas B. Martin, Lawrence R. Rabiner, Jay G. Wilpon
Hidden Markov model speech recognition arrangement

Patent number: 4783804

Abstract: Markov model speech pattern templates are formed for speech analysis systems by analyzing identified speech patterns to generate frame sequences of acoustic feature signals representative thereof. The speech pattern template is produced by iteratively generating succeeding Markov model signal sets starting with an initial Markov model signal set. Each iteration includes forming a set of signals representative of the current iteration Markov model of the identified speech pattern responsive to said frame sequences of acoustic feature signals and one of the previous Markov model signal sets and comparing the current iteration Markov model signal set with said previous Markov model signal set to generate a signal corresponding to the similarity therebetween. The iterations are terminated when said similarity signal is equal to or smaller than a predetermined value and the last formed Markov model signal set is selected as a reference template for said identified speech pattern.

Type: Grant

Filed: March 21, 1985

Date of Patent: November 8, 1988

Assignee: American Telephone and Telegraph Company, AT&T Bell Laboratories

Inventors: Biing-Hwang Juang, Stephen E. Levinson, Lawrence R. Rabiner, Man M. Sondhi
LPC Word recognizer utilizing energy features

Patent number: 4519094

Abstract: In a speech recognition arrangement, a speech pattern is recognized as one of a plurality of reference patterns for which acoustic feature signal templates are stored. Each template includes a time frame (e.g., 10 millisecond) sequence of spectral features e.g., LPC and nonspectral e.g., acoustic energy (E) normalized to the peak energy over an utterance interval. LPC and normalized energy feature signal sequences are produced for an unknown speech pattern. For each time frame, the correspondence between the LPC features of the speech pattern and each reference is measured as well as the correspodence between the energy (E) features. In comparing the unknown speech features to those of the reference templates, the dynamic time warp distance DT=D.sub.LPC +.alpha.D.sub.E is used where .alpha. is a weighting factor selected to minimize the probability of erroneous recognition.

Type: Grant

Filed: August 26, 1982

Date of Patent: May 21, 1985

Assignee: AT&T Bell Laboratories

Inventors: Michael K. Brown, Lawrence R. Rabiner
Dynamic time warping arrangement

Patent number: 4488243

Abstract: In a speech recognition system, dynamic time warp (DTW) calculations are reduced by use of a search strategy for an optimal non-linear search warp function path: signals predictive of the similarity are generated responsive to the preceding and current correspondence signals of the input word to reference warped words.

Type: Grant

Filed: May 3, 1982

Date of Patent: December 11, 1984

Assignee: AT&T Bell Laboratories

Inventors: Michael K. Brown, Lawrence R. Rabiner
Method and apparatus for generating speech pattern templates

Patent number: 4454586

Abstract: A system for generating speech pattern templates for use with either speech recognition or speech synthesis. Reference demisyllable templates are first generated from a reference first speaker using both manual and automatic analysis. The analysis for a second speaker is simplified and automated by comparing with the first speaker's templates. The second speaker speaks the same words at a rate time-warped to match the first speakers rate and template. We define a demisyllable as each of the two halves of a syllable, assuming a syllable starts and ends with a noisy consonant, and the syllable is split at its vowel center, thereby simplifying concatenation and comparison. Key features of the invention include generating a set of signals representative of the time alignment between the first and second speaker's templates, and the time-of-occurence boundaries of each syllable in a word.

Type: Grant

Filed: November 19, 1981

Date of Patent: June 12, 1984

Assignee: AT&T Bell Laboratories

Inventors: Frank C. Pirz, Lawrence R. Rabiner, Jay G. Wilpon
Continuous speech pattern recognizer

Patent number: 4400788

Abstract: This speech recognizer concatenates a string of reference isolated-words for comparison with the unknown string of connected-words. The invention includes a level-building (LB) algorithm, "level" implying a location in a sequence of words. A constrained endpoint dynamic-time-warp algorithm, in which the slope of the warping function is restricted between 1/2 and 2, is used to find the best alignment between an unknown continuous-word test pattern, and a concatenated sequence of L reference patterns. Properties of the LB algorithm include: modification of the references; back-track decision logic; heuristic selection of multiple candidates, and syntax constraints. As a result, the processing required is less than two-level dynamic-program-matching and sampling algorithms.

Type: Grant

Filed: March 27, 1981

Date of Patent: August 23, 1983

Assignee: Bell Telephone Laboratories, Incorporated

Inventors: Cory S. Myers, Frank C. Pirz, Lawrence R. Rabiner
Endpoint detector

Patent number: 4370521

Abstract: An arrangement for endpoint detection improves speech recognition accuracy and lowers rejection rates by developing an ordered list of endpoint candidates. A triple thresholding technique defines energy signal pulses. The energy pulses are combined according to predetermined criteria to form the endpoint candidates.

Type: Grant

Filed: December 19, 1980

Date of Patent: January 25, 1983

Assignee: Bell Telephone Laboratories, Incorporated

Inventors: James D. Johnston, Lori F. Lamel, Lawrence R. Rabiner, Aaron E. Rosenberg, Jay G. Wilpon
Continuous speech recognition system

Patent number: 4349700

Abstract: Recognition of continuous speech by comparison with prestored isolated words may be confused by the merging together of spoken adjacent words (coarticulation). Improved recognition is attained by generating overlap-words, e.g., words whose first phoneme is the end phoneme of the preceding word in a string of words. The reference candidate series of overlap-words is transformed under dynamic time warping so as to time-match the utterance series of overlap-words.

Type: Grant

Filed: April 8, 1980

Date of Patent: September 14, 1982

Assignee: Bell Telephone Laboratories, Incorporated

Inventors: Frank C. Pirz, Lawrence R. Rabiner
Spoken word controlled automatic dialer

Patent number: 4348550

Abstract: A speech controlled dialing circuit identifies input utterances which may be a command word (mode select), repertory word (dialing name or number), or non-recognized ("Other"). Responsive to the identification of each occurring input utterance, a set of predetermined templates are selected to identify the next occuring utterance. A programmed microprocessor system is described to implement the main controller function.

Type: Grant

Filed: June 9, 1980

Date of Patent: September 7, 1982

Assignee: Bell Telephone Laboratories, Incorporated

Inventors: Frank C. Pirz, Lawrence R. Rabiner, Aaron E. Rosenberg, Jay G. Wilpon
Multiple template speech recognition system

Patent number: 4181821

Abstract: A speech analyzer for recognizing an unknown utterance as one of a set of reference words is adapted to generate a feature signal set for each utterance of every reference word. At least one template signal is produced for each reference word which template signal is representative of a group of feature signal sets. Responsive to a feature signal set formed from the unknown utterance and each reference word template signal, a signal representative of the similarity between the unknown utterance and the template signal is generated. A plurality of similarity signals for each reference word is selected and a signal corresponding to the average of said selected similarity signals is formed. The average similarity signals are compared to identify the unknown utterance as the most similar reference word.

Type: Grant

Filed: October 31, 1978

Date of Patent: January 1, 1980

Assignee: Bell Telephone Laboratories, Incorporated

Inventors: Frank C. Pirz, Lawrence R. Rabiner
Multiple template speech recognition system

Patent number: RE31188

Abstract: A speech analyzer for recognizing an unknown utterance as one of a set of reference words is adapted to generate a feature signal set for each utterance of every reference word. At least one template signal is produced for each reference word which template signal is representative of a group of feature signal sets. Responsive to a feature signal set formed from the unknown utterance and each reference word template signal, a signal representative of the similarity between the unknown utterance and the template signal is generated. A plurality of similarity signals for each reference word is selected and a signal corresponding to the average of said selected similarity signals is formed. The average similarity signals are compared to identify the unknown utterance as the most similar reference word.

Type: Grant

Filed: December 31, 1981

Date of Patent: March 22, 1983

Assignee: Bell Telephone Laboratories, Incorporated

Inventors: Frank C. Pirz, Lawrence R. Rabiner
Spoken word controlled automatic dialer

Patent number: RE32012

Abstract: A speech controlled dialing circuit identifies input utterances which may be a command word (mode select), repertory word (dialing name or number), or nonrecognized ("Other"). Responsive to the identification of each occurring input utterance, a set of predetermined templates are selected to identify the next occuring utterance. A programmed microprocessor system is described to implement the main controller function.

Type: Grant

Filed: September 7, 1984

Date of Patent: October 22, 1985

Assignee: AT&T Bell Laboratories

Inventors: Frank C. Pirz, Lawrence R. Rabiner, Aaron E. Rosenberg, Jay G. Wilpon
Endpoint detector

Patent number: RE32172

Abstract: An arrangement for endpoint detection improves speech recognition accuracy and lowers rejection rates by developing an ordered list of endpoint candidates. A triple thresholding technique defines energy signal pulses. The energy pulses are combined according to predetermined criteria to form the endpoint candidates.

Type: Grant

Filed: January 25, 1985

Date of Patent: June 3, 1986

Assignee: AT&T Bell Laboratories

Inventors: James D. Johnston, Lori F. Lamel, Lawrence R. Rabiner, Aaron E. Rosenberg, Jay G. Wilpon
Hidden Markov model speech recognition arrangement

Patent number: RE33597

Abstract: A speech recognizer includes a plurality of stored constrained hidden Markov model reference templates and a set of stored signals representative of prescribed acoustic features of the said plurality of reference patterns. The Markov model template includes a set of N state signals. The number of states is preselected to be independent of the reference pattern acoustic features and preferably substantially smaller than the number of acoustic feature frames of the reference patterns. An input utterance is analyzed to form a sequence of said prescribed feature signals representative of the utterance. The utterance representative prescribed feature signal sequence is combined with the N state constrained hidden Markov model template signals to form a signal representative of the probability of the utterance being each reference pattern. The input speech pattern is identified as one of the reference patterns responsive to the probability representative signals.

Type: Grant

Filed: May 5, 1988

Date of Patent: May 28, 1991

Inventors: Stephen E. Levinson, Lawrence R. Rabiner, Man M. Sondhi