Patents Examined by Thomas J. Onka

Speech recognition training using bio-signals

Patent number: 5559926

Abstract: A bio-signal is monitored while a speech recognition system is trained to recognize a word or utterance. An utterance is identified for retraining when the bio-signal is above an upper threshold or below a lower threshold while the recognition system is being trained to recognize the utterance.

Type: Grant

Filed: December 22, 1993

Date of Patent: September 24, 1996

Assignee: Lucent Technologies Inc.

Inventor: Joseph DeSimone
Speech recognition using bio-signals

Patent number: 5539860

Abstract: The recognition rate of a speech recognition system is improved by compensating for changes in the user's speech that result from factors such as emotion, anxiety or fatigue. A speech signal derived from a user's utterance, and a bio-signal, which is indicative of the user's emotional state, are provided to a speech recognition system. The bio-signal is used to provide a reference frequency that changes when the user's emotional state changes. An utterance is identified by examining the relative magnitudes of its frequency components and the position of the frequency components relative to the reference frequency.

Type: Grant

Filed: December 22, 1993

Date of Patent: July 23, 1996

Assignee: AT&T Corp.

Inventors: Joseph DeSimone, Jian-Tu Hsieh
Voice coding communication system and apparatus

Patent number: 5539858

Abstract: A voice coding communication system in case of applying an adaptive differential PCM system in which a discontinuous transmitter is provided at the transmitting side to reduce its power consumption, transmission is suspended during a silent duration, and a pseudo-noise is generated at the receiving side and is used as the reproduced output during the silent duration. Discomfort is avoided in hearing the reproduced voice which is caused by the difference between the level of the pseudo-noise and the level of a background noise at the transmitting side.

Type: Grant

Filed: June 17, 1994

Date of Patent: July 23, 1996

Assignee: Kokusai Electric Co. Ltd.

Inventors: Seishi Sasaki, Masayasu Miyake
Speech recognition using bio-signals

Patent number: 5539861

Abstract: The recognition rate of a speech recognition system is improved by compensating for changes in the user's speech that result from factors such as emotion, anxiety or fatigue. A speech signal derived from a user's utterance is modified by a preprocessor and provided to a speech recognition system to improve the recognition rate. The speech signal is modified based on a bio-signal which is indicative of the user's emotional state.

Type: Grant

Filed: December 22, 1993

Date of Patent: July 23, 1996

Assignee: AT&T Corp.

Inventor: Joseph DeSimone
Sub-partitioned vector quantization of probability density functions

Patent number: 5535305

Abstract: A speech recognition memory compression method and apparatus subpartitions probability density function (pdf) space along the hidden Markov model (HMM) index into packets of typically 4 to 8 log-pdf values. Vector quantization techniques are applied using a logarithmic distance metric and a probability weighted logarithmic probability space for the splitting of clusters. Experimental results indicate a significant reduction in memory can be obtained with little increase in overall speech recognition error.

Type: Grant

Filed: December 31, 1992

Date of Patent: July 9, 1996

Assignee: Apple Computer, Inc.

Inventors: Alejandro Acero, Yen-Lu Chow, Kai-Fu Lee
Method of accommodating for carbon/electret telephone set variability in automatic speaker verification

Patent number: 5528731

Abstract: In a speaker verification system, a method of compensating for differences in speech samples obtained during registration and those obtained during verification due to the use of different types of microphones is provided by filtering at least one of the samples such that the similarities of the two samples are increased. The filtered sample is used within the speaker verification matching process. A two-way comparison is disclosed in which both a verification speech sample and a reference sample are filtered with nonlinear microphone characteristics such as carbon microphone characteristics. A four-way comparison is also disclosed in which patterns produced from unfiltered verification and reference samples and patterns produced from the filtered verification and reference samples are compared to identify a match. A score is determined for each comparison. The comparison having the best score is used to determine if a match has occurred.

Type: Grant

Filed: November 19, 1993

Date of Patent: June 18, 1996

Assignee: AT&T Corp.

Inventors: Richard M. Sachs, Max S. Schoeffler
Speech recognition apparatus

Patent number: 5526466

Abstract: At the time of training reference speech, the relationship between durations of each recognition unit is obtained by a duration training circuit and, at the time of recognizing speech, a beginning and an end of input speech is detected by a speech period sensing circuit, and then by using the relationship and the input speech period length, the durations of the recognition units in the input speech are estimated. Next, the reference speech and the input speech are matched by the matching means by using the calculated estimation values in such a manner that the recognition units have a duration close to that of the estimated values.

Type: Grant

Filed: April 11, 1994

Date of Patent: June 11, 1996

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventor: Yumi Takizawa
Method and system for location-specific speech recognition

Patent number: 5524169

Abstract: A method and system for reducing perplexity in a speech recognition system based upon determined geographic location. In a mobile speech recognition system which processes input frames of speech against stored templates representing speech, a core library of speech templates is created and stored representing a basic vocabulary of speech. Multiple location-specific libraries of speech templates are also created and stored, each library containing speech templates representing a specialized vocabulary for a specific geographic location. The geographic location of the mobile speech recognition system is then periodically determined utilizing a cellular telephone system, a geopositioning satellite system or other similar systems and a particular one of the location-specific libraries of speech templates is identified for the current location of the system.

Type: Grant

Filed: December 30, 1993

Date of Patent: June 4, 1996

Assignee: International Business Machines Incorporated

Inventors: Paul S. Cohen, John M. Lucassen, Roger M. Miller, Elton B. Sherwin, Jr.
Speech coding apparatus and method using classification rules

Patent number: 5522011

Abstract: A speech coding apparatus and method uses classification rules to code an utterance while consuming fewer computing resources. The value of at least one feature of an utterance is measured during each of a series of successive time intervals to produce a series of feature vector signals representing the feature values. The classification rules comprise at least first and second sets of classification rules. The first set of classification rules map each feature vector signal from a set of all possible feature vector signals to exactly one of at least two disjoint subsets of feature vector signals. The second set of classification rules map each feature vector signal in a subset of feature vector signals to exactly one of at least two different classes of prototype vector signals. Each class contains a plurality of prototype vector signals. According to the classification rules, a first feature vector signal is mapped to a first class of prototype vector signals.

Type: Grant

Filed: September 27, 1993

Date of Patent: May 28, 1996

Assignee: International Business Machines Corporation

Inventors: Mark E. Epstein, Ponani S. Gopalakrishnan, David Nahamoo, Michael A. Picheny, Jan Sedivy
Quantization process for a predictor filter for vocoder of very low bit rate

Patent number: 5522009

Abstract: A quantization process proposes a low data rate for predictor filters of a vocoder with a speech signal broken down into packets having a predetermined number L of frames of constant duration and a weight allocated to each frame according to the average strength of the speech signal in the respective frame. The process involves allocating a predictor filter for each frame and determining the possible configurations for predictor filters having the same number of coefficients and the possible configuration for which the coefficients of a current frame predictor filter are interpolated from the predictor filter coefficients from neighboring frames. Subsequently, a deterministic error is calculated by measuring the distances between the filters in order to form a first stack with a predetermined number of configurations which give the lowest errors.

Type: Grant

Filed: October 7, 1992

Date of Patent: May 28, 1996

Assignee: Thomson-CSF

Inventor: Pierre-Andre Laurent
Method of training neural networks used for speech recognition

Patent number: 5509103

Abstract: A speech-recognition system for recognizing isolated words includes pre-processing circuitry for performing analog-to-digital conversion and cepstral analysis, and a plurality of neural networks which compute discriminant functions based on polynomial expansions. The system may be implemented using either hardware or software or a combination thereof. The speech wave-form of a spoken word is analyzed and converted into a sequence of data frames. The sequence of frames is partitioned into data blocks, and the data blocks are then broadcast to a plurality of neural networks. Using the data blocks, the neural networks compute polynomial expansions. The output of the neural networks is used to determine the identity of the spoken word. The neural networks utilize a matrix-inversion or alternatively a least-squares estimation training algorithm which does not require repetitive training and which yields a global minimum to each given set of training examples.

Type: Grant

Filed: June 3, 1994

Date of Patent: April 16, 1996

Assignee: Motorola, Inc.

Inventor: Shay-Ping T. Wang
Reduction of phase information in coding of speech

Patent number: 5504832

Abstract: In an encoding device (100) operable in response to an input speech signal by means of an adaptive transform coding to produce an output encoded speech signal, the input speech signal is partitioned into data blocks by a partition circuit (113). Each of data blocks is decomposed into a plurality of frequency components by a Fourier transformer (114). A spectral envelope calculator (120) estimates intensity of a spectral envelope of the input speech signal. In cooperation with a scalar spectral calculator (115) and a bit assignment determiner (121), a quantizer (116) quantizes or encodes the frequency components with phase information selectively removed from a part of the frequency components on the basis of the intensity of the spectral envelope. In a decoding device, a phase information assignor assigns pseudo-phase information to each of the frequency components from which the phase information is selectively removed.

Type: Grant

Filed: December 23, 1992

Date of Patent: April 2, 1996

Assignee: NEC Corporation

Inventor: Tetsu Taguchi
Speech recognition method and system using triphones, diphones, and phonemes

Patent number: 5502790

Abstract: A speech recognition system starts by training hidden Markov models for all triphones, diphones, and phonemes occurring in a small training vocabulary. Hidden Markov models of a target vocabulary are created by concatenating the triphone, diphone, and phoneme models, using triphone models if available, diphone HMMs when triphone models are not available, and phoneme models when neither triphone nor diphone models are available. Utterances from the target vocabulary are recognized by choosing a model with maximum probability of reproducing quantized utterance features.

Type: Grant

Filed: December 21, 1992

Date of Patent: March 26, 1996

Assignee: Oki Electric Industry Co., Ltd.

Inventor: Jie Yi
Extended task space control for robotic manipulators

Patent number: 5499320

Abstract: The invention is a method of operating a robot in successive sampling intervals to perform a task, the robot having joints and joint actuators with actuator control loops, by decomposing the task into behavior forces, accelerations, velocities and positions of plural behaviors to be exhibited by the robot simultaneously, computing actuator accelerations of the joint actuators for the current sampling interval from both behavior forces, accelerations velocities and positions of the current sampling interval and actuator velocities and positions of the previous sampling interval, computing actuator velocities and positions of the joint actuators for the current sampling interval from the actuator velocities and positions of the previous sampling interval, and, finally, controlling the actuators in accordance with the actuator accelerations, velocities and positions of the current sampling interval.

Type: Grant

Filed: March 24, 1993

Date of Patent: March 12, 1996

Assignee: The United States of America as represented by the Administrator of the National Aeronautics and Space Administration

Inventors: Paul G. Backes, Mark K. Long
Waveform blending technique for text-to-speech system

Patent number: 5490234

Abstract: A concatenator for a first digital frame with a second digital frame, such as the ending and beginning of adjacent diphone strings being concatenated to form speech is based on determining an optimum blend point for the first and second digital frames in response to the magnitudes of samples in the first and second digital frames. The frames are then blended to generate a digital sequence representing a concatenation of the first and second frames with reference to the optimum blend point. The system operates by first computing an extended frame in response to the first digital frame, and then finding a subset of the extended frame with matches the second digital frame using a minimum average magnitude difference function over the samples in the subset. The blend point is the first sample of the matching subset.

Type: Grant

Filed: January 21, 1993

Date of Patent: February 6, 1996

Assignee: Apple Computer, Inc.

Inventor: Shankar Narayan
Formant detecting device and speech processing apparatus

Patent number: 5479560

Abstract: A speech processing apparatus for obtaining a processed speech which is natural and comfortable for a listener, by refining a gain value assigned for each frequency band in enhancing formants in a power spectrum. The power spectrum, calculated in a frequency analyzing unit, is subject to contrast enhancement in a contrast enhancing unit, and judged as to whether it is a format or not in each frequency band. In a gain value assigning unit, a gain value of 1 is assigned to a formant, and a gain value smaller than 1 is to a frequency other than formant. A threshold value for each frequency band is determined by a threshold value determining unit in accordance with power spectrum of input speech signal, to eliminate the effect of variation in speech level.

Type: Grant

Filed: October 27, 1993

Date of Patent: December 26, 1995

Assignee: Technology Research Association of Medical and Welfare Apparatus

Inventor: Tsuyoshi Mekata
Method of compressing an audio signal using adaptive bit allocation taking account of temporal masking

Patent number: 5475789

Abstract: An audio signal is coded by converting the audio signal into a signal in a frequency domain and effecting a big allocation on the converted audio signal. A masking threshold level for reducing aurally recognized noise due to a masking effect is determined using a signal spectral distribution in a present frame and a signal spectral distribution in a past frame. If the difference between the determined masking threshold level and a masking threshold level in the past frame is equal to or greater than a predetermined level, then a level limited to the difference corresponding to the predetermined level is regarded as a masking threshold level in the present frame.

Type: Grant

Filed: March 4, 1993

Date of Patent: December 12, 1995

Assignee: Sony Corporation

Inventor: Masayuki Nishiguchi
Speech recognition system with improved rejection of words and sounds not in the system vocabulary

Patent number: 5465317

Abstract: A speech recognizer that selects a command model for a current sound if the best match score for the current sound exceeds its corresponding threshold score. The threshold score is assigned a confidence score based on the best match score and recognition threshold of a prior sound.

Type: Grant

Filed: May 18, 1993

Date of Patent: November 7, 1995

Assignee: International Business Machines Corporation

Inventor: Edward A. Epstein
Voice input system for data retrieval

Patent number: 5454063

Abstract: A computer input system employing an automatic speech recognizer. The system is used for finding names in a data-base. The system reduces the number of letters a speaker needs to enter in order to find a name uniquely in the data-base. The system enables the speaker to enter inputs that identify which words in the name the speaker's letter inputs correspond to. The system builds a set of search parameters incorporating both the user's word identifier inputs and letter inputs. This set can be called an abbreviation because it usually represents a small fraction of the total number of letters in the name the user seeks to find.

Type: Grant

Filed: November 29, 1993

Date of Patent: September 26, 1995

Inventor: Michael T. Rossides

prev 1 2