Normalizing Patents (Class 704/234)

Process and device for blind equalization of the effects of a transmission channel on a digital speech signal

Patent number: 6157909

Abstract: A process and device for blind equalization of the effects of a transmission channel on a speech signal. The speech signal is transformed into cepstral vectors which are representative of the speech signal over a given horizon. A reference cepstrum consisting of a constant cepstrum signal representative of the long-term cepstrum of the speech signal is calculated for each cepstral vector. Each of the cepstral vectors is subjected to adaptive filtering by LMS on the basis of the reference cepstrum so as to generate a set of equalized cepstral vectors on the basis of the calculation of an error signal between the reference cepstrum and equalized cepstral vectors. The error signal is expressed as the difference between the reference cepstrum component of a given rank and the component of the same rank of the equalized cepstral vector.

Type: Grant

Filed: July 20, 1998

Date of Patent: December 5, 2000

Assignee: France Telecom

Inventors: Laurent Mauuary, Jean Monne
Source normalization training for HMM modeling of speech

Patent number: 6151573

Abstract: A maximum likelihood (ML) linear regression (LR) solution to environment normalization is provided where the environment is modeled as a hidden (non-observable) variable. By application of an expectation maximization algorithm and extension of Baum-Welch forward and backward variables (Steps 23a-23d) a source normalization is achieved such that it is not necessary to label a database in terms of environment such as speaker identity, channel, microphone and noise type.

Type: Grant

Filed: August 15, 1998

Date of Patent: November 21, 2000

Assignee: Texas Instruments Incorporated

Inventor: Yifan Gong
Speech recognition

Patent number: 6138095

Abstract: Speech recognition in which the log probabilities of the null and alternative hypothesis are computed for an input speech sample by comparison with specific stored speech vocabularies/grammars and with general speech characteristics. The difference in probabilities is normalized by the magnitude of the null hypothesis to derive a likelihood factor which is compared with a rejection threshold that is utterance-length dependent. Advantageously, a high-order polynomial representation of the rejection threshold length dependency may be simplified by a series of piece-wise constants which are stored as rejection thresholds to be selected in accordance with the length of the input speech sample.

Type: Grant

Filed: September 3, 1998

Date of Patent: October 24, 2000

Assignee: Lucent Technologies Inc.

Inventors: Sunil K. Gupta, Frank Kao-Ping Soong
Method and apparatus for providing an improved feature set in speech recognition by performing noise cancellation and background masking

Patent number: 6098040

Abstract: The invention relates to a method and apparatus for generating noise-attenuated feature vectors for use in recognizing speech, more particularly to a system and method providing a feature set for speech recognition that is robust to adverse noise conditions. This is done by receiving, through an input, a set of signal frames, at least some containing speech sounds, and then classifying the frames in the set of signal frames into classification groups on the basis of their energy levels. Each classification group is characterized by a mean energy value. In a specific example of implementation, the invention makes use of channel energy values to condition the frames in the set of signal frames. The frames in the set of signal frames are attenuated or noise reduced by altering the energy of the frames on the basis of the frames containing non-speech sounds.

Type: Grant

Filed: November 7, 1997

Date of Patent: August 1, 2000

Assignee: Nortel Networks Corporation

Inventors: Marco Petroni, Steven Douglas Peters
Communication network for transmitting speech signals

Patent number: 6038530

Abstract: In a speech communication network in which a transmitter (1) transmits speech signals via a network (4) to a receiver (8), it can happen that more traffic is offered to the network (4) than it can handle. In order to reduce the network load, at least one node (24) is arranged to perform bitrate reduction to delete some of the prediction parameters representing the speech signal. It can also be the case that a receiver comprises a speech decoder having insufficient computational power available for decoding the encoded speech signal. In such case, the speech decoder is arranged for using only a part of the prediction parameters available. This results in a lower complexity of the synthesis filter (60).

Type: Grant

Filed: February 9, 1998

Date of Patent: March 14, 2000

Assignee: U.S. Philips Corporation

Inventors: Rakesh Taori, Andreas J. Gerrits
Apparatus and method for correcting the difference in frequency characteristics between microphones for analyzing speech and for creating a recognition dictionary

Patent number: 6032115

Abstract: In sound recognition apparatus of the present invention, user's utterance or a sound provided by an output section using previously stored sound waveforms is simultaneously inputted through a basic microphone of known frequency characteristics and an input microphone of unknown frequency characteristics. An analysis section respectively analyzes the frequency of the input speech through the basic microphone and the input microphone. A frequency characteristics calculation section calculates first difference data between the frequencies of the input speech of the basic microphone and the input microphone, and calculates frequency characteristics of the input microphone according to the first difference data and the frequency characteristics of the basic microphone.

Type: Grant

Filed: September 26, 1997

Date of Patent: February 29, 2000

Assignee: Kabushiki Kaisha Toshiba

Inventors: Hiroshi Kanazawa, Takehiko Isaka, Yoshifumi Nagata, Hiroyuki Tsuboi
Method for establishing handset-dependent normalizing models for speaker recognition

Patent number: 5950157

Abstract: Adverse effects of type mismatch between acoustic input devices used during testing and during training in machine-based recognition of the source of acoustic phenomena are minimized. A normalizing model is matched to a source model based, or dependent, upon an acoustic input device whose transfer characteristics color acoustic characteristics of a source as represented in the source model. An application of the present invention is to speaker recognition, i.e., recognition of the identity of a speaker by the speaker's voice.

Type: Grant

Filed: April 18, 1997

Date of Patent: September 7, 1999

Assignee: SRI International

Inventors: Larry P. Heck, Mitchel Weintraub
Speaker independent speech recognition system and method

Patent number: 5946653

Abstract: An improved method of training a SISRS uses less processing and memory resources by operating on vectors instead of matrices which represent spoken commands. Memory requirements are linearly proportional to the number of spoken commands for storing each command model. A spoken command is identified from the set of spoken commands by a command recognition procedure (200). The command recognition procedure (200) includes sampling the speaker's speech, deriving cepstral coefficients and delta-cepstral coefficients, and performing a polynomial expansion on cepstral coefficients. The identified spoken command is selected using the dot product of the command model data and the average command structure representing the unidentified spoken command.

Type: Grant

Filed: October 1, 1997

Date of Patent: August 31, 1999

Assignee: Motorola, Inc.

Inventors: William Michael Campbell, John Eric Kleider, Charles Conway Broun, Carl Steven Gifford, Khaled Assaleh
System and method for classifying a speech signal

Patent number: 5924066

Abstract: A system and method for classifying a speech signal within a likely speech signal class of a plurality of speech signal classes are provided. Stochastic models include a plurality of states having state transitions and output probabilities to generate state sequences which model evolutionary characteristics and durational variability of a speech signal. The method includes extracting a frame sequence, and determining a state sequence for each stochastic model with each state sequence having full state segmentation. Representative frames are determined to provide speech signal time normalization. A likely speech signal class is determined from a neural network having a plurality of inputs receiving the representative frames and a plurality of outputs corresponding to the plurality of speech signal classes. An output signal is generated based on the likely stochastic model.

Type: Grant

Filed: September 26, 1997

Date of Patent: July 13, 1999

Assignees: U S WEST, Inc., MediaOne, Inc.

Inventor: Amlan Kundu
Adaptive equalizer preprocessor for mobile telephone speech coder to modify nonideal frequency response of acoustic transducer

Patent number: 5915235

Abstract: The present invention teaches an equalizer preprocessor for a mobile telephone speech coder that adapts to the characteristics of its input transducer. The equalizer determines the frequency response of the input transducer by measuring the long term characteristics of the input signal and estimating the spectral envelope of that signal. The equalizer then adapts so that the output signal has a spectral response closer to a perceptually ideal response in accordance with the calculated spectral envelope. In a first embodiment of the present invention, the adaptive equalizer is implemented using digital filtering techniques. The equalizer determines a set of long term autocorrelation coefficient values and from these values generates a set of filter taps which serve to whiten or flatten the spectral response of the input signal. This whitened signal is then passed through a target filter which impresses upon the whitened signal the target spectral response.

Type: Grant

Filed: October 17, 1997

Date of Patent: June 22, 1999

Inventors: Andrew P. DeJaco, John A. Miller
Speech adaptation system and speech recognizer

Patent number: 5890113

Abstract: An analyzing unit 1 converts an input speech into a feature vector time series. A reference pattern storing unit 3 stores the feature vector time series obtained by the same manner as in the analyzing unit. A matching unit 2 correlates for time axis the input speech feature vector time series and the reference patterns to one another. An environmental adapting unit 4 performs the environmental adaptation between the input speech feature vector time series and the reference patterns according to the result of matching in the matching unit 2. A speaker adapting unit 6 performs the adaptation concerning the speaker between the environmentally adapted reference patterns from the environmental adapting unit 4 and the input speech feature vector time series.

Type: Grant

Filed: December 13, 1996

Date of Patent: March 30, 1999

Assignee: NEC Corporation

Inventor: Keizaburo Takagi
Speech recognition using recursive time-domain high-pass filtering of spectral feature vectors

Patent number: 5878392

Abstract: A circuit arrangement for speech recognition carries out an analysis of a speech signal, extracting characteristic features. The extracted features are represented by spectral feature vectors which are compared with reference feature vectors stored for the speech signal to be recognized. The reference feature vectors are determined during a training phase in which a speech signal is recorded several times. A recognition result essentially depends on a quality of the spectral feature vectors and reference feature vectors. A recognition result essentially depends on a quality of the spectral feature vectors and reference feature vectors. A recursive high-pass filtering is performed in the time domain on the spectral feature vectors. Influences of noise signals on the recognition result are reduced by this and a high degree of speaker independence of the recognition is achieved.

Type: Grant

Filed: May 27, 1997

Date of Patent: March 2, 1999

Assignee: U.S. Philips Corporation

Inventors: Peter Meyer, Hans-Wilhelm Ruhl
Decision-directed frame-synchronous adaptive equalization filtering of a speech signal by implementing a hidden markov model

Patent number: 5864806

Abstract: For equalizing a speech signal constituted by an observed sequence of successive input sound frames, which speech signal is liable to be affected by disturbances, the speech signal is modelled by means of a hidden Markov model and, at each instant t: equalization filters are constituted in association with the paths in the Markov sense at instant t; at least a plurality of the equalization filters are applied to the frames to obtain, at instant t, a plurality of filtered sound frame sequences and an utterance probability for each of the paths respectively associated with the equalization filters applied; the equalization filter corresponding to the most probable path in the Markov sense is selected; and the filtered frame supplied by the selected equalization filter is selected as the equalized frame.

Type: Grant

Filed: May 5, 1997

Date of Patent: January 26, 1999

Assignee: France Telecom

Inventors: Chafic Mokbel, Denis Jouvet, Jean Monne
Method and recognizer for recognizing a sampled sound signal in noise

Patent number: 5842162

Abstract: A sound recognizer uses a feature value normalization process to substantially increase the accuracy of recognizing acoustic signals in noise. The sound recognizer includes a feature vector device which determines a number of feature values for a number of analysis frames, a min/max device which determines a minimum and maximum feature value for each of a number of frequency bands, a normalizer which normalizes each of the feature values with the minimum and maximum feature values resulting in normalized feature vectors, and a comparator which compares the normalized feature vectors with template feature vectors to identify one of the template feature vectors that most resembles the normalized feature vectors.

Type: Grant

Filed: September 23, 1997

Date of Patent: November 24, 1998

Assignee: Motorola, Inc.

Inventor: Adam B. Fineberg
Speaker verification system using decision fusion logic

Patent number: 5839103

Abstract: The present invention relates to a pattern recognition system which uses data fusion to combine data from a plurality of extracted features and a plurality of classifiers. Speaker patterns can be accurately verified with the combination of discriminant based and distortion based classifiers. A novel approach using a training set of a "leave one out" data can be used for training the system with a reduced data set. Extracted features can be improved with a pole filtered method for reducing channel effects and an affine transformation for improving the correlation between training and testing data.

Type: Grant

Filed: June 7, 1995

Date of Patent: November 17, 1998

Assignee: Rutgers, The State University of New Jersey

Inventors: Richard J. Mammone, Kevin Farrell, Manish Sharma, Devang Naik, Xiaoyu Zhang, Khaled Assaleh, Han-Seng Liou
Adaptive decision directed speech recognition bias equalization method and apparatus

Patent number: 5812972

Abstract: The present invention provides a speech recognizer that creates and updates the equalization vector as input speech is provided to the recognizer. The present invention includes a speech analyzer which transforms an input speech signal into a series of feature vectors or observation sequence. Each feature vector is then provided to a speech recognizer which modifies the feature vector by subtracting a previously determined equalization vector therefrom. The recognizer then performs segmentation and matches the modified feature vector to a stored model vector which is defined as the segmentation vector. The recognizer then, from time to time, determines a new equalization vector, the new equalization vector being defined based on the difference between one or more input feature vectors and their respective segmentation vectors.

Type: Grant

Filed: December 30, 1994

Date of Patent: September 22, 1998

Assignee: Lucent Technologies Inc.

Inventors: Biing-Hwang Juang, David Mansour, Jay Gordon Wilpon
System and method for determining pitch contours

Patent number: 5790978

Abstract: A system and method are provided for automatically computing local pitch contours from textual input to produce pitch contours that closely mimic those found in natural speech. The methodology of the invention incorporates parameterized equations whose parameters can be estimated directly from natural speech recordings. That methodology incorporates a model based on the premise that pitch contours instantiating a particular pitch contour class can be described as distortions in the temporal and frequency domains of a single, underlying contour. After the nature of the pitch contour for different pitch contour classes has been established, a pitch contour can be predicted that closely models a natural speech contour for a synthetic speech utterance by adding the individual contours of the different intonational classes and adjusting the boundaries of these to match the boundaries of the adjacent intonation curves.

Type: Grant

Filed: September 15, 1995

Date of Patent: August 4, 1998

Assignee: Lucent Technologies, Inc.

Inventors: Joseph Philip Olive, Jan Pieter VanSanten

prev 1 2 3 4 5