Patents by Inventor James G. Droppo

James G. Droppo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for constructing a speech filter using estimates of clean speech and noise

Patent number: 7725314

Abstract: A method and apparatus identify a clean speech signal from a noisy speech signal. To do this, a clean speech value and a noise value are estimated from the noisy speech signal. The clean speech value and the noise value are then used to define a gain on a filter. The noisy speech signal is applied to the filter to produce the clean speech signal. Under some embodiments, the noise value and the clean speech value are used in both the numerator and the denominator of the filter gain, with the numerator being guaranteed to be positive.

Type: Grant

Filed: February 16, 2004

Date of Patent: May 25, 2010

Assignee: Microsoft Corporation

Inventors: Jian Wu, James G. Droppo, Li Deng, Alejandro Acero
Multi-sensory speech enhancement using a speech-state model

Patent number: 7680656

Abstract: A method and apparatus determine a likelihood of a speech state based on an alternative sensor signal and an air conduction microphone signal. The likelihood of the speech state is used, together with the alternative sensor signal and the air conduction microphone signal, to estimate a clean speech value for a clean speech signal.

Type: Grant

Filed: June 28, 2005

Date of Patent: March 16, 2010

Assignee: Microsoft Corporation

Inventors: Zhengyou Zhang, Zicheng Liu, Alejandro Acero, Amarnag Subramanya, James G. Droppo
LOUDSPEAKER ARRAY DESIGN

Publication number: 20090304211

Abstract: Sound signals to be output from a loudspeaker array are modified by a plurality of filters designed according to an unconstrained optimization procedure to improve overall performance (e.g., power, directivity) of the loudspeaker array. More particularly, respective filters are configured to receive a signal to be output to a plurality of loudspeakers. Upon receiving the signal, the respective filters individually modify the received signal according to the results of the unconstrained optimization procedure and then output the individually modified signals to respective loudspeakers. The unconstrained optimization procedure takes into account manufacturing tolerances and individually enhances the signal output to each of a plurality of individual loudspeakers within an array to achieve an overall improvement in performance.

Type: Application

Filed: June 4, 2008

Publication date: December 10, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Ivan J. Tashev, James G. Droppo, Michael L. Seltzer, Alejandro Acero
Method of noise reduction based on dynamic aspects of speech

Patent number: 7617098

Abstract: A system and method are provided that reduce noise in pattern recognition signals. To do this, embodiments of the present invention utilize a prior model of dynamic aspects of clean speech together with one or both of a prior model of static aspects of clean speech, and an acoustic model that indicates the relationship between clean speech, noisy speech and noise. In one embodiment, components of a noise-reduced feature vector are produced by forming a weighted sum of predicted values from the prior model of dynamic aspects of clean speech, the prior model of static aspects of clean speech and the acoustic-environmental model.

Type: Grant

Filed: May 12, 2006

Date of Patent: November 10, 2009

Assignee: Microsoft Corporation

Inventors: Li Deng, James G. Droppo, Alejandro Acero
Method and apparatus for reducing noise corruption from an alternative sensor signal during multi-sensory speech enhancement

Patent number: 7590529

Abstract: A method and apparatus classify a portion of an alternative sensor signal as either containing noise or not containing noise. The portions of the alternative sensor signal that are classified as containing noise are not used to estimate a portion of a clean speech signal and the channel response associated with the alternative sensor. The portions of the alternative sensor signal that are classified as not containing noise are used to estimate a portion of a clean speech signal and the channel response associated with the alternative sensor.

Type: Grant

Filed: February 4, 2005

Date of Patent: September 15, 2009

Assignee: Microsoft Corporation

Inventors: Zhengyou Zhang, Amarnag Subramanya, James G. Droppo, Zicheng Liu
Method and apparatus for multi-sensory speech enhancement

Patent number: 7574008

Abstract: A method and apparatus determine a channel response for an alternative sensor using an alternative sensor signal and an air conduction microphone signal. The channel response is then used to estimate a clean speech value using at least a portion of the alternative sensor signal.

Type: Grant

Filed: September 17, 2004

Date of Patent: August 11, 2009

Assignee: Microsoft Corporation

Inventors: Zhengyou Zhang, Alejandro Acero, James G. Droppo, Xuedong David Huang, Zicheng Liu
SPEECH RECOGNITION WITH NON-LINEAR NOISE REDUCTION ON MEL-FREQUENCY CEPTRA

Publication number: 20090177468

Abstract: In an automatic speech recognition system, a feature extractor extracts features from a speech signal, and speech is recognized by the automatic speech recognition system based on the extracted features. Noise reduction as part of the feature extractor is provided by feature enhancement in which feature-domain noise reduction in the form of Mel-frequency cepstra is provided based on the minimum means square error criterion. Specifically, the devised method takes into account the random phase between the clean speech and the mixing noise. The feature-domain noise reduction is performed in a dimension-wise fashion to the individual dimensions of the feature vectors input to the automatic speech recognition system, in order to perform environment-robust speech recognition.

Type: Application

Filed: January 8, 2008

Publication date: July 9, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Dong Yu, Alejandro Acero, James G. Droppo, Li Deng
Noise reduction using correction vectors based on dynamic aspects of speech and noise normalization

Patent number: 7542900

Abstract: A method and apparatus are provided for reducing noise in a signal. Under one aspect of the invention, a correction vector is selected based on a noisy feature vector that represents a noisy signal. The selected correction vector incorporates dynamic aspects of pattern signals. The selected correction vector is then added to the noisy feature vector to produce a cleaned feature vector. In other aspects of the invention, a noise value is produced from an estimate of the noise in a noisy signal. The noise value is subtracted from a value representing a portion of the noisy signal to produce a noise-normalized value. The noise-normalized value is used to select a correction value that is added to the noise-normalized value to produce a cleaned noise-normalized value. The noise value is then added to the cleaned noise-normalized value to produce a cleaned value representing a portion of a cleaned signal.

Type: Grant

Filed: May 5, 2006

Date of Patent: June 2, 2009

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Li Deng, Alejandro Acero
Method of pattern recognition using noise reduction uncertainty

Patent number: 7460992

Abstract: A method and apparatus are provided for using the uncertainty of a noise-removal process during pattern recognition. In particular, noise is removed from a representation of a portion of a noisy signal to produce a representation of a cleaned signal. In the meantime, an uncertainty associated with the noise removal is computed and is used with the representation of the cleaned signal to modify a probability for a phonetic state in the recognition system. In particular embodiments, the uncertainty is used to modify a probability distribution, by increasing the variance in each Gaussian distribution by the amount equal to the estimated variance of the cleaned signal, which is used in decoding the phonetic state sequence in a pattern recognition task.

Type: Grant

Filed: May 16, 2006

Date of Patent: December 2, 2008

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Alejandro Acero, Li Deng
METHOD OF PATTERN RECOGNITION USING NOISE REDUCTION UNCERTAINTY

Publication number: 20080281591

Abstract: A method and apparatus are provided for using the uncertainty of a noise-removal process during pattern recognition. In particular, noise is removed from a representation of a portion of a noisy signal to produce a representation of a cleaned signal. In the meantime, an uncertainty associated with the noise removal is computed and is used with the representation of the cleaned signal to modify a probability for a phonetic state in the recognition system. In particular embodiments, the uncertainty is used to modify a probability distribution, by increasing the variance in each Gaussian distribution by the amount equal to the estimated variance of the cleaned signal, which is used in decoding the phonetic state sequence in a pattern recognition task.

Type: Application

Filed: July 25, 2008

Publication date: November 13, 2008

Applicant: MICROSOFT CORPORATION

Inventors: James G. Droppo, Alejandro Acero, Li Deng
Method and apparatus for multi-sensory speech enhancement

Patent number: 7447630

Abstract: A method and system use an alternative sensor signal received from a sensor other than an air conduction microphone to estimate a clean speech value. The estimation uses either the alternative sensor signal alone, or in conjunction with the air conduction microphone signal. The clean speech value is estimated without using a model trained from noisy training data collected from an air conduction microphone. Under one embodiment, correction vectors are added to a vector formed from the alternative sensor signal in order to form a filter, which is applied to the air conductive microphone signal to produce the clean speech estimate. In other embodiments, the pitch of a speech signal is determined from the alternative sensor signal and is used to decompose an air conduction microphone signal. The decomposed signal is then used to determine a clean signal estimate.

Type: Grant

Filed: November 26, 2003

Date of Patent: November 4, 2008

Assignee: Microsoft Corporation

Inventors: Zicheng Liu, Michael J. Sinclair, Alejandro Acero, Xuedong D. Huang, James G. Droppo, Li Deng, Zhengyou Zhang, Yanli Zheng
Pitch model for noise estimation

Publication number: 20080215321

Abstract: Pitch is tracked for individual samples, which are taken much more frequently than an analysis frame. Speech is identified based on the tracked pitch and the speech components of the signal are removed with a time-varying filter, leaving only an estimate of a time-varying speech signal. This estimate is then used to generate a time-varying noise model which, in turn, can be used to enhance speech related systems.

Type: Application

Filed: April 19, 2007

Publication date: September 4, 2008

Applicant: Microsoft Corporation

Inventors: James G. Droppo, Alejandro Acero, Luis Buera
DISCRIMINATIVE TRAINING FOR SPEECH RECOGNITION

Publication number: 20080114596

Abstract: Parameters for a feature extractor and acoustic model of a speech recognition module are trained. An objective function is utilized to determine values for the feature extractor parameters and the acoustic model parameters.

Type: Application

Filed: November 15, 2006

Publication date: May 15, 2008

Applicant: Microsoft Corporation

Inventors: Alejandro Acero, James G. Droppo, Milind V. Mahajan
NOISE SUPPRESSOR FOR SPEECH RECOGNITION

Publication number: 20080114593

Abstract: A noise suppressor for altering a speech signal is trained based on a speech recognition system. An objective function can be utilized to adjust parameters of the noise suppressor. The noise suppressor can be used to alter speech signals for the speech recognition system.

Type: Application

Filed: November 15, 2006

Publication date: May 15, 2008

Applicant: Microsoft Corporation

Inventors: Ivan J. Tashev, Alejandro Acero, James G. Droppo
Method of noise reduction using instantaneous signal-to-noise ratio as the principal quantity for optimal estimation

Patent number: 7363221

Abstract: A system and method are provided that accurately estimate noise and that reduce noise in pattern recognition signals. The method and system define a mapping random variable as a function of at least a clean signal random variable and a noise random variable. A model parameter that describes at least one aspect of a distribution of values for the mapping random variable is then determined. Based on the model parameter, an estimate for the clean signal random variable is determined. Under many aspects of the present invention, the mapping random variable is a signal-to-noise ratio variable and the method and system estimate a value for the signal-to-noise ratio variable from the model parameter.

Type: Grant

Filed: August 19, 2003

Date of Patent: April 22, 2008

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Li Deng, Alejandro Acero
Method of determining uncertainty associated with acoustic distortion-based noise reduction

Patent number: 7289955

Abstract: A method and apparatus are provided for determining uncertainty in noise reduction based on a parametric model of speech distortion. The method is first used to reduce noise in a noisy signal. In particular, noise is reduced from a representation of a portion of a noisy signal to produce a representation of a cleaned signal by utilizing an acoustic environment model. The uncertainty associated with the noise reduction process is then computed. In one embodiment, the uncertainty of the noise reduction process is used, in conjunction with the noise-reduced signal, to decode a pattern state.

Type: Grant

Filed: December 20, 2006

Date of Patent: October 30, 2007

Assignee: Microsoft Corporation

Inventors: Li Deng, Alejandro Acero, James G. Droppo
Method and apparatus for identifying noise environments from noisy signals

Patent number: 7266494

Abstract: A method and apparatus are provided for identifying a noise environment for a frame of an input signal based on at least one feature for that frame. To identify the noise environment, a probability for a noise environment is determined by applying the noisy input feature vector to a distribution of noisy training feature vectors. In one embodiment, each noisy training feature vector in the distribution is formed by modifying a set of clean training feature vectors. In one embodiment, the probabilities of the noise environments for past frames are included in the identification of an environment for a current frame. In one embodiment, a correction vector is then selected based on the identified noise environment.

Type: Grant

Filed: November 10, 2004

Date of Patent: September 4, 2007

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Alejandro Acero, Li Deng
Method of reducing index sizes used to represent spectral content vectors

Patent number: 7200557

Abstract: A method identifies a codeword to represent a vector derived from an audio signal by applying the vector to first and second decision trees. The first decision tree is associated with a first type of audio sound and produces a first codeword. The second decision tree is associated with a second type of audio sound and produces a second codeword. One of the first and second codewords is then selected as the codeword for the vector. In further embodiments, the vector describes the spectral content of the audio signal and a linear prediction value is generated for the vector. The difference between the linear prediction value and the vector is used to identify the codeword.

Type: Grant

Filed: November 27, 2002

Date of Patent: April 3, 2007

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Alejandro Acero, Constantinos Boulis
Noise reduction using correction vectors based on dynamic aspects of speech and noise normalization

Patent number: 7181390

Abstract: A method and apparatus are provided for reducing noise in a signal. Under one aspect of the invention, a correction vector is selected based on a noisy feature vector that represents a noisy signal. The selected correction vector incorporates dynamic aspects of pattern signals. The selected correction vector is then added to the noisy feature vector to produce a cleaned feature vector. In other aspects of the invention, a noise value is produced from an estimate of the noise in a noisy signal. The noise value is subtracted from a value representing a portion of the noisy signal to produce a noise-normalized value. The noise-normalized value is used to select a correction value that is added to the noise-normalized value to produce a cleaned noise-normalized value. The noise value is then added to the cleaned noise-normalized value to produce a cleaned value representing a portion of a cleaned signal.

Type: Grant

Filed: July 26, 2005

Date of Patent: February 20, 2007

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Li Deng, Alejandro Acero
Method of determining uncertainty associated with acoustic distortion-based noise reduction

Patent number: 7174292

Abstract: A method and apparatus are provided for determining uncertainty in noise reduction based on a parametric model of speech distortion. The method is first used to reduce noise in a noisy signal. In particular, noise is reduced from a representation of a portion of a noisy signal to produce a representation of a cleaned signal by utilizing an acoustic environment model. The uncertainty associated with the noise reduction process is then computed. In one embodiment, the uncertainty of the noise reduction process is used, in conjunction with the noise-reduced signal, to decode a pattern state.

Type: Grant

Filed: September 5, 2002

Date of Patent: February 6, 2007

Assignee: Microsoft Corporation

Inventors: Li Deng, Alejandro Acero, James G. Droppo

prev 1 2 3 4 next