Patents by Inventor Tetsuya Takiguchi

Tetsuya Takiguchi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech recognition device, speech recognition method, computer-executable program for causing computer to execute recognition method, and storage medium

Patent number: 8024184

Abstract: A speech recognition device and method configured to include a computer, for recognizing speech, including: a storage location for storing a feature quantity acquired from a speech signal for each frame; storage portions for storing acoustic model data and language model data; a echo speech component for generating echo speech model data from a speech signal acquired prior to a speech signal to be processed at the current time point and using the echo speech model data to generate adapted acoustic model data; and a processing component for utilizing the feature quantity, the adapted acoustic model data, and the language model data to provide a speech recognition result of the speech signal.

Type: Grant

Filed: June 2, 2009

Date of Patent: September 20, 2011

Assignee: Nuance Communications, Inc.

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Signal enhancement via noise reduction for speech recognition

Patent number: 7895038

Abstract: Speech enhancement techniques for extemporaneous noise without a noise interval and unknown extemporaneous noise are provided with a method of signal enhancement including subtracting a given reference signal from an input signal containing a target signal and a noise signal by spectral subtraction; applying an adaptive filter to the reference signal; and controlling a filter coefficient of the adaptive filter in order to reduce components of the noise signal in the input signal. In signal enhancement, a database of a signal model concerning the target signal expressing a given feature by a given statistical model is provided, and the filter coefficient is controlled based on the likelihood of the signal model with respect to an output signal from the spectral subtraction means.

Type: Grant

Filed: May 26, 2008

Date of Patent: February 22, 2011

Assignee: International Business Machines Corporation

Inventors: Masafumi Nishimura, Tetsuya Takiguchi
Speech recognition apparatus, speech recognition apparatus and program thereof

Patent number: 7720679

Abstract: Provided is a method for canceling background noise of a sound source other than a target direction sound source in order to realize highly accurate speech recognition, and a system using the same. In terms of directional characteristics of a microphone array, due to a capability of approximating a power distribution of each angle of each of possible various sound source directions by use of a sum of coefficient multiples of a base form angle power distribution of a target sound source measured beforehand by base form angle by using a base form sound, and power distribution of a non-directional background sound by base form, only a component of the target sound source direction is extracted at a noise suppression part. In addition, when the target sound source direction is unknown, at a sound source localization part, a distribution for minimizing the approximate residual is selected from base form angle power distributions of various sound source directions to assume a target sound source direction.

Type: Grant

Filed: September 24, 2008

Date of Patent: May 18, 2010

Assignee: Nuance Communications, Inc.

Inventors: Osamu Ichikawa, Tetsuya Takiguchi, Masafumi Nishimura
Speech recognition system and program thereof

Patent number: 7660717

Abstract: Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM for each speech frame of the inputted speech by use of the composite HMM.

Type: Grant

Filed: January 9, 2008

Date of Patent: February 9, 2010

Assignee: Nuance Communications, Inc.

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD, COMPUTER-EXECUTABLE PROGRAM FOR CAUSING COMPUTER TO EXECUTE RECOGNITION METHOD, AND STORAGE MEDIUM

Publication number: 20090306977

Abstract: A speech recognition device and method configured to include a computer, for recognizing speech, including: a storage location for storing a feature quantity acquired from a speech signal for each frame; storage portions for storing acoustic model data and language model data; a echo speech component for generating echo speech model data from a speech signal acquired prior to a speech signal to be processed at the current time point and using the echo speech model data to generate adapted acoustic model data; and a processing component for utilizing the feature quantity, the adapted acoustic model data, and the language model data to provide a speech recognition result of the speech signal.

Type: Application

Filed: June 2, 2009

Publication date: December 10, 2009

Applicant: Nuance Communications, Inc.

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Voice recording system, recording device, voice analysis device, voice recording method and program

Patent number: 7599836

Abstract: To provide a method of specifying each of speakers of individual voices, based on recorded voices made by a plurality of speakers, with a simple system configuration, and to provide a system using the method. The system includes: microphones individually provided for each of the speakers; a voice processing unit which gives a unique characteristic to each pair of two-channel voice signals recorded with each of the microphones 10, by executing different kinds of voice processing on the respective pairs of voice signals, and which mixes the voice signals for each channel; and an analysis unit which performs an analysis according to the unique characteristics, given to the voice signals concerning the respective microphones through the processing by the voice processing unit, and which specifies the speaker for each speech segment of the voice signals.

Type: Grant

Filed: May 25, 2005

Date of Patent: October 6, 2009

Assignee: Nuance Communications, Inc.

Inventors: Osamu Ichikawa, Masafumi Nishimura, Tetsuya Takiguchi
Signal enhancement via noise reduction for speech recognition

Patent number: 7533015

Abstract: Provides speech enhancement techniques for extemporaneous noise without a noise interval and unknown extemporaneous noise. Signal enhancement includes: subtracting a given reference signal from an input signal containing a target signal and a noise signal by spectral subtraction; applying an adaptive filter to the reference signal; and controlling a filter coefficient of the adaptive filter in order to reduce components of the noise signal in the input signal. In signal enhancement, a database of a signal model concerning the target signal expressing a given feature by a given statistical model is provided, and the filter coefficient is controlled based on the likelihood of the signal model with respect to an output signal from the spectral subtraction means.

Type: Grant

Filed: February 28, 2005

Date of Patent: May 12, 2009

Assignee: International Business Machines Corporation

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Speech recognition apparatus, speech recognition apparatus and program thereof

Patent number: 7478041

Abstract: Provided is a method for canceling background noise of a sound source other than a target direction sound source in order to realize highly accurate speech recognition, and a system using the same. In terms of directional characteristics of a microphone array, due to a capability of approximating a power distribution of each angle of each of possible various sound source directions by use of a sum of coefficient multiples of a base form angle power distribution of a target sound source measured beforehand by base form angle by using a base form sound, and power distribution of a non-directional background sound by base form, only a component of the target sound source direction is extracted at a noise suppression part. In addition, when the target sound source direction is unknown, at a sound source localization part, a distribution for minimizing the approximate residual is selected from base form angle power distributions of various sound source directions to assume a target sound source direction.

Type: Grant

Filed: March 12, 2003

Date of Patent: January 13, 2009

Assignee: International Business Machines Corporation

Inventors: Osamu Ichikawa, Tetsuya Takiguchi, Masafumi Nishimura
Signal enhancement and speech recognition

Publication number: 20080294432

Abstract: Provides speech enhancement techniques which are effective even for extemporaneous noise without a noise interval and unknown extemporaneous noise. An example of a signal enhancement device includes: spectral subtraction means for subtracting a given reference signal from an input signal containing a target signal and a noise signal by spectral subtraction; an adaptive filter applied to the reference signal; and coefficient control means for controlling a filter coefficient of the adaptive filter in order to reduce components of the noise signal in the input signal. In the signal enhancement device, a database of a signal model concerning the target signal expressing a given feature by means of a given statistical model is provided, and the filter coefficient is controlled based on the likelihood of the signal model with respect to an output signal from the spectral subtraction means.

Type: Application

Filed: May 26, 2008

Publication date: November 27, 2008

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Speech recognition system and program thereof

Patent number: 7403896

Abstract: Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM for each speech frame of the inputted speech by use of the composite HMM.

Type: Grant

Filed: March 14, 2003

Date of Patent: July 22, 2008

Assignee: International Business Machines Corporation

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Signal enhancement and speech recognition

Publication number: 20060122832

Abstract: Provides speech enhancement techniques which are effective even for extemporaneous noise without a noise interval and unknown extemporaneous noise. An example of a signal enhancement device includes: spectral subtraction means for subtracting a given reference signal from an input signal containing a target signal and a noise signal by spectral subtraction; an adaptive filter applied to the reference signal; and coefficient control means for controlling a filter coefficient of the adaptive filter in order to reduce components of the noise signal in the input signal. In the signal enhancement device, a database of a signal model concerning the target signal expressing a given feature by means of a given statistical model is provided, and the filter coefficient is controlled based on the likelihood of the signal model with respect to an output signal from the spectral subtraction means.

Type: Application

Filed: February 28, 2005

Publication date: June 8, 2006

Applicant: International Business Machines Corporation

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Voice recording system, recording device, voice analysis device, voice recording method and program

Publication number: 20050267762

Abstract: To provide a method of specifying each of speakers of individual voices, based on recorded voices made by a plurality of speakers, with a simple system configuration, and to provide a system using the method. The system includes: microphones individually provided for each of the speakers; a voice processing unit which gives a unique characteristic to each pair of two-channel voice signals recorded with each of the microphones 10, by executing different kinds of voice processing on the respective pairs of voice signals, and which mixes the voice signals for each channel; and an analysis unit which performs an analysis according to the unique characteristics, given to the voice signals concerning the respective microphones through the processing by the voice processing unit, and which specifies the speaker for each speech segment of the voice signals.

Type: Application

Filed: May 25, 2005

Publication date: December 1, 2005

Applicant: International Business Machines Corporation

Inventors: Osamu Ichikawa, Masafumi Nishimura, Tetsuya Takiguchi
Speech recognition device, speech recognition method, computer-executable program for causing computer to execute recognition method, and storage medium

Publication number: 20050010410

Abstract: A speech recognition device and method configured to include a computer, for recognizing speech, including: a storage location for storing a feature quantity acquired from a speech signal for each frame; storage portions for storing acoustic model data and language model data; a echo speech component for generating echo speech model data from a speech signal acquired prior to a speech signal to be processed at the current time point and using the echo speech model data to generate adapted acoustic model data; and a processing component for utilizing the feature quantity, the adapted acoustic model data, and the language model data to provide a speech recognition result of the speech signal.

Type: Application

Filed: May 20, 2004

Publication date: January 13, 2005

Applicant: International Business Machines Corporation

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Speech recognition system and program thereof

Publication number: 20030225581

Abstract: Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM for each speech frame of the inputted speech by use of the composite HMM.

Type: Application

Filed: March 14, 2003

Publication date: December 4, 2003

Applicant: International Business Machines Corporation

Inventors: Tetsuya Takiguchi, Masafumi Nishimura
Voice recognition apparatus, voice recognition apparatus and program thereof

Publication number: 20030177006

Abstract: Provided is a method for canceling background noise of a sound source other than a target direction sound source in order to realize highly accurate voice recognition, and a system using the same. In terms of directional characteristics of a microphone array, due to a capability of approximating a power distribution of each angle of each of possible various sound source directions by use of a sum of coefficient multiples of a base form angle power distribution of a target sound source measured beforehand by base form angle by using a base form sound, and power distribution of a non-directional background sound by base form, only a component of the target sound source direction is extracted at a noise suppression part. In addition, when the target sound source direction is unknown, at a sound source localization part, a distribution for minimizing the approximate residual is selected from base form angle power distributions of various sound source directions to assume a target sound source direction.

Type: Application

Filed: March 12, 2003

Publication date: September 18, 2003

Inventors: Osamu Ichikawa, Tetsuya Takiguchi, Masafumi Nishimura