Patents by Inventor Yoshihisa Nakatoh

Yoshihisa Nakatoh has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SOUND SOURCE LOCALIZATION DEVICE

Publication number: 20090285409

Abstract: Provided is a sound source localization device which can detect a source location of an extraction sound, including at least two microphones; an analysis unit (103) which (i) analyze frequencies of the mixed sound including the noise and received by each microphone, and (ii) generates frequency signals; and an extraction unit (105) which, for each source location candidate, (a) adjusts time axes of the frequency signals corresponding to the microphones, so that there is no time difference between when the mixed sound reaches one microphone from the source location candidate and when the mixed sound reaches another microphone from the source location candidate, and (b) determines frequency signals having a difference distance equal to or smaller than a threshold value, from among the frequency signals corresponding to the microphones with the time axis having been adjusted, the difference distance representing a degree of a difference in the frequency signals between the microphones, and (c) extracts the sourc

Type: Application

Filed: November 6, 2007

Publication date: November 19, 2009

Inventors: Shinichi Yoshizawa, Yoshihisa Nakatoh
Audio restoration apparatus and audio restoration method

Patent number: 7536303

Abstract: An audio restoration apparatus is provided which restores an audio to be restored having a missing audio part and being included in a mixed audio. The audio restoration apparatus includes: a mixed audio separation unit which extracts the audio to be restored included in the mixed audio; an audio structure analysis unit which generates at least one of a phoneme sequence, a character sequence and a musical note sequence of the missing audio part; an unchanged audio characteristic domain analysis unit which segments the extracted audio to be restored into time domains in each of which an audio characteristic remains unchanged; an audio characteristic extraction unit which identifies a time domain where the missing audio part is located, and extracts audio characteristics of the identified time domain in the audio to be restored; and an audio restoration unit which restores the missing audio part in the audio to be restored.

Type: Grant

Filed: April 11, 2006

Date of Patent: May 19, 2009

Assignee: Panasonic Corporation

Inventors: Shinichi Yoshizawa, Tetsu Suzuki, Yoshihisa Nakatoh
Mixed audio separation apparatus

Publication number: 20090067647

Abstract: A mixed audio separation system (100) which separates a specific audio from among a mixed audio (S100) includes a local frequency information generation unit (105) which obtains pieces of local frequency information (S103) corresponding to local reference waveforms (S102), based on the local reference waveforms (S102) and an analysis waveform which is the waveform of the mixed audio (S100). Each of the local reference waveforms (S102) (i) constitutes a part of a reference waveform for analyzing a predetermined frequency, (ii) has a predetermined temporal/spatial resolution and (iii) includes at least one of an amplification spectrum and a phase spectrum in the predetermined frequency.

Type: Application

Filed: April 11, 2006

Publication date: March 12, 2009

Inventors: Shinichi Yoshizawa, Tetsu Suzuki, Yoshihisa Nakatoh
Sound identification apparatus

Patent number: 7473838

Abstract: A sound identification apparatus which reduces the chance of a drop in the identification rate, including: a frame sound feature extraction unit which extracts a sound feature per frame of an inputted audio signal; a frame likelihood calculation unit which calculates a frame likelihood of the sound feature in each frame, for each of a plurality of sound models; a confidence measure judgment unit which judges a confidence measure based on the frame likelihood; a cumulative likelihood output unit time determination unit which determines a cumulative likelihood output unit time based on the confidence measure; a cumulative likelihood calculation unit which calculates a cumulative likelihood in which the frame likelihoods of the frames included in the cumulative likelihood output unit time are cumulated, for each sound model; a sound type candidate judgment unit which determines, for each cumulative likelihood output unit time, a sound type corresponding to the sound model that has a maximum cumulative likelihood

Type: Grant

Filed: April 9, 2007

Date of Patent: January 6, 2009

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Tetsu Suzuki, Yoshihisa Nakatoh, Shinichi Yoshizawa
Target sound analysis apparatus, target sound analysis method and target sound analysis program

Publication number: 20080304672

Abstract: A target sound analysis apparatus capable of distinguishing between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyzing whether or not the target sound is contained in an evaluation sound is an target sound analysis apparatus that analyzes whether or not a target sound is included in an evaluation sound, and includes: a target sound preparation unit that prepares a target sound that is an analysis waveform to be used for analyzing a fundamental period; an evaluation sound preparation unit that prepares an evaluation sound that is an analyzed waveform in which its fundamental period will be analyzed; and an analysis unit that temporally shifts the target sound with respect to the evaluation sound to sequentially calculate differential values of the evaluation sound and the target sound at corresponding points in time, calculate an iterative interval between the points in time where the differential value is equal to or lower than a predete

Type: Application

Filed: September 25, 2007

Publication date: December 11, 2008

Inventors: Shinichi Yoshizawa, Yoshihisa Nakatoh, Tetsu Suzuki
Speech recognition apparatus and speech recognition method

Patent number: 7310601

Abstract: The present invention provides a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case where topics are changed.

Type: Grant

Filed: December 8, 2005

Date of Patent: December 18, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Makoto Nishizaki, Yoshihisa Nakatoh, Maki Yamada, Shinichi Yoshizawa
Sound identification apparatus

Publication number: 20070192099

Abstract: A sound identification apparatus which reduces the chance of a drop in the identification rate, including: a frame sound feature extraction unit which extracts a sound feature per frame of an inputted audio signal; a frame likelihood calculation unit which calculates a frame likelihood of the sound feature in each frame, for each of a plurality of sound models; a confidence measure judgment unit which judges a confidence measure based on the frame likelihood; a cumulative likelihood output unit time determination unit which determines a cumulative likelihood output unit time based on the confidence measure; a cumulative likelihood calculation unit which calculates a cumulative likelihood in which the frame likelihoods of the frames included in the cumulative likelihood output unit time are cumulated, for each sound model; a sound type candidate judgment unit which determines, for each cumulative likelihood output unit time, a sound type corresponding to the sound model that has a maximum cumulative likelihood

Type: Application

Filed: April 9, 2007

Publication date: August 16, 2007

Inventors: Tetsu Suzuki, Yoshihisa Nakatoh, Shinichi Yoshizawa
Multistage inverse quantization having a plurality of frequency bands

Patent number: 7243061

Abstract: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.

Type: Grant

Filed: October 1, 2004

Date of Patent: July 10, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Takeshi Norimatsu, Shuji Miyasaka, Yoshihisa Nakatoh, Mineo Tsushima, Tomokazu Ishikawa
Audio restoration apparatus and audio restoration method

Publication number: 20060193671

Abstract: An audio restoration apparatus which restores an audio to be restored having a missing audio part and being included in a mixed audio.

Type: Application

Filed: April 11, 2006

Publication date: August 31, 2006

Inventors: Shinichi Yoshizawa, Tetsu Suzuki, Yoshihisa Nakatoh
Speech recognition apparatus and speech recognition method

Publication number: 20060100876

Abstract: To provide a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case where topics are changed.

Type: Application

Filed: December 8, 2005

Publication date: May 11, 2006

Inventors: Makoto Nishizaki, Yoshihisa Nakatoh, Maki Yamada, Shinichi Yoshizawa
Speech recognition device and speech recognition method

Publication number: 20050256712

Abstract: The speech recognition apparatus (1) is equipped with the garbage acoustic model storage unit (110) storing the garbage acoustic model which learned the collection of the unnecessary words; the feature value calculation unit (101) which calculates the feature parameter necessary for recognition by acoustically analyzing the unidentified input speech including the non-language speech per frame which is a unit for speech analysis; the garbage acoustic score calculation unit (111) which calculates the garbage acoustic score by comparing the feature parameter and the garbage acoustic model; the garbage acoustic score correction unit (113) which corrects the garbage acoustic score calculated by the garbage acoustic score calculation unit (111) so as to raise it in the frame where the non-language speech is inputted; and the recognition result output unit (105) which outputs, as the recognition result of the unidentified input speech, the word string with the highest cumulative score of the language score, the word

Type: Application

Filed: February 4, 2004

Publication date: November 17, 2005

Inventors: Maki Yamada, Makoto Nishizaki, Yoshihisa Nakatoh, Shinichi Yoshizawa
Multistage inverse quantization having the plurality of frequency bands

Patent number: 6904404

Abstract: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.

Type: Grant

Filed: January 8, 1999

Date of Patent: June 7, 2005

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Takeshi Norimatsu, Shuji Miyasaka, Yoshihisa Nakatoh, Mineo Tsushima, Tomokazu Ishikawa
Multistage inverse quantization having the plurality of frequency bands

Publication number: 20050060147

Abstract: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.

Type: Application

Filed: October 1, 2004

Publication date: March 17, 2005

Inventors: Takeshi Norimatsu, Shuji Miyasaka, Yoshihisa Nakatoh, Mineo Tsushima, Tomokazu Ishikawa
Method of speaker normalization for speech recognition using frequency conversion and speech recognition apparatus applying the preceding method

Publication number: 20040117181

Abstract: An input speech utterance is segmented into a prefixed time length to make frames, to extract an acoustic feature parameter of each frame. The acoustic feature parameter is frequency-converted by using pluralfrequency conversion coefficients previously defined. By using all combinations of plural post-conversion feature parameters obtained by the frequency conversion and at least one standard phonemic model, to compute plural similarities or distances of between the post-conversion feature parameters of each of the frames and the standard phonemic model. A frequency converting condition for normalizing the input utterance is decided by using the pluralsimilarities or distances. By using the frequency converting condition, the input utterance is normalized. With this method, even in case there is change of the speaker making a speech utterance, the individual difference of input utterance can be corrected thereby improving the performance of speech recognition.

Type: Application

Filed: September 24, 2003

Publication date: June 17, 2004

Inventors: Keiko Morii, Yoshihisa Nakatoh, Hiroyasu Kuwano
Audio signal compression method, audio signal compression apparatus, speech signal compression method, speech signal compression apparatus, speech recognition method, and speech recognition apparatus

Patent number: 6477490

Abstract: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing t

Type: Grant

Filed: June 28, 2001

Date of Patent: November 5, 2002

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu, Mineo Tsushima, Tomokazu Ishikawa, Mitsuhiko Serikawa, Taro Katayama, Junichi Nakahashi, Yoriko Yagi
Audio signal compression method, audio signal compression apparatus, speech signal compression method, speech signal compression apparatus, speech recognition method, and speech recognition apparatus

Publication number: 20010044727

Abstract: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization means having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing th

Type: Application

Filed: June 28, 2001

Publication date: November 22, 2001

Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu, Mineo Tsushima, Tomokazu Ishikawa, Mitsuhiko Serikawa, Taro Katayama, Junichi Nakahashi, Yoriko Yagi
Speech recognition method and apparatus using frequency warping of linear prediction coefficients

Patent number: 6311153

Abstract: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing t

Type: Grant

Filed: October 2, 1998

Date of Patent: October 30, 2001

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu, Mineo Tsushima, Tomokazu Ishikawa, Mitsuhiko Serikawa, Taro Katayama, Junichi Nakahashi, Yoriko Yagi
Apparatus for expanding narrowband speech to wideband speech by codebook correspondence of linear mapping functions

Patent number: 5978759

Abstract: Apparatus for expanding the bandwidth of speech signals such that a narrowband speech signal is input and digitized, the spectral envelope information and residual information are extracted from the digitized signal by linear predictive coding analysis, the spectral envelope information is expanded into wideband information by a spectral envelope converter, the residual information is expanded into wideband information by a residual converter, the converted spectral envelope information and residual information are combined to produce a wideband speech signal, frequency information not contained in the input signal is extracted from the obtained wideband speech signal by a filter, and the resulting signal is added to the original digitized input signal, and the obtained signal is converted into an analog signal as the output signal of the apparatus.

Type: Grant

Filed: September 21, 1998

Date of Patent: November 2, 1999

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Mineo Tsushima, Yoshihisa Nakatoh, Takeshi Norimatsu
Method and an apparatus for speech detection for determining whether an input signal is speech or nonspeech

Patent number: 5611019

Abstract: The speech detection apparatus comprises: a reference model maker for extracting a plurality of parameters for a speech detection from training data, and for making a reference model based on the parameters; a parameter extractor for extracting the plurality of parameters from each frame of an input audio signal; and a decision device for deciding whether or not the audio signal is speech, by comparing the parameters extracted from the input audio signal with the reference model. The reference model maker makes the reference model for each phoneme.

Type: Grant

Filed: May 19, 1994

Date of Patent: March 11, 1997

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu
Apparatus for and method of generating a timing signal

Patent number: 5040234

Abstract: A circuit for generating a pulse-like timing signal driving a stepping motor which is used to drive for example a magnetic tape to run in an audio tape or video tape recording and reproducing apparatus. A rewritable memory stores output pattern data and time information for generating the timing signal. This timing signal generating circuit comprises a rewrite control circuit responsive to an operation mode instructing signal, for performing a predetermined operational processing on basic pattern data and basic time information and storing the resulting output pattern data and uniquely related time information in the rewritable memory.

Type: Grant

Filed: December 5, 1989

Date of Patent: August 13, 1991

Assignee: Sharp Kabushiki Kaisha

Inventors: Tooru Yamamoto, Kazuharu Date, Yoshihisa Nakatoh, Shigeki Imai

prev 1 2 3 next