Patents by Inventor Yoshihisa Nakatoh

Yoshihisa Nakatoh has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20090285409
    Abstract: Provided is a sound source localization device which can detect a source location of an extraction sound, including at least two microphones; an analysis unit (103) which (i) analyze frequencies of the mixed sound including the noise and received by each microphone, and (ii) generates frequency signals; and an extraction unit (105) which, for each source location candidate, (a) adjusts time axes of the frequency signals corresponding to the microphones, so that there is no time difference between when the mixed sound reaches one microphone from the source location candidate and when the mixed sound reaches another microphone from the source location candidate, and (b) determines frequency signals having a difference distance equal to or smaller than a threshold value, from among the frequency signals corresponding to the microphones with the time axis having been adjusted, the difference distance representing a degree of a difference in the frequency signals between the microphones, and (c) extracts the sourc
    Type: Application
    Filed: November 6, 2007
    Publication date: November 19, 2009
    Inventors: Shinichi Yoshizawa, Yoshihisa Nakatoh
  • Patent number: 7536303
    Abstract: An audio restoration apparatus is provided which restores an audio to be restored having a missing audio part and being included in a mixed audio. The audio restoration apparatus includes: a mixed audio separation unit which extracts the audio to be restored included in the mixed audio; an audio structure analysis unit which generates at least one of a phoneme sequence, a character sequence and a musical note sequence of the missing audio part; an unchanged audio characteristic domain analysis unit which segments the extracted audio to be restored into time domains in each of which an audio characteristic remains unchanged; an audio characteristic extraction unit which identifies a time domain where the missing audio part is located, and extracts audio characteristics of the identified time domain in the audio to be restored; and an audio restoration unit which restores the missing audio part in the audio to be restored.
    Type: Grant
    Filed: April 11, 2006
    Date of Patent: May 19, 2009
    Assignee: Panasonic Corporation
    Inventors: Shinichi Yoshizawa, Tetsu Suzuki, Yoshihisa Nakatoh
  • Publication number: 20090067647
    Abstract: A mixed audio separation system (100) which separates a specific audio from among a mixed audio (S100) includes a local frequency information generation unit (105) which obtains pieces of local frequency information (S103) corresponding to local reference waveforms (S102), based on the local reference waveforms (S102) and an analysis waveform which is the waveform of the mixed audio (S100). Each of the local reference waveforms (S102) (i) constitutes a part of a reference waveform for analyzing a predetermined frequency, (ii) has a predetermined temporal/spatial resolution and (iii) includes at least one of an amplification spectrum and a phase spectrum in the predetermined frequency.
    Type: Application
    Filed: April 11, 2006
    Publication date: March 12, 2009
    Inventors: Shinichi Yoshizawa, Tetsu Suzuki, Yoshihisa Nakatoh
  • Patent number: 7473838
    Abstract: A sound identification apparatus which reduces the chance of a drop in the identification rate, including: a frame sound feature extraction unit which extracts a sound feature per frame of an inputted audio signal; a frame likelihood calculation unit which calculates a frame likelihood of the sound feature in each frame, for each of a plurality of sound models; a confidence measure judgment unit which judges a confidence measure based on the frame likelihood; a cumulative likelihood output unit time determination unit which determines a cumulative likelihood output unit time based on the confidence measure; a cumulative likelihood calculation unit which calculates a cumulative likelihood in which the frame likelihoods of the frames included in the cumulative likelihood output unit time are cumulated, for each sound model; a sound type candidate judgment unit which determines, for each cumulative likelihood output unit time, a sound type corresponding to the sound model that has a maximum cumulative likelihood
    Type: Grant
    Filed: April 9, 2007
    Date of Patent: January 6, 2009
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Tetsu Suzuki, Yoshihisa Nakatoh, Shinichi Yoshizawa
  • Publication number: 20080304672
    Abstract: A target sound analysis apparatus capable of distinguishing between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyzing whether or not the target sound is contained in an evaluation sound is an target sound analysis apparatus that analyzes whether or not a target sound is included in an evaluation sound, and includes: a target sound preparation unit that prepares a target sound that is an analysis waveform to be used for analyzing a fundamental period; an evaluation sound preparation unit that prepares an evaluation sound that is an analyzed waveform in which its fundamental period will be analyzed; and an analysis unit that temporally shifts the target sound with respect to the evaluation sound to sequentially calculate differential values of the evaluation sound and the target sound at corresponding points in time, calculate an iterative interval between the points in time where the differential value is equal to or lower than a predete
    Type: Application
    Filed: September 25, 2007
    Publication date: December 11, 2008
    Inventors: Shinichi Yoshizawa, Yoshihisa Nakatoh, Tetsu Suzuki
  • Patent number: 7310601
    Abstract: The present invention provides a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case where topics are changed.
    Type: Grant
    Filed: December 8, 2005
    Date of Patent: December 18, 2007
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Makoto Nishizaki, Yoshihisa Nakatoh, Maki Yamada, Shinichi Yoshizawa
  • Publication number: 20070192099
    Abstract: A sound identification apparatus which reduces the chance of a drop in the identification rate, including: a frame sound feature extraction unit which extracts a sound feature per frame of an inputted audio signal; a frame likelihood calculation unit which calculates a frame likelihood of the sound feature in each frame, for each of a plurality of sound models; a confidence measure judgment unit which judges a confidence measure based on the frame likelihood; a cumulative likelihood output unit time determination unit which determines a cumulative likelihood output unit time based on the confidence measure; a cumulative likelihood calculation unit which calculates a cumulative likelihood in which the frame likelihoods of the frames included in the cumulative likelihood output unit time are cumulated, for each sound model; a sound type candidate judgment unit which determines, for each cumulative likelihood output unit time, a sound type corresponding to the sound model that has a maximum cumulative likelihood
    Type: Application
    Filed: April 9, 2007
    Publication date: August 16, 2007
    Inventors: Tetsu Suzuki, Yoshihisa Nakatoh, Shinichi Yoshizawa
  • Patent number: 7243061
    Abstract: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.
    Type: Grant
    Filed: October 1, 2004
    Date of Patent: July 10, 2007
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Takeshi Norimatsu, Shuji Miyasaka, Yoshihisa Nakatoh, Mineo Tsushima, Tomokazu Ishikawa
  • Publication number: 20060193671
    Abstract: An audio restoration apparatus which restores an audio to be restored having a missing audio part and being included in a mixed audio.
    Type: Application
    Filed: April 11, 2006
    Publication date: August 31, 2006
    Inventors: Shinichi Yoshizawa, Tetsu Suzuki, Yoshihisa Nakatoh
  • Publication number: 20060100876
    Abstract: To provide a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case where topics are changed.
    Type: Application
    Filed: December 8, 2005
    Publication date: May 11, 2006
    Inventors: Makoto Nishizaki, Yoshihisa Nakatoh, Maki Yamada, Shinichi Yoshizawa
  • Publication number: 20050256712
    Abstract: The speech recognition apparatus (1) is equipped with the garbage acoustic model storage unit (110) storing the garbage acoustic model which learned the collection of the unnecessary words; the feature value calculation unit (101) which calculates the feature parameter necessary for recognition by acoustically analyzing the unidentified input speech including the non-language speech per frame which is a unit for speech analysis; the garbage acoustic score calculation unit (111) which calculates the garbage acoustic score by comparing the feature parameter and the garbage acoustic model; the garbage acoustic score correction unit (113) which corrects the garbage acoustic score calculated by the garbage acoustic score calculation unit (111) so as to raise it in the frame where the non-language speech is inputted; and the recognition result output unit (105) which outputs, as the recognition result of the unidentified input speech, the word string with the highest cumulative score of the language score, the word
    Type: Application
    Filed: February 4, 2004
    Publication date: November 17, 2005
    Inventors: Maki Yamada, Makoto Nishizaki, Yoshihisa Nakatoh, Shinichi Yoshizawa
  • Patent number: 6904404
    Abstract: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.
    Type: Grant
    Filed: January 8, 1999
    Date of Patent: June 7, 2005
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Takeshi Norimatsu, Shuji Miyasaka, Yoshihisa Nakatoh, Mineo Tsushima, Tomokazu Ishikawa
  • Publication number: 20050060147
    Abstract: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.
    Type: Application
    Filed: October 1, 2004
    Publication date: March 17, 2005
    Inventors: Takeshi Norimatsu, Shuji Miyasaka, Yoshihisa Nakatoh, Mineo Tsushima, Tomokazu Ishikawa
  • Publication number: 20040117181
    Abstract: An input speech utterance is segmented into a prefixed time length to make frames, to extract an acoustic feature parameter of each frame. The acoustic feature parameter is frequency-converted by using pluralfrequency conversion coefficients previously defined. By using all combinations of plural post-conversion feature parameters obtained by the frequency conversion and at least one standard phonemic model, to compute plural similarities or distances of between the post-conversion feature parameters of each of the frames and the standard phonemic model. A frequency converting condition for normalizing the input utterance is decided by using the pluralsimilarities or distances. By using the frequency converting condition, the input utterance is normalized. With this method, even in case there is change of the speaker making a speech utterance, the individual difference of input utterance can be corrected thereby improving the performance of speech recognition.
    Type: Application
    Filed: September 24, 2003
    Publication date: June 17, 2004
    Inventors: Keiko Morii, Yoshihisa Nakatoh, Hiroyasu Kuwano
  • Patent number: 6477490
    Abstract: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing t
    Type: Grant
    Filed: June 28, 2001
    Date of Patent: November 5, 2002
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu, Mineo Tsushima, Tomokazu Ishikawa, Mitsuhiko Serikawa, Taro Katayama, Junichi Nakahashi, Yoriko Yagi
  • Publication number: 20010044727
    Abstract: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization means having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing th
    Type: Application
    Filed: June 28, 2001
    Publication date: November 22, 2001
    Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu, Mineo Tsushima, Tomokazu Ishikawa, Mitsuhiko Serikawa, Taro Katayama, Junichi Nakahashi, Yoriko Yagi
  • Patent number: 6311153
    Abstract: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing t
    Type: Grant
    Filed: October 2, 1998
    Date of Patent: October 30, 2001
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu, Mineo Tsushima, Tomokazu Ishikawa, Mitsuhiko Serikawa, Taro Katayama, Junichi Nakahashi, Yoriko Yagi
  • Patent number: 5978759
    Abstract: Apparatus for expanding the bandwidth of speech signals such that a narrowband speech signal is input and digitized, the spectral envelope information and residual information are extracted from the digitized signal by linear predictive coding analysis, the spectral envelope information is expanded into wideband information by a spectral envelope converter, the residual information is expanded into wideband information by a residual converter, the converted spectral envelope information and residual information are combined to produce a wideband speech signal, frequency information not contained in the input signal is extracted from the obtained wideband speech signal by a filter, and the resulting signal is added to the original digitized input signal, and the obtained signal is converted into an analog signal as the output signal of the apparatus.
    Type: Grant
    Filed: September 21, 1998
    Date of Patent: November 2, 1999
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Mineo Tsushima, Yoshihisa Nakatoh, Takeshi Norimatsu
  • Patent number: 5611019
    Abstract: The speech detection apparatus comprises: a reference model maker for extracting a plurality of parameters for a speech detection from training data, and for making a reference model based on the parameters; a parameter extractor for extracting the plurality of parameters from each frame of an input audio signal; and a decision device for deciding whether or not the audio signal is speech, by comparing the parameters extracted from the input audio signal with the reference model. The reference model maker makes the reference model for each phoneme.
    Type: Grant
    Filed: May 19, 1994
    Date of Patent: March 11, 1997
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Yoshihisa Nakatoh, Takeshi Norimatsu
  • Patent number: 5040234
    Abstract: A circuit for generating a pulse-like timing signal driving a stepping motor which is used to drive for example a magnetic tape to run in an audio tape or video tape recording and reproducing apparatus. A rewritable memory stores output pattern data and time information for generating the timing signal. This timing signal generating circuit comprises a rewrite control circuit responsive to an operation mode instructing signal, for performing a predetermined operational processing on basic pattern data and basic time information and storing the resulting output pattern data and uniquely related time information in the rewritable memory.
    Type: Grant
    Filed: December 5, 1989
    Date of Patent: August 13, 1991
    Assignee: Sharp Kabushiki Kaisha
    Inventors: Tooru Yamamoto, Kazuharu Date, Yoshihisa Nakatoh, Shigeki Imai