Patents by Inventor Toshiaki Fukada

Toshiaki Fukada has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20010032078
    Abstract: A speech information processing apparatus synthesizes speech with natural intonation by modeling time change in fundamental frequency of a predetermined unit of phoneme. When a predetermined unit of phonological series is inputted, fundamental frequencies of respective phonemes constructing the phonological series are generated based on a segment pitch pattern model (S203). Phonemes are synthesized based on the generated fundamental frequencies of the respective phonemes (S204 and S205).
    Type: Application
    Filed: March 28, 2001
    Publication date: October 18, 2001
    Inventor: Toshiaki Fukada
  • Publication number: 20010032080
    Abstract: A speech information processing apparatus which sets the duration of phonological series with accuracy, and sets a natural phoneme duration in accordance with phonemic/linguistic environment. For this purpose, the duration of predetermined unit of phonological series is obtained based on a duration model for entire segment (S302). Then duration of each of phonemes constructing the phonological series is obtained based on the duration model for the entire segment (S303). Then duration of each phoneme is set based on the duration of the phonological series and the duration of each phoneme (S304).
    Type: Application
    Filed: March 28, 2001
    Publication date: October 18, 2001
    Inventor: Toshiaki Fukada
  • Patent number: 6041299
    Abstract: There are disclosed an apparatus for calculating a posteriori probabilities of phoneme symbols and a speech recognition apparatus using the apparatus for calculating a posteriori probabilities of phoneme symbols. A feature extracting section extracts speech feature parameters from a speech signal of an uttered speech sentence composed of an inputted character series, and a calculating section calculates a a posteriori probability of a phoneme symbol of the speech signal, by using a bidirectional recurrent neural network. The bidirectional recurrent neural network includes (a) an input layer for receiving the speech feature parameters extracted by the feature extracting means and a plurality of hypothetical phoneme symbol series signals, (b) an intermediate layer of at least one layer having a plurality of units, and (c) an output layer for outputting a a posteriori probability of each phoneme symbol.
    Type: Grant
    Filed: March 11, 1998
    Date of Patent: March 21, 2000
    Assignee: ATR Interpreting Telecommunications Research Laboratories
    Inventors: Mike Schuster, Toshiaki Fukada
  • Patent number: 5845047
    Abstract: A speech information processing apparatus includes a statistical processing unit for extracting features by performing statistical processing of a feature file formed by extracting features of speech, such as the fundamental frequency and its variations, and the power and its variations of speech, from a speech file, and a label file in which a phoneme environment, comprising the accent type, the number of moras, the mora position, phonemes and the like, is considered, and a pitch pattern forming unit for forming a pitch pattern, in which phoneme environment is considered, based on the result of the statistical processing.
    Type: Grant
    Filed: March 20, 1995
    Date of Patent: December 1, 1998
    Assignee: Canon Kabushiki Kaisha
    Inventors: Toshiaki Fukada, Yasunori Ohora, Yasuhiro Komori, Takashi Aso
  • Patent number: 5809467
    Abstract: A document inputting apparatus or speech outputting apparatus inputs and displays document data, specifies accent information, pronunciation information and syllable-length information of words or characters of the document data. The apparatus displays the document data in accordance with the specified information so that information such as the accent positions or accent intensities can be recognized. Thus formed document data is stored in a memory with the accent information, the pronunciation information or the syllable-length information. Upon reading the document data from the memory and outputting it as speech, the specified information is referred to for speech synthesizing, thus outputting speech corresponding to the correct pronunciation.
    Type: Grant
    Filed: September 5, 1997
    Date of Patent: September 15, 1998
    Assignee: Canon Kabushiki Kaisha
    Inventors: Mitsuru Otsuka, Yasunori Ohora, Takashi Aso, Toshiyuki Noguchi, Toshiaki Fukada
  • Patent number: 5806039
    Abstract: A data processing apparatus for synchronized audiovisual output has synchronizing signal bits which are assigned to bits of each sound data, represented by a 16-bit PCM code. A predetermined bit of the assigned bits having the least influence upon the human auditory sense is extracted as a synchronizing signal bit for synchronization of the image data output and sound output.
    Type: Grant
    Filed: May 20, 1997
    Date of Patent: September 8, 1998
    Assignee: Canon Kabushiki Kaisha
    Inventors: Toshiaki Fukada, Yasunori Ohora, Takashi Aso, Mitsuru Otsuka
  • Patent number: 5745650
    Abstract: A speech synthesis method and apparatus for synthesizing speech from a character series comprising a text and pitch information. The apparatus includes a parameter generator for generating power spectrum envelopes as parameters of a speech waveform to be synthesized representing the input text in accordance with the input character series. The apparatus also includes a pitch waveform generator for generating pitch waveforms whose period equals the pitch specified by the pitch information. The pitch waveform generator generates the pitch waveforms from the input pitch information and the power spectrum envelopes generated by the parameter generator. Also provided is a speech waveform output device for outputting the speech waveform obtained by connecting the generated pitch waveforms.
    Type: Grant
    Filed: May 24, 1995
    Date of Patent: April 28, 1998
    Assignee: Canon Kabushiki Kaisha
    Inventors: Mitsuru Otsuka, Yasunori Ohora, Takashi Aso, Toshiaki Fukada
  • Patent number: 5745651
    Abstract: A speech synthesis method and a speech synthesis apparatus includes a system for synthesis by rule that prevents the quality of synthesized speech from deteriorating and for reducing the number of calculations that are required for the generation of a speech waveform. The speech synthesis apparatus includes a character series input section, for inputting a character series as phonetic text, a pitch waveform generator, for generating a pitch waveform by calculating a product of a matrix, which has been acquired for each pitch, and the character series, which is input by the character series input section, and a device for connecting pitch waveforms that are generated by the pitch waveform generator and for providing a speech waveform. The calculation method for the generation of such a pitch waveform provides a great reduction in the number of calculations that are required.
    Type: Grant
    Filed: May 30, 1995
    Date of Patent: April 28, 1998
    Assignee: Canon Kabushiki Kaisha
    Inventors: Mitsuru Otsuka, Yasunori Ohora, Takashi Aso, Toshiaki Fukada
  • Patent number: 5682502
    Abstract: In a speech synthesizer, each frame for generating a speech waveform has an expansion degree to which the frame is expanded or compressed in accordance with the production speed of synthetic speech. In accordance with the set speech production speed, the time interval between beat synchronization points is determined on the basis of the speed of the speech to be produced, and the time length of each frame present between the beat synchronization points is determined on the basis of the expansion degree of the frame. Parameters for producing a speech waveform in each frame are properly generated by the time length determined for the frame. In the speech synthesizer for outputting a speech signal by coupling phonemes constituted by one or a plurality of frames having phoneme vowel-consonant combination parameters (VcV, cV, or V) of the speech waveform, the number of frames can be held constant regardless of a change in the speech production speed.
    Type: Grant
    Filed: June 14, 1995
    Date of Patent: October 28, 1997
    Assignee: Canon Kabushiki Kaisha
    Inventors: Mitsuru Ohtsuka, Yasunori Ohora, Takashi Asou, Takeshi Fujita, Toshiaki Fukada