Patents by Inventor Toshiaki Fukada

Toshiaki Fukada has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech information processing method and apparatus and storage medium

Publication number: 20010032078

Abstract: A speech information processing apparatus synthesizes speech with natural intonation by modeling time change in fundamental frequency of a predetermined unit of phoneme. When a predetermined unit of phonological series is inputted, fundamental frequencies of respective phonemes constructing the phonological series are generated based on a segment pitch pattern model (S203). Phonemes are synthesized based on the generated fundamental frequencies of the respective phonemes (S204 and S205).

Type: Application

Filed: March 28, 2001

Publication date: October 18, 2001

Inventor: Toshiaki Fukada
Speech information processing method and apparatus and storage meidum

Publication number: 20010032080

Abstract: A speech information processing apparatus which sets the duration of phonological series with accuracy, and sets a natural phoneme duration in accordance with phonemic/linguistic environment. For this purpose, the duration of predetermined unit of phonological series is obtained based on a duration model for entire segment (S302). Then duration of each of phonemes constructing the phonological series is obtained based on the duration model for the entire segment (S303). Then duration of each phoneme is set based on the duration of the phonological series and the duration of each phoneme (S304).

Type: Application

Filed: March 28, 2001

Publication date: October 18, 2001

Inventor: Toshiaki Fukada
Apparatus for calculating a posterior probability of phoneme symbol, and speech recognition apparatus

Patent number: 6041299

Abstract: There are disclosed an apparatus for calculating a posteriori probabilities of phoneme symbols and a speech recognition apparatus using the apparatus for calculating a posteriori probabilities of phoneme symbols. A feature extracting section extracts speech feature parameters from a speech signal of an uttered speech sentence composed of an inputted character series, and a calculating section calculates a a posteriori probability of a phoneme symbol of the speech signal, by using a bidirectional recurrent neural network. The bidirectional recurrent neural network includes (a) an input layer for receiving the speech feature parameters extracted by the feature extracting means and a plurality of hypothetical phoneme symbol series signals, (b) an intermediate layer of at least one layer having a plurality of units, and (c) an output layer for outputting a a posteriori probability of each phoneme symbol.

Type: Grant

Filed: March 11, 1998

Date of Patent: March 21, 2000

Assignee: ATR Interpreting Telecommunications Research Laboratories

Inventors: Mike Schuster, Toshiaki Fukada
Method and apparatus for processing speech information using a phoneme environment

Patent number: 5845047

Abstract: A speech information processing apparatus includes a statistical processing unit for extracting features by performing statistical processing of a feature file formed by extracting features of speech, such as the fundamental frequency and its variations, and the power and its variations of speech, from a speech file, and a label file in which a phoneme environment, comprising the accent type, the number of moras, the mora position, phonemes and the like, is considered, and a pitch pattern forming unit for forming a pitch pattern, in which phoneme environment is considered, based on the result of the statistical processing.

Type: Grant

Filed: March 20, 1995

Date of Patent: December 1, 1998

Assignee: Canon Kabushiki Kaisha

Inventors: Toshiaki Fukada, Yasunori Ohora, Yasuhiro Komori, Takashi Aso
Document inputting method and apparatus and speech outputting apparatus

Patent number: 5809467

Abstract: A document inputting apparatus or speech outputting apparatus inputs and displays document data, specifies accent information, pronunciation information and syllable-length information of words or characters of the document data. The apparatus displays the document data in accordance with the specified information so that information such as the accent positions or accent intensities can be recognized. Thus formed document data is stored in a memory with the accent information, the pronunciation information or the syllable-length information. Upon reading the document data from the memory and outputting it as speech, the specified information is referred to for speech synthesizing, thus outputting speech corresponding to the correct pronunciation.

Type: Grant

Filed: September 5, 1997

Date of Patent: September 15, 1998

Assignee: Canon Kabushiki Kaisha

Inventors: Mitsuru Otsuka, Yasunori Ohora, Takashi Aso, Toshiyuki Noguchi, Toshiaki Fukada
Data processing method and apparatus for generating sound signals representing music and speech in a multimedia apparatus

Patent number: 5806039

Abstract: A data processing apparatus for synchronized audiovisual output has synchronizing signal bits which are assigned to bits of each sound data, represented by a 16-bit PCM code. A predetermined bit of the assigned bits having the least influence upon the human auditory sense is extracted as a synchronizing signal bit for synchronization of the image data output and sound output.

Type: Grant

Filed: May 20, 1997

Date of Patent: September 8, 1998

Assignee: Canon Kabushiki Kaisha

Inventors: Toshiaki Fukada, Yasunori Ohora, Takashi Aso, Mitsuru Otsuka
Speech synthesis apparatus and method for synthesizing speech from a character series comprising a text and pitch information

Patent number: 5745650

Abstract: A speech synthesis method and apparatus for synthesizing speech from a character series comprising a text and pitch information. The apparatus includes a parameter generator for generating power spectrum envelopes as parameters of a speech waveform to be synthesized representing the input text in accordance with the input character series. The apparatus also includes a pitch waveform generator for generating pitch waveforms whose period equals the pitch specified by the pitch information. The pitch waveform generator generates the pitch waveforms from the input pitch information and the power spectrum envelopes generated by the parameter generator. Also provided is a speech waveform output device for outputting the speech waveform obtained by connecting the generated pitch waveforms.

Type: Grant

Filed: May 24, 1995

Date of Patent: April 28, 1998

Assignee: Canon Kabushiki Kaisha

Inventors: Mitsuru Otsuka, Yasunori Ohora, Takashi Aso, Toshiaki Fukada
Speech synthesis apparatus and method for causing a computer to perform speech synthesis by calculating product of parameters for a speech waveform and a read waveform generation matrix

Patent number: 5745651

Abstract: A speech synthesis method and a speech synthesis apparatus includes a system for synthesis by rule that prevents the quality of synthesized speech from deteriorating and for reducing the number of calculations that are required for the generation of a speech waveform. The speech synthesis apparatus includes a character series input section, for inputting a character series as phonetic text, a pitch waveform generator, for generating a pitch waveform by calculating a product of a matrix, which has been acquired for each pitch, and the character series, which is input by the character series input section, and a device for connecting pitch waveforms that are generated by the pitch waveform generator and for providing a speech waveform. The calculation method for the generation of such a pitch waveform provides a great reduction in the number of calculations that are required.

Type: Grant

Filed: May 30, 1995

Date of Patent: April 28, 1998

Assignee: Canon Kabushiki Kaisha

Inventors: Mitsuru Otsuka, Yasunori Ohora, Takashi Aso, Toshiaki Fukada
Syllable-beat-point synchronized rule-based speech synthesis from coded utterance-speed-independent phoneme combination parameters

Patent number: 5682502

Abstract: In a speech synthesizer, each frame for generating a speech waveform has an expansion degree to which the frame is expanded or compressed in accordance with the production speed of synthetic speech. In accordance with the set speech production speed, the time interval between beat synchronization points is determined on the basis of the speed of the speech to be produced, and the time length of each frame present between the beat synchronization points is determined on the basis of the expansion degree of the frame. Parameters for producing a speech waveform in each frame are properly generated by the time length determined for the frame. In the speech synthesizer for outputting a speech signal by coupling phonemes constituted by one or a plurality of frames having phoneme vowel-consonant combination parameters (VcV, cV, or V) of the speech waveform, the number of frames can be held constant regardless of a change in the speech production speed.

Type: Grant

Filed: June 14, 1995

Date of Patent: October 28, 1997

Assignee: Canon Kabushiki Kaisha

Inventors: Mitsuru Ohtsuka, Yasunori Ohora, Takashi Asou, Takeshi Fujita, Toshiaki Fukada

prev 1 2 3 4