Patents by Inventor Soichi Toyama

Soichi Toyama has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Operator recognition device, operator recognition method and operator recognition program

Patent number: 7979718

Abstract: An operator recognition device is provided that eliminates the registration of data such as HMM data having a characteristic amount for which error in recognition occurs easily when recognizing an operator, and thus reduces the possibility of errors in recognition, and has stable recognition performance. When registering HMM data that is used when performing recognition processing, a speaker recognition device 100 eliminates the registration of HMM data of a password having a characteristic amount of the spoken voice component that is similar to a characteristic amount that is indicated by HMM data that is already registered, and does not allow the registration of HMM data for which it is estimated that error in recognition will occur easily during the recognition process.

Type: Grant

Filed: March 24, 2006

Date of Patent: July 12, 2011

Assignees: Pioneer Corporation, Tech Experts Incorporation

Inventors: Soichi Toyama, Ikuo Fujita, Mitsuya Komamura
Speech recognition device and speech recognition method

Patent number: 7813921

Abstract: There is provided a voice recognition device and a voice recognition method that enhance the function of noise adaptation processing in voice recognition processing and reduce the capacity of a memory being used. Acoustic models are subjected to clustering processing to calculate the centroid of each cluster and the differential vector between the centroid and each model, model composition between each kind of assumed noise model and the calculated centroid is carried out, and the centroid of each composition model and the differential vector are stored in a memory. In the actual recognition processing, the centroid optimal to the environment estimated by the utterance environmental estimation is extracted from the memory, model restoration is carried out on the extracted centroid by using the differential vector stored in the memory, and noise adaptation processing is executed on the basis of the restored model.

Type: Grant

Filed: March 15, 2005

Date of Patent: October 12, 2010

Assignee: Pioneer Corporation

Inventors: Hajime Kobayashi, Soichi Toyama, Yasunori Suzuki
ACOUSTIC MODEL REGISTRATION APPARATUS, TALKER RECOGNITION APPARATUS, ACOUSTIC MODEL REGISTRATION METHOD AND ACOUSTIC MODEL REGISTRATION PROCESSING PROGRAM

Publication number: 20100063817

Abstract: An acoustic model registration apparatus, an talker recognition apparatus, an acoustic model registration method and an acoustic model registration processing program, each of which prevents certainly an acoustic model having a low recognition capability for talker from being registered certainly, are provided.

Type: Application

Filed: March 14, 2007

Publication date: March 11, 2010

Applicant: Pioneer Corporation

Inventors: Soichi Toyama, Ikuo Fujita, Yukio Kamoshida
OPERATOR RECOGNITION DEVICE, OPERATOR RECOGNITION METHOD AND OPERATOR RECOGNITION PROGRAM

Publication number: 20090254757

Abstract: An operator recognition device is provided that eliminates the registration of data such as HMM data having a characteristic amount for which error in recognition occurs easily when recognizing an operator, and thus reduces the possibility of errors in recognition, and has stable recognition performance. When registering HMM data that is used when performing recognition processing, a speaker recognition device 100 eliminates the registration of HMM data of a password having a characteristic amount of the spoken voice component that is similar to a characteristic amount that is indicated by HMM data that is already registered, and does not allow the registration of HMM data for which it is estimated that error in recognition will occur easily during the recognition process.

Type: Application

Filed: March 24, 2006

Publication date: October 8, 2009

Applicants: Pioneer Corporation, Tech Experts Incorporation

Inventors: Soichi Toyama, Ikuo Fujita, Mitsuya Komamura
SPEAKER MODEL REGISTERING APPARATUS AND METHOD, AND COMPUTER PROGRAM

Publication number: 20090106025

Abstract: EN) A speaker recognition system (1) includes a speaker model registration device (10) which registers a speaker model for speaker recognition in the speaker recognition system. The speaker model registration device includes acquisition means (13) for acquiring utterances by n+? times (wherein n is an integer not smaller than 2 and ? is an integer not smaller than 1); calculation means (20) for calculating a speaker model by using the acquired utterances of n times as utterances for registration; correlation means (30) for correlating the calculated speaker model by using the acquired utterances of ? times as correlation utterances; and registration means (40) for registering those having the correlation result satisfying a predetermined reference among the correlated speaker models, as the speaker model for speaker recognition.

Type: Application

Filed: March 16, 2007

Publication date: April 23, 2009

Applicant: PIONEER CORPORATION

Inventor: Soichi Toyama
Speech Recognition Device and Speech Recognition Method

Publication number: 20080270127

Abstract: There is provided a voice recognition device and a voice recognition method that enhance the function of noise adaptation processing in voice recognition processing and reduce the capacity of a memory being used. Acoustic models are subjected to clustering processing to calculate the centroid of each cluster and the differential vector between the centroid and each model, model composition between each kind of assumed noise model and the calculated centroid is carried out, and the centroid of each composition model and the differential vector are stored in a memory. In the actual recognition processing, the centroid optimal to the environment estimated by the utterance environmental estimation is extracted from the memory, model restoration is carried out on the extracted centroid by using the differential vector stored in the memory, and noise adaptation processing is executed on the basis of the restored model.

Type: Application

Filed: March 15, 2005

Publication date: October 30, 2008

Inventors: Hajime Kobayashi, Soichi Toyama, Yasunori Suzuki
Data Selecting Apparatus, Data Selecting Method, Data Selecting Program, a Recording Medium on which the Data Selecting Program is Recorded, Navigation Apparatus, Navigation Method, Navigation Program, and Recording Medium on which the Navigation Program is Recorded

Publication number: 20080046844

Abstract: The present invention is directed to provide a data selecting apparatus and a navigation apparatus capable of easily and promptly selecting one piece of data from a plurality of pieces of data. A navigation apparatus 100 has: a display controller 111 for obtaining name data and genre information of each point data from a map data storing unit 105, generating display data for displaying names of the point data, which are arranged by the genre information at the same hierarchical level on the basis of the obtained name data and genre information, and performing the display control on the generated display data, and an operating unit 106 used for selecting a genre to which point data to be selected by the user belongs and selecting a name of one piece of the point data from the selected genre.

Type: Application

Filed: March 31, 2005

Publication date: February 21, 2008

Inventors: Shinichi Sugie, Naohiko Ichihara, Yuji Koga, Soichi Toyama
Speech Recognition Apparatus And Speech Recognition Method

Publication number: 20070203700

Abstract: A speech recognition apparatus and speech recognition method are provided for reducing such events as erroneous recognition and disabled recognition and improving a recognition efficiency.

Type: Application

Filed: March 22, 2005

Publication date: August 30, 2007

Inventor: Soichi Toyama
Apparatus and method for speech recognition

Patent number: 7257532

Abstract: Before executing a speech recognition, a composite acoustic model adapted to noise is generated by composition of a noise adaptive representative acoustic model generated by noise-adaptation of each representative acoustic model and difference models stored in advance in a storing section, respectively. Then, the noise and speaker adaptive acoustic model is generated by executing speaker-adaptation to the composite acoustic model with the feature vector series of uttered speech. The renewal difference model is generated by the difference between the noise and speaker adaptive acoustic model and the noise adaptive representative acoustic model, to replace the difference model stored in the storing section therewith. The speech recognition is performed by comparing the feature vector series of the uttered speech to be recognized with the composite acoustic model adapted to noise and speaker generated by the composition of the noise adaptive representative acoustic model and the renewal difference model.

Type: Grant

Filed: September 22, 2003

Date of Patent: August 14, 2007

Assignee: Pioneer Corporation

Inventor: Soichi Toyama
Contents presenting system and method

Patent number: 7177809

Abstract: A contents presenting system includes: an analyzing unit which collects and analyzes user's conversation to output an analysis result; a contents acquiring unit which acquires contents from a contents database based on the analysis result; and a contents presenting unit which presents the acquired contents to the user. Since the analysis result of user's conversation includes a factor representing the environment where the user is talking, by determining contents based on the analysis result, it is possible to provide the contents which is suited for the environment where the user is present.

Type: Grant

Filed: June 10, 2002

Date of Patent: February 13, 2007

Assignee: Pioneer Corporation

Inventors: Takehiko Shioda, Soichi Toyama, Keiichi Yamauchi, Hiroaki Shibasaki, Hideki Amano
Speech synthesis method

Patent number: 7130799

Abstract: A speech synthesizing method which synthesizes speech naturally is disclosed. Standardized frame power values of an n-th frame is calculated when frame power values at head and tail frames in a phoneme are standardized. An average value of the power values sampled from the power frequency characteristics in the n-th frame at a predetermined frequency interval is set as a mean frame power value. A sum of squares of signal levels in one frame of a frequency signal from a sound source is calculated as a frame power correction value. A speech envelope signal is calculated as a function having variables of the standardized frame power values, the frame power correction value and the mean frame power value. The speech envelope signal adjusts the amplitude level of a speech waveform signal supplied from a vocal tract filter according to the level of the speech envelope signal.

Type: Grant

Filed: October 10, 2000

Date of Patent: October 31, 2006

Assignee: Pioneer Corporation

Inventors: Katsumi Amano, Shisei Cho, Soichi Toyama, Hiroyuki Ishihara
Speech recognition system with an adaptive acoustic model

Patent number: 7065488

Abstract: At the time of the speaker adaptation, first feature vector generation sections (7, 8, 9) generate a feature vector series [Ci, M] from which the additive noise and multiplicative noise are removed. A second feature vector generation section (12) generates a feature vector series [Si, M] including the features of the additive noise and multiplicative noise. A path search section (10) conducts a path search by comparing the feature vector series [Ci, m] to the standard vector [an, m] of the standard voice HMM (300). When the speaker adaptation section (11) conducts correlation operation on an average feature vector [S^n, m] of the standard vector [an, m] corresponding to the path search result Dv and the feature vector series [Si, m], the adaptive vector [xn, m] is generated. The adaptive vector [xn, m] updates the feature vector of the speaker adaptive acoustic model (400) used for the speech recognition.

Type: Grant

Filed: September 28, 2001

Date of Patent: June 20, 2006

Assignee: Pioneer Corporation

Inventors: Kiyoshi Yajima, Soichi Toyama
Voice recognition system

Patent number: 7016837

Abstract: An initial combination HMM 16 is generated from a voice HMM 10 having multiplicative distortions and an initial noise HMM of additive noise, and at the same time, a Jacobian matrix J is calculated by a Jacobian matrix calculating section 19. Noise variation Namh (cep), in which an estimated value Ha^(cep) of the multiplicative distortions that are obtained from voice that is actually uttered, additive noise Na(cep) that is obtained in a non-utterance period, and additive noise Nm(cep) of the initial noise HMM 17 are combined, is multiplied by a Jacobian matrix, wherein the result of the multiplication and initial combination HMM 16 are combined, and an adaptive HMM 26 is generated. Thereby, an adaptive HMM 26 that is matched to the observation value series RNah(cep) generated from actual utterance voice can be generated in advance.

Type: Grant

Filed: September 18, 2001

Date of Patent: March 21, 2006

Assignee: Pioneer Corporation

Inventors: Hiroshi Seo, Mitsuya Komamura, Soichi Toyama
Voice recognition system

Patent number: 6937981

Abstract: A multiplicative distortion Hm(cep) is subtracted from a voice HMM 5, a multiplicative distortion Ha(cep) of the uttered voice is subtracted from a noise HMM 6 formed by HMM, and the subtraction results Sm(cep) and {Nm(cep)?Ha (cep)} are combined with each other to thereby form a combined HMM 18 in the cepstrum domain. A cepstrum R^a(cep) obtained by subtracting the multiplicative distortion Ha (cep) from the cepstrum Ra (cep) of the uttered voice is compared with the distribution R^m(cep) of the combined HMM 18 in the cepstrum domain, and the combined HMM with the maximum likelihood is output as the voice recognition result.

Type: Grant

Filed: September 18, 2001

Date of Patent: August 30, 2005

Assignee: Pioneer Corporation

Inventors: Hiroshi Seo, Mitsuya Komamura, Soichi Toyama
Voice recognition system

Publication number: 20050091053

Abstract: A trained vector creating part 15 creates a characteristic of an unvoiced sound in advance as a trained vector V. Meanwhile, a threshold value THD for distinguishing a voice from a background sound is created based on a predictive residual power ? of a sound which is created during a non-voice period. As a voice is actually uttered, an inner product computation part 18 calculates an inner product of a feature vector A of an input signal Sa and a trained vector V, and a first threshold value judging part 19 judges that it is a voice section when the inner product has a value which is equal to or larger than a predetermined value ? while a second threshold value judging part 21 judges that it is a voice section when the predictive residual power ? of the input signal Sa is larger than a threshold value THD.

Type: Application

Filed: November 24, 2004

Publication date: April 28, 2005

Inventors: Hajime Kobayashi, Mitsuya Komamura, Soichi Toyama
System and method for speech recognition

Publication number: 20040260546

Abstract: A system and method include an initial noise model produced based on pre-estimated noise of a service environment and an initial synthesized model of a voice containing noise. The system and method produce an utterance environment noise model from background noise of the service environment upon speech recognition as well as a sequence of feature vectors from noise-superimposed speech including an uttered voice and the background noise. The system and method also produce an adaptive model by adapting the initial synthesized model using the utterance environment noise model, the initial noise model, and a compensation model, so that the adaptive model is checked against the sequence of feature vectors to perform speech recognition. Upon performing the speech recognition, a compensation model is created upon which the signal to noise ratio between the background noise present at the time of actual utterance of a voice and the uttered voice is reflected.

Type: Application

Filed: April 23, 2004

Publication date: December 23, 2004

Inventors: Hiroshi Seo, Soichi Toyama
Voice recognition apparatus, voice recognition method and program for voice recognition

Publication number: 20040215458

Abstract: An apparatus for recognizing a word based on voice information, includes, a keyword model storing device, a non-keyword model storing device, a model updating device and a recognition device. The keyword model storing device stores words to be potentially spoken, as keyword models. The non-keyword model storing device stores words to be potentially spoken, as non-keyword models. The model updating device updates the recognition and non-keyword models, based on a previously recognized word. The recognition device matches the recognition and non-keyword models updated with the voice information. The model updating device updates the non-keyword models utilizing a non-keyword variation vector, which is indicative of variation of the non-keyword models, from the non-updated to the updated, and has been set to be smaller than a non-keyword variation vector applied in the updating of the non-keyword models.

Type: Application

Filed: April 26, 2004

Publication date: October 28, 2004

Inventors: Hajime Kobayashi, Soichi Toyama, Yoshihiro Kawazoe, Kengo Hanai
Apparatus and method for speech recognition

Publication number: 20040093210

Abstract: Before executing a speech recognition, a composite acoustic model adapted to noise is generated by composition of a noise adaptive representative acoustic model generated by noise-adaptation of each representative acoustic model and difference models stored in advance in a storing section, respectively. Then, the noise and speaker adaptive acoustic model is generated by executing speaker-adaptation to the composite acoustic model with the feature vector series of uttered speech. The renewal difference model is generated by the difference between the noise and speaker adaptive acoustic model and the noise adaptive representative acoustic model, to replace the difference model stored in the storing section therewith. The speech recognition is performed by comparing the feature vector series of the uttered speech to be recognized with the composite acoustic model adapted to noise and speaker generated by the composition of the noise adaptive representative acoustic model and the renewal difference model.

Type: Application

Filed: September 22, 2003

Publication date: May 13, 2004

Inventor: Soichi Toyama
Apparatus and method for speech recognition

Publication number: 20030220791

Abstract: A true/false judgment on a result of speech recognition is made with high accuracy using a less volume of processing. By comparing acoustic models HMMsb against the feature vector sequence V(n) of utterances, a recognition result RCG specifying the acoustic model HMMsb having the maximum likelihood, a first score FSCR indicting the value of the maximum likelihood, and a second score SSCR indicating the value of the second highest likelihood are found. Then, by comparing an evaluation value FSCR×(FSCR−SSCR) based on the first score FSCR and the second score SSCR with a pre-set threshold value THD, a true/false judgment on the recognition result RCG is made. When the recognition result RCG is judged as being true, speaker adaptation is applied to the acoustic models HMMsb, and when the recognition result RCG is judged as being false, speaker adaptation is not applied to the acoustic models HMMsb. It is thus possible to improve the accuracy of speaker adaptation.

Type: Application

Filed: April 25, 2003

Publication date: November 27, 2003

Applicant: Pioneer Corporation

Inventor: Soichi Toyama
Speech recognition apparatus, speech recognition method, and computer-readable recording medium in which speech recognition program is recorded

Publication number: 20030220792

Abstract: A speech recognition device comprises an HMM model database which prestores keyword HMMs which represent feature patterns of keywords to be recognized, likelihood calculator which calculates the likelihood of an extracted feature value of a speech signal in each frame by comparing it with keyword HMMs and designated-speech HMMs, extraneous-speech likelihood setting device which sets extraneous-speech likelihood based on the calculated likelihood of a match with the designated-speech HMMs, matching processor which performs a matching process based on the calculated likelihood and the extraneous-speech likelihood, and determining device which determines the keywords contained in the spontaneous speech based on the matching process.

Type: Application

Filed: May 19, 2003

Publication date: November 27, 2003

Applicant: Pioneer Corporation

Inventors: Hajime Kobayashi, Soichi Toyama

1 2 next