Patents by Inventor Gakuto Kurata

Gakuto Kurata has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN

Publication number: 20080306742

Abstract: For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first set of speech samples being associated with a first speech control option and the second set of speech samples being associated with a second speech control option. A display unit displays the similarity indication. In another aspect, word vectors are generated for the respective speech sample sets, indicating frequencies of occurrence of respective words in the respective speech sample sets. The similarity calculating unit calculates the similarity indication responsive to the word vectors of the respective speech sample sets. In another aspect, a perplexity indication is calculated for respective speech sample sets responsive to language models for the respective speech sample sets.

Type: Application

Filed: July 31, 2008

Publication date: December 11, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Osamu Ichikawa, Gakuto Kurata, Masafumi Nishimura
UNSUPERVISED LEXICON ACQUISITION FROM SPEECH AND TEXT

Publication number: 20080221890

Abstract: Techniques for acquiring, from an input text and an input speech, a set of a character string and a pronunciation thereof which should be recognized as a word. A system according to the present invention: selects, from an input text, plural candidate character strings which are candidates to be recognized as a word; generates plural pronunciation candidates of the selected candidate character strings; generates frequency data by combining data in which the generated pronunciation candidates are respectively associated with the character strings; generates recognition data in which character strings respectively indicating plural words contained in the input speech are associated with pronunciations; and selects and outputs a combination contained in the recognition data, out of combinations each consisting of one of the candidate character strings and one of the pronunciation candidates.

Type: Application

Filed: March 6, 2008

Publication date: September 11, 2008

Inventors: Gakuto Kurata, Shinsuke Mori, Masafumi Nishimura
Stochastic Syllable Accent Recognition

Publication number: 20080177543

Abstract: Training wording data indicating the wording of each of the words in training text, training speech data indicating characteristics of speech of each of the words, and training boundary data indicating whether each word in training speech is a boundary of a prosodic phrase are stored. After inputting candidates for boundary data, a first likelihood that each of the a boundary of a prosodic phrase of the words in the inputted text would agree with one of the inputted boundary data candidates is calculated and a second likelihood is calculated. Thereafter, one boundary data candidate maximizing a product of the first and second likelihoods is searched out from among the inputted boundary data candidates, and then a result of the searching is outputted.

Type: Application

Filed: November 27, 2007

Publication date: July 24, 2008

Applicant: International Business Machines Corporation

Inventors: Tohru Nagano, Masafumi Nishimura, Ryuki Tachibana, Gakuto Kurata
METHOD FOR SEGMENTING UTTERANCES BY USING PARTNER'S RESPONSE

Publication number: 20080154594

Abstract: An apparatus, method and program for dividing a conversational dialog into utterance. The apparatus includes: a computer processor; a word database for storing spellings and pronunciations of words; a grammar database for storing syntactic rules on words; a pause detecting section which detects a pause location in a channel making a main speech among conversational dialogs inputted in at least two channels; an acknowledgement detecting section which detects an acknowledgement location in a channel not making the main speech; a boundary-candidate extracting section which extracts boundary candidates in the main speech, by extracting pauses existing within a predetermined range before and after a base point that is the acknowledgement location; and a recognizing unit which outputs a word string of the main speech segmented by one of the extracted boundary candidates after dividing the segmented speech into optimal utterance in reference to the word database and grammar database.

Type: Application

Filed: December 26, 2007

Publication date: June 26, 2008

Inventors: Nobuyasu Itoh, Gakuto Kurata
System And Method For Supporting Text-To-Speech

Publication number: 20080046247

Abstract: A system for generating high-quality synthesized text-to-speech includes a learning data generating unit, a frequency data generating unit, and a setting unit. The learning data generating unit recognizes inputted speech, and then generates first learning data in which wordings of phrases are associated with readings thereof. The frequency data generating unit generates, based on the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases. The setting unit sets the thus generated frequency data for a language processing unit in order to approximate outputted speech of text-to-speech to the inputted speech. Furthermore, the language processing unit generates, from a wording of text, a reading corresponding to the wording, on the basis of the appearance frequencies.

Type: Application

Filed: July 9, 2007

Publication date: February 21, 2008

Inventors: Gakuto Kurata, Toru Nagano, Masafumi Nishimura, Ryuki Tachibana
APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN

Publication number: 20080040119

Abstract: For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first set of speech samples being associated with a first speech control option and the second set of speech samples being associated with a second speech control option. A display unit displays the similarity indication. In another aspect, word vectors are generated for the respective speech sample sets, indicating frequencies of occurrence of respective words in the respective speech sample sets. The similarity calculating unit calculates the similarity indication responsive to the word vectors of the respective speech sample sets. In another aspect, a perplexity indication is calculated for respective speech sample sets responsive to language models for the respective speech sample sets.

Type: Application

Filed: July 3, 2007

Publication date: February 14, 2008

Inventors: Osamu Ichikawa, Gakuto Kurata, Masafumi Nishimura

prev … 6 7 8 9 10

APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN

UNSUPERVISED LEXICON ACQUISITION FROM SPEECH AND TEXT

Stochastic Syllable Accent Recognition

METHOD FOR SEGMENTING UTTERANCES BY USING PARTNER'S RESPONSE

System And Method For Supporting Text-To-Speech

APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN