Patents by Inventor Gakuto Kurata

Gakuto Kurata has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

INPUT GENERATION FOR CLASSIFIER

Publication number: 20180218736

Abstract: A computer-implemented method for generating an input for a classifier. The method includes obtaining n-best hypotheses which is an output of an automatic speech recognition (ASR) for an utterance, combining the n-best hypotheses horizontally in a predetermined order with a separator between each pair of hypotheses, and outputting the combined n-best hypotheses as a single text input to a classifier.

Type: Application

Filed: February 2, 2017

Publication date: August 2, 2018

Inventors: Nobuyasu Itoh, Gakuto Kurata, Ryuki Tachibana
SYMBOL SEQUENCE ESTIMATION IN SPEECH

Publication number: 20180204567

Abstract: Symbol sequences are estimated using a computer-implemented method including detecting one or more candidates of a target symbol sequence from a speech-to-text data, extracting a related portion of each candidate from the speech-to-text data, detecting repetition of at least a partial sequence of each candidate within the related portion of the corresponding candidate, labeling the detected repetition with a repetition indication, and estimating whether each candidate is the target symbol sequence, using the corresponding related portion including the repetition indication of each of the candidates.

Type: Application

Filed: January 18, 2017

Publication date: July 19, 2018

Inventors: Kenneth W. Church, Gakuto Kurata, Bhuvana Ramabhadran, Abhinav Sethy, Masayuki Suzuki, Ryuki Tachibana
Dialog server for handling conversation in virtual space method and computer program for having conversation in virtual space

Patent number: 10001970

Abstract: A dialog server which provides dialogs made by at least one user through their respective avatars in a virtual space. A method and a computer readable article of manufacture tangibly embodying computer readable instructions for executing the steps of the method are also provided. The dialog server includes: a position storage unit which stores positional information on the avatars; an utterance receiver which receives at least one utterance of avatars and utterance strength representing an importance or attention level of the utterance; an interest level calculator which calculates interest levels between avatars based on their positional information; a message processor which generates a message based on the utterance in accordance with a value calculated from the interest levels and the utterance strength; and a message transmitter which transmits the message to the avatars.

Type: Grant

Filed: January 12, 2017

Date of Patent: June 19, 2018

Assignee: Activision Publishing, Inc.

Inventors: Gakuto Kurata, Tohru Nagano, Michiaki Tatsubori
Method for improving acoustic model, computer for improving acoustic model and computer program thereof

Patent number: 9984681

Abstract: Embodiments include methods and systems for improving an acoustic model. Aspects include acquiring a first standard deviation value by calculating standard deviation of a feature from first training data and acquiring a second standard deviation value by calculating standard deviation of a feature from second training data acquired in a different environment from an environment of the first training data. Aspects also include creating a feature adapted to an environment where the first training data is recorded, by multiplying the feature acquired from the second training data by a ratio obtained by dividing the first standard deviation value by the second standard deviation value. Aspects further include reconstructing an acoustic model constructed using training data acquired in the same environment as the environment of the first training data using the feature adapted to the environment where the first training data is recorded.

Type: Grant

Filed: August 16, 2017

Date of Patent: May 29, 2018

Assignee: International Business Machines Corporation

Inventors: Gakuto Kurata, Toru Nagano, Masayuki Suzuki
Method for improving acoustic model, computer for improving acoustic model and computer program thereof

Patent number: 9984680

Abstract: Embodiments include methods and systems for improving an acoustic model. Aspects include acquiring a first standard deviation value by calculating standard deviation of a feature from first training data and acquiring a second standard deviation value by calculating standard deviation of a feature from second training data acquired in a different environment from an environment of the first training data. Aspects also include creating a feature adapted to an environment where the first training data is recorded, by multiplying the feature acquired from the second training data by a ratio obtained by dividing the first standard deviation value by the second standard deviation value. Aspects further include reconstructing an acoustic model constructed using training data acquired in the same environment as the environment of the first training data using the feature adapted to the environment where the first training data is recorded.

Type: Grant

Filed: August 16, 2017

Date of Patent: May 29, 2018

Assignee: International Business Machines Corporation

Inventors: Gakuto Kurata, Toru Nagano, Masayuki Suzuki
Pronunciation accuracy in speech recognition

Patent number: 9978364

Abstract: A reading accuracy-improving system includes: a reading conversion unit for retrieving a plurality of candidate word strings from speech recognition results to determine the reading of each candidate word string; a reading score calculating unit for determining the speech recognition score for each of one or more candidate word strings with the same reading to determine a reading score; and a candidate word string selection unit for selecting a candidate to output from the plurality of candidate word strings on the basis of the reading score and speech recognition score corresponding to each candidate word string.

Type: Grant

Filed: March 28, 2016

Date of Patent: May 22, 2018

Assignee: International Business Machines Corporation

Inventors: Gakuto Kurata, Masafumi Nishimura, Ryuki Tachibana
METHOD OF SELECTING TRAINING TEXT FOR LANGUAGE MODEL, AND METHOD OF TRAINING LANGUAGE MODEL USING THE TRAINING TEXT, AND COMPUTER AND COMPUTER PROGRAM FOR EXECUTING THE METHODS

Publication number: 20180114524

Abstract: Method of selecting training text for language model, and method of training language model using the training text, and computer and computer program for executing the methods. The present invention provides for selecting training text for a language model that includes: generating a template for selecting training text from a corpus in a first domain according to generation techniques of: (i) replacing one or more words in a word string selected from the corpus in the first domain with a special symbol representing any word or word string, and adopting the word string after replacement as a template for selecting the training text; and/or (ii) adopting the word string selected from the corpus in the first domain as the template for selecting the training text; and selecting text covered by the template as the training text from a corpus in a second domain different from the first domain.

Type: Application

Filed: November 30, 2017

Publication date: April 26, 2018

Inventors: Nobuyasu Itoh, Gakuto Kurata, Masafumi Nishimura
Method of selecting training text for language model, and method of training language model using the training text, and computer and computer program for executing the methods

Patent number: 9934776

Abstract: Method of selecting training text for language model, and method of training language model using the training text, and computer and computer program for executing the methods. The present invention provides for selecting training text for a language model that includes: generating a template for selecting training text from a corpus in a first domain according to generation techniques of: (i) replacing one or more words in a word string selected from the corpus in the first domain with a special symbol representing any word or word string, and adopting the word string after replacement as a template for selecting the training text; and/or (ii) adopting the word string selected from the corpus in the first domain as the template for selecting the training text; and selecting text covered by the template as the training text from a corpus in a second domain different from the first domain.

Type: Grant

Filed: July 20, 2015

Date of Patent: April 3, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Nobuyasu Itoh, Gakuto Kurata, Masafumi Nishimura
SUPERVISED TRAINING FOR WORD EMBEDDING

Publication number: 20180090128

Abstract: A method and system are provided for training word embedding of domain-specific words. The method includes training, by a processor, a first word embedding, using a general domain corpus, on one or more terms inputted by a user. The method further includes retraining, by the processor, the first word embedding, using a specific domain corpus, for a Neuro-Linguistic Programming task, to create a tuned word embedding. The method also includes training, by the processor, a Neural Network for the Neuro-Linguistic Programming task, using the specific domain corpus. The method additionally includes incorporating, by the processor, the trained Neural Network and tuned word embedding into a Neural Network-based Neuro-Linguistic Programming task. The retraining of the first word embedding and the training of the Neural Network are performed together, and the tuned word embedding is accelerated due to a change in a hyper parameter for domain-specific words.

Type: Application

Filed: September 23, 2016

Publication date: March 29, 2018

Inventors: Gakuto Kurata, Masayuki Suzuki, Ryuki Tachibana
RECURRENT NEURAL NETWORK PROCESSING POOLING OPERATION

Publication number: 20180082167

Abstract: A method and system are provided. The method includes obtaining, by a hardware processor, candidate data representing a plurality of candidates. The method further includes calculating, by the hardware processor, for each of the candidates, a temporal next state of a Recurrent Neural Network (RNN) by inputting a corresponding one of the candidates to the RNN at a current state. The method also includes merging, by the hardware processor, the temporal next state for each of the candidates to obtain the temporal next state of the RNN.

Type: Application

Filed: September 21, 2016

Publication date: March 22, 2018

Inventors: Gakuto Kurata, Masayuki Suzuki
DETECTING CUSTOMERS WITH LOW SPEECH RECOGNITION ACCURACY BY INVESTIGATING CONSISTENCY OF CONVERSATION IN CALL-CENTER

Publication number: 20180082676

Abstract: Methods and a system are provided for estimating automatic speech recognition (ASR) accuracy. A method includes obtaining transcriptions of utterances in a conversation over two channels. The method further includes sorting the transcriptions along a time axis using a forced alignment. The method also includes training a language model with the sorted transcriptions. The method additionally includes performing ASR for utterances in a conversation between a first user and a second user. The second user is a target of ASR accuracy estimation. The method further includes determining whether an ASR result of the second user is consistent or inconsistent with an ASR result of the first user using the trained language model. The method also includes estimating the ASR result of the second user as poor responsive to the ASR result of the second user being as inconsistent with the ASR result of the first user.

Type: Application

Filed: November 27, 2017

Publication date: March 22, 2018

Inventors: Gakuto Kurata, Masayuki A. Suzuki
Approach to reducing the response time of a speech interface

Patent number: 9922647

Abstract: A method for reducing response time in a speech interface including constructing a partially completed word sequence from a partially received utterance from a speaker received by an audio sensor, modeling a remainder portion using a processor based on a rich predictive model to predict the remainder portion, and responding to the partially completed word sequence and the predicted remainder portion using a natural language vocalization generator with a vocalization, wherein the vocalization is prepared before a complete utterance is received from the speaker and conveyed to the speaker by an audio transducer.

Type: Grant

Filed: January 29, 2016

Date of Patent: March 20, 2018

Assignee: International Business Machines Corporation

Inventors: Gakuto Kurata, Tohru Nagano
LEVERAGING CLASS INFORMATION TO INITIALIZE A NEURAL NETWORK LANGAUGE MODEL

Publication number: 20180060730

Abstract: Methods and systems for language processing includes initializing a word embedding matrix based on pre-determined word classes, such that matrix entries associated with a class of which a word is a member are initialized to a non-zero value and other entries are initialized to zero. A neural network is trained based on the initialized word embedding matrix to generate a neural network language model. A language processing task is performed using the neural network language model.

Type: Application

Filed: August 29, 2016

Publication date: March 1, 2018

Inventor: Gakuto Kurata
GENERATING LABELED DATA BY SEQUENCE-TO-SEQUENCE MODELING WITH ADDED PERTURBATIONS TO ENCODED INFORMATION

Publication number: 20180046912

Abstract: Methods and a system are provided for generating labeled data. A method includes encoding, by a processor-based encoder, a first labeled data into an encoded representation of the first labeled data. The method further includes modifying the encoded representation into a modified representation by adding a perturbation to the encoded representation. The method additionally includes decoding, by a processor-based decoder, the modified representation into a second labeled data.

Type: Application

Filed: August 12, 2016

Publication date: February 15, 2018

Inventor: Gakuto Kurata
TRAINING DEEP NEURAL NETWORK FOR ACOUSTIC MODELING IN SPEECH RECOGNITION

Publication number: 20180047413

Abstract: A method is provided for training a Deep Neural Network (DNN) for acoustic modeling in speech recognition. The method includes reading central frames and side frames as input frames from a memory. The side frames are preceding side frames preceding the central frames and/or succeeding side frames succeeding the central frames. The method further includes executing pre-training for only the central frames or both the central frames and the side frames and fine-tuning for the central frames and the side frames so as to emphasize connections between acoustic features in the central frames and units of the bottom layer in hidden layer of the DNN.

Type: Application

Filed: October 26, 2017

Publication date: February 15, 2018

Inventor: Gakuto Kurata
Method of selecting training text for language model, and method of training language model using the training text, and computer and computer program for executing the methods

Patent number: 9892727

Abstract: Method of selecting training text for language model, and method of training language model using the training text, and computer and computer program for executing the methods. The present invention provides for selecting training text for a language model that includes: generating a template for selecting training text from a corpus in a first domain according to generation techniques of: (i) replacing one or more words in a word string selected from the corpus in the first domain with a special symbol representing any word or word string, and adopting the word string after replacement as a template for selecting the training text; and/or (ii) adopting the word string selected from the corpus in the first domain as the template for selecting the training text; and selecting text covered by the template as the training text from a corpus in a second domain different from the first domain.

Type: Grant

Filed: December 10, 2015

Date of Patent: February 13, 2018

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Nobuyasu Itoh, Gakuto Kurata, Masafumi Nishimura
NEURAL NETWORK BASED ACOUSTIC MODELS FOR SPEECH RECOGNITION BY GROUPING CONTEXT-DEPENDENT TARGETS

Publication number: 20180039883

Abstract: Methods and systems for training a neural network include identifying weights in a neural network between a final hidden neuron layer and an output neuron layer that correspond to state matches between a neuron of the final hidden neuron layer and a respective neuron of the output neuron layer. The identified weights are initialized to a predetermined non-zero value and initializing other weights between the final hidden neuron layer and the output neuron layer to zero. The neural network is trained based on a training corpus after initialization.

Type: Application

Filed: August 2, 2016

Publication date: February 8, 2018

Inventor: Gakuto Kurata
APPROACH TO REDUCING THE RESPONSE TIME OF A SPEECH INTERFACE

Publication number: 20180033433

Abstract: A method for reducing response time in a speech interface including constructing a partially completed word sequence from a partially received utterance from a speaker received by an audio sensor, modeling a remainder portion using a processor based on a rich predictive model to predict the remainder portion, and responding to the partially completed word sequence and the predicted remainder portion using a natural language vocalization generator with a vocalization, wherein the vocalization is prepared before a complete utterance is received from the speaker and conveyed to the speaker by an audio transducer.

Type: Application

Filed: October 6, 2017

Publication date: February 1, 2018

Inventors: Gakuto Kurata, Tohru Nagano
Method for improving acoustic model, computer for improving acoustic model and computer program thereof

Patent number: 9870766

Abstract: Embodiments include methods and systems for improving an acoustic model. Aspects include acquiring a first standard deviation value by calculating standard deviation of a feature from first training data and acquiring a second standard deviation value by calculating standard deviation of a feature from second training data acquired in a different environment from an environment of the first training data. Aspects also include creating a feature adapted to an environment where the first training data is recorded, by multiplying the feature acquired from the second training data by a ratio obtained by dividing the first standard deviation value by the second standard deviation value. Aspects further include reconstructing an acoustic model constructed using training data acquired in the same environment as the environment of the first training data using the feature adapted to the environment where the first training data is recorded.

Type: Grant

Filed: October 28, 2015

Date of Patent: January 16, 2018

Assignee: International Business Machines Incorporated

Inventors: Gakuto Kurata, Toru Nagano, Masayuki Suzuki
Method for improving acoustic model, computer for improving acoustic model and computer program thereof

Patent number: 9870767

Abstract: Embodiments include methods and systems for improving an acoustic model. Aspects include acquiring a first standard deviation value by calculating standard deviation of a feature from first training data and acquiring a second standard deviation value by calculating standard deviation of a feature from second training data acquired in a different environment from an environment of the first training data. Aspects also include creating a feature adapted to an environment where the first training data is recorded, by multiplying the feature acquired from the second training data by a ratio obtained by dividing the first standard deviation value by the second standard deviation value. Aspects further include reconstructing an acoustic model constructed using training data acquired in the same environment as the environment of the first training data using the feature adapted to the environment where the first training data is recorded.

Type: Grant

Filed: December 15, 2015

Date of Patent: January 16, 2018

Assignee: International Business Machines Corporation

Inventors: Gakuto Kurata, Toru Nagano, Masayuki Suzuki

prev … 2 3 4 5 6 7 8 9 10 next