Patents by Inventor Gakuto Kurata

Gakuto Kurata has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

HINDRANCE SPEECH PORTION DETECTION USING TIME STAMPS

Publication number: 20210319787

Abstract: A computer-implemented method of detecting a portion of audio data to be removed is provided. The method includes obtaining a recognition result of audio data. The recognition result includes recognized text data and time stamps. The method also includes extracting one or more candidate phrases from the recognition result using n-gram counts. The method further includes, for each candidate phrase, making pairs of same phrases with different time stamps and clustering the pairs of the same phrase by using differences in time stamps. The method includes further determining a portion of the audio data to be removed using results of the clustering.

Type: Application

Filed: April 10, 2020

Publication date: October 14, 2021

Inventors: Nobuyasu Itoh, Gakuto Kurata, Masayuki Suzuki
Symbol sequence estimation in speech

Patent number: 11145308

Abstract: Symbol sequences are estimated using a computer-implemented method including detecting one or more candidates of a target symbol sequence from a speech-to-text data, extracting a related portion of each candidate from the speech-to-text data, detecting repetition of at least a partial sequence of each candidate within the related portion of the corresponding candidate, labeling the detected repetition with a repetition indication, and estimating whether each candidate is the target symbol sequence, using the corresponding related portion including the repetition indication of each of the candidates.

Type: Grant

Filed: September 20, 2019

Date of Patent: October 12, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Kenneth W. Church, Gakuto Kurata, Bhuvana Ramabhadran, Abhinav Sethy, Masayuki Suzuki, Ryuki Tachibana
TRAINING OF MODEL FOR PROCESSING SEQUENCE DATA

Publication number: 20210312294

Abstract: A technique for training a model is disclosed. A training sample including an input sequence of observations and a target sequence of symbols having length different from the input sequence of observations is obtained. The input sequence of observations is fed into the model to obtain a sequence of predictions. The sequence of predictions is shifted by an amount with respect to the input sequence of observations. The model is updated based on a loss using a shifted sequence of predictions and the target sequence of the symbols.

Type: Application

Filed: April 3, 2020

Publication date: October 7, 2021

Inventors: Gakuto Kurata, Kartik Audhkhasi
Generating phonemes of loan words using two converters

Patent number: 11138965

Abstract: A technique for estimating phonemes for a word written in a different language is disclosed. A sequence of graphemes of a given word in a source language is received. The sequence of the graphemes in the source language is converted into a sequence of phonemes in the source language. One or more sequences of phonemes in a target language are generated from the sequence of the phonemes in the source language by using a neural network model. One sequence of phonemes in the target language is determined for the given word. Also, technique for estimating graphemes of a word from phonemes in a different language is disclosed.

Type: Grant

Filed: November 2, 2017

Date of Patent: October 5, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Gakuto Kurata, Toru Nagano, Yuta Tsuboi
Approach to reducing the response time of a speech interface

Patent number: 11107473

Abstract: A method for reducing response time in a speech interface including constructing a partially completed word sequence from a partially received utterance from a speaker received by an audio sensor, modeling a remainder portion using a processor based on a rich predictive model to predict the remainder portion, and responding to the partially completed word sequence and the predicted remainder portion using a natural language vocalization generator with a vocalization, wherein the vocalization is prepared before a complete utterance is received from the speaker and conveyed to the speaker by an audio transducer.

Type: Grant

Filed: October 17, 2019

Date of Patent: August 31, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Gakuto Kurata, Tohru Nagano
GENERATION OF MATCHED CORPUS FOR LANGUAGE MODEL TRAINING

Publication number: 20210248996

Abstract: A computer-implemented method for generating a text is disclosed. The method includes obtaining a first text collection matched with a target domain and a second text collection including a plurality of samples, each of which describes rewriting between a first text and a second text that has a style different from the first text. The method also includes training a text generation model with the first text collection and the second text collection, in which the text generation model has, in a vocabulary, one or more operation tokens indicating rewriting. The method further includes outputting a plurality of texts obtained from the text generation model.

Type: Application

Filed: February 6, 2020

Publication date: August 12, 2021

Inventors: Nobuyasu Itoh, Gakuto Kurata, Masayuki Suzuki
Input generation for classifier

Patent number: 11056102

Abstract: A computer-implemented method includes generating a single text data structure for a classifier of a speech recognition system, and sending the single text data structure to the classifier. Generating the single text data structure includes obtaining n-best hypotheses as an output of an automatic speech recognition (ASR) task for an utterance received by the speech recognition system, and combining the n-best hypotheses in a predetermined order with a separator between each pair of hypotheses to generate the single text data structure. The classifier is trained based on a single training text data structure by obtaining training source data, including selecting a first text sample and at least one similar text sample belong to a same class as the first text sample based on a maximum number of hypotheses, and arranging the plurality of text samples based on a degree of similarity.

Type: Grant

Filed: September 23, 2019

Date of Patent: July 6, 2021

Assignee: International Business Machines Corporation

Inventors: Nobuyasu Itoh, Gakuto Kurata, Ryuki Tachibana
RNNLM-based generation of templates for class-based text generation

Patent number: 11011161

Abstract: A computer-implemented method is provided for generating a plurality of templates. The method includes obtaining, by a processor device, a Recurrent Neural Network Language Model (RNNLM) trained using a first set of text data. The method further includes adapting, by the processor device, the RNNLM using a second set of text data by adding a new node corresponding to a class in both an input layer and an output layer of the RNNLM, the class being obtained from the second set of text data. The method also includes generating, by the processor device, the plurality of templates using the adapted RNNLM.

Type: Grant

Filed: December 10, 2018

Date of Patent: May 18, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Masayuki Suzuki, Toru Nagano, Nobuyasu Itoh, Gakuto Kurata
Training data modification for training model

Patent number: 11011156

Abstract: A computer-implemented method for training a model is disclosed. The model is capable of retaining a history of one or more preceding elements and has a direction of prediction. The method includes obtaining a training sequence of elements. The method also includes splitting the training sequence into a plurality of parts. The method further includes selecting one part of the plurality of the parts depending on the direction of the model to generate a modified training data. The method includes further training the model using the modified training data.

Type: Grant

Filed: April 11, 2019

Date of Patent: May 18, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Gakuto Kurata
Implementing a classification model for recognition processing

Patent number: 10990902

Abstract: A method, system, and computer program product for learning a recognition model for recognition processing. The method includes preparing one or more examples for learning, each of which includes an input segment, an additional segment adjacent to the input segment and an assigned label. The input segment and the additional segment are extracted from an original training data. A classification model is trained, using the input segment and the additional segment in the examples, to initialize parameters of the classification model so that extended segments including the input segment and the additional segment are reconstructed from the input segment. Then, the classification model is tuned to predict a target label, using the input segment and the assigned label in the examples, based on the initialized parameters. At least a portion of the obtained classification model is included in the recognition model.

Type: Grant

Filed: September 25, 2019

Date of Patent: April 27, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Gakuto Kurata
Priors adaptation for conservative training of acoustic model

Patent number: 10991363

Abstract: An apparatus, method, and computer program product for adapting an acoustic model to a specific environment are defined. An adapted model obtained by adapting an original model to the specific environment using adaptation data, the original model being trained using training data and being used to calculate probabilities of context-dependent phones given an acoustic feature. Adapted probabilities obtained by adapting original probabilities using the training data and the adaptation data, the original probabilities being trained using the training data and being prior probabilities of context-dependent phones. An adapted acoustic model obtained from the adapted model and the adapted probabilities.

Type: Grant

Filed: November 6, 2017

Date of Patent: April 27, 2021

Assignee: International Business Machines Corporation

Inventors: Gakuto Kurata, Bhuvana Ramabhadran, Masayuki Suzuki
NEURAL NETWORK FOR CHEMICAL COMPOUNDS

Publication number: 20210110240

Abstract: A computer implemented method for training a neural network to capture a structural feature specific to a set of chemical compounds is disclosed. In the method, the computer system reads an expression describing a structure of the chemical compound for each chemical compound in the set and enumerates one or more combinations of a position and a type of a structural element appearing in the expression for each chemical compound in the set. The computer system also generates training data based on the one or more enumerated combinations for each chemical compound in the set. The training data includes one or more values with a length, each of which indicates whether or not a corresponding type of the structural element appears at a corresponding position for each combination. Furthermore, the computer system trains the neural network based on the training data for the set of the chemical compounds.

Type: Application

Filed: December 23, 2020

Publication date: April 15, 2021

Inventors: Satoshi Hara, Gakuto Kurata, Shigeru Nakagawa, Seiji Takeda
ALIGNING SPIKE TIMING OF MODELS

Publication number: 20210082399

Abstract: A technique for aligning spike timing of models is disclosed. A first model having a first architecture trained with a set of training samples is generated. Each training sample includes an input sequence of observations and an output sequence of symbols having different length from the input sequence. Then, one or more second models are trained with the trained first model by minimizing a guide loss jointly with a normal loss for each second model and a sequence recognition task is performed using the one or more second models. The guide loss evaluates dissimilarity in spike timing between the trained first model and each second model being trained.

Type: Application

Filed: September 13, 2019

Publication date: March 18, 2021

Inventors: Gakuto Kurata, Kartik Audhkhasi
Priors adaptation for conservative training of acoustic model

Patent number: 10923110

Abstract: An apparatus, method, and computer program product for adapting an acoustic model to a specific environment are defined. An adapted model obtained by adapting an original model to the specific environment using adaptation data, the original model being trained using training data and being used to calculate probabilities of context-dependent phones given an acoustic feature. Adapted probabilities obtained by adapting original probabilities using the training data and the adaptation data, the original probabilities being trained using the training data and being prior probabilities of context-dependent phones. An adapted acoustic model obtained from the adapted model and the adapted probabilities.

Type: Grant

Filed: August 25, 2017

Date of Patent: February 16, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Gakuto Kurata, Bhuvana Ramabhadran, Masayuki Suzuki
DATA AUGMENTATION BY FRAME INSERTION FOR SPEECH DATA

Publication number: 20210043186

Abstract: A technique for data augmentation for speech data is disclosed. Original speech data including a sequence of feature frames is obtained. A partially prolonged copy of the original speech data is generated by inserting one or more new frames into the sequence of the feature frames. The partially prolonged copy is output as augmented speech data for training an acoustic model for training an acoustic model.

Type: Application

Filed: August 8, 2019

Publication date: February 11, 2021

Inventors: Toru Nagano, Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata
Neural network for chemical compounds

Patent number: 10915808

Abstract: A computer implemented method for training a neural network to capture a structural feature specific to a set of chemical compounds is disclosed. In the method, the computer system reads an expression describing a structure of the chemical compound for each chemical compound in the set and enumerates one or more combinations of a position and a type of a structural element appearing in the expression for each chemical compound in the set. The computer system also generates training data based on the one or more enumerated combinations for each chemical compound in the set. The training data includes one or more values with a length, each of which indicates whether or not a corresponding type of the structural element appears at a corresponding position for each combination. Furthermore, the computer system trains the neural network based on the training data for the set of the chemical compounds.

Type: Grant

Filed: July 5, 2016

Date of Patent: February 9, 2021

Assignee: International Business Machines Corporation

Inventors: Satoshi Hara, Gakuto Kurata, Shigeru Nakagawa, Seiji Takeda
Technique for automatically splitting words

Patent number: 10909316

Abstract: A computer-implemented method, computer program product, and system are provided for separating a word in a dictionary. The method includes reading a word from the dictionary as a source word. The method also includes searching the dictionary for another word having a substring with a same surface string and a same reading as the source word. The method additionally includes splitting the another word by the source word to obtain one or more remaining substrings of the another word. The method further includes registering each of the one or more remaining substrings as a new word in the dictionary.

Type: Grant

Filed: October 31, 2019

Date of Patent: February 2, 2021

Assignee: International Business Machines Corporation

Inventors: Toru Nagano, Nobuyasu Itoh, Gakuto Kurata
FUSION OF NEURAL NETWORKS

Publication number: 20200356850

Abstract: Fusion of neural networks is performed by obtaining a first neural network and a second neural network. The first and the second neural networks are the result of a parent neural network subjected to different training. A similarity score is calculated of a first component of the first neural network and a corresponding second component of the second neural network. An interpolation weight is determined for the first and the second components by using the similarity score. A neural network parameter of the first component is updated based on the interpolation weight and a corresponding neural network parameter of the second component to obtain a fused neural network.

Type: Application

Filed: May 8, 2019

Publication date: November 12, 2020

Inventors: Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata
Use of small unit language model for training large unit language models

Patent number: 10832657

Abstract: A computer-implemented method, computer program product, and apparatus are provided. The method includes generating a plurality of sequences of small unit tokens from a first language model that is trained with a small unit corpus including the small unit tokens, the small unit corpus having been derived by tokenization with a small unit. The method further includes tokenizing the plurality of sequences of small unit tokens by a large unit that is larger than the small unit, to create a derived large unit corpus including derived large unit tokens.

Type: Grant

Filed: March 1, 2018

Date of Patent: November 10, 2020

Assignee: International Business Machines Corporation

Inventors: Masayuki Suzuki, Nobuyasu Itoh, Gakuto Kurata
TRAINING DATA MODIFICATION FOR TRAINING MODEL

Publication number: 20200327881

Abstract: A computer-implemented method for training a model is disclosed. The model is capable of retaining a history of one or more preceding elements and has a direction of prediction. The method includes obtaining a training sequence of elements. The method also includes splitting the training sequence into a plurality of parts. The method further includes selecting one part of the plurality of the parts depending on the direction of the model to generate a modified training data. The method includes further training the model using the modified training data.

Type: Application

Filed: April 11, 2019

Publication date: October 15, 2020

Inventor: Gakuto Kurata

prev 1 2 3 4 5 6 7 … next