Patents by Inventor Shigeki KARITA

Shigeki KARITA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Determination device, determination method, and determination program

Patent number: 11837222

Abstract: A determination device includes a memory, and processing circuitry coupled to the memory and configured to accept input of a plurality of sequences provided as candidates for a solution to one given input, and determine, for two sequences of the plurality of sequences, a sequence that has a higher accuracy than the other sequence of the two sequences, using a model expressed as a neural network.

Type: Grant

Filed: February 1, 2019

Date of Patent: December 5, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsunori Ogawa, Marc Delcroix, Shigeki Karita, Tomohiro Nakatani
GENERATION DEVICE, GENERATION METHOD, AND GENERATION PROGRAM

Publication number: 20230032372

Abstract: The extraction unit 132 extracts a second word corresponding to a first word included in a first text from among a plurality of words belonging to a predetermined domain. The determination unit 133 determines whether a predetermined condition for the word class of the first word is satisfied or not. When it is determined by the determination unit 133 that the condition is satisfied, the generation unit 134 generates a second text in which the first word of the first text is exchanged with the second word.

Type: Application

Filed: January 22, 2020

Publication date: February 2, 2023

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsunori OGAWA, Naohiro TAWARA, Shigeki KARITA, Marc DELCROIX
Learning device and method for updating a parameter of a speech recognition model

Patent number: 11551667

Abstract: A learning device (10) includes a feature extracting unit (11) that extracts features of speech from speech data for training, a probability calculating unit (12) that, on the basis of the features of speech, performs prefix searching using a speech recognition model of which a neural network is representative, and calculates a posterior probability of a recognition character string to obtain a plurality of hypothetical character strings, an error calculating unit (13) that calculates an error by word error rates of the plurality of hypothetical character strings and a correct character string for training, and obtains a parameter for the entire model that minimizes an expected value of summation of loss in the word error rates, and an updating unit (14) that updates a parameter of the model in accordance with the parameter obtained by the error calculating unit (13).

Type: Grant

Filed: February 1, 2019

Date of Patent: January 10, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Shigeki Karita, Atsunori Ogawa, Marc Delcroix, Tomohiro Nakatani
DETERMINATION DEVICE, TRAINING DEVICE, DETERMINATION METHOD, AND DETERMINATION PROGRAM

Publication number: 20220262356

Abstract: A reranking device include a hypothesis input unit configured to receive input of N-best hypotheses associated with scores of a speech recognition accuracy; a hypothesis selection unit configured to select two hypotheses to be determined from among the input N-best hypotheses. Further, there is a determination unit configured to determine which accuracy of two hypotheses is higher by using: a plurality of first auxiliary model to M-th auxiliary model represented by such a neural network as to be capable of converting, when the selected two hypotheses are given, the two hypotheses into hidden state vectors, and determining which of the two hypotheses is higher based on the hidden state vectors of the two hypotheses; and a main model represented by such a neural network as to be capable of determining which of the two hypotheses is higher based on the hidden state vectors of the two hypotheses.

Type: Application

Filed: August 8, 2019

Publication date: August 18, 2022

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsunori OGAWA, Marc DELCROIX, Shigeki KARITA, Tomohiro NAKATANI
Unsupervised Learning of Disentangled Speech Content and Style Representation

Publication number: 20220189456

Abstract: A linguistic content and speaking style disentanglement model includes a content encoder, a style encoder, and a decoder. The content encoder is configured to receive input speech as input and generate a latent representation of linguistic content for the input speech output. The content encoder is trained to disentangle speaking style information from the latent representation of linguistic content. The style encoder is configured to receive the input speech as input and generate a latent representation of speaking style for the input speech as output. The style encoder is trained to disentangle linguistic content information from the latent representation of speaking style. The decoder is configured to generate output speech based on the latent representation of linguistic content for the input speech and the latent representation of speaking style for the same or different input speech.

Type: Application

Filed: November 18, 2021

Publication date: June 16, 2022

Applicant: Google LLC

Inventors: Ruoming Pang, Andros Tjandra, Yu Zhang, Shigeki Karita
LEARNING DEVICE, LEARNING METHOD AND LEARNING PROGRAM

Publication number: 20210056954

Abstract: A learning device (10) includes a feature extracting unit (11) that extracts features of speech from speech data for training, a probability calculating unit (12) that, on the basis of the features of speech, performs prefix searching using a speech recognition model of which a neural network is representative, and calculates a posterior probability of a recognition character string to obtain a plurality of hypothetical character strings, an error calculating unit (13) that calculates an error by word error rates of the plurality of hypothetical character strings and a correct character string for training, and obtains a parameter for the entire model that minimizes an expected value of summation of loss in the word error rates, and an updating unit (14) that updates a parameter of the model in accordance with the parameter obtained by the error calculating unit (13).

Type: Application

Filed: February 1, 2019

Publication date: February 25, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Shigeki KARITA, Atsunori OGAWA, Marc DELCROIX, Tomohiro NAKATANI
DETERMINATION DEVICE, DETERMINATION METHOD, AND DETERMINATION PROGRAM

Publication number: 20210035564

Abstract: A determination device includes a memory, and processing circuitry coupled to the memory and configured to accept input of a plurality of sequences provided as candidates for a solution to one given input, and determine, for two sequences of the plurality of sequences, a sequence that has a higher accuracy than the other sequence of the two sequences, using a model expressed as a neural network.

Type: Application

Filed: February 1, 2019

Publication date: February 4, 2021

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsunori OGAWA, Marc DELCROIX, Shigeki KARITA, Tomohiro NAKATANI
LEARNING DEVICE, LEARNING METHOD, AND LEARNING PROGRAM

Publication number: 20200365143

Abstract: A learning device includes a memory, and processing circuitry coupled to the memory and configured to receive an input of a plurality of series for learning having known accuracy, and learn a model represented by a neural network, the model being capable of determining accuracy levels of two series when given feature amounts of the two series among the plurality of series.

Type: Application

Filed: February 1, 2019

Publication date: November 19, 2020

Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Atsunori OGAWA, Marc DELCROIX, Shigeki KARITA, Tomohiro NAKATANI