Patents by Inventor Eiichiro Sumita

Eiichiro Sumita has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Pseudo parallel translation data generation apparatus, machine translation processing apparatus, and pseudo parallel translation data generation method

Patent number: 11816444

Abstract: Provided are a model training method for neural machine translation that enhances an encoder using a monolingual corpus of a target language and improves the accuracy of the entire translator, and a machine translation system for performing the model training method. The machine translation system 1000 uses a monolingual corpus of the target language to obtain multiple pieces of pseudo source language data, thus allowing for obtaining a large amount of pseudo parallel corpus data having diversity. Further, the machine translation system 1000 uses both the pseudo parallel corpus data having diversity, which has been obtained in large quantities, and the base parallel corpus data in a small quantity but with high accuracy, with the applied learning rates changed accordingly, to perform the learning process (training process) for the machine translation model. This allows the machine translation system 1000 to obtain a learned model (machine translation model) with very high accuracy.

Type: Grant

Filed: February 12, 2019

Date of Patent: November 14, 2023

Assignee: NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY

Inventors: Kenji Imamura, Atsushi Fujita, Eiichiro Sumita
INFERENCER, INFERENCE METHOD AND INFERENCE PROGRAM

Publication number: 20220237380

Abstract: Improvement is made in performance of a trained neural network that uses positional information indicating a position at which each token included in an input sequence is present in the input sequence.

Type: Application

Filed: July 6, 2020

Publication date: July 28, 2022

Applicant: NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY

Inventors: Kehai CHEN, Rui WANG, Masao UCHIYAMA, Eiichiro SUMITA
Topic inferring apparatus, topic inferring method, and storage medium

Patent number: 11301624

Abstract: In order to solve a problem that the level of precision in inferring a crosslingual topic of documents or words in a non-parallel corpus associated in the document level is not high, a topic inferring apparatus includes: a word distribution information storage unit in which word distribution information is stored in association with each of two or more languages; a document topic distribution generating unit that acquires document topic distribution information of a multilingual document set; a segment topic distribution generating unit that acquires segment topic distribution information of each segment, using the document topic distribution information; and a word topic determining unit that determines, for each word contained in two or more documents contained in the multilingual document set, a topic of each word using the segment topic distribution information. Accordingly, it is possible to improve the level of precision in inferring a topic.

Type: Grant

Filed: February 10, 2017

Date of Patent: April 12, 2022

Assignee: NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY

Inventors: Akihiro Tamura, Eiichiro Sumita, Yutaka Kidawara
TOPIC INFERRING APPARATUS, TOPIC INFERRING METHOD, AND STORAGE MEDIUM

Publication number: 20210166135

Abstract: In order to solve a problem that the level of precision in inferring a crosslingual topic of documents or words in a non-parallel corpus associated in the document level is not high, a topic inferring apparatus includes: a word distribution information storage unit in which word distribution information is stored in association with each of two or more languages; a document topic distribution generating unit that acquires document topic distribution information of a multilingual document set; a segment topic distribution generating unit that acquires segment topic distribution information of each segment, using the document topic distribution information; and a word topic determining unit that determines, for each word contained in two or more documents contained in the multilingual document set, a topic of each word using the segment topic distribution information. Accordingly, it is possible to improve the level of precision in inferring a topic.

Type: Application

Filed: February 10, 2017

Publication date: June 3, 2021

Inventors: Akihiro TAMURA, Eiichiro SUMITA, Yutaka KIDAWARA
PSEUDO PARALLEL TRANSLATION DATA GENERATION APPARATUS, MACHINE TRANSLATION PROCESSING APPARATUS, AND PSEUDO PARALLEL TRANSLATION DATA GENERATION METHOD

Publication number: 20210027026

Abstract: Provided are a model training method for neural machine translation that enhances an encoder using a monolingual corpus of a target language and improves the accuracy of the entire translator, and a machine translation system for performing the model training method. The machine translation system 1000 uses a monolingual corpus of the target language to obtain multiple pieces of pseudo source language data, thus allowing for obtaining a large amount of pseudo parallel corpus data having diversity. Further, the machine translation system 1000 uses both the pseudo parallel corpus data having diversity, which has been obtained in large quantities, and the base parallel corpus data in a small quantity but with high accuracy, with the applied learning rates changed accordingly, to perform the learning process (training process) for the machine translation model. This allows the machine translation system 1000 to obtain a learned model (machine translation model) with very high accuracy.

Type: Application

Filed: February 12, 2019

Publication date: January 28, 2021

Inventors: Kenji IMAMURA, Atsushi FUJITA, Eiichiro SUMITA
Word alignment score computing apparatus, word alignment apparatus, and computer program

Patent number: 10049105

Abstract: [Object] An object is to provide an apparatus for attaining highly precise word alignment. [Solution] The apparatus includes: selecting means receiving a bilingual sentence pair and a word alignment for the bilingual sentence pair, for successively selecting words fj of a sentence in a first language in a prescribed order; and a recurrent neural network (RNN) 100, computing, for all words of the sentence in the first language, a score 102 representing a probability that a word pair consisting of the word fj and a word ea_{j} aligned with the word fj by a word alignment aj in a second language of the bilingual sentence pair is a correct word pair, and based on this score, for computing a score of the word alignment aj. When computing a score of word pair (fj, ea_{j}), RNN 100 computes a score 102 of the word pair (fj, ea_{j}) based on all word alignments a1j-1 selected by the selecting means prior to the word fj of the word pair (fj, ea_{j}), of the word alignments aj, by means of a recurrent connection 118.

Type: Grant

Filed: February 12, 2015

Date of Patent: August 14, 2018

Assignee: National Institute of Information and Communications Technology

Inventors: Akihiro Tamura, Taro Watanabe, Eiichiro Sumita
JOINT ESTIMATION METHOD AND METHOD OF TRAINING SEQUENCE-TO-SEQUENCE MODEL THEREFOR

Publication number: 20170154258

Abstract: An estimation method utilizing a pair of target-directional models 106 and 108 includes the steps 160 and 164 of decoding an input 142 utilizing the first and the second models 106 and 108, thereby producing k-best hypotheses 162 and 166 from each of the first and the second models 106 and 108; calculating a union of the k-best hypotheses, and re-scoring 168 each of the best hypotheses in the union utilizing the first and the second models; and selecting a hypothesis 144 with the highest score.

Type: Application

Filed: November 30, 2015

Publication date: June 1, 2017

Inventors: Lemao LIU, Andrew FINCH, Masao UCHIYAMA, Eiichiro SUMITA
WORD ALIGNMENT SCORE COMPUTING APPARATUS, WORD ALIGNMENT APPARATUS, AND COMPUTER PROGRAM

Publication number: 20170068665

Abstract: [Object] An object is to provide an apparatus for attaining highly precise word alignment. [Solution] The apparatus includes: selecting means receiving a bilingual sentence pair and a word alignment for the bilingual sentence pair, for successively selecting words fj of a sentence in a first language in a prescribed order; and a recurrent neural network (RNN) 100, computing, for all words of the sentence in the first language, a score 102 representing a probability that a word pair consisting of the word fj and a word ea_{j} aligned with the word fj by a word alignment aj in a second language of the bilingual sentence pair is a correct word pair, and based on this score, for computing a score of the word alignment aj. When computing a score of word pair (fj, ea_{j}), RNN 100 computes a score 102 of the word pair (fj, ea_{j}) based on all word alignments a1j-1 selected by the selecting means prior to the word fj of the word pair (fj, ea_{j}), of the word alignments aj, by means of a recurrent connection 118.

Type: Application

Filed: February 12, 2015

Publication date: March 9, 2017

Inventors: Akihiro TAMURA, Taro WATANABE, Eiichiro SUMITA
Speech translation system, dictionary server, and program

Patent number: 9442920

Abstract: A conventional speech recognition dictionary, translation dictionary and speech synthesis dictionary used in speech translation have inconsistencies.

Type: Grant

Filed: March 3, 2010

Date of Patent: September 13, 2016

Assignee: National Institute of Information and Communications Technology

Inventors: Satoshi Nakamura, Eiichiro Sumita, Yutaka Ashikari, Noriyuki Kimura, Chiori Hori
TRANSLATION APPARATUS, LEARNING APPARATUS, TRANSLATION METHOD, AND STORAGE MEDIUM

Publication number: 20160179790

Abstract: In order to solve a problem that, in the case of using a neural network that non-linearly links features, scores of translation candidates have to be calculated again during search, which requires an inordinate amount of effort, a translation apparatus includes: a parameter storage unit in which a first weight vector that is applied to a non-local feature function and a second weight vector that is applied to a local feature function can be stored; a feature function information storage unit in which non-local first feature function information and local second feature function information can be stored; a portion pair information storage unit in which two or more pieces of portion pair information such as a phrase pair or a rule pair can be stored; a score acquiring unit that acquires scores of two or more target language sentences by introducing a non-linear model to units of a phrase pair, a rule pair, or the like, and limiting the non-linear model to features closed to a phrase pair or a rule pair; a targ

Type: Application

Filed: May 23, 2014

Publication date: June 23, 2016

Inventors: Taro WATANABE, Lemao LIU, Eiichiro SUMITA
BILINGUAL PHRASE LEARNING APPARATUS, STATISTICAL MACHINE TRANSLATION APPARATUS, BILINGUAL PHRASE LEARNING METHOD, AND STORAGE MEDIUM

Publication number: 20160132491

Abstract: In order to solve a conventional problem that a translation model has to be updated each time a translation corpus is added, in a case of calculating a score of each phrase pair acquired from a j-th translation corpus (2?j?N) (added translation corpus), a score of each phrase pair corresponding to the j-th translation corpus is calculated using the one or more pieces of phrase appearance frequency information corresponding to a (j?1)-th translation corpus, the calculated score is used to generate a translation model, and the newly generated translation model is used in a state of being integrated to an original translation model. Accordingly, a translation model can be easily enhanced in a stepwise manner.

Type: Application

Filed: May 23, 2014

Publication date: May 12, 2016

Applicant: National Institute of Information and Communications Technology

Inventors: Taro WATANABE, Conghui ZHU, Eiichiro SUMITA
Speech translation system, control device, and control method

Patent number: 8954335

Abstract: Appropriate processing results or appropriate apparatuses can be selected with a control device that selects the most probable speech recognition result by using speech recognition scores received with speech recognition results from two or more speech recognition apparatuses; sends the selected speech recognition result to two or more translation apparatuses respectively; selects the most probable translation result by using translation scores received with translation results from the two or more translation apparatuses; sends the selected translation result to two or more speech synthesis apparatuses respectively; receives a speech synthesis processing result including a speech synthesis result and a speech synthesis score from each of the two or more speech synthesis apparatuses; selects the most probable speech synthesis result by using the scores; and sends the selected speech synthesis result to a second terminal apparatus.

Type: Grant

Filed: March 3, 2010

Date of Patent: February 10, 2015

Assignee: National Institute of Information and Communications Technology

Inventors: Satoshi Nakamura, Eiichiro Sumita, Yutaka Ashikari, Noriyuki Kimura, Chiori Hori
Speech translation system, first terminal apparatus, speech recognition server, translation server, and speech synthesis server

Patent number: 8862478

Abstract: In conventional network-type speech translation systems, devices or models for recognizing or synthesizing speech cannot be changed in accordance with speakers' attributes, and therefore, accuracy is reduced or inappropriate output occurs in each process of speech recognition, translation, and speech synthesis. Accuracy of each processing of speech translation, translation, or speech synthesis is improved and appropriate output is performed in a network-type speech translation system by, based on speaker attributes, appropriately changing the server to perform speech recognition or the speech recognition model, appropriately changing the translation server to perform translation or the translation model, or appropriately changing the speech synthesis server or speech synthesis model.

Type: Grant

Filed: March 3, 2010

Date of Patent: October 14, 2014

Assignee: National Institute of Information and Communications Technology

Inventors: Satoshi Nakamura, Eiichiro Sumita, Yutaka Ashikari, Noriyuki Kimura, Chiori Hori
SPEECH TRANSLATION SYSTEM, CONTROL DEVICE, AND CONTROL METHOD

Publication number: 20120221321

Abstract: Appropriate processing results or appropriate apparatuses can be selected with a control device that selects the most probable speech recognition result by using speech recognition scores received with speech recognition results from two or more speech recognition apparatuses; sends the selected speech recognition result to two or more translation apparatuses respectively; selects the most probable translation result by using translation scores received with translation results from the two or more translation apparatuses; sends the selected translation result to two or more speech synthesis apparatuses respectively; receives a speech synthesis processing result including a speech synthesis result and a speech synthesis score from each of the two or more speech synthesis apparatuses; selects the most probable speech synthesis result by using the scores; and sends the selected speech synthesis result to a second terminal apparatus.

Type: Application

Filed: March 3, 2010

Publication date: August 30, 2012

Inventors: Satoshi Nakamura, Eiichiro Sumita, Yutaka Ashikari, Noriyuki Kimura, Chiori Hori
SPEECH TRANSLATION SYSTEM, FIRST TERMINAL APPARATUS, SPEECH RECOGNITION SERVER, TRANSLATION SERVER, AND SPEECH SYNTHESIS SERVER

Publication number: 20120197629

Abstract: In conventional network-type speech translation systems, devices or models for recognizing or synthesizing speech cannot be changed in accordance with speakers' attributes, and therefore, accuracy is reduced or inappropriate output occurs in each process of speech recognition, translation, and speech synthesis. Accuracy of each processing of speech translation, translation, or speech synthesis is improved and appropriate output is performed in a network-type speech translation system by, based on speaker attributes, appropriately changing the server to perform speech recognition or the speech recognition model, appropriately changing the translation server to perform translation or the translation model, or appropriately changing the speech synthesis server or speech synthesis model.

Type: Application

Filed: March 3, 2010

Publication date: August 2, 2012

Inventors: Satoshi Nakamura, Eiichiro Sumita, Yutaka Ashikari, Noriyuki Kimura, Chiori Hori
SPEECH TRANSLATION SYSTEM, DICTIONARY SERVER, AND PROGRAM

Publication number: 20120166176

Abstract: A conventional speech recognition dictionary, translation dictionary and speech synthesis dictionary used in speech translation have inconsistencies.

Type: Application

Filed: March 3, 2010

Publication date: June 28, 2012

Inventors: Satoshi Nakamura, Eiichiro Sumita, Yutaka Ashikari, Noriyuki Kimura, Chiori Hori
Machine translation apparatus and machine translation computer program

Patent number: 7925493

Abstract: A method of machine translation, using a bilingual corpus containing translation pairs each consisting of a sentence of a first language and a sentence of a second language, for translating an input sentence of the first language to the second language, including the steps of: receiving the input sentence of the first language and extracting, from the bilingual corpus, a sentence of the second language forming a pair with a sentence of the first language with highest similarity to the input sentence; applying an arbitrary modification among a plurality of predetermined modifications to the extracted sentence of the second language, and computing likelihood of sentences resulting from the modification; selecting a prescribed number of sentences having high likelihood from among the sentences resulting from the modification; repeating, on each of the sentences selected in the step of selecting, the steps of extracting, computing and selecting, until the likelihood no longer improves; and outputting, as a transl

Type: Grant

Filed: August 13, 2004

Date of Patent: April 12, 2011

Assignee: Advanced Telecommunications Research Institute International

Inventors: Taro Watanabe, Eiichiro Sumita
System that translates by improving a plurality of candidate translations and selecting best translation

Publication number: 20050055217

Abstract: A machine translation system includes: a distributing module for distributing an input sentence to a plurality of machine translation apparatuses for generating a translation of a second language of the input sentence of a first language, and receiving the translation of the second language from each of the plurality of translation apparatuses; a translation improving module, using each of the translations of the second language received by the distributing module as a starting point, improving the translation such that an evaluation in accordance with a prescribed evaluation method is improved; and a translation selecting module for selecting, as a translation of the input sentence, a translation satisfying a prescribed condition, among the translations improved by the translation improving module.

Type: Application

Filed: August 13, 2004

Publication date: March 10, 2005

Inventors: Eiichiro Sumita, Taro Watanabe
Machine translation apparatus and machine translation computer program

Publication number: 20050049851

Abstract: A method of machine translation, using a bilingual corpus containing translation pairs each consisting of a sentence of a first language and a sentence of a second language, for translating an input sentence of the first language to the second language, including the steps of: receiving the input sentence of the first language and extracting, from the bilingual corpus, a sentence of the second language forming a pair with a sentence of the first language with highest similarity to the input sentence; applying an arbitrary modification among a plurality of predetermined modifications to the extracted sentence of the second language, and computing likelihood of sentences resulting from the modification; selecting a prescribed number of sentences having high likelihood from among the sentences resulting from the modification; repeating, on each of the sentences selected in the step of selecting, the steps of extracting, computing and selecting, until the likelihood no longer improves; and outputting, as a transl

Type: Application

Filed: August 13, 2004

Publication date: March 3, 2005

Inventors: Taro Watanabe, Eiichiro Sumita
Method and apparatus for improving translation knowledge of machine translation

Publication number: 20040255281

Abstract: A method of improving translation knowledge includes the steps of preparing a set of translation knowledge, preparing a bilingual corpus of a source language and a target language, machine-translating sentences of the source language in the bilingual corpus to the target language using a set of translation knowledge, evaluating translation quality of the resulting translations in accordance with a prescribed evaluation standard, calculating degree of contribution to translation quality of a part of the translation knowledge, and removing the corresponding part of the translation knowledge when the calculated degree of contribution of the part is negative.

Type: Application

Filed: May 7, 2004

Publication date: December 16, 2004

Applicant: ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL

Inventors: Kenji Imamura, Eiichiro Sumita

1 2 next