Patents by Inventor An Le NGUYEN

An Le NGUYEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

COMPUTER-READABLE RECORDING MEDIUM STORING LANGUAGE PROCESSING PROGRAM, LANGUAGE PROCESSING APPARATUS, AND LANGUAGE PROCESSING METHOD

Publication number: 20250094719

Abstract: A non-transitory computer-readable recording medium stores a language processing program for causing a computer to execute a process including: extracting, from a second text written in a second language, a second named entity corresponding to a first named entity contained in a first text written in a first language; associating the first text with the second text based on a similarity between the first named entity and the second named entity and an alignment probability between the first named entity and the second named entity; and outputting association information indicating a result of associating the first text with the second text.

Type: Application

Filed: August 1, 2024

Publication date: March 20, 2025

Applicant: Fujitsu Limited

Inventor: An Le NGUYEN
Machine learning method and information processing apparatus

Patent number: 12147778

Abstract: A non-transitory computer-readable recording medium stores a program for causing a computer to execute a process, the process includes acquiring training data that includes a first sentence expressed in a first language and a second sentence expressed in a second language, identifying a named entity and parts of speech from the first sentence, and generating, based on the training data, a translation model that includes an attention mechanism for the named entity and the parts of speech.

Type: Grant

Filed: January 31, 2022

Date of Patent: November 19, 2024

Assignee: FUJITSU LIMITED

Inventor: An Le Nguyen
AUTOMATIC CONSTRUCTION METHOD FOR PARALLEL CORPORA AND INFORMATION PROCESSING APPARATUS

Publication number: 20240220740

Abstract: An information processing apparatus acquires a first parallel corpus in which a first sentence, which includes a first named entity in a first language, and a second sentence, which includes a second named entity in a second language corresponding to the first named entity, are associated, extracts a third named entity whose degree of similarity with the first named entity exceeds a threshold from first dictionary data including a plurality of named entities in the first language, specifies a fourth named entity corresponding to the third named entity using second dictionary data indicating correspondence between named entities in the first language and named entities in the second language, and generates a second parallel corpus by replacing the first named entity included in the first sentence with the third named entity and replacing the second named entity included in the second sentence with the fourth named entity.

Type: Application

Filed: November 15, 2023

Publication date: July 4, 2024

Applicant: Fujitsu Limited

Inventor: An Le NGUYEN
MACHINE LEARNING METHOD AND NAMED ENTITY RECOGNITION APPARATUS

Publication number: 20230044266

Abstract: A computer divides a character string included in text data into a plurality of tokens. The computer searches, by performing matching processing between a token string indicating a specific number of consecutive tokens among the plurality of tokens and dictionary information including a plurality of named entities, the plurality of named entities for a similar named entity whose similarity to the token string is equal to or more than a threshold. The computer converts matching information indicating a result of the matching processing between the token string and the similar named entity into first vector data. The computer generates input data by using a plurality of pieces of vector data converted from the plurality of tokens and the first vector data. The computer generates a named entity recognition model that detects a named entity by performing machine learning using the input data.

Type: Application

Filed: October 17, 2022

Publication date: February 9, 2023

Applicant: FUJITSU LIMITED

Inventors: AN LE NGUYEN, Hajime Morita
MACHINE LEARNING METHOD AND INFORMATION PROCESSING APPARATUS

Publication number: 20220292267

Abstract: A non-transitory computer-readable recording medium stores a program for causing a computer to execute a process, the process includes acquiring training data that includes a first sentence expressed in a first language and a second sentence expressed in a second language, identifying a named entity and parts of speech from the first sentence, and generating, based on the training data, a translation model that includes an attention mechanism for the named entity and the parts of speech.

Type: Application

Filed: January 31, 2022

Publication date: September 15, 2022

Applicant: FUJITSU LIMITED

Inventor: An Le NGUYEN

COMPUTER-READABLE RECORDING MEDIUM STORING LANGUAGE PROCESSING PROGRAM, LANGUAGE PROCESSING APPARATUS, AND LANGUAGE PROCESSING METHOD

Machine learning method and information processing apparatus

AUTOMATIC CONSTRUCTION METHOD FOR PARALLEL CORPORA AND INFORMATION PROCESSING APPARATUS

MACHINE LEARNING METHOD AND NAMED ENTITY RECOGNITION APPARATUS

MACHINE LEARNING METHOD AND INFORMATION PROCESSING APPARATUS