Patents by Inventor An Le NGUYEN

An Le NGUYEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250094719
    Abstract: A non-transitory computer-readable recording medium stores a language processing program for causing a computer to execute a process including: extracting, from a second text written in a second language, a second named entity corresponding to a first named entity contained in a first text written in a first language; associating the first text with the second text based on a similarity between the first named entity and the second named entity and an alignment probability between the first named entity and the second named entity; and outputting association information indicating a result of associating the first text with the second text.
    Type: Application
    Filed: August 1, 2024
    Publication date: March 20, 2025
    Applicant: Fujitsu Limited
    Inventor: An Le NGUYEN
  • Patent number: 12147778
    Abstract: A non-transitory computer-readable recording medium stores a program for causing a computer to execute a process, the process includes acquiring training data that includes a first sentence expressed in a first language and a second sentence expressed in a second language, identifying a named entity and parts of speech from the first sentence, and generating, based on the training data, a translation model that includes an attention mechanism for the named entity and the parts of speech.
    Type: Grant
    Filed: January 31, 2022
    Date of Patent: November 19, 2024
    Assignee: FUJITSU LIMITED
    Inventor: An Le Nguyen
  • Publication number: 20240220740
    Abstract: An information processing apparatus acquires a first parallel corpus in which a first sentence, which includes a first named entity in a first language, and a second sentence, which includes a second named entity in a second language corresponding to the first named entity, are associated, extracts a third named entity whose degree of similarity with the first named entity exceeds a threshold from first dictionary data including a plurality of named entities in the first language, specifies a fourth named entity corresponding to the third named entity using second dictionary data indicating correspondence between named entities in the first language and named entities in the second language, and generates a second parallel corpus by replacing the first named entity included in the first sentence with the third named entity and replacing the second named entity included in the second sentence with the fourth named entity.
    Type: Application
    Filed: November 15, 2023
    Publication date: July 4, 2024
    Applicant: Fujitsu Limited
    Inventor: An Le NGUYEN
  • Publication number: 20230044266
    Abstract: A computer divides a character string included in text data into a plurality of tokens. The computer searches, by performing matching processing between a token string indicating a specific number of consecutive tokens among the plurality of tokens and dictionary information including a plurality of named entities, the plurality of named entities for a similar named entity whose similarity to the token string is equal to or more than a threshold. The computer converts matching information indicating a result of the matching processing between the token string and the similar named entity into first vector data. The computer generates input data by using a plurality of pieces of vector data converted from the plurality of tokens and the first vector data. The computer generates a named entity recognition model that detects a named entity by performing machine learning using the input data.
    Type: Application
    Filed: October 17, 2022
    Publication date: February 9, 2023
    Applicant: FUJITSU LIMITED
    Inventors: AN LE NGUYEN, Hajime Morita
  • Publication number: 20220292267
    Abstract: A non-transitory computer-readable recording medium stores a program for causing a computer to execute a process, the process includes acquiring training data that includes a first sentence expressed in a first language and a second sentence expressed in a second language, identifying a named entity and parts of speech from the first sentence, and generating, based on the training data, a translation model that includes an attention mechanism for the named entity and the parts of speech.
    Type: Application
    Filed: January 31, 2022
    Publication date: September 15, 2022
    Applicant: FUJITSU LIMITED
    Inventor: An Le NGUYEN