Patents by Inventor Tengfei Ma

Tengfei Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10984196
    Abstract: Method and apparatus for generating natural language observations using sequence-to-sequence models are provided. The method includes receiving a first electronic document and dynamically generating, without requiring user intervention, a first natural language observation corresponding to a first portion of the first electronic document by processing the first portion of the first electronic document using a first sequence-to-sequence model. A second natural language observation is received for the first portion of the first electronic document, and the generated first natural language observation and the authoritative natural language observation are provided. The method further includes receiving an evaluation of the first natural language observation and the second natural language observation, and refining the first sequence-to-sequence model based on the evaluation of the first natural language observation and second natural language observation.
    Type: Grant
    Filed: January 11, 2018
    Date of Patent: April 20, 2021
    Assignee: International Business Machines Corporation
    Inventors: Patrick D. Watson, Maria D. Chang, Jae-Wook Ahn, Sharad Sundararajan, Tengfei Ma
  • Publication number: 20210098074
    Abstract: A method, computer system, and a computer program product for designing one or more folded structural proteins from at least one raw amino acid sequence is provided. The present invention may include computing one or more character embeddings based on the at least one raw amino acid sequence by utilizing a multi-scale neighborhood-based neural network (MNNN) model. The present invention may then include refining the computed one or more character embeddings with at least one set of sequence neighborhood information. The present invention may further include predicting one or more dihedral angles based on the refined one or more character embeddings.
    Type: Application
    Filed: September 27, 2019
    Publication date: April 1, 2021
    Inventors: Lingfei Wu, Siyu Huo, Tengfei Ma, Pin-Yu Chen, Zhao Qin, Eugene Jungsup Lim, Francisco Javier Martin-Martinez, Hui Sun, Benedetto Marelli, Markus Jochen Buehler
  • Publication number: 20200082272
    Abstract: Mechanisms are provided for executing a trained deep learning (DL) model. The mechanisms receive, from a trained autoencoder executing on a client computing device, one or more intermediate representation (IR) data structures corresponding to training input data input to the trained autoencoder. The mechanisms train the DL model to generate a correct output based on the IR data structures from the trained autoencoder, to thereby generate a trained DL model. The mechanisms receive, from the trained autoencoder executing on the client computing device, a new IR data structure corresponding to new input data input to the trained autoencoder. The mechanisms input the new IR data structure to the trained DL model executing on the deep learning service computing system, to generate output results for the new IR data structure. The mechanisms generate an output response based on the output results, which is transmitted to the client computing device.
    Type: Application
    Filed: September 11, 2018
    Publication date: March 12, 2020
    Inventors: Zhongshu Gu, Heqing Huang, Jialong Zhang, Cao Xiao, Tengfei Ma, Dimitrios Pendarakis, Ian M. Molloy
  • Publication number: 20200028843
    Abstract: A method for authenticating a user is presented. Responsive to a request for access to a computer resource, a computer system prompts the user making the request to access the computer resource to perform a new motion in an environment in which the user is monitored by a sensor system. Detected biometric data in the new motion performed by the user is identified by the computer system. A determination is made as to whether the user performing the new motion is an authenticated user based on comparing the detected biometric data with stored biometric data for a prior motion performed by the authenticated user. The computer system provides access to the computer resource when the user is identified as the authenticated user.
    Type: Application
    Filed: July 17, 2018
    Publication date: January 23, 2020
    Inventors: Patrick Watson, Tengfei Ma, Maria Chang, Jae-Wook Ahn, Ravi Tejwani, Aldis Sipolins
  • Publication number: 20200020243
    Abstract: Systems, methods, and computer-readable media are described for determining a score for a target student answer using unlabeled data. The target answer is provided by a student to a question for which there is no ground-truth answer data. A set of student answers serves as a set of pseudo-reference answers and a classifier is used to score each answer based on each other answer. In this manner, each student answer serves as a pseudo-reference answer for each other student answer. A clustering approach can also be employed to cluster a set of student answers into clusters. The centroids of the clusters can then serve as the set of pseudo-reference answers. Clustering improves the robustness and efficiency of the score determined for a target student answer.
    Type: Application
    Filed: July 10, 2018
    Publication date: January 16, 2020
    Inventors: Tengfei MA, Patrick WATSON, Jae-Wook AHN, Maria CHANG, Aldis SIPOLINS
  • Publication number: 20190340541
    Abstract: Techniques that facilitate layered stochastics anonymization of data are provided. In one example, a system includes a machine learning component and an evaluation component. The machine learning component performs a machine learning process for first data associated with one or more features to generate second data indicative of one or more example datasets within a degree of similarity to the first data. The first data and the second data comprise a corresponding data format. The evaluation component evaluates the second data for a particular feature from the one or more features and generates third data indicative of a confidence score for the second data.
    Type: Application
    Filed: May 3, 2018
    Publication date: November 7, 2019
    Inventors: Patrick Watson, Maria Chang, Tengfei Ma, Aldis Sipolins
  • Publication number: 20190213256
    Abstract: Method and apparatus for generating natural language observations using sequence-to-sequence models are provided. The method includes receiving a first electronic document and dynamically generating, without requiring user intervention, a first natural language observation corresponding to a first portion of the first electronic document by processing the first portion of the first electronic document using a first sequence-to-sequence model. A second natural language observation is received for the first portion of the first electronic document, and the generated first natural language observation and the authoritative natural language observation are provided. The method further includes receiving an evaluation of the first natural language observation and the second natural language observation, and refining the first sequence-to-sequence model based on the evaluation of the first natural language observation and second natural language observation.
    Type: Application
    Filed: January 11, 2018
    Publication date: July 11, 2019
    Inventors: Patrick D. WATSON, Maria D. CHANG, Jae-Wook AHN, Sharad SUNDARARAJAN, Tengfei MA
  • Patent number: 10204101
    Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.
    Type: Grant
    Filed: November 8, 2017
    Date of Patent: February 12, 2019
    Assignee: International Business Machines Corporation
    Inventor: Tengfei Ma
  • Patent number: 10204100
    Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.
    Type: Grant
    Filed: March 28, 2017
    Date of Patent: February 12, 2019
    Assignee: International Business Machines Corporation
    Inventor: Tengfei Ma
  • Publication number: 20180285350
    Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.
    Type: Application
    Filed: March 28, 2017
    Publication date: October 4, 2018
    Inventor: Tengfei Ma
  • Publication number: 20180285352
    Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.
    Type: Application
    Filed: November 8, 2017
    Publication date: October 4, 2018
    Inventor: Tengfei Ma