Patents by Inventor Tengfei Ma

Tengfei Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Distributed system for evaluation and feedback of digital text-based content

Patent number: 10984196

Abstract: Method and apparatus for generating natural language observations using sequence-to-sequence models are provided. The method includes receiving a first electronic document and dynamically generating, without requiring user intervention, a first natural language observation corresponding to a first portion of the first electronic document by processing the first portion of the first electronic document using a first sequence-to-sequence model. A second natural language observation is received for the first portion of the first electronic document, and the generated first natural language observation and the authoritative natural language observation are provided. The method further includes receiving an evaluation of the first natural language observation and the second natural language observation, and refining the first sequence-to-sequence model based on the evaluation of the first natural language observation and second natural language observation.

Type: Grant

Filed: January 11, 2018

Date of Patent: April 20, 2021

Assignee: International Business Machines Corporation

Inventors: Patrick D. Watson, Maria D. Chang, Jae-Wook Ahn, Sharad Sundararajan, Tengfei Ma
DESIGNING AND FOLDING STRUCTURAL PROTEINS FROM THE PRIMARY AMINO ACID SEQUENCE

Publication number: 20210098074

Abstract: A method, computer system, and a computer program product for designing one or more folded structural proteins from at least one raw amino acid sequence is provided. The present invention may include computing one or more character embeddings based on the at least one raw amino acid sequence by utilizing a multi-scale neighborhood-based neural network (MNNN) model. The present invention may then include refining the computed one or more character embeddings with at least one set of sequence neighborhood information. The present invention may further include predicting one or more dihedral angles based on the refined one or more character embeddings.

Type: Application

Filed: September 27, 2019

Publication date: April 1, 2021

Inventors: Lingfei Wu, Siyu Huo, Tengfei Ma, Pin-Yu Chen, Zhao Qin, Eugene Jungsup Lim, Francisco Javier Martin-Martinez, Hui Sun, Benedetto Marelli, Markus Jochen Buehler
Enhancing Data Privacy in Remote Deep Learning Services

Publication number: 20200082272

Abstract: Mechanisms are provided for executing a trained deep learning (DL) model. The mechanisms receive, from a trained autoencoder executing on a client computing device, one or more intermediate representation (IR) data structures corresponding to training input data input to the trained autoencoder. The mechanisms train the DL model to generate a correct output based on the IR data structures from the trained autoencoder, to thereby generate a trained DL model. The mechanisms receive, from the trained autoencoder executing on the client computing device, a new IR data structure corresponding to new input data input to the trained autoencoder. The mechanisms input the new IR data structure to the trained DL model executing on the deep learning service computing system, to generate output results for the new IR data structure. The mechanisms generate an output response based on the output results, which is transmitted to the client computing device.

Type: Application

Filed: September 11, 2018

Publication date: March 12, 2020

Inventors: Zhongshu Gu, Heqing Huang, Jialong Zhang, Cao Xiao, Tengfei Ma, Dimitrios Pendarakis, Ian M. Molloy
Motion Based Authentication

Publication number: 20200028843

Abstract: A method for authenticating a user is presented. Responsive to a request for access to a computer resource, a computer system prompts the user making the request to access the computer resource to perform a new motion in an environment in which the user is monitored by a sensor system. Detected biometric data in the new motion performed by the user is identified by the computer system. A determination is made as to whether the user performing the new motion is an authenticated user based on comparing the detected biometric data with stored biometric data for a prior motion performed by the authenticated user. The computer system provides access to the computer resource when the user is identified as the authenticated user.

Type: Application

Filed: July 17, 2018

Publication date: January 23, 2020

Inventors: Patrick Watson, Tengfei Ma, Maria Chang, Jae-Wook Ahn, Ravi Tejwani, Aldis Sipolins
NO-GROUND TRUTH SHORT ANSWER SCORING

Publication number: 20200020243

Abstract: Systems, methods, and computer-readable media are described for determining a score for a target student answer using unlabeled data. The target answer is provided by a student to a question for which there is no ground-truth answer data. A set of student answers serves as a set of pseudo-reference answers and a classifier is used to score each answer based on each other answer. In this manner, each student answer serves as a pseudo-reference answer for each other student answer. A clustering approach can also be employed to cluster a set of student answers into clusters. The centroids of the clusters can then serve as the set of pseudo-reference answers. Clustering improves the robustness and efficiency of the score determined for a target student answer.

Type: Application

Filed: July 10, 2018

Publication date: January 16, 2020

Inventors: Tengfei MA, Patrick WATSON, Jae-Wook AHN, Maria CHANG, Aldis SIPOLINS
LAYERED STOCHASTIC ANONYMIZATION OF DATA

Publication number: 20190340541

Abstract: Techniques that facilitate layered stochastics anonymization of data are provided. In one example, a system includes a machine learning component and an evaluation component. The machine learning component performs a machine learning process for first data associated with one or more features to generate second data indicative of one or more example datasets within a degree of similarity to the first data. The first data and the second data comprise a corresponding data format. The evaluation component evaluates the second data for a particular feature from the one or more features and generates third data indicative of a confidence score for the second data.

Type: Application

Filed: May 3, 2018

Publication date: November 7, 2019

Inventors: Patrick Watson, Maria Chang, Tengfei Ma, Aldis Sipolins
DISTRIBUTED SYSTEM FOR EVALUATION AND FEEDBACK OF DIGITAL TEXT-BASED CONTENT

Publication number: 20190213256

Abstract: Method and apparatus for generating natural language observations using sequence-to-sequence models are provided. The method includes receiving a first electronic document and dynamically generating, without requiring user intervention, a first natural language observation corresponding to a first portion of the first electronic document by processing the first portion of the first electronic document using a first sequence-to-sequence model. A second natural language observation is received for the first portion of the first electronic document, and the generated first natural language observation and the authoritative natural language observation are provided. The method further includes receiving an evaluation of the first natural language observation and the second natural language observation, and refining the first sequence-to-sequence model based on the evaluation of the first natural language observation and second natural language observation.

Type: Application

Filed: January 11, 2018

Publication date: July 11, 2019

Inventors: Patrick D. WATSON, Maria D. CHANG, Jae-Wook AHN, Sharad SUNDARARAJAN, Tengfei MA
Lexicon extraction from non-parallel data

Patent number: 10204101

Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.

Type: Grant

Filed: November 8, 2017

Date of Patent: February 12, 2019

Assignee: International Business Machines Corporation

Inventor: Tengfei Ma
Lexicon extraction from non-parallel data

Patent number: 10204100

Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.

Type: Grant

Filed: March 28, 2017

Date of Patent: February 12, 2019

Assignee: International Business Machines Corporation

Inventor: Tengfei Ma
LEXICON EXTRACTION FROM NON-PARALLEL DATA

Publication number: 20180285350

Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.

Type: Application

Filed: March 28, 2017

Publication date: October 4, 2018

Inventor: Tengfei Ma
LEXICON EXTRACTION FROM NON-PARALLEL DATA

Publication number: 20180285352

Abstract: A computer-implemented method executed on a processor for lexicon extraction from non-parallel data is provided. The computer-implemented method includes representing each word of a plurality of words by a vector of documents in which the word appears, modeling each word as a topic distribution by using the vector of documents, receiving a first word in a source language, and finding a second word in a target language as a translation of the first word based on similarity of topic distributions of the first word and the second word.

Type: Application

Filed: November 8, 2017

Publication date: October 4, 2018

Inventor: Tengfei Ma

prev 1 2