Patents by Inventor Lingfei Wu

Lingfei Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Graph-based event schema induction for information retrieval

Patent number: 11615152

Abstract: Systems, devices, computer-implemented methods, and/or computer program products that facilitate event schema induction from unstructured or semi-structured data. In one example, a system can comprise a processor that executes computer executable components stored in memory. The computer executable components can comprise a schema component and a retrieval component. The schema component can derive an event schema for a document corpus using parsing results obtained from the document corpus. The retrieval component can populate a response to a query with a document of the document corpus using events extracted from the query and the document using the event schema.

Type: Grant

Filed: April 6, 2021

Date of Patent: March 28, 2023

Assignees: INTERNATIONAL BUSINESS MACHINES CORPORATION, THE BOARD OF TRUSTEES OF THE UNIVERSITY OF ILLINOIS

Inventors: Rajarshi Haldar, Yu Deng, Lingfei Wu, Ruchi Mahindru, Julia Constanze Hockenmaier, Sinem Guven Kaya
Conversation history within conversational machine reading comprehension

Patent number: 11593672

Abstract: Aspects described herein include a method of conversational machine reading comprehension, as well as an associated system and computer program product. The method comprises receiving a plurality of questions relating to a context, and generating a sequence of context graphs. Each of the context graphs includes encoded representations of: (i) the context, (ii) a respective question of the plurality of questions, and (iii) a respective conversation history reflecting: (a) one or more previous questions relative to the respective question, and (b) one or more previous answers to the one or more previous questions. The method further comprises identifying, using at least one graph neural network, one or more temporal dependencies between adjacent context graphs of the sequence. The method further comprises predicting, based at least on the one or more temporal dependencies, an answer for a first question of the plurality of questions.

Type: Grant

Filed: August 22, 2019

Date of Patent: February 28, 2023

Assignee: International Business Machines Corporation

Inventors: Lingfei Wu, Mohammed J Zaki, Yu Chen
NATURAL QUESTION GENERATION VIA REINFORCEMENT LEARNING BASED GRAPH-TO-SEQUENCE MODEL

Publication number: 20230055666

Abstract: For a passage text and a corresponding answer text, perform a word-level soft alignment to obtain contextualized passage embeddings and contextualized answer embeddings, and a hidden level soft alignment on the contextualized passage embeddings and the contextualized answer embeddings to obtain a passage embedding matrix. Construct a passage graph of the passage text based on the passage embedding matrix, and apply a bidirectional gated graph neural network to the passage graph until a final state embedding is determined, during which intermediate node embeddings are fused from both incoming and outgoing edges. Obtain a graph-level embedding from the final state embedding, and decode the final state embedding to generate an output sequence word-by-word. Train a machine learning model to generate at least one question corresponding to the passage text and the answer text, by evaluating the output sequence with a hybrid evaluator combining cross-entropy evaluation and reinforcement learning evaluation.

Type: Application

Filed: October 23, 2022

Publication date: February 23, 2023

Inventors: Lingfei Wu, Yu Chen, Mohammed J. Zaki
Scalable attributed graph embedding for large-scale graph analytics

Patent number: 11580322

Abstract: A computer-implemented method for calculating Scalable Attributed Graph Embedding for Large-Scale Graph Analytics that includes computing a node embedding for a first node-attributed graph in a node embedded space. One or more random attributed graphs is generated in the node embedded space. A graph embedding operation is performed using a dissimilarity measure between one or more raw graphs and the one or more generated random graphs, and an edge-attributed graph into a second node-attributed graph using an adjoint graph.

Type: Grant

Filed: May 15, 2020

Date of Patent: February 14, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Lingfei Wu
NON-FACTOID QUESTION ANSWERING ACROSS TASKS AND DOMAINS

Publication number: 20230012063

Abstract: An approach for a non-factoid question answering framework across tasks and domains may be provided. The approach may include training a multi-task joint learning model in a general domain. The approach may also include initializing the multi-task joint learning model in a specific target domain. The approach may include tuning the joint learning model in the target domain. The approach may include determining which task of the multiple tasks is more difficult for the multi-task joint learning model to learn. The approach may also include dynamically adjusting the weights of the multi-task joint learning model, allowing the model to concentrate on learning the more difficult learning task.

Type: Application

Filed: July 7, 2021

Publication date: January 12, 2023

Inventors: Wenhao Yu, LINGFEI WU, Yu Deng, Qingkai Zeng, Ruchi Mahindru, Sinem Guven Kaya, Meng Jiang
Natural question generation via reinforcement learning based graph-to-sequence model

Patent number: 11481418

Abstract: For a passage text and a corresponding answer text, perform a word-level soft alignment to obtain contextualized passage embeddings and contextualized answer embeddings, and a hidden level soft alignment on the contextualized passage embeddings and the contextualized answer embeddings to obtain a passage embedding matrix. Construct a passage graph of the passage text based on the passage embedding matrix, and apply a bidirectional gated graph neural network to the passage graph until a final state embedding is determined, during which intermediate node embeddings are fused from both incoming and outgoing edges. Obtain a graph-level embedding from the final state embedding, and decode the final state embedding to generate an output sequence word-by-word. Train a machine learning model to generate at least one question corresponding to the passage text and the answer text, by evaluating the output sequence with a hybrid evaluator combining cross-entropy evaluation and reinforcement learning evaluation.

Type: Grant

Filed: April 9, 2020

Date of Patent: October 25, 2022

Assignees: International Business Machines Corporation, RENSSELAER POLYTECHNIC INSTITUTE

Inventors: Lingfei Wu, Yu Chen, Mohammed J. Zaki
KNOWLEDGE GRAPH COMPRESSION

Publication number: 20220335270

Abstract: Aspects of the present disclosure relate to knowledge graph compression. An input knowledge graph (KG) can be received. The input KG can be encoded to receive a first set of node embeddings. The input KG can be compressed into an output KG. The output KG can be encoded to receive a second set of node embeddings. A model for KG compression can be trained using optimal transport based on a distance matrix between the first set of node embeddings and the second set of node embeddings.

Type: Application

Filed: April 15, 2021

Publication date: October 20, 2022

Inventors: Tengfei Ma, Manling Li, Mo Yu, Tian GAO, LINGFEI WU
GRAPH-BASED EVENT SCHEMA INDUCTION FOR INFORMATION RETRIEVAL

Publication number: 20220318316

Abstract: Systems, devices, computer-implemented methods, and/or computer program products that facilitate event schema induction from unstructured or semi-structured data. In one example, a system can comprise a processor that executes computer executable components stored in memory. The computer executable components can comprise a schema component and a retrieval component. The schema component can derive an event schema for a document corpus using parsing results obtained from the document corpus. The retrieval component can populate a response to a query with a document of the document corpus using events extracted from the query and the document using the event schema.

Type: Application

Filed: April 6, 2021

Publication date: October 6, 2022

Inventors: Rajarshi Haldar, Yu Deng, Lingfei Wu, Ruchi Mahindru, Julia Constanze Hockenmaier, Sinem Guven Kaya
VISUAL QUESTION GENERATION WITH ANSWER-AWARENESS AND REGION-REFERENCE

Publication number: 20220245838

Abstract: A computer-implemented method for visual question generation includes training an alignment module to analyze an image, an answer hint, and a visual hint with respect to the image. A k-nearest neighbors (KNN) graph is constructed by performing an aligned embedding for each region of the image. A node embedding component is generated by using a graph embedding component of the KNN graph. A visual question is generated by sequence decoding each image and graph of the image.

Type: Application

Filed: January 29, 2021

Publication date: August 4, 2022

Inventors: Lingfei Wu, Lei Yu, Chen Wang, Dakuo Wang
AUTOMATED GRAPH BASED INFORMATION EXTRACTION

Publication number: 20220245337

Abstract: A set of sentences within a natural language text document are parsed, generating a word-level graph corresponding to a sentence in the set of sentences. Within the word-level graph using a trained entity identification model, a set of entity candidates are identified. From a set of graphs modelling relationships between portions of the set of sentences, a set of embeddings is generated. From a set of pairs of embeddings in the set of embeddings using a set of deconvolution layers, a set of links between entity candidates within the set of entity candidates is extracted. From the set of links and the set of entity candidates, an output graph modelling linkages between portions of the set of sentences within the natural language text document is generated.

Type: Application

Filed: February 2, 2021

Publication date: August 4, 2022

Applicant: International Business Machines Corporation

Inventors: LINGFEI WU, Tengfei Ma, Tian GAO, Xiaojie Guo
ADAPTIVE SELF-ADVERSARIAL NEGATIVE SAMPLING FOR GRAPH NEURAL NETWORK TRAINING

Publication number: 20220245460

Abstract: A graph neural network (GNN) training method, system, and computer program product in a graph, include generating, by the computing device, one or more one or more hypothetical edges between two or more nodes of a plurality of nodes of a graph neural network, testing, by the computing device, to determine whether the one or more generated hypothetical edges should be connected by using negative sampling, and permanently connecting, by the computing device, the one or more tested hypothetical edges if the negative sampling indicates the connectivity.

Type: Application

Filed: January 29, 2021

Publication date: August 4, 2022

Inventors: Xiao Qin, Nasrullah Sheikh, Berthold Reinwald, Lingfei Wu
Time-series representation learning via random time warping

Patent number: 11366990

Abstract: Embodiments of the present invention provide a computer-implemented method for performing unsupervised time-series feature learning. The method generates a set of reference time-series of random lengths, in which each length is uniformly sampled from a predetermined minimum length to a predetermined maximum length, and in which values of each reference time-series in the set are drawn from a distribution. The method generates a feature matrix for raw time-series data based on a set of computed distances between the generated set of reference time-series and the raw time-series data. The method provides the feature matrix as an input to one or more machine learning models.

Type: Grant

Filed: May 15, 2017

Date of Patent: June 21, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael J. Witbrock, Lingfei Wu, Cao Xiao, Jinfeng Yi
Learning-based automation machine learning code annotation in computational notebooks

Patent number: 11360763

Abstract: One embodiment of the invention provides a method for automated code annotation in machine learning (ML) and data science. The method comprises receiving, as input, a section of executable code. The method further comprises classifying, via a ML model, the section of executable code with a stage classification label indicative of a stage within a workflow for automated ML that the executable code applies to. The method further comprises categorizing, based on the stage classification label, the section of executable code with a category of annotation that is most appropriate for the section of executable code. The method further comprises generating a suggested annotation for the section of executable code based on the category of annotation. The method further comprises providing, as output, the suggested annotation to a display of an electronic device for user review. The suggested annotation is user interactable via the electronic device.

Type: Grant

Filed: October 13, 2020

Date of Patent: June 14, 2022

Assignee: International Business Machines Corporation

Inventors: Dakuo Wang, Lingfei Wu, Yi Wang, Xuye Liu, Chuang Gan, Si Er Han, Bei Chen, Ji Hui Yang
Spatial-temporal graph-to-sequence learning based grounded video descriptions

Patent number: 11354904

Abstract: Techniques for generating a grounded video description for a video input are provided. Hierarchical Attention based Spatial-Temporal Graph-to-Sequence Learning framework for producing a GVD is provided by generating an initial graph representing a plurality of object features in a plurality of frames of a received video input and generating an implicit graph for the plurality of object features in the plurality of frames using a similarity function. The initial graph and the implicit graph are combined to form a refined graph and the refined graph is processed using attention processes, to generate an attended hierarchical graph of the plurality of object features for the plurality of frames. The grounded video description is generated for the received video input using at least the hierarchical graph of the plurality of features.

Type: Grant

Filed: July 10, 2020

Date of Patent: June 7, 2022

Assignee: International Business Machines Corporation

Inventors: Lingfei Wu, Liana Fong
Abstract Meaning Representation Parsing with Graph Translation

Publication number: 20220171923

Abstract: A computer-implemented method for generating an abstract meaning representation (“AMR”) of a sentence, comprising receiving, by a computing device, an input sentence and parsing the input sentence into one or more syntactic and/or semantic graphs. An input graph including a node set and an edge set is formed from the one or more syntactic and/or semantic graphs. Node representations are generated by natural language processing. The input graph is provided to a first neural network to provide an output graph having learned node representations aligned with the node representations in the input graph. The method further includes predicting via a second neural network, node label and predicting, via a third neural network, edge labels in the output graph. The AMR is generated based on the predicted node labels and predicted edge labels. A system and a non-transitory computer readable storage medium are also disclosed.

Type: Application

Filed: December 1, 2020

Publication date: June 2, 2022

Inventors: Lingfei Wu, Jinjun Xiong, Hongyu Gong, Suma Bhat, Wen-Mei Hwu
LEARNING-BASED AUTOMATED MACHINE LEARNING CODE ANNOTATION WITH GRAPH NEURAL NETWORK

Publication number: 20220138266

Abstract: Obtain, at a computing device, a segment of computer code. With a classification module of a machine learning system executing on the computing device, determine a required annotation category for the segment of computer code. With an annotation generation module of the machine learning system executing on the computing device, generate a natural language annotation of the segment of computer code based on the segment of computer code and the required annotation category. Provide the natural language annotation to a user interface for display adjacent the segment of computer code.

Type: Application

Filed: November 3, 2020

Publication date: May 5, 2022

Inventors: Dakuo Wang, Lingfei Wu, Xuye Liu, Yi Wang, Chuang Gan, Jing Xu, Xue Ying Zhang, Jun Wang
Rich descriptor framework to text generation using graphs and structural neural encoders

Patent number: 11321541

Abstract: Technology for using a bi-directed graph convolutional neural network (“BGCNN”) to convert RDF data into natural language text. Some embodiments perform RDF-to-Text generation by learning graph-augmented structural neural encoders, consisting of: (a) bidirected graph-based meta-paths encoder; (b) bidirected graph convolutional networks encoder, and (c) separated attention mechanism for combining encoders and decoder to translate RDF triplets to natural language description.

Type: Grant

Filed: July 2, 2020

Date of Patent: May 3, 2022

Assignee: International Business Machines Corporation

Inventors: Lingfei Wu, Chen Wang
Text style transfer using reinforcement learning

Patent number: 11314950

Abstract: A computer-implemented method is provided for transferring a target text style using Reinforcement Learning (RL). The method includes pre-determining, by a Long Short-Term Memory (LSTM) Neural Network (NN), the target text style of a target-style natural language sentence. The method further includes transforming, by a hardware processor using the LSTM NN, a source-style natural language sentence into the target-style natural language sentence that maintains the target text style of the target-style natural language sentence. The method also includes calculating an accuracy rating of a transformation of the source-style natural language sentence into the target-style natural language sentence based upon rewards relating to at least the target text style of the source-style natural language sentence.

Type: Grant

Filed: March 25, 2020

Date of Patent: April 26, 2022

Assignees: INTERNATIONAL BUSINESS MACHINES CORPORATION, THE BOARD OF TRUSTEES OF THE UNIVERSITY OF ILLINOIS

Inventors: Lingfei Wu, Jinjun Xiong, Hongyu Gong, Suma Bhat, Wen-Mei Hwu
LEARNING-BASED AUTOMATION MACHINE LEARNING CODE ANNOTATION IN COMPUTATIONAL NOTEBOOKS

Publication number: 20220113964

Abstract: One embodiment of the invention provides a method for automated code annotation in machine learning (ML) and data science. The method comprises receiving, as input, a section of executable code. The method further comprises classifying, via a ML model, the section of executable code with a stage classification label indicative of a stage within a workflow for automated ML that the executable code applies to. The method further comprises categorizing, based on the stage classification label, the section of executable code with a category of annotation that is most appropriate for the section of executable code. The method further comprises generating a suggested annotation for the section of executable code based on the category of annotation. The method further comprises providing, as output, the suggested annotation to a display of an electronic device for user review. The suggested annotation is user interactable via the electronic device.

Type: Application

Filed: October 13, 2020

Publication date: April 14, 2022

Inventors: Dakuo Wang, Lingfei Wu, Yi Wang, Xuye Liu, Chuang Gan, Si Er Han, Bei Chen, Ji Hui Yang
SEMANTIC CODE RETRIEVAL USING GRAPH MATCHING

Publication number: 20220107799

Abstract: Techniques regarding code retrieval tasks are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a code retrieval component that can execute a code retrieval machine learning task by computing an amount of similarity between neural network embeddings of graph representations of a query text and at least a portion of a computer program code.

Type: Application

Filed: October 2, 2020

Publication date: April 7, 2022

Inventors: Lingfei Wu, Liana Fong

prev 1 2 3 4 next