Patents by Inventor Lingfei Wu

Lingfei Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220108188
    Abstract: Techniques regarding identifying candidate knowledge graph subgraphs in a question answering over knowledge graph task are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a question answering over knowledge graph component that encodes graph structure information of a knowledge graph subgraph and a question graph into neural network embeddings.
    Type: Application
    Filed: October 1, 2020
    Publication date: April 7, 2022
    Inventors: Lingfei Wu, Chen Wang
  • Publication number: 20220027707
    Abstract: A method, a computer program product, and a system for subgraph guided knowledge graph question generation. The method includes inputting a knowledge graph subgraph and a target answer into a long short-term memory encoder. The method also includes producing embeddings relating to the nodes and the edges. The method includes indicating the embeddings associated with the target answer. The method includes applying a graph neural network encoder computation in an iterative manner to the embeddings, with updated embeddings produced by the GNN encoder acting as initial values that are applied to the GNN encoder for a next iteration, until final state embeddings are produced. The method includes computing a graph-level embedding based on the final state embeddings and computing, by a recurrent neural network decoder, a question relating to the target answer and the knowledge graph subgraph using the graph-level embedding.
    Type: Application
    Filed: July 24, 2020
    Publication date: January 27, 2022
    Inventors: Lingfei Wu, Yu Chen, Mohammed J. Zaki
  • Patent number: 11227231
    Abstract: A method and system of analyzing a symbolic sequence is provided. Metadata of a symbolic sequence is received from a computing device of an owner. A set of R random sequences are generated based on the received metadata and sent to the computing device of the owner of the symbolic sequence for computation of a feature matrix based on the set of R random sequences and the symbolic sequence. The feature matrix is received from the computing device of the owner. Upon determining that an inner product of the feature matrix is below a threshold accuracy, the iterative process returns to generating R random sequences. Upon determining that the inner product of the feature matrix is at or above the threshold accuracy, the feature matrix is categorized based on machine learning. The categorized global feature matrix is sent to be displayed on a user interface of the computing device of the owner.
    Type: Grant
    Filed: May 4, 2018
    Date of Patent: January 18, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lingfei Wu, Kun Xu, Pin-Yu Chen, Chia-Yu Chen
  • Publication number: 20220012499
    Abstract: Techniques for generating a grounded video description for a video input are provided. Hierarchical Attention based Spatial-Temporal Graph-to-Sequence Learning framework for producing a GVD is provided by generating an initial graph representing a plurality of object features in a plurality of frames of a received video input and generating an implicit graph for the plurality of object features in the plurality of frames using a similarity function. The initial graph and the implicit graph are combined to form a refined graph and the refined graph is processed using attention processes, to generate an attended hierarchical graph of the plurality of object features for the plurality of frames. The grounded video description is generated for the received video input using at least the hierarchical graph of the plurality of features.
    Type: Application
    Filed: July 10, 2020
    Publication date: January 13, 2022
    Inventors: Lingfei WU, Liana FONG
  • Publication number: 20220004720
    Abstract: Technology for using a bi-directed graph convolutional neural network (“BGCNN”) to convert RDF data into natural language text. Some embodiments perform RDF-to-Text generation by learning graph-augmented structural neural encoders, consisting of: (a) bidirected graph-based meta-paths encoder; (b) bidirected graph convolutional networks encoder, and (c) separated attention mechanism for combining encoders and decoder to translate RDF triplets to natural language description.
    Type: Application
    Filed: July 2, 2020
    Publication date: January 6, 2022
    Inventors: Lingfei Wu, Chen Wang
  • Publication number: 20210374499
    Abstract: An initial noisy graph topology is obtained and an initial adjacency matrix is generated by a similarity learning component using similarity learning and a similarity metric function. An updated adjacency matrix with node embeddings is produced from the initial adjacency matrix using a graph neural network (GNN). The node embeddings are fed back to revise the similarity learning component. The generating, producing, and feeding back operations are repeated for a plurality of iterations.
    Type: Application
    Filed: May 26, 2020
    Publication date: December 2, 2021
    Inventors: Lingfei Wu, Yu Chen, Mohammed J. Zaki
  • Publication number: 20210365306
    Abstract: Computer-implemented techniques for unsupervised event extraction are provided. In one instance, a computer implemented method can include parsing, by a system operatively coupled to a processor, unstructured text comprising event information to identify candidate event components. The computer implemented method can further include employing, by the system, one or more unsupervised machine learning techniques to generate structured event information defining events represented in the unstructured text based on the candidate event components.
    Type: Application
    Filed: May 21, 2020
    Publication date: November 25, 2021
    Inventors: Rajarshi Haldar, Yu Deng, Lingfei Wu, Ruchi Mahindru, Shu Tao
  • Publication number: 20210357681
    Abstract: A computer-implemented method for calculating Scalable Attributed Graph Embedding for Large-Scale Graph Analytics that includes computing a node embedding for a first node-attributed graph in a node embedded space. One or more random attributed graphs is generated in the node embedded space. A graph embedding operation is performed using a dissimilarity measure between one or more raw graphs and the one or more generated random graphs, and an edge-attributed graph into a second node-attributed graph using an adjoint graph.
    Type: Application
    Filed: May 15, 2020
    Publication date: November 18, 2021
    Inventor: Lingfei Wu
  • Publication number: 20210357746
    Abstract: A computer-implemented method for calculating a similarity between a pair of graph-structured objects by learning-based techniques. The operations include computing the node embeddings of a pair of graph-structured objects of two computer graphs utilizing a hierarchical graph matching network (HGMN). A first component of the HGMN performs graph matching of global-level graph interactions of the two computer graphs. A second component of the HGMN performs graph matching of cross-level node-graph interactions of the two computer graphs. There is an aggregating of features learned from the graph matching of the global-level graph interactions and the cross-level node-graph interactions. At least one of a graph-graph classification or a graph-graph regression is performed utilizing the learned features of the two computer graphs.
    Type: Application
    Filed: May 15, 2020
    Publication date: November 18, 2021
    Inventors: Lingfei Wu, Tengfei Ma
  • Publication number: 20210349895
    Abstract: Methods and systems for log message aggregation include determining a first similarity distance score for a first incoming message by comparing the first incoming message to one or more stored templates. It is determined that the first incoming message imperfectly matches a matched template of the one or more stored templates, based on the first similarity distance score. A token in the imperfectly matched template is replaced with a wildcard, to reduce the first similarity distance score.
    Type: Application
    Filed: May 5, 2020
    Publication date: November 11, 2021
    Inventors: Chen Wang, Lingfei Wu
  • Patent number: 11157705
    Abstract: Aspects described herein include a method of semantic parsing, and related system and computer program product. The method comprises receiving an input comprising a plurality of words, generating a structured representation of the plurality of words, encoding the structured representation into a latent embedding space, and decoding the encoded structured representation from the latent embedding space into a logical representation of the plurality of words.
    Type: Grant
    Filed: July 22, 2019
    Date of Patent: October 26, 2021
    Assignee: International Business Machines Corporation
    Inventors: Lingfei Wu, Wei Zhang
  • Publication number: 20210303803
    Abstract: A computer-implemented method is provided for transferring a target text style using Reinforcement Learning (RL). The method includes pre-determining, by a Long Short-Term Memory (LSTM) Neural Network (NN), the target text style of a target-style natural language sentence. The method further includes transforming, by a hardware processor using the LSTM NN, a source-style natural language sentence into the target-style natural language sentence that maintains the target text style of the target-style natural language sentence. The method also includes calculating an accuracy rating of a transformation of the source-style natural language sentence into the target-style natural language sentence based upon rewards relating to at least the target text style of the source-style natural language sentence.
    Type: Application
    Filed: March 25, 2020
    Publication date: September 30, 2021
    Inventors: Lingfei Wu, Jinjun Xiong, Hongyu Gong, Suma Bhat, Wen-Mei Hwu
  • Patent number: 11132512
    Abstract: Embodiments of the invention describe a computer-implemented method that includes receiving a query that includes a query sequence having query characters grouped into query words. A segment of program code is retrieved from a database for evaluation. The program code includes a program code sequence including program code characters grouped into program code words. The query sequence, the query word, the program code sequence, and the program code word are each converted to sequence and word representations. Query sequence-level features, query word-level features, program code sequence-level features, and program code word-level features are extracted from the sequence and word representation. Similarity between the query and the segment of program code is determined by applying a similarity metric technique to the query sequence-level features, the query word-level features, the program code sequence-level features, and the program code word-level features.
    Type: Grant
    Filed: November 8, 2019
    Date of Patent: September 28, 2021
    Assignees: INTERNATIONAL BUSINESS MACHINES CORPORATION, The Board of Trustees of the University of Illinois
    Inventors: Lingfei Wu, Jinjun Xiong, Julia Constanze Hockenmaier, Rajarshi Haldar
  • Patent number: 11080228
    Abstract: A random binning featurization process method, system, and computer program product for a distributed random binning featurization process on one or more multicore systems with a hybrid two-level parallelism, the method including in a training phase, receiving a first data matrix dividing the random binning featurization process into two orthogonal levels, in a high-level generating a randomized number of high-dimension grids and evenly partitioning the grids into nodes in a parallel system, and in a low-level, evenly partitioning dimensions in each grid to construct look-up tables of index vectors and compute a local feature matrix for each node.
    Type: Grant
    Filed: March 13, 2017
    Date of Patent: August 3, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Liana Liyow Fong, Wei Tan, Michael Witbrock, Lingfei Wu
  • Publication number: 20210209139
    Abstract: For a passage text and a corresponding answer text, perform a word-level soft alignment to obtain contextualized passage embeddings and contextualized answer embeddings, and a hidden level soft alignment on the contextualized passage embeddings and the contextualized answer embeddings to obtain a passage embedding matrix. Construct a passage graph of the passage text based on the passage embedding matrix, and apply a bidirectional gated graph neural network to the passage graph until a final state embedding is determined, during which intermediate node embeddings are fused from both incoming and outgoing edges. Obtain a graph-level embedding from the final state embedding, and decode the final state embedding to generate an output sequence word-by-word. Train a machine learning model to generate at least one question corresponding to the passage text and the answer text, by evaluating the output sequence with a hybrid evaluator combining cross-entropy evaluation and reinforcement learning evaluation.
    Type: Application
    Filed: April 9, 2020
    Publication date: July 8, 2021
    Inventors: Lingfei Wu, Yu Chen, Mohammed J. Zaki
  • Publication number: 20210149959
    Abstract: Systems, computer-implemented methods, and computer program products that facilitate random graph embedding components are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a random graph component that can generate a random graph based on node embeddings corresponding to a data graph. The computer executable components can further comprise a graph embedding component that can compute a graph feature matrix corresponding to the data graph based on a distance between the random graph and the data graph.
    Type: Application
    Filed: December 29, 2020
    Publication date: May 20, 2021
    Inventors: Lingfei Wu, Kun Xu, Wei Zhang
  • Publication number: 20210150373
    Abstract: Generate, from a logical formula, a directed acyclic graph having a plurality of nodes and a plurality of edges. Assign an initial embedding to each mode and edge, to one of a plurality of layers. Compute a plurality of initial node states by using feed-forward networks, and construct cross-dependent embeddings between conjecture node embeddings and premise node embeddings. Topologically sort the DAG with the initial embeddings and node states. Beginning from a lowest rank, compute layer-by-layer embedding updates for each of the plurality of layers until a root is reached. Assign the embedding update for the root node as a final embedding for the DAG. Provide the final embedding for the DAG as input to a machine learning system, and carry out the automatic theorem proving with same.
    Type: Application
    Filed: November 15, 2019
    Publication date: May 20, 2021
    Inventors: Maxwell Crouse, Ibrahim Abdelaziz, Cristina Cornelio, Veronika Thost, Lingfei Wu, Bassem Makni, Kavitha Srinivas, Achille Belly Fokoue-Nkoutche
  • Publication number: 20210141863
    Abstract: Embodiments of the invention describe a computer-implemented method that includes receiving a query that includes a query sequence having query characters grouped into query words. A segment of program code is retrieved from a database for evaluation. The program code includes a program code sequence including program code characters grouped into program code words. The query sequence, the query word, the program code sequence, and the program code word are each converted to sequence and word representations. Query sequence-level features, query word-level features, program code sequence-level features, and program code word-level features are extracted from the sequence and word representation. Similarity between the query and the segment of program code is determined by applying a similarity metric technique to the query sequence-level features, the query word-level features, the program code sequence-level features, and the program code word-level features.
    Type: Application
    Filed: November 8, 2019
    Publication date: May 13, 2021
    Inventors: Lingfei Wu, Jinjun Xiong, Julia Constanze Hockenmaier, Rajarshi Haldar
  • Patent number: 10997525
    Abstract: A method and system of creating a model for large scale data analytics is provided. Training data is received in a form of a data matrix X and partitioned into a plurality of partitions. A random matrix T is generated. A feature matrix is determined based on multiplying the partitioned training data by the random matrix T. A predicted data {tilde over (y)} is determined for each partition via a stochastic average gradient (SAG) of each partition. A number of SAG values is reduced based on a number of rows n in the data matrix X. For each iteration, a sum of the reduced SAG values is determined, as well as a full gradient based on the sum of the reduced SAG values from all rows n, by distributed parallel processing. The model parameters w are updated based on the full gradient for each partition.
    Type: Grant
    Filed: November 20, 2017
    Date of Patent: May 4, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Shen Li, Xiang Ni, Michael John Witbrock, Lingfei Wu
  • Publication number: 20210098074
    Abstract: A method, computer system, and a computer program product for designing one or more folded structural proteins from at least one raw amino acid sequence is provided. The present invention may include computing one or more character embeddings based on the at least one raw amino acid sequence by utilizing a multi-scale neighborhood-based neural network (MNNN) model. The present invention may then include refining the computed one or more character embeddings with at least one set of sequence neighborhood information. The present invention may further include predicting one or more dihedral angles based on the refined one or more character embeddings.
    Type: Application
    Filed: September 27, 2019
    Publication date: April 1, 2021
    Inventors: Lingfei Wu, Siyu Huo, Tengfei Ma, Pin-Yu Chen, Zhao Qin, Eugene Jungsup Lim, Francisco Javier Martin-Martinez, Hui Sun, Benedetto Marelli, Markus Jochen Buehler