Patents by Inventor Sheng Hua Bao

Sheng Hua Bao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10558695
    Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.
    Type: Grant
    Filed: May 30, 2017
    Date of Patent: February 11, 2020
    Assignee: International Business Machines Corporation
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Patent number: 10558756
    Abstract: A data handling system enables the unsupervised creation of an information extraction dictionary by expanding upon a word or phrase included within an expansion query. Prior to receiving the expansion query, the data handling system performs an unsupervised learning of an information corpus which includes text to assign a corpus vector to each word and phrase of the text. After the expansion query, the data handling system compares the expansion query to the corpus vectors. The data handling system ranks the corpus vectors by similarity to the expansion query and provides a ranked list of words or phrases associated with the ranked corpus vectors. The ranked list may be subsequently utilized as the information extraction dictionary.
    Type: Grant
    Filed: November 3, 2016
    Date of Patent: February 11, 2020
    Assignee: International Business Machines Corporation
    Inventors: Sheng Hua Bao, Su Yan
  • Patent number: 10558747
    Abstract: A data handling system enables the unsupervised creation of an information extraction dictionary by expanding upon a word or phrase included within an expansion query. Prior to receiving the expansion query, the data handling system performs an unsupervised learning of an information corpus which includes text to assign a corpus vector to each word and phrase of the text. After the expansion query, the data handling system compares the expansion query to the corpus vectors. The data handling system ranks the corpus vectors by similarity to the expansion query and provides a ranked list of words or phrases associated with the ranked corpus vectors. The ranked list may be subsequently utilized as the information extraction dictionary.
    Type: Grant
    Filed: November 3, 2016
    Date of Patent: February 11, 2020
    Assignee: International Business Machines Corporation
    Inventors: Sheng Hua Bao, Su Yan
  • Patent number: 10552541
    Abstract: According to an embodiment of the present invention, a natural language query including an ambiguous entity is received from a user. A meaning of the ambiguous entity is determined based on an extracted language context of the natural language query. The determined meaning, extracted language context, and contextual information of the user is applied to a machine learning model to determine a plurality of computer applications from amongst multiple computer applications to process the natural language query. The determined applications are executed to produce results for the natural language query tailored to an interest of the user in accordance with the contextual information.
    Type: Grant
    Filed: August 27, 2018
    Date of Patent: February 4, 2020
    Assignee: International Business Machines Corporation
    Inventors: Brian S. Dreher, Henry H. Chen, Sheng Hua Bao, William S. Spangler
  • Publication number: 20200036806
    Abstract: A computer system accesses a storage device. Contents of an object of a request are sorted. The contents of an object of a request are sorted. A hash key is generated to access information in the storage device based on the sorted contents of the object, wherein objects with non-critical differences are mapped to the same hash key. The information in the storage device is accessed based on the generated hash key to produce a response to the request. Embodiments of the present invention further include a method and program product for accessing a storage device in substantially the same manner described above.
    Type: Application
    Filed: July 27, 2018
    Publication date: January 30, 2020
    Inventors: Brian S. Dreher, Sheng Hua Bao, Xiaoyang Gao, Yanyan Han
  • Patent number: 10540438
    Abstract: Performing an operation comprising extracting, from an input comprising unstructured electronic text, a plurality of feature values for a plurality of features defined in a feature vector, identifying, based on a machine learning (ML) model applied to the plurality of feature values, a portion of the electronic text corresponding to an adverse event, and annotating the portion of the electronic text with an indication of the identified adverse event.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: January 21, 2020
    Assignee: International Business Machines Corporation
    Inventors: Meenakshi Nagarajan, Cartic Ramakrishnan, Sheng Hua Bao, Nakul Chakrapani
  • Publication number: 20190347324
    Abstract: A computer system processes a group of inputs. A group of entities that is input for processing is intercepted. The intercepted group is expanded into individual entities. Each of the individual entities is processed to produce results for each individual entity. The results for each individual entity are intercepted and merged to produce results for the group of entities. Embodiments of the present invention further include a method and program product for processing a group of inputs in substantially the same manner described above.
    Type: Application
    Filed: May 11, 2018
    Publication date: November 14, 2019
    Inventors: Brian S. Dreher, Sheng Hua Bao, Xiaoyang Gao, Yanyan Han
  • Publication number: 20190340487
    Abstract: Mechanisms are provided to implement a medical coding engine to perform medical coding using a neural network architecture that leverages hierarchical semantics between medical concepts. The medical coding engine configures a medical coding neural network to comprise an first layer of nodes comprising preferred terminology (PT) nodes, a second layer comprising lowest level terminology (LLT) nodes, and a third layer comprising weighted values for each connection between each PT node and each LLT node forming a PT node/LLT node connection. Responsive to receiving an adverse event from a cognitive system, a PT node is identified in the first layer associated with a citation from the adverse event. One or more nodes are identified from the second layer based on the identification PT node and a weight associated with the PT node/LLT node connection. A medical code associated with each the one or more LLT nodes is then output.
    Type: Application
    Filed: May 1, 2018
    Publication date: November 7, 2019
    Inventors: Nitish Aggarwal, Sheng Hua Bao, Pathirage Perera
  • Publication number: 20190340294
    Abstract: According to embodiments of the present invention, methods, systems and computer readable media are provided, in a cognitive data processing system, for implementing a predictive analytics system that utilizes entity and non-entity information. A collection of content is processed to extract defined entities pertaining to one or more domains. Semantic relationships are determined between objects within the collection of content, wherein the objects include undefined entities. The defined entities and objects are resolved based on entity definitions and the semantic relationships to determine defined entities and undefined entities for a resulting data set. The resulting data set is processed to identify one or more relationships between a defined entity and an undefined entity.
    Type: Application
    Filed: May 4, 2018
    Publication date: November 7, 2019
    Inventors: William S. Spangler, Richard L. Martin, David Martinez Iraola, Daniel Pierce, Sheng Hua Bao, Meenakshi Nagarajan, Michael D. Pfeifer
  • Publication number: 20190332714
    Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.
    Type: Application
    Filed: April 27, 2018
    Publication date: October 31, 2019
    Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
  • Publication number: 20190332614
    Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.
    Type: Application
    Filed: June 24, 2019
    Publication date: October 31, 2019
    Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
  • Publication number: 20190324976
    Abstract: Methods, systems, and computer program products are provided for processing a request regarding relationships among instances of entities. A graphical representation of instances of entities is generated and includes one or more source nodes, each representing an instance of an input entity of a request, and one or more related nodes, each representing an instance of a second entity related to one or more corresponding instances of the input entity and associated with a corresponding confidence score for the relationship. Each of the one or more related nodes associated with a confidence score satisfying a threshold is identified. One or more supplemental nodes are added to the graphical representation, each of which represents a corresponding instance of a third entity with a relationship to a corresponding instance of the second entity. The graphical representation is traversed to identify relationships between instances of entities and produce results for the request.
    Type: Application
    Filed: June 24, 2019
    Publication date: October 24, 2019
    Inventors: Yanyan Han, Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Brian S. Dreher
  • Publication number: 20190326000
    Abstract: Mechanisms are provided that implement a drug-adverse event causality evaluation engine to identify human resource selections based on a readability of unstructured text within an individual case safety report (ICSR) and a confidence value of the ICSR. The drug-adverse event causality evaluation engine receives the ICSR from a cognitive system. The drug-adverse event causality evaluation engine analyzes the ICSR to determine a readability value of the ICSR. The drug-adverse event causality evaluation engine determines whether or not an assessment, by a human reviewer, of the ICSR is required based on a combination of the readability value of the ICSR and the confidence value. The drug-adverse event causality evaluation engine outputs an indication of whether human reviewer assessment is required.
    Type: Application
    Filed: November 1, 2018
    Publication date: October 24, 2019
    Inventors: Sheng Hua Bao, Pathirage Perera, Cartic Ramakrishnan
  • Publication number: 20190325047
    Abstract: Methods, systems and computer readable media are provided for accessing data utilizing a multi-level table comprising generating a plurality of levels of the multi-level table, wherein a first level of the multi-level table includes a hyper-table with a plurality of hyper-cells each hyper-cell including information for a group of cells from an initial base table, wherein intermediate levels of the multi-level table each include a plurality of hyper-tables comprising hyper-cells with each hyper-table linked to and providing information for a corresponding hyper-cell of a hyper-table of a prior level, and wherein a plurality of tables of a terminal level includes information from cells of the initial base table with each table linked to and providing information for a corresponding hyper-cell. Data from the multi-level table is accessed by traversing links between the hyper-tables of the plurality of levels to access data within the tables of the terminal level.
    Type: Application
    Filed: April 24, 2018
    Publication date: October 24, 2019
    Inventors: Xiaoyang Gao, William Scott Spangler, Sheng Hua Bao, Yanyan Han, Brian S. Dreher
  • Publication number: 20190325063
    Abstract: Methods, systems, and computer program products are provided for processing a request regarding relationships among instances of entities. A graphical representation of instances of entities is generated and includes one or more source nodes, each representing an instance of an input entity of a request, and one or more related nodes, each representing an instance of a second entity related to one or more corresponding instances of the input entity and associated with a corresponding confidence score for the relationship. Each of the one or more related nodes associated with a confidence score satisfying a threshold is identified. One or more supplemental nodes are added to the graphical representation, each of which represents a corresponding instance of a third entity with a relationship to a corresponding instance of the second entity. The graphical representation is traversed to identify relationships between instances of entities and produce results for the request.
    Type: Application
    Filed: April 24, 2018
    Publication date: October 24, 2019
    Inventors: Yanyan Han, Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Brian S. Dreher
  • Publication number: 20190324957
    Abstract: Methods, systems and computer readable media are provided for accessing data utilizing a multi-level table comprising generating a plurality of levels of the multi-level table, wherein a first level of the multi-level table includes a hyper-table with a plurality of hyper-cells each hyper-cell including information for a group of cells from an initial base table, wherein intermediate levels of the multi-level table each include a plurality of hyper-tables comprising hyper-cells with each hyper-table linked to and providing information for a corresponding hyper-cell of a hyper-table of a prior level, and wherein a plurality of tables of a terminal level includes information from cells of the initial base table with each table linked to and providing information for a corresponding hyper-cell. Data from the multi-level table is accessed by traversing links between the hyper-tables of the plurality of levels to access data within the tables of the terminal level.
    Type: Application
    Filed: June 24, 2019
    Publication date: October 24, 2019
    Inventors: Xiaoyang Gao, William Scott Spangler, Sheng Hua Bao, Yanyan Han, Brian S. Dreher
  • Publication number: 20190325999
    Abstract: Mechanisms are provided that implement a drug-adverse event causality evaluation engine to identify human resource selections based on a readability of unstructured text within an individual case safety report (ICSR) and a confidence value of the ICSR. The drug-adverse event causality evaluation engine receives the ICSR from a cognitive system. The drug-adverse event causality evaluation engine analyzes the ICSR to determine a readability value of the ICSR. The drug-adverse event causality evaluation engine determines whether or not an assessment, by a human reviewer, of the ICSR is required based on a combination of the readability value of the ICSR and the confidence value. The drug-adverse event causality evaluation engine outputs an indication of whether human reviewer assessment is required.
    Type: Application
    Filed: April 20, 2018
    Publication date: October 24, 2019
    Inventors: Sheng Hua Bao, Pathirage Perera, Cartic Ramakrishnan
  • Publication number: 20190317957
    Abstract: Relationship extraction between descriptors in one or more lists of weather condition descriptors, and adverse event descriptors within unstructured data sources using natural language processing. Medical condition descriptor may be a descriptor that may be used to further extract relationships between weather condition descriptors and adverse event descriptors. A data object is generated, according to a data model, based on the extracted relationships between the descriptors. A set of candidate unstructured documents containing the extracted relationship between the descriptors is retrieved and filtered by selecting unstructured documents that include a precautionary measure descriptor. The filtered precautionary measure descriptors are presented to a user in a summarized message to a user device.
    Type: Application
    Filed: June 28, 2019
    Publication date: October 17, 2019
    Inventors: Shilpi Ahuja, Sheng Hua Bao, Rashmi Gangadharaiah
  • Publication number: 20190304573
    Abstract: According to embodiments of the present invention, similarity metrics or measures of similarity may be combined using an adaptive weighting scheme. A subset of entities from a first set of entities that have a known relationship is randomly selected. The subset is combined with a second set of entities that have an unknown relationship to each other and/or to the first set of entities. At least two different measures of similarity (similarity metrics) between the first set and the combined second set (including the subset) is determined for each entity in the second set. For each entity in the second set, the at least two different measures of similarity are compared, and a weight is assigned adaptively to each measure of similarity based on the magnitude of the measure of similarity. The weighted measures of similarity are combined to determine an aggregate adaptively weighted similarity score for each entity.
    Type: Application
    Filed: June 18, 2019
    Publication date: October 3, 2019
    Inventors: Yanyan Han, Sheng Hua Bao, Xiaoyang Gao, Brian S. Dreher, William S. Spangler, Feng Wang
  • Publication number: 20190303780
    Abstract: According to embodiments of the present invention, an entity may be represented by an unstructured feature vector comprising a plurality of features extracted from unstructured data using semantic analysis and a structural feature vector comprising a plurality of features from chemical structure data. A similarity matrix may be used to compare entities and generate a similarity score, based on both the unstructured feature vector and the structural feature vector for each entity. In some aspects, a user may enter a query (from which a chemical structural feature vector is dynamically generated) to compare against entities having unstructured and/or structural feature vectors, stored in a database.
    Type: Application
    Filed: June 18, 2019
    Publication date: October 3, 2019
    Inventors: William S. Spangler, Richard L. Martin, Feng Wang, Xiaoyang Gao, Sheng Hua Bao