Patents by Inventor Sheng Hua Bao

Sheng Hua Bao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11924304
    Abstract: A computer system accesses a storage device. Contents of an object of a request are sorted. The contents of an object of a request are sorted. A hash key is generated to access information in the storage device based on the sorted contents of the object, wherein objects with non-critical differences are mapped to the same hash key. The information in the storage device is accessed based on the generated hash key to produce a response to the request. Embodiments of the present invention further include a method and program product for accessing a storage device in substantially the same manner described above.
    Type: Grant
    Filed: July 27, 2018
    Date of Patent: March 5, 2024
    Assignee: International Business Machines Corporation
    Inventors: Brian S. Dreher, Sheng Hua Bao, Xiaoyang Gao, Yanyan Han
  • Patent number: 11676043
    Abstract: A mechanism is provided in a data processing system having a processor and a memory. The memory comprises instructions which are executed by the processor to cause the processor to implement a training system for finding an optimal surface for hierarchical classification task on an ontology. The training system receives a training data set and a hierarchical classification ontology data structure. The training system generates a neural network architecture based on the training data set and the hierarchical classification ontology data structure. The neural network architecture comprises an indicative layer, a parent tier (PT) output and a lower leaf tier (LLT) output. The training system trains the neural network architecture to classify the training data set to leaf nodes at the LLT output and parent nodes at the PT output. The indicative layer in the neural network architecture determines a surface that passes through each path from a root to a leaf node in the hierarchical ontology data structure.
    Type: Grant
    Filed: March 4, 2019
    Date of Patent: June 13, 2023
    Assignee: International Business Machines Corporation
    Inventors: Pathirage Dinindu Sujan Udayanga Perera, Orna Raz, Ramani Routray, Vivek Krishnamurthy, Sheng Hua Bao, Eitan D. Farchi
  • Patent number: 11556710
    Abstract: A computer system processes a group of inputs. A group of entities that is input for processing is intercepted. The intercepted group is expanded into individual entities. Each of the individual entities is processed to produce results for each individual entity. The results for each individual entity are intercepted and merged to produce results for the group of entities. Embodiments of the present invention further include a method and program product for processing a group of inputs in substantially the same manner described above.
    Type: Grant
    Filed: May 11, 2018
    Date of Patent: January 17, 2023
    Assignee: International Business Machines Corporation
    Inventors: Brian S. Dreher, Sheng Hua Bao, Xiaoyang Gao, Yanyan Han
  • Patent number: 11514691
    Abstract: A computer system trains a machine learning model. A vector representation is generated for each document in a collection of documents. The documents are clustered based on the vector representations of the documents to produce a plurality of clusters. A training set is produced by selecting one or more documents from each cluster, wherein the selected documents represent a sample of the collection of documents to train the machine learning model. The machine learning model is trained by applying the training set to the machine learning model. Embodiments of the present invention further include a method and program product for training a machine learning model in substantially the same manner described above.
    Type: Grant
    Filed: June 12, 2019
    Date of Patent: November 29, 2022
    Assignee: International Business Machines Corporation
    Inventors: Pathirage D. S. U. Perera, Eitan D. Farchi, Orna Raz, Ramani Routray, Sheng Hua Bao, Marcel Zalmanovici
  • Patent number: 11475335
    Abstract: A mechanism is provided in a data processing system for training a computer implemented model. The mechanism determines an operation for which the computer implemented model is to be trained. The mechanism performs a statistical analysis of an enterprise dataset for an enterprise to generate one or more statistical distributions of cases and features correlated with the operation for which the computer implemented model is to be trained. The mechanism selects a subset of cases in the enterprise dataset for annotation based on the one or more statistical distributions of cases and features. The mechanism annotates the selected subset of cases to generate an annotated training dataset. The mechanism trains the computer implemented model, using the annotated training dataset, to perform the operation.
    Type: Grant
    Filed: April 24, 2019
    Date of Patent: October 18, 2022
    Assignee: International Business Machines Corporation
    Inventors: Ramani Routray, Sheng Hua Bao, Claire Abu-Assal, Cartic Ramakrishnan, Pathirage Dinindu Sujan Udayanga Perera, Abhinandan Kelgere Ramesh, Bruce L. Hillsberg
  • Patent number: 11409950
    Abstract: Mechanisms are provided to implement an annotation mechanism allows users to annotate documents with annotations for processing by a cognitive medical system. The annotation mechanism receives, via a user interface, a user selection of an electronic document for annotation, and determines one or more domains associated with the selected electronic document from an analysis of metadata associated with the selected electronic document. The annotation mechanism retrieves a predefined set of annotations associated with each determined domain, and presents the predefined set of annotations as user selectable elements. The annotation mechanism receives, via the user interface, a selection of one or more annotations in the predefined set of annotations to be associated with the selected portion of the selected electronic document, and generates annotation metadata associating the selected portion using the selected one or more annotations.
    Type: Grant
    Filed: May 8, 2019
    Date of Patent: August 9, 2022
    Assignee: International Business Machines Corporation
    Inventors: Sheng Hua Bao, Xianying Liu, Nan Liu, Ramani Routray, Tongkai Shao, Feng Wang
  • Publication number: 20220215155
    Abstract: Provided are embodiments for performing data linking with visual information. Embodiments include receiving a form including at least one or more entities, determining a visual location information of the at least one or more entities in the forms, and identifying one or more attributes in the form. Embodiments also include linking the at least one entity with one or more attributes using the visual location information of the at least one entity, and providing structured data linking the at least one entity with the one or more attributes.
    Type: Application
    Filed: January 4, 2021
    Publication date: July 7, 2022
    Inventors: Pathirage Dinindu Sujan Udayanga Perera, Cartic Ramakrishnan, Sheng Hua Bao
  • Patent number: 11372905
    Abstract: From metadata corresponding to a narrative text, a first encoding is constructed, the first encoding comprising a standardized text string, the first encoding formed according to an encoding scheme. A specified portion of the standardized text string of the first encoding is marked as an anchor term. A correspondence between the first encoding and a second encoding is tested using the encoding scheme and a Natural Language Processing engine, responsive to finding the anchor term within the narrative text. The second encoding corresponds to a text window. The text window comprises a portion of the narrative text comprising an instance of the anchor term and a word within a predetermined distance from the instance. Responsive to the second encoding being identical to the first encoding, the narrative text is annotated, the annotating creating new data linking the narrative text with the second encoding.
    Type: Grant
    Filed: February 4, 2019
    Date of Patent: June 28, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Nakul Chakrapani, Ramani Routray, Pathirage Perera, Sheng Hua Bao, Orna Raz, Eitan Farchi
  • Patent number: 11269929
    Abstract: According to embodiments of the present invention, methods, systems and computer readable media are provided, in a cognitive data processing system, for implementing a predictive analytics system that utilizes entity and non-entity information. A collection of content is processed to extract defined entities pertaining to one or more domains. Semantic relationships are determined between objects within the collection of content, wherein the objects include undefined entities. The defined entities and objects are resolved based on entity definitions and the semantic relationships to determine defined entities and undefined entities for a resulting data set. The resulting data set is processed to identify one or more relationships between a defined entity and an undefined entity.
    Type: Grant
    Filed: May 4, 2018
    Date of Patent: March 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: William S. Spangler, Richard L. Martin, David Martinez Iraola, Daniel Pierce, Sheng Hua Bao, Meenakshi Nagarajan, Michael D. Pfeifer
  • Patent number: 11257592
    Abstract: A method, a system, and a computer program product are provided. A machine learning model is generated to process adverse event information and produce multiple corresponding medical codes associated with the adverse event information, wherein the multiple medical codes are semantically and hierarchically related in a medical taxonomy. The machine learning model includes multiple parallel output layers, each of which is associated with a corresponding medical code. The machine learning model is trained with training data elements, each of which includes adverse event information mapped to respective multiple medical codes, wherein results from each of the output layers adjusts the machine learning model. After completing the training, information pertaining to an adverse event is applied to the machine learning model to determine the corresponding multiple medical codes within the medical taxonomy.
    Type: Grant
    Filed: February 26, 2019
    Date of Patent: February 22, 2022
    Assignee: International Business Machines Corporation
    Inventors: Pathirage D. S. U Perera, Cartic Ramakrishnan, Sheng Hua Bao, Ramani Routray
  • Patent number: 11250933
    Abstract: According to embodiments of the present invention, similarity metrics or measures of similarity may be combined using an adaptive weighting scheme. A subset of entities from a first set of entities that have a known relationship is randomly selected. The subset is combined with a second set of entities that have an unknown relationship to each other and/or to the first set of entities. At least two different measures of similarity (similarity metrics) between the first set and the combined second set (including the subset) is determined for each entity in the second set. For each entity in the second set, the at least two different measures of similarity are compared, and a weight is assigned adaptively to each measure of similarity based on the magnitude of the measure of similarity. The weighted measures of similarity are combined to determine an aggregate adaptively weighted similarity score for each entity.
    Type: Grant
    Filed: June 18, 2019
    Date of Patent: February 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: Yanyan Han, Sheng Hua Bao, Xiaoyang Gao, Brian S. Dreher, William S. Spangler, Feng Wang
  • Patent number: 11244743
    Abstract: According to embodiments of the present invention, similarity metrics or measures of similarity may be combined using an adaptive weighting scheme. A subset of entities from a first set of entities that have a known relationship is randomly selected. The subset is combined with a second set of entities that have an unknown relationship to each other and/or to the first set of entities. At least two different measures of similarity (similarity metrics) between the first set and the combined second set (including the subset) is determined for each entity in the second set. For each entity in the second set, the at least two different measures of similarity are compared, and a weight is assigned adaptively to each measure of similarity based on the magnitude of the measure of similarity. The weighted measures of similarity are combined to determine an aggregate adaptively weighted similarity score for each entity.
    Type: Grant
    Filed: January 5, 2018
    Date of Patent: February 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: Yanyan Han, Sheng Hua Bao, Xiaoyang Gao, Brian S. Dreher, William S. Spangler, Feng Wang
  • Publication number: 20210406640
    Abstract: Mechanisms are provided to implement a medical coding engine to perform medical coding using a neural network architecture that leverages hierarchical semantics between medical concepts. The medical coding engine configures a medical coding neural network to comprise an first layer of nodes comprising preferred terminology (PT) nodes, a second layer comprising lowest level terminology (LLT) nodes, and a third layer comprising weighted values for each connection between each PT node and each LLT node forming a PT node/LLT node connection. Responsive to receiving an adverse event from a cognitive system, a PT node is identified in the first layer associated with a citation from the adverse event. One or more LLT nodes are identified from the second layer based on the identification PT node and a weight associated with the PT node/LLT node connection. A medical code associated with each the one or more LLT nodes is then output.
    Type: Application
    Filed: September 8, 2021
    Publication date: December 30, 2021
    Inventors: Nitish Aggarwal, Sheng Hua Bao, Pathirage Perera
  • Patent number: 11188574
    Abstract: Methods, systems, and computer program products are provided for processing a request regarding relationships among instances of entities. A graphical representation of instances of entities is generated and includes one or more source nodes, each representing an instance of an input entity of a request, and one or more related nodes, each representing an instance of a second entity related to one or more corresponding instances of the input entity and associated with a corresponding confidence score for the relationship. Each of the one or more related nodes associated with a confidence score satisfying a threshold is identified. One or more supplemental nodes are added to the graphical representation, each of which represents a corresponding instance of a third entity with a relationship to a corresponding instance of the second entity. The graphical representation is traversed to identify relationships between instances of entities and produce results for the request.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: November 30, 2021
    Assignee: International Business Machines Corporation
    Inventors: Yanyan Han, Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Brian S. Dreher
  • Patent number: 11182369
    Abstract: Methods, systems and computer readable media are provided for accessing data utilizing a multi-level table comprising generating a plurality of levels of the multi-level table, wherein a first level of the multi-level table includes a hyper-table with a plurality of hyper-cells each hyper-cell including information for a group of cells from an initial base table, wherein intermediate levels of the multi-level table each include a plurality of hyper-tables comprising hyper-cells with each hyper-table linked to and providing information for a corresponding hyper-cell of a hyper-table of a prior level, and wherein a plurality of tables of a terminal level includes information from cells of the initial base table with each table linked to and providing information for a corresponding hyper-cell. Data from the multi-level table is accessed by traversing links between the hyper-tables of the plurality of levels to access data within the tables of the terminal level.
    Type: Grant
    Filed: April 24, 2018
    Date of Patent: November 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Yanyan Han, Brian S. Dreher
  • Patent number: 11182371
    Abstract: Methods, systems and computer readable media are provided for accessing data utilizing a multi-level table comprising generating a plurality of levels of the multi-level table, wherein a first level of the multi-level table includes a hyper-table with a plurality of hyper-cells each hyper-cell including information for a group of cells from an initial base table, wherein intermediate levels of the multi-level table each include a plurality of hyper-tables comprising hyper-cells with each hyper-table linked to and providing information for a corresponding hyper-cell of a hyper-table of a prior level, and wherein a plurality of tables of a terminal level includes information from cells of the initial base table with each table linked to and providing information for a corresponding hyper-cell. Data from the multi-level table is accessed by traversing links between the hyper-tables of the plurality of levels to access data within the tables of the terminal level.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: November 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Xiaoyang Gao, William Scott Spangler, Sheng Hua Bao, Yanyan Han, Brian S. Dreher
  • Patent number: 11176441
    Abstract: Mechanisms are provided to implement a medical coding engine to perform medical coding using a neural network architecture that leverages hierarchical semantics between medical concepts. The medical coding engine configures a medical coding neural network to comprise an first layer of nodes comprising preferred terminology (PT) nodes, a second layer comprising lowest level terminology (LLT) nodes, and a third layer comprising weighted values for each connection between each PT node and each LLT node forming a PT node/LLT node connection. Responsive to receiving an adverse event from a cognitive system, a PT node is identified in the first layer associated with a citation from the adverse event. One or more nodes are identified from the second layer based on the identification PT node and a weight associated with the PT node/LLT node connection. A medical code associated with each the one or more LLT nodes is then output.
    Type: Grant
    Filed: May 1, 2018
    Date of Patent: November 16, 2021
    Assignee: International Business Machines Corporation
    Inventors: Nitish Aggarwal, Sheng Hua Bao, Pathirage Perera
  • Patent number: 11151172
    Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: October 19, 2021
    Assignee: International Business Machines Corporation
    Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
  • Patent number: 11151171
    Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.
    Type: Grant
    Filed: April 27, 2018
    Date of Patent: October 19, 2021
    Assignee: International Business Machines Corporation
    Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
  • Patent number: 11120339
    Abstract: A method, computer system, and a computer program product for determining the reliability of a claim is provided. The present invention may include receiving an input data from a user. The present invention may also include analyzing the claim associated with the received input data to determine a reliability score associated with the input data, wherein the claim is semantically similar to the received input data. The present invention may further include generating, from a prediction model, the reliability score for the claim associated with the received input data. The present invention may also include presenting the reliability score for the claim associated with the received input data to the user.
    Type: Grant
    Filed: May 10, 2017
    Date of Patent: September 14, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sheng Hua Bao, Rashmi Gangadharaiah, Richard L. Martin, David Martinez Iraola, Meenakshi Nagarajan, Dan G. Tecuci