Patents by Inventor William S Spangler

William S Spangler has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11651235
    Abstract: A method, computer system, and a computer program product for generating a candidate set of entities from a training set of entities is provided. The present invention may include determining an ontology class for an input entity in the training set of entities. The present invention may include adding the input entity to an ontology list. The present invention may then include assigning an entity score to the input entity. The present invention may also include normalizing the ontology list of entity scores. The present invention may lastly include selecting the candidate set of entities with the highest entity score.
    Type: Grant
    Filed: November 28, 2018
    Date of Patent: May 16, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: William S. Spangler, Alix Lacoste, Katherine Shen, Hrishikesh Sathe, Jacques Labrie
  • Patent number: 11354591
    Abstract: Mechanisms are provided to implement a genomic database curation (GDC) system. The GDC system generates a ground truth database based on a training subset of datasets from an uncurated large scale genomic database, and label metadata for the training subset. The GDC system trains at least one classification engine of the GDC system based on the training subset and the ground truth database at least by performing a machine learning operation on the at least one classification engine. The GDC system automatically applies the at least one trained classification engine on the uncurated large scale genomic database to generate an automatically curated large scale genomic database. A meta-classifier engine generates an output specifying at least one of significant gene signatures or gene pathways for at least one of diseases or drug agents based on the automatically curated large scale genomic database.
    Type: Grant
    Filed: October 11, 2018
    Date of Patent: June 7, 2022
    Assignee: International Business Machines Corporation
    Inventors: Sanjoy Dey, Achille B. Fokoue-Nkoutche, William S. Spangler, Ping Zhang
  • Patent number: 11269929
    Abstract: According to embodiments of the present invention, methods, systems and computer readable media are provided, in a cognitive data processing system, for implementing a predictive analytics system that utilizes entity and non-entity information. A collection of content is processed to extract defined entities pertaining to one or more domains. Semantic relationships are determined between objects within the collection of content, wherein the objects include undefined entities. The defined entities and objects are resolved based on entity definitions and the semantic relationships to determine defined entities and undefined entities for a resulting data set. The resulting data set is processed to identify one or more relationships between a defined entity and an undefined entity.
    Type: Grant
    Filed: May 4, 2018
    Date of Patent: March 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: William S. Spangler, Richard L. Martin, David Martinez Iraola, Daniel Pierce, Sheng Hua Bao, Meenakshi Nagarajan, Michael D. Pfeifer
  • Patent number: 11250933
    Abstract: According to embodiments of the present invention, similarity metrics or measures of similarity may be combined using an adaptive weighting scheme. A subset of entities from a first set of entities that have a known relationship is randomly selected. The subset is combined with a second set of entities that have an unknown relationship to each other and/or to the first set of entities. At least two different measures of similarity (similarity metrics) between the first set and the combined second set (including the subset) is determined for each entity in the second set. For each entity in the second set, the at least two different measures of similarity are compared, and a weight is assigned adaptively to each measure of similarity based on the magnitude of the measure of similarity. The weighted measures of similarity are combined to determine an aggregate adaptively weighted similarity score for each entity.
    Type: Grant
    Filed: June 18, 2019
    Date of Patent: February 15, 2022
    Assignee: International Business Machines Corporation
    Inventors: Yanyan Han, Sheng Hua Bao, Xiaoyang Gao, Brian S. Dreher, William S. Spangler, Feng Wang
  • Patent number: 11244743
    Abstract: According to embodiments of the present invention, similarity metrics or measures of similarity may be combined using an adaptive weighting scheme. A subset of entities from a first set of entities that have a known relationship is randomly selected. The subset is combined with a second set of entities that have an unknown relationship to each other and/or to the first set of entities. At least two different measures of similarity (similarity metrics) between the first set and the combined second set (including the subset) is determined for each entity in the second set. For each entity in the second set, the at least two different measures of similarity are compared, and a weight is assigned adaptively to each measure of similarity based on the magnitude of the measure of similarity. The weighted measures of similarity are combined to determine an aggregate adaptively weighted similarity score for each entity.
    Type: Grant
    Filed: January 5, 2018
    Date of Patent: February 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: Yanyan Han, Sheng Hua Bao, Xiaoyang Gao, Brian S. Dreher, William S. Spangler, Feng Wang
  • Patent number: 11188574
    Abstract: Methods, systems, and computer program products are provided for processing a request regarding relationships among instances of entities. A graphical representation of instances of entities is generated and includes one or more source nodes, each representing an instance of an input entity of a request, and one or more related nodes, each representing an instance of a second entity related to one or more corresponding instances of the input entity and associated with a corresponding confidence score for the relationship. Each of the one or more related nodes associated with a confidence score satisfying a threshold is identified. One or more supplemental nodes are added to the graphical representation, each of which represents a corresponding instance of a third entity with a relationship to a corresponding instance of the second entity. The graphical representation is traversed to identify relationships between instances of entities and produce results for the request.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: November 30, 2021
    Assignee: International Business Machines Corporation
    Inventors: Yanyan Han, Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Brian S. Dreher
  • Patent number: 11182369
    Abstract: Methods, systems and computer readable media are provided for accessing data utilizing a multi-level table comprising generating a plurality of levels of the multi-level table, wherein a first level of the multi-level table includes a hyper-table with a plurality of hyper-cells each hyper-cell including information for a group of cells from an initial base table, wherein intermediate levels of the multi-level table each include a plurality of hyper-tables comprising hyper-cells with each hyper-table linked to and providing information for a corresponding hyper-cell of a hyper-table of a prior level, and wherein a plurality of tables of a terminal level includes information from cells of the initial base table with each table linked to and providing information for a corresponding hyper-cell. Data from the multi-level table is accessed by traversing links between the hyper-tables of the plurality of levels to access data within the tables of the terminal level.
    Type: Grant
    Filed: April 24, 2018
    Date of Patent: November 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Yanyan Han, Brian S. Dreher
  • Patent number: 11182692
    Abstract: According to an embodiment of the present invention, a system designates each document in a collection of documents as a member of a first group containing known subjects for a concept of interest or as a member of a second group containing candidate subjects for the concept of interest and determines a subset of documents for at least one subject. The system generates a classifier based on the documents in the first and second groups and applies the classifier to a set of documents for the at least one subject to determine whether each document belong to the first and/or second group. The system generates a score for the at least one subject based on a quantity of documents for that subject assigned to the first group of documents relative to a total quantity of documents for that subject and ranks that subject based on the determined score for each subject.
    Type: Grant
    Filed: June 16, 2017
    Date of Patent: November 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Alix M. Lacoste, William S. Spangler
  • Patent number: 11151172
    Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: October 19, 2021
    Assignee: International Business Machines Corporation
    Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
  • Patent number: 11151171
    Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.
    Type: Grant
    Filed: April 27, 2018
    Date of Patent: October 19, 2021
    Assignee: International Business Machines Corporation
    Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
  • Patent number: 11120257
    Abstract: Rationalization of network predictions using similarity to known connections is provided. In various embodiments, a graph is read. The graph comprises a plurality of nodes. Each of the plurality of nodes corresponds to an entity or property. The plurality of nodes is interconnected by a plurality of edges. Each edge corresponds to a relationship between connected nodes. A new edge in the graph is predicted. The new edge corresponds to a relationship between a first node and a second node. The first node corresponds to an entity and the second node corresponds to an entity or property. One or more additional nodes connected to the second node is located. The one or more additional nodes is scored according to its connections in common with the first node. One or more sources is provided to a user describing the connection between the one or more additional node and the second node.
    Type: Grant
    Filed: February 24, 2020
    Date of Patent: September 14, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alix Lacoste, William S. Spangler, Feng Wang
  • Patent number: 11100413
    Abstract: According to embodiments of the present invention, an entity may be represented by an unstructured feature vector comprising a plurality of features extracted from unstructured data using semantic analysis and a structural feature vector comprising a plurality of features from chemical structure data. A similarity matrix may be used to compare entities and generate a similarity score, based on both the unstructured feature vector and the structural feature vector for each entity. In some aspects, a user may enter a query (from which a chemical structural feature vector is dynamically generated) to compare against entities having unstructured and/or structural feature vectors, stored in a database.
    Type: Grant
    Filed: June 18, 2019
    Date of Patent: August 24, 2021
    Assignee: International Business Machines Corporation
    Inventors: William S. Spangler, Richard L. Martin, Feng Wang, Xiaoyang Gao, Sheng Hua Bao
  • Patent number: 11093842
    Abstract: According to embodiments of the present invention, an entity may be represented by an unstructured feature vector comprising a plurality of features extracted from unstructured data using semantic analysis and a structural feature vector comprising a plurality of features from chemical structure data. A similarity matrix may be used to compare entities and generate a similarity score, based on both the unstructured feature vector and the structural feature vector for each entity. In some aspects, a user may enter a query (from which a chemical structural feature vector is dynamically generated) to compare against entities having unstructured and/or structural feature vectors, stored in a database.
    Type: Grant
    Filed: February 13, 2018
    Date of Patent: August 17, 2021
    Assignee: International Business Machines Corporation
    Inventors: William S. Spangler, Richard L. Martin, Feng Wang, Xiaoyang Gao, Sheng Hua Bao
  • Patent number: 10810234
    Abstract: Methods, systems, and computer program products are provided for processing a request regarding relationships among instances of entities. A graphical representation of instances of entities is generated and includes one or more source nodes, each representing an instance of an input entity of a request, and one or more related nodes, each representing an instance of a second entity related to one or more corresponding instances of the input entity and associated with a corresponding confidence score for the relationship. Each of the one or more related nodes associated with a confidence score satisfying a threshold is identified. One or more supplemental nodes are added to the graphical representation, each of which represents a corresponding instance of a third entity with a relationship to a corresponding instance of the second entity. The graphical representation is traversed to identify relationships between instances of entities and produce results for the request.
    Type: Grant
    Filed: April 24, 2018
    Date of Patent: October 20, 2020
    Assignee: International Business Machines Coproration
    Inventors: Yanyan Han, Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Brian S. Dreher
  • Publication number: 20200193154
    Abstract: Rationalization of network predictions using similarity to known connections is provided. In various embodiments, a graph is read. The graph comprises a plurality of nodes. Each of the plurality of nodes corresponds to an entity or property. The plurality of nodes is interconnected by a plurality of edges. Each edge corresponds to a relationship between connected nodes. A new edge in the graph is predicted. The new edge corresponds to a relationship between a first node and a second node. The first node corresponds to an entity and the second node corresponds to an entity or property. One or more additional nodes connected to the second node is located. The one or more additional nodes is scored according to its connections in common with the first node. One or more sources is provided to a user describing the connection between the one or more additional node and the second node.
    Type: Application
    Filed: February 24, 2020
    Publication date: June 18, 2020
    Inventors: Alix Lacoste, William S. Spangler, Feng Wang
  • Publication number: 20200167663
    Abstract: A method, computer system, and a computer program product for generating a candidate set of entities from a training set of entities is provided. The present invention may include determining an ontology class for an input entity in the training set of entities. The present invention may include adding the input entity to an ontology list. The present invention may then include assigning an entity score to the input entity. The present invention may also include normalizing the ontology list of entity scores. The present invention may lastly include selecting the candidate set of entities with the highest entity score.
    Type: Application
    Filed: November 28, 2018
    Publication date: May 28, 2020
    Inventors: William S. Spangler, Alix Lacoste, Katherine Shen, Hrishikesh Sathe, Jacques Labrie
  • Publication number: 20200118040
    Abstract: Mechanisms are provided to implement a genomic database curation (GDC) system. The GDC system generates a ground truth database based on a training subset of datasets from an uncurated large scale genomic database, and label metadata for the training subset. The GDC system trains at least one classification engine of the GDC system based on the training subset and the ground truth database at least by performing a machine learning operation on the at least one classification engine. The GDC system automatically applies the at least one trained classification engine on the uncurated large scale genomic database to generate an automatically curated large scale genomic database. A meta-classifier engine generates an output specifying at least one of significant gene signatures or gene pathways for at least one of diseases or drug agents based on the automatically curated large scale genomic database.
    Type: Application
    Filed: October 11, 2018
    Publication date: April 16, 2020
    Inventors: Sanjoy Dey, Achille B. Fokoue-Nkoutche, William S. Spangler, Ping Zhang
  • Patent number: 10607720
    Abstract: The present invention relates to a method and system for associating gene expression data with a disease name. A first data set associated with a plurality of genetic probes for a plurality of biological samples may be received. The first data set may be sorted based on a normalized gene expression values for the plurality of genetic probes. A largest value gap of the normalized gene expression values may be identified. A set of expressed genes within the first data set may be identified. An indexable document may be generated for a biological sample of the plurality of biological samples comprising data associated with the set of expressed genes. A second data set associated with an expressed gene of the set of expressed genes may be searched. A disease name may be associated with an expressed gene based on a threshold correlation between the disease name and the expressed gene.
    Type: Grant
    Filed: May 11, 2016
    Date of Patent: March 31, 2020
    Assignee: International Business Machines Corporation
    Inventors: Linda H. Kato, Jeffrey T. Kreulen, Jacques Labrie, William S. Spangler, Ignacio G. Terrizzano
  • Patent number: 10607074
    Abstract: Rationalization of network predictions using similarity to known connections is provided. In various embodiments, a graph is read. The graph comprises a plurality of nodes. Each of the plurality of nodes corresponds to an entity or property. The plurality of nodes is interconnected by a plurality of edges. Each edge corresponds to a relationship between connected nodes. A new edge in the graph is predicted. The new edge corresponds to a relationship between a first node and a second node. The first node corresponds to an entity and the second node corresponds to an entity or property. One or more additional nodes connected to the second node is located. The one or more additional nodes is scored according to its connections in common with the first node. One or more sources is provided to a user describing the connection between the one or more additional node and the second node.
    Type: Grant
    Filed: November 22, 2017
    Date of Patent: March 31, 2020
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Alix Lacoste, William S. Spangler, Feng Wang
  • Publication number: 20200065385
    Abstract: According to an embodiment of the present invention, a natural language query including an ambiguous entity is received from a user. A meaning of the ambiguous entity is determined based on an extracted language context of the natural language query. The determined meaning, extracted language context, and contextual information of the user is applied to a machine learning model to determine a plurality of computer applications from amongst multiple computer applications to process the natural language query. The determined applications are executed to produce results for the natural language query tailored to an interest of the user in accordance with the contextual information.
    Type: Application
    Filed: August 27, 2018
    Publication date: February 27, 2020
    Inventors: Brian S. Dreher, Henry H. Chen, Sheng Hua Bao, William S. Spangler