Patents by Inventor William S Spangler

William S Spangler has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Processing natural language queries based on machine learning

Patent number: 10552541

Abstract: According to an embodiment of the present invention, a natural language query including an ambiguous entity is received from a user. A meaning of the ambiguous entity is determined based on an extracted language context of the natural language query. The determined meaning, extracted language context, and contextual information of the user is applied to a machine learning model to determine a plurality of computer applications from amongst multiple computer applications to process the natural language query. The determined applications are executed to produce results for the natural language query tailored to an interest of the user in accordance with the contextual information.

Type: Grant

Filed: August 27, 2018

Date of Patent: February 4, 2020

Assignee: International Business Machines Corporation

Inventors: Brian S. Dreher, Henry H. Chen, Sheng Hua Bao, William S. Spangler
ASSOCIATING GENE EXPRESSION DATA WITH A DISEASE NAME

Publication number: 20190385706

Abstract: The present invention relates to a method and system for associating gene expression data with a disease name. A first data set associated with a plurality of genetic probes for a plurality of biological samples may be received. The first data set may be sorted based on a normalized gene expression values for the plurality of genetic probes. A largest value gap of the normalized gene expression values may be identified. A set of expressed genes within the first data set may be identified. An indexable document may be generated for a biological sample of the plurality of biological samples comprising data associated with the set of expressed genes. A second data set associated with an expressed gene of the set of expressed genes may be searched. A disease name may be associated with an expressed gene based on a threshold correlation between the disease name and the expressed gene.

Type: Application

Filed: August 27, 2019

Publication date: December 19, 2019

Inventors: Linda H. Kato, Jeffrey T. Kreulen, Jacques Labrie, William S. Spangler, Ignacio G. Terrizzano
COMBINING SEMANTIC RELATIONSHIP INFORMATION WITH ENTITIES AND NON-ENTITIES FOR PREDICTIVE ANALYTICS IN A COGNITIVE SYSTEM

Publication number: 20190340294

Abstract: According to embodiments of the present invention, methods, systems and computer readable media are provided, in a cognitive data processing system, for implementing a predictive analytics system that utilizes entity and non-entity information. A collection of content is processed to extract defined entities pertaining to one or more domains. Semantic relationships are determined between objects within the collection of content, wherein the objects include undefined entities. The defined entities and objects are resolved based on entity definitions and the semantic relationships to determine defined entities and undefined entities for a resulting data set. The resulting data set is processed to identify one or more relationships between a defined entity and an undefined entity.

Type: Application

Filed: May 4, 2018

Publication date: November 7, 2019

Inventors: William S. Spangler, Richard L. Martin, David Martinez Iraola, Daniel Pierce, Sheng Hua Bao, Meenakshi Nagarajan, Michael D. Pfeifer
ACCESSING FACETED INFORMATION USING ONTOLOGIES

Publication number: 20190332614

Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.

Type: Application

Filed: June 24, 2019

Publication date: October 31, 2019

Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
ACCESSING FACETED INFORMATION USING ONTOLOGIES

Publication number: 20190332714

Abstract: Methods, systems and computer readable media are provided for accessing faceted information using ontologies. Information for an initial entity, including different ontologies to which the initial entity belongs, is retrieved. Entities within different ontologies are determined. The determined entities and different ontologies are displayed on a user interface to enable traversal of the different ontologies for viewing of the determined entities. Accordingly, a user may search for an initial entity. Different ontologies, linked to the initial entity may be returned. A user may select an ontology, and ontologies in which the initial entity belongs are displayed. The user may select any of the displayed ontologies to access other entities in the selected ontology.

Type: Application

Filed: April 27, 2018

Publication date: October 31, 2019

Inventors: Hrishikesh Sathe, Sheng Hua Bao, William S. Spangler, Xiaoyang Gao
SEARCHING FOR AND DETERMINING RELATIONSHIPS AMONG ENTITIES

Publication number: 20190325063

Abstract: Methods, systems, and computer program products are provided for processing a request regarding relationships among instances of entities. A graphical representation of instances of entities is generated and includes one or more source nodes, each representing an instance of an input entity of a request, and one or more related nodes, each representing an instance of a second entity related to one or more corresponding instances of the input entity and associated with a corresponding confidence score for the relationship. Each of the one or more related nodes associated with a confidence score satisfying a threshold is identified. One or more supplemental nodes are added to the graphical representation, each of which represents a corresponding instance of a third entity with a relationship to a corresponding instance of the second entity. The graphical representation is traversed to identify relationships between instances of entities and produce results for the request.

Type: Application

Filed: April 24, 2018

Publication date: October 24, 2019

Inventors: Yanyan Han, Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Brian S. Dreher
SEARCHING FOR AND DETERMINING RELATIONSHIPS AMONG ENTITIES

Publication number: 20190324976

Abstract: Methods, systems, and computer program products are provided for processing a request regarding relationships among instances of entities. A graphical representation of instances of entities is generated and includes one or more source nodes, each representing an instance of an input entity of a request, and one or more related nodes, each representing an instance of a second entity related to one or more corresponding instances of the input entity and associated with a corresponding confidence score for the relationship. Each of the one or more related nodes associated with a confidence score satisfying a threshold is identified. One or more supplemental nodes are added to the graphical representation, each of which represents a corresponding instance of a third entity with a relationship to a corresponding instance of the second entity. The graphical representation is traversed to identify relationships between instances of entities and produce results for the request.

Type: Application

Filed: June 24, 2019

Publication date: October 24, 2019

Inventors: Yanyan Han, Xiaoyang Gao, William S. Spangler, Sheng Hua Bao, Brian S. Dreher
COMBINING CHEMICAL STRUCTURE DATA WITH UNSTRUCTURED DATA FOR PREDICTIVE ANALYTICS IN A COGNITIVE SYSTEM

Publication number: 20190303780

Abstract: According to embodiments of the present invention, an entity may be represented by an unstructured feature vector comprising a plurality of features extracted from unstructured data using semantic analysis and a structural feature vector comprising a plurality of features from chemical structure data. A similarity matrix may be used to compare entities and generate a similarity score, based on both the unstructured feature vector and the structural feature vector for each entity. In some aspects, a user may enter a query (from which a chemical structural feature vector is dynamically generated) to compare against entities having unstructured and/or structural feature vectors, stored in a database.

Type: Application

Filed: June 18, 2019

Publication date: October 3, 2019

Inventors: William S. Spangler, Richard L. Martin, Feng Wang, Xiaoyang Gao, Sheng Hua Bao
ADAPTIVE WEIGHTING OF SIMILARITY METRICS FOR PREDICTIVE ANALYTICS OF A COGNITIVE SYSTEM

Publication number: 20190304573

Abstract: According to embodiments of the present invention, similarity metrics or measures of similarity may be combined using an adaptive weighting scheme. A subset of entities from a first set of entities that have a known relationship is randomly selected. The subset is combined with a second set of entities that have an unknown relationship to each other and/or to the first set of entities. At least two different measures of similarity (similarity metrics) between the first set and the combined second set (including the subset) is determined for each entity in the second set. For each entity in the second set, the at least two different measures of similarity are compared, and a weight is assigned adaptively to each measure of similarity based on the magnitude of the measure of similarity. The weighted measures of similarity are combined to determine an aggregate adaptively weighted similarity score for each entity.

Type: Application

Filed: June 18, 2019

Publication date: October 3, 2019

Inventors: Yanyan Han, Sheng Hua Bao, Xiaoyang Gao, Brian S. Dreher, William S. Spangler, Feng Wang
COMBINING CHEMICAL STRUCTURE DATA WITH UNSTRUCTURED DATA FOR PREDICTIVE ANALYTICS IN A COGNITIVE SYSTEM

Publication number: 20190251455

Abstract: According to embodiments of the present invention, an entity may be represented by an unstructured feature vector comprising a plurality of features extracted from unstructured data using semantic analysis and a structural feature vector comprising a plurality of features from chemical structure data. A similarity matrix may be used to compare entities and generate a similarity score, based on both the unstructured feature vector and the structural feature vector for each entity. In some aspects, a user may enter a query (from which a chemical structural feature vector is dynamically generated) to compare against entities having unstructured and/or structural feature vectors, stored in a database.

Type: Application

Filed: February 13, 2018

Publication date: August 15, 2019

Inventors: William S. Spangler, Richard L. Martin, Feng Wang, Xiaoyang Gao, Sheng Hua Bao
ADAPTIVE WEIGHTING OF SIMILARITY METRICS FOR PREDICTIVE ANALYTICS OF A COGNITIVE SYSTEM

Publication number: 20190213303

Abstract: According to embodiments of the present invention, similarity metrics or measures of similarity may be combined using an adaptive weighting scheme. A subset of entities from a first set of entities that have a known relationship is randomly selected. The subset is combined with a second set of entities that have an unknown relationship to each other and/or to the first set of entities. At least two different measures of similarity (similarity metrics) between the first set and the combined second set (including the subset) is determined for each entity in the second set. For each entity in the second set, the at least two different measures of similarity are compared, and a weight is assigned adaptively to each measure of similarity based on the magnitude of the measure of similarity. The weighted measures of similarity are combined to determine an aggregate adaptively weighted similarity score for each entity.

Type: Application

Filed: January 5, 2018

Publication date: July 11, 2019

Inventors: Yanyan Han, Sheng Hua Bao, Xiaoyang Gao, Brian S. Dreher, William S. Spangler, Feng Wang
RATIONALIZING NETWORK PREDICTIONS USING SIMILARITY TO KNOWN CONNECTIONS

Publication number: 20190156116

Abstract: Rationalization of network predictions using similarity to known connections is provided. In various embodiments, a graph is read. The graph comprises a plurality of nodes. Each of the plurality of nodes corresponds to an entity or property. The plurality of nodes is interconnected by a plurality of edges. Each edge corresponds to a relationship between connected nodes. A new edge in the graph is predicted. The new edge corresponds to a relationship between a first node and a second node. The first node corresponds to an entity and the second node corresponds to an entity or property. One or more additional nodes connected to the second node is located. The one or more additional nodes is scored according to its connections in common with the first node. One or more sources is provided to a user describing the connection between the one or more additional node and the second node.

Type: Application

Filed: November 22, 2017

Publication date: May 23, 2019

Inventors: Alix Lacoste, William S. Spangler, Feng Wang
MACHINE LEARNING FOR RANKING CANDIDATE SUBJECTS BASED ON A TRAINING SET

Publication number: 20180365589

Abstract: According to an embodiment of the present invention, a system designates each document in a collection of documents as a member of a first group containing known subjects for a concept of interest or as a member of a second group containing candidate subjects for the concept of interest and determines a subset of documents for at least one subject. The system generates a classifier based on the documents in the first and second groups and applies the classifier to a set of documents for the at least one subject to determine whether each document belong to the first and/or second group. The system generates a score for the at least one subject based on a quantity of documents for that subject assigned to the first group of documents relative to a total quantity of documents for that subject and ranks that subject based on the determined score for each subject.

Type: Application

Filed: June 16, 2017

Publication date: December 20, 2018

Inventors: Alix M. Lacoste, William S. Spangler
ASSOCIATING GENE EXPRESSION DATA WITH A DISEASE NAME

Publication number: 20170329900

Abstract: The present invention relates to a method and system for associating gene expression data with a disease name. A first data set associated with a plurality of genetic probes for a plurality of biological samples may be received. The first data set may be sorted based on a normalized gene expression values for the plurality of genetic probes. A largest value gap of the normalized gene expression values may be identified. A set of expressed genes within the first data set may be identified. An indexable document may be generated for a biological sample of the plurality of biological samples comprising data associated with the set of expressed genes. A second data set associated with an expressed gene of the set of expressed genes may be searched. A disease name may be associated with an expressed gene based on a threshold correlation between the disease name and the expressed gene.

Type: Application

Filed: May 11, 2016

Publication date: November 16, 2017

Inventors: Linda H. Kato, Jeffrey T. Kreulen, Jacques Labrie, William S. Spangler, Ignacio G. Terrizzano
Approximate named-entity extraction

Patent number: 9311290

Abstract: According to one embodiment, a method is provided for approximate named-entity extraction from a dictionary that includes entries, where each of the entries includes one or more words. Words are read from the entries of the dictionary, and network resources are searched to determine a frequency of occurrence of the words on the network resources. In view of the frequency of occurrence of the words located on the network resources, domain relevancy of the words in the entries of the dictionary is determined. A domain repository is created using top-ranked words as determined by the domain relevancy of the words. In view of the domain repository, signatures for both the entries of the dictionary and strings of an input document are computed. The strings of the input document are filtered by comparing the signatures of the strings against the signatures of the entries to identify approximate-match entity names.

Type: Grant

Filed: August 20, 2013

Date of Patent: April 12, 2016

Assignee: International Business Machines Corporation

Inventors: Ying Chen, William S. Spangler, Su Yan
Approximate named-entity extraction

Patent number: 9251133

Abstract: According to one embodiment, approximate named-entity extraction from a dictionary that includes entries is provided, where each of the entries includes one or more words. Words are read from the entries of the dictionary, and network resources are searched to determine a frequency of occurrence of the words on the network resources. In view of the frequency of occurrence of the words located on the network resources, domain relevancy of the words in the entries of the dictionary is determined. A domain repository is created using top-ranked words as determined by the domain relevancy of the words. In view of the domain repository, signatures for both the entries of the dictionary and strings of an input document are computed. The strings of the input document are filtered by comparing the signatures of the strings against the signatures of the entries to identify approximate-match entity names.

Type: Grant

Filed: December 12, 2012

Date of Patent: February 2, 2016

Assignee: International Business Machines Corporation

Inventors: Ying Chen, William S. Spangler, Su Yan
Composing analytic solutions

Patent number: 8973012

Abstract: An approach for composing an analytic solution is provided. After associating descriptive schemas with web services and web-based applets, a set of input data sources is enumerated for selection. A desired output type is received. Based on the descriptive schemas that specify required inputs and outputs of the web services and web-based applets, combinations of web services and web-based applets are generated. The generated combinations achieve a result of the desired output type from one of the enumerated input data sources. Each combination is derived from available web services and web-based applets. The combinations include one or more workflows that provide an analytic solution. A workflow whose result satisfies the business objective may be saved. Steps in a workflow may be iteratively refined to generate a workflow whose result satisfies the business objective.

Type: Grant

Filed: October 25, 2011

Date of Patent: March 3, 2015

Assignee: International Business Machines Corporation

Inventors: Ying Chen, Thilina Gunarathne, Eugene M. Maximilien, William S. Spangler
Composing analytic solutions

Patent number: 8973013

Abstract: An approach for composing an analytic solution is provided. After associating descriptive schemas with web services and web-based applets, a set of input data sources is enumerated for selection. A desired output type is received. Based on the descriptive schemas that specify required inputs and outputs of the web services and web-based applets, combinations of web services and web-based applets are generated. The generated combinations achieve a result of the desired output type from one of the enumerated input data sources. Each combination is derived from available web services and web-based applets. The combinations include one or more workflows that provide an analytic solution. A workflow whose result satisfies the business objective may be saved. Steps in a workflow may be iteratively refined to generate a workflow whose result satisfies the business objective.

Type: Grant

Filed: August 28, 2012

Date of Patent: March 3, 2015

Assignee: International Business Machines Corporation

Inventors: Ying Chen, Thilina Gunarathne, Eugene M. Maximilien, William S. Spangler
Systems, methods and computer program products for discovering a text query from example documents

Patent number: 8862605

Abstract: Discovering a keyword query corresponding to an input collection of documents taken from a candidate pool includes selecting a document from a working set as the input set, and extracting a list of snippets in the selected document. For each snippet, executing a set of proximity queries based on selected terms in that snippet, and finding all possible proximity queries that return less than N query results from the candidate pool. A query is selected from said proximity queries, based on the selected query returning the greatest number of working set documents, and returning the smallest number of documents not in the working set. Documents returned by the selected query are removed from the working set, and the above steps are repeated until no documents remain in the working set. The disjunction of selected queries is returned as the discovered query.

Type: Grant

Filed: November 18, 2011

Date of Patent: October 14, 2014

Assignee: International Business Machines Corporation

Inventor: William S. Spangler
Systems, methods and computer program products for fast and scalable proximal search for search queries

Patent number: 8805848

Abstract: Embodiments of the invention provide a system, method and computer program products for information retrieval from multiple documents by proximity searching for search queries. A method includes generating an index for the multiple documents, wherein the index includes words in snippets in the documents. An input search query is processed against the index by searching query terms over the snippets to introduce term proximity information implicitly in the information retrieval. Results of multiple sentence level search operations are combined as output.

Type: Grant

Filed: May 24, 2012

Date of Patent: August 12, 2014

Assignee: International Business Machines Corporation

Inventors: Sumit Bhatia, Bin He, Qi He, William S. Spangler

prev 1 2 3 next