Patents by Inventor Petar Ristoski

Petar Ristoski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

EXPERT-IN-THE-LOOP AI FOR MATERIALS DISCOVERY

Publication number: 20210303762

Abstract: A set of material candidates expected to yield materials with target properties can be generated. A subject matter expert's decision indicating accepted and rejected material candidates from the set of material candidates can be received. Based on the subject matter expert's input, a machine learning model can be trained to replicate the subject matter expert's decision.

Type: Application

Filed: March 31, 2020

Publication date: September 30, 2021

Inventors: Petar Ristoski, Dmitry Zubarev, Linda Ha Kato, Anna Lisa Gentile, Nathaniel H. Park, Daniel Gruhl, Steven R. Welch, Daniel Paul Sanders, James L. Hedrick, Chandrasekhar Narayan, Chad Eric DeLuca, Alfredo Alba
LABELING SOFTWARE APPLICATIONS RUNNING ON NODES OF A DATA CENTER

Publication number: 20210248105

Abstract: A computer-implemented method according to one embodiment includes receiving snapshot data for a node within a data center; determining one or more candidate labels for one or more software applications running on the node, utilizing the snapshot data; implementing a validation of the one or more candidate labels to determine one or more validated labels; and training a machine learning model, utilizing the one or more validated labels and the snapshot data.

Type: Application

Filed: February 10, 2020

Publication date: August 12, 2021

Inventors: Anna Lisa Gentile, Chad Eric DeLuca, Petar Ristoski, Linda Ha Kato, Alfredo Alba, Daniel Gruhl, Steven R. Welch
HANDLING INFERENCES IN AN ARTIFICIAL INTELLIGENCE SYSTEM

Publication number: 20210232955

Abstract: Technology for using a computing device to interpret entity and relationship occurrences a natural language understanding system that includes the following operations (not necessarily in the following order): (i) receiving a corpus that includes unstructured data and/or structured data; (ii) parsing the corpus to obtain parsed corpus information; (iii) applying artificial intelligence to the parsed corpus information to determine a plurality of logical relationships manifested by the corpus; and (iv) expressing, by machine logic, the plurality of logical relationships as a respectively corresponding plurality of logical rule expressions, with each logical rule expression of the plurality of logical rule expressions expressing the respectively corresponding logical relationship as fact(s) with regard to the corpus.

Type: Application

Filed: January 29, 2020

Publication date: July 29, 2021

Inventors: Alfredo Alba, Daniel Gruhl, Chad Eric DeLuca, Petar Ristoski, Christian B. Kau, Anna Lisa Gentile, Linda Ha Kato, Steven R. Welch
Dictionary expansion using neural language models

Patent number: 11030402

Abstract: Embodiments relate to a system, program product, and method for iterative expansion and application of a domain-specific dictionary. One or more dictionary instances are applied against a text corpus. The dictionary is iteratively expanded and selectively populated with one or more additional dictionary instances, including semantically similar instances to the applied dictionary instances and extension instances contextually related to the applied dictionary instances. The iteratively expanded dictionary is applied to an unexplored corpus to identify matching corpus data to populated instances of the dictionary.

Type: Grant

Filed: May 3, 2019

Date of Patent: June 8, 2021

Assignee: International Business Machines Corporation

Inventors: Petar Ristoski, Daniel Gruhl, Alfredo Alba, Anna Lisa Gentile, Ismini Lourentzou, Chad Eric DeLuca, Linda Ha Kato, Steven R. Welch, Chris Kau
On-Demand Knowledge Resource Management

Publication number: 20210081803

Abstract: Embodiments relate to a system, program product, and method for knowledge resource management. A first document is subjected to a first semantic annotation and one or more entities, relations, and textual annotations of interest are identified. A neural model is built with the first document and trained with the first document and one or more of the first semantic annotations. An un-annotated document is applied to the neural model, and one or more second semantic annotations are produced. The un-annotated document is enriched with the produced second semantic annotation(s) and is subjected to adjudication. The neural model is selectively amended responsive to the adjudication.

Type: Application

Filed: September 17, 2019

Publication date: March 18, 2021

Applicant: International Business Machines Corporation

Inventors: Petar Ristoski, Anna Lisa Gentile, Daniel Gruhl, Linda Ha Kato, Chad Eric DeLuca, Alfredo Alba, Chris Kau, Steven R. Welch
Identifying Ambiguity in Semantic Resources

Publication number: 20210034704

Abstract: Embodiments relate to a system, program product, and method for dictionary membership management directed at identifying ambiguity in semantic resources. A dictionary of seed terms is applied to a text corpus and matching items in the corpus are identified. The linguistic properties for each matching item are characterized and a context pattern of each matching item is constructed. Each context pattern is applied to the dictionary and matching content between the seed terms and the context pattern is identified and quantified. Lexicon items from the dictionary that have anomalous behavior reflected in the quantification are identified. One or more seed words identified as having anomalous behavior are selectively removed from the dictionary.

Type: Application

Filed: July 29, 2019

Publication date: February 4, 2021

Applicant: International Business Machines Corporation

Inventors: Anna Lisa Gentile, Anni R. Coden, Ismini Lourentzou, Daniel Gruhl, Chad Eric DeLuca, Petar Ristoski, Linda Ha Kato, Chris Kau, Steven R. Welch, Alfredo Alba
Collaborative Information Extraction

Publication number: 20200380311

Abstract: Embodiments relate to a system, program product, and method for information extraction and annotation of a data set. Neural models are utilized to automatically attach machine annotations to data elements within an unlabeled data set. The attached machine annotations are evaluated and a score is attached to the annotations. The score reflects a confidence of correctness of the annotations. A labeled data set is iteratively expanded with selectively evaluated annotations based on the attached score. The labeled data set is applied to an unexplored corpus to identify matching corpus data to populated instances of the labeled data set.

Type: Application

Filed: May 29, 2019

Publication date: December 3, 2020

Applicant: International Business Machines Corporation

Inventors: Ismini Lourentzou, Anna Lisa Gentile, Daniel Gruhl, Alfredo Alba, Petar Ristoski, Chad Eric DeLuca, Linda Ha Kato, Chris Kau, Steven R. Welch
Dictionary Expansion Using Neural Language Models

Publication number: 20200349226

Abstract: Embodiments relate to a system, program product, and method for iterative expansion and application of a domain-specific dictionary. One or more dictionary instances are applied against a text corpus. The dictionary is iteratively expanded and selectively populated with one or more additional dictionary instances, including semantically similar instances to the applied dictionary instances and extension instances contextually related to the applied dictionary instances. The iteratively expanded dictionary is applied to an unexplored corpus to identify matching corpus data to populated instances of the dictionary.

Type: Application

Filed: May 3, 2019

Publication date: November 5, 2020

Applicant: International Business Machines Corporation

Inventors: Petar Ristoski, Daniel Gruhl, Alfredo Alba, Anna Lisa Gentile, Ismini Lourentzou, Chad Eric DeLuca, Linda Ha Kato, Steven R. Welch, Chris Kau
USER-CENTRIC ONTOLOGY POPULATION WITH USER REFINEMENT

Publication number: 20200097602

Abstract: One embodiment provides a method that includes determining candidate ontologies for alignment from multiple available knowledge bases. An initial target ontology is selected from the candidate ontologies and correcting the initial selected ontology with received refinement input. Concepts in the selected initial ontology are aligned with concepts of the target ontology using a deep learning hierarchical classification with received review input. A user is assisted to build, change and grow the selected initial ontology exploiting both the target ontology and new facts extracted from unstructured data.

Type: Application

Filed: September 25, 2018

Publication date: March 26, 2020

Inventors: Petar Ristoski, Anna Lisa Gentile, Daniel Gruhl, Alfredo Alba, Chris Kau, Chad DeLuca, Linda Kato, Ismini Lourentzou, Steven R. Welch
ON-DEMAND RELATION EXTRACTION FROM TEXT

Publication number: 20200097597

Abstract: One embodiment provides a method for on-demand relation extraction from unstructured text that includes obtaining a text corpus of domain related unstructured text. Representations of the unstructured text that capture entity-specific syntactic knowledge are created. Initial user seeds of informative examples containing relations are received. Extraction models in a neural network are trained using the initial user seeds. Performance information and a confidence score are provided for each prediction for each extraction model. A next batch of informative examples are identified for annotation from the text corpus based on training a neural network classifier on a pool of labeled informative examples. Stopping criteria is determined based on differences of the performance information and the confidence score in relation to parameters for each extraction model. Based on the stopping criteria, it is determined whether to retrain a particular extraction model after the informative examples have been labeled.

Type: Application

Filed: September 24, 2018

Publication date: March 26, 2020

Inventors: Ismini Lourentzou, Anna Lisa Gentile, Daniel Gruhl, Alfredo Alba, Chris Kau, Chad DeLuca, Linda Kato, Petar Ristoski, Steven R. Welch
LINKED DATA SEEDED MULTI-LINGUAL LEXICON EXTRACTION

Publication number: 20200019608

Abstract: One embodiment provides a method for relevant language-independent terminology extraction from content, the method including extracting lexicon items from the content based on context extraction patterns using statistical processing. Feedback on the extracted lexicon items is received to automatically tune scores and thresholds for the context extraction patterns. Available Linked Data is leveraged for a bootstrap source. The relevant language-independent terminology extraction is bootstrapped using the bootstrap source.

Type: Application

Filed: July 11, 2018

Publication date: January 16, 2020

Inventors: Anna Lisa Gentile, Daniel Gruhl, Petar Ristoski, Steven R. Welch, Alfredo Alba, Chris Kau, Chad DeLuca, Linda Kato

prev 1 2