Patents by Inventor Petar Ristoski

Petar Ristoski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210303762
    Abstract: A set of material candidates expected to yield materials with target properties can be generated. A subject matter expert's decision indicating accepted and rejected material candidates from the set of material candidates can be received. Based on the subject matter expert's input, a machine learning model can be trained to replicate the subject matter expert's decision.
    Type: Application
    Filed: March 31, 2020
    Publication date: September 30, 2021
    Inventors: Petar Ristoski, Dmitry Zubarev, Linda Ha Kato, Anna Lisa Gentile, Nathaniel H. Park, Daniel Gruhl, Steven R. Welch, Daniel Paul Sanders, James L. Hedrick, Chandrasekhar Narayan, Chad Eric DeLuca, Alfredo Alba
  • Publication number: 20210248105
    Abstract: A computer-implemented method according to one embodiment includes receiving snapshot data for a node within a data center; determining one or more candidate labels for one or more software applications running on the node, utilizing the snapshot data; implementing a validation of the one or more candidate labels to determine one or more validated labels; and training a machine learning model, utilizing the one or more validated labels and the snapshot data.
    Type: Application
    Filed: February 10, 2020
    Publication date: August 12, 2021
    Inventors: Anna Lisa Gentile, Chad Eric DeLuca, Petar Ristoski, Linda Ha Kato, Alfredo Alba, Daniel Gruhl, Steven R. Welch
  • Publication number: 20210232955
    Abstract: Technology for using a computing device to interpret entity and relationship occurrences a natural language understanding system that includes the following operations (not necessarily in the following order): (i) receiving a corpus that includes unstructured data and/or structured data; (ii) parsing the corpus to obtain parsed corpus information; (iii) applying artificial intelligence to the parsed corpus information to determine a plurality of logical relationships manifested by the corpus; and (iv) expressing, by machine logic, the plurality of logical relationships as a respectively corresponding plurality of logical rule expressions, with each logical rule expression of the plurality of logical rule expressions expressing the respectively corresponding logical relationship as fact(s) with regard to the corpus.
    Type: Application
    Filed: January 29, 2020
    Publication date: July 29, 2021
    Inventors: Alfredo Alba, Daniel Gruhl, Chad Eric DeLuca, Petar Ristoski, Christian B. Kau, Anna Lisa Gentile, Linda Ha Kato, Steven R. Welch
  • Patent number: 11030402
    Abstract: Embodiments relate to a system, program product, and method for iterative expansion and application of a domain-specific dictionary. One or more dictionary instances are applied against a text corpus. The dictionary is iteratively expanded and selectively populated with one or more additional dictionary instances, including semantically similar instances to the applied dictionary instances and extension instances contextually related to the applied dictionary instances. The iteratively expanded dictionary is applied to an unexplored corpus to identify matching corpus data to populated instances of the dictionary.
    Type: Grant
    Filed: May 3, 2019
    Date of Patent: June 8, 2021
    Assignee: International Business Machines Corporation
    Inventors: Petar Ristoski, Daniel Gruhl, Alfredo Alba, Anna Lisa Gentile, Ismini Lourentzou, Chad Eric DeLuca, Linda Ha Kato, Steven R. Welch, Chris Kau
  • Publication number: 20210081803
    Abstract: Embodiments relate to a system, program product, and method for knowledge resource management. A first document is subjected to a first semantic annotation and one or more entities, relations, and textual annotations of interest are identified. A neural model is built with the first document and trained with the first document and one or more of the first semantic annotations. An un-annotated document is applied to the neural model, and one or more second semantic annotations are produced. The un-annotated document is enriched with the produced second semantic annotation(s) and is subjected to adjudication. The neural model is selectively amended responsive to the adjudication.
    Type: Application
    Filed: September 17, 2019
    Publication date: March 18, 2021
    Applicant: International Business Machines Corporation
    Inventors: Petar Ristoski, Anna Lisa Gentile, Daniel Gruhl, Linda Ha Kato, Chad Eric DeLuca, Alfredo Alba, Chris Kau, Steven R. Welch
  • Publication number: 20210034704
    Abstract: Embodiments relate to a system, program product, and method for dictionary membership management directed at identifying ambiguity in semantic resources. A dictionary of seed terms is applied to a text corpus and matching items in the corpus are identified. The linguistic properties for each matching item are characterized and a context pattern of each matching item is constructed. Each context pattern is applied to the dictionary and matching content between the seed terms and the context pattern is identified and quantified. Lexicon items from the dictionary that have anomalous behavior reflected in the quantification are identified. One or more seed words identified as having anomalous behavior are selectively removed from the dictionary.
    Type: Application
    Filed: July 29, 2019
    Publication date: February 4, 2021
    Applicant: International Business Machines Corporation
    Inventors: Anna Lisa Gentile, Anni R. Coden, Ismini Lourentzou, Daniel Gruhl, Chad Eric DeLuca, Petar Ristoski, Linda Ha Kato, Chris Kau, Steven R. Welch, Alfredo Alba
  • Publication number: 20200380311
    Abstract: Embodiments relate to a system, program product, and method for information extraction and annotation of a data set. Neural models are utilized to automatically attach machine annotations to data elements within an unlabeled data set. The attached machine annotations are evaluated and a score is attached to the annotations. The score reflects a confidence of correctness of the annotations. A labeled data set is iteratively expanded with selectively evaluated annotations based on the attached score. The labeled data set is applied to an unexplored corpus to identify matching corpus data to populated instances of the labeled data set.
    Type: Application
    Filed: May 29, 2019
    Publication date: December 3, 2020
    Applicant: International Business Machines Corporation
    Inventors: Ismini Lourentzou, Anna Lisa Gentile, Daniel Gruhl, Alfredo Alba, Petar Ristoski, Chad Eric DeLuca, Linda Ha Kato, Chris Kau, Steven R. Welch
  • Publication number: 20200349226
    Abstract: Embodiments relate to a system, program product, and method for iterative expansion and application of a domain-specific dictionary. One or more dictionary instances are applied against a text corpus. The dictionary is iteratively expanded and selectively populated with one or more additional dictionary instances, including semantically similar instances to the applied dictionary instances and extension instances contextually related to the applied dictionary instances. The iteratively expanded dictionary is applied to an unexplored corpus to identify matching corpus data to populated instances of the dictionary.
    Type: Application
    Filed: May 3, 2019
    Publication date: November 5, 2020
    Applicant: International Business Machines Corporation
    Inventors: Petar Ristoski, Daniel Gruhl, Alfredo Alba, Anna Lisa Gentile, Ismini Lourentzou, Chad Eric DeLuca, Linda Ha Kato, Steven R. Welch, Chris Kau
  • Publication number: 20200097602
    Abstract: One embodiment provides a method that includes determining candidate ontologies for alignment from multiple available knowledge bases. An initial target ontology is selected from the candidate ontologies and correcting the initial selected ontology with received refinement input. Concepts in the selected initial ontology are aligned with concepts of the target ontology using a deep learning hierarchical classification with received review input. A user is assisted to build, change and grow the selected initial ontology exploiting both the target ontology and new facts extracted from unstructured data.
    Type: Application
    Filed: September 25, 2018
    Publication date: March 26, 2020
    Inventors: Petar Ristoski, Anna Lisa Gentile, Daniel Gruhl, Alfredo Alba, Chris Kau, Chad DeLuca, Linda Kato, Ismini Lourentzou, Steven R. Welch
  • Publication number: 20200097597
    Abstract: One embodiment provides a method for on-demand relation extraction from unstructured text that includes obtaining a text corpus of domain related unstructured text. Representations of the unstructured text that capture entity-specific syntactic knowledge are created. Initial user seeds of informative examples containing relations are received. Extraction models in a neural network are trained using the initial user seeds. Performance information and a confidence score are provided for each prediction for each extraction model. A next batch of informative examples are identified for annotation from the text corpus based on training a neural network classifier on a pool of labeled informative examples. Stopping criteria is determined based on differences of the performance information and the confidence score in relation to parameters for each extraction model. Based on the stopping criteria, it is determined whether to retrain a particular extraction model after the informative examples have been labeled.
    Type: Application
    Filed: September 24, 2018
    Publication date: March 26, 2020
    Inventors: Ismini Lourentzou, Anna Lisa Gentile, Daniel Gruhl, Alfredo Alba, Chris Kau, Chad DeLuca, Linda Kato, Petar Ristoski, Steven R. Welch
  • Publication number: 20200019608
    Abstract: One embodiment provides a method for relevant language-independent terminology extraction from content, the method including extracting lexicon items from the content based on context extraction patterns using statistical processing. Feedback on the extracted lexicon items is received to automatically tune scores and thresholds for the context extraction patterns. Available Linked Data is leveraged for a bootstrap source. The relevant language-independent terminology extraction is bootstrapped using the bootstrap source.
    Type: Application
    Filed: July 11, 2018
    Publication date: January 16, 2020
    Inventors: Anna Lisa Gentile, Daniel Gruhl, Petar Ristoski, Steven R. Welch, Alfredo Alba, Chris Kau, Chad DeLuca, Linda Kato