Patents by Inventor Thuany Karoline Stuart

Thuany Karoline Stuart has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method for weighting a graph

Patent number: 11941056

Abstract: The present disclosure relates to a method for a weighting graph comprising nodes representing entities and edges representing relationships between entities in accordance with one or more domains. The method comprises: pre-processing the graph comprising assigning weights to the nodes and/or the edges of the graph in accordance with a specific domain of the domains, wherein the weight indicates a domain specific data quality problem of attribute values representing an edge of the edges and/or an entity involved in that edge. The weighted graph may be provided for enabling a processing of the graph in accordance with the specific domain.

Type: Grant

Filed: April 20, 2021

Date of Patent: March 26, 2024

Assignee: International Business Machines Corporation

Inventors: Martin Oberhofer, Mike W. Grasselt, Claudio Andrea Fanconi, Thuany Karoline Stuart, Yannick Saillet, Basem Elasioty, Hemanth Kumar Babu, Robert Kern
DUPLICATE DETERMINATION IN A GRAPH USING DIFFERENT VERSIONS OF THE GRAPH

Publication number: 20230229644

Abstract: Duplicating identification in a graph having a second version. The graph includes nodes representing entities and edges representing relationships between the entities. The method includes: providing a previous first version of the graph. At least two target nodes of the graph may be identified. The first version and the second version of the graph may be compared for determining a comparison score indicative of a similarity between the two target nodes. The comparison score may be used for determining whether the two target nodes are duplicates with respect to each other.

Type: Application

Filed: January 18, 2022

Publication date: July 20, 2023

Inventors: Lars Bremer, Thuany Karoline Stuart, Hemanth Kumar Babu, Martin Anton Oberhofer
Measuring data quality of data in a graph database

Patent number: 11651055

Abstract: Methods, computer program products and/or systems are provided that perform the following operations: obtaining a first graph comprising first nodes representing first entities and first edges representing relationships between first entities, the first nodes being associated with first entity attributes descriptive of the first entities represented by the first nodes, the first edges being associated with first edge attributes descriptive of the relationships represented by the first edges; determining a first subgraph for a certain node of the first nodes of the first graph, the first subgraph including the certain node and at least one neighboring node of the certain node; and determining a data quality issue regarding the certain node based, at least in part, on applying one or more applicable rules of a set of data quality rules to first entity attribute values and first edge attribute values of the first subgraph.

Type: Grant

Filed: October 29, 2020

Date of Patent: May 16, 2023

Assignee: International Business Machines Corporation

Inventors: Yannick Saillet, Claudio Andrea Fanconi, Martin Oberhofer, Hemanth Kumar Babu, Basem Elasioty, Mike W. Grasselt, Robert Kern, Thuany Karoline Stuart
ASSESSING IF RECORDS FROM DIFFERENT DATA SOURCES REPRESENT A SAME ENTITY

Publication number: 20230110007

Abstract: In an approach, a processor receives a first record from a first data source, where the first record comprises attributes, a second record from a second data source, where the second record comprises said attributes, a first individual quality rating for the attributes of the first record, and a second individual quality rating for the attributes of the second record. A processor, in response to inputting the first record and the second record into a probabilistic matching engine, receives a matching score for each of the respective attributes. A processor calculates a weighted matching score for each of the respective attributes by weighting the matching score for each of the respective attributes with the first individual quality rating and the second individual quality rating. A processor assesses whether the first record and the second record represent the same entity based on the weighted matching score.

Type: Application

Filed: September 30, 2021

Publication date: April 13, 2023

Inventors: Martin Anton Oberhofer, Lars Bremer, Basem Elasioty, Hemanth Kumar Babu, Thuany Karoline Stuart
Duplicate determination in a graph

Patent number: 11531656

Abstract: Embodiments of the present invention provide for a method for duplicate determination in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. Embodiments of the present invention identify two target nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. Embodiments of the present invention can then determine whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs and based on a comparison of the two target nodes of the subgraphs.

Type: Grant

Filed: June 8, 2021

Date of Patent: December 20, 2022

Assignee: International Business Machines Corporation

Inventors: Lars Bremer, Thuany Karoline Stuart, Hemanth Kumar Babu, Martin Oberhofer
DUPLICATE DETERMINATION IN A GRAPH

Publication number: 20220391365

Abstract: Embodiments of the present invention provide for a method for duplicate determination in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. Embodiments of the present invention identify two target nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. Embodiments of the present invention can then determine whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs and based on a comparison of the two target nodes of the subgraphs.

Type: Application

Filed: June 8, 2021

Publication date: December 8, 2022

Inventors: Lars Bremer, Thuany Karoline Stuart, Hemanth Kumar Babu, Martin Oberhofer
Method for duplicate determination in a graph

Patent number: 11500876

Abstract: Embodiments of the present invention determines duplicates in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. The method comprises: identifying at least two nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. The method further comprises determining whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs.

Type: Grant

Filed: December 8, 2020

Date of Patent: November 15, 2022

Assignee: International Business Machines Corporation

Inventors: Thuany Karoline Stuart, Basem Elasioty, Claudio Andrea Fanconi, Mike W. Grasselt, Hemanth Kumar Babu, Yannick Saillet, Robert Kern, Martin Oberhofer, Lars Bremer, Jonathan Roesner, Jason Allen Woods
ACTIVE LEARNING IMPROVING SIMILAR TASK RECOMMENDATIONS

Publication number: 20220215286

Abstract: A method, computer system, and computer program product for training a machine learning model for use by a task management system are provided. The embodiment may include presenting a task to be resolved to a user via a user interface. The embodiment may also include presenting a further task to be resolved to the user via the user interface. The embodiment may further include predicting time to be spent on the further task presented to the user. The embodiment may also include determining actual time the user spent completing the further task. The embodiment may further include training a machine learning model for a subsequent similar task based on the predicted time and the determined actual time.

Type: Application

Filed: January 4, 2021

Publication date: July 7, 2022

Inventors: Alexandre Luz Xavier Da Costa, Lars Bremer, Karin Steckler, Thuany Karoline Stuart
DETERMINING AN ONTOLOGY FOR GRAPHS

Publication number: 20220188344

Abstract: The present disclosure relates to a method, computer program product and system. The method may comprise providing a first graph being an instance of a first ontology. Sample values of a plurality of concept attributes may be collected from the first graph. The sample values may be clustered into one or more clusters based on content and/or format of the sample values. A cluster of the clusters that contains sample values representing different concept attributes may be identified. An additional concept and associated set of relations representing the concept attribute values of the cluster may be determined and the first ontology may be updated using the additional concept and associated set of relations.

Type: Application

Filed: December 14, 2020

Publication date: June 16, 2022

Inventors: Thuany Karoline Stuart, Martin Oberhofer, Lars Bremer, Hemanth Kumar Babu
MEASURING DATA QUALITY OF DATA IN A GRAPH DATABASE

Publication number: 20220138512

Abstract: Methods, computer program products and/or systems are provided that perform the following operations: obtaining a first graph comprising first nodes representing first entities and first edges representing relationships between first entities, the first nodes being associated with first entity attributes descriptive of the first entities represented by the first nodes, the first edges being associated with first edge attributes descriptive of the relationships represented by the first edges; determining a first subgraph for a certain node of the first nodes of the first graph, the first subgraph including the certain node and at least one neighboring node of the certain node; and determining a data quality issue regarding the certain node based, at least in part, on applying one or more applicable rules of a set of data quality rules to first entity attribute values and first edge attribute values of the first subgraph.

Type: Application

Filed: October 29, 2020

Publication date: May 5, 2022

Inventors: Yannick Saillet, Claudio Andrea Fanconi, Martin Oberhofer, Hemanth Kumar Babu, Basem Elasioty, Mike W. Grasselt, Robert Kern, Thuany Karoline Stuart
MACHINE LEARNING MODEL TRAINING FROM MANUAL DECISIONS

Publication number: 20220092469

Abstract: In an approach to improving machine learning model training for data matching from manual decisions, one or more computer processors detect a correction made to two data records. One or more computer processors determine a common attribute between the two data records. One or more computer processors identify a first machine learning model associated with the common attribute. One or more computer processors add comparison data of the two data records to training data for the machine learning model, where the comparison data includes the correction.

Type: Application

Filed: September 23, 2020

Publication date: March 24, 2022

Inventors: Lars Bremer, Hemanth Kumar Babu, Thuany Karoline Stuart, Martin Oberhofer
SAMPLE PAIR SELECTION IN ENTITY MATCHING ANALYSIS

Publication number: 20220092064

Abstract: Selecting entity matching systems sample record pairs by selecting at least one first record pair from entity matching system data records, scoring attribute compare methods of the at least one first record pair according to the entity matching system, adding the at least one first record pair to a no-match set according to the attribute matching score, selecting at least one second record pair from an entity matching system data record bucket, scoring attribute compare methods of the at least one second record pair according to the entity matching system, adding the at least one second record pair to record pair set, according to the second record pair attribute compare method score, and providing the record pair set to a user.

Type: Application

Filed: September 21, 2020

Publication date: March 24, 2022

Inventors: Lars Bremer, Martin Oberhofer, Thuany Karoline Stuart, Claudio Andrea Fanconi, Jonathan Roesner, Daniel Suski
METHOD FOR WEIGHTING A GRAPH

Publication number: 20210342397

Abstract: The present disclosure relates to a method for a weighting graph comprising nodes representing entities and edges representing relationships between entities in accordance with one or more domains. The method comprises: pre-processing the graph comprising assigning weights to the nodes and/or the edges of the graph in accordance with a specific domain of the domains, wherein the weight indicates a domain specific data quality problem of attribute values representing an edge of the edges and/or an entity involved in that edge. The weighted graph may be provided for enabling a processing of the graph in accordance with the specific domain.

Type: Application

Filed: April 20, 2021

Publication date: November 4, 2021

Inventors: Martin Oberhofer, Mike W. Grasselt, Claudio Andrea Fanconi, Thuany Karoline Stuart, Yannick Saillet, Basem Elasioty, Hemanth Kumar Babu, Robert Kern
METHOD FOR DUPLICATE DETERMINATION IN A GRAPH

Publication number: 20210342352

Abstract: Embodiments of the present invention determines duplicates in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. The method comprises: identifying at least two nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. The method further comprises determining whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs.

Type: Application

Filed: December 8, 2020

Publication date: November 4, 2021

Inventors: Thuany Karoline Stuart, Basem Elasioty, Claudio Andrea Fanconi, Mike W. Grasselt, Hemanth Kumar Babu, Yannick Saillet, Robert Kern, Martin Oberhofer, Lars Bremer, Jonathan Roesner, Jason Allen Woods