Patents by Inventor Thuany Karoline Stuart
Thuany Karoline Stuart has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11941056Abstract: The present disclosure relates to a method for a weighting graph comprising nodes representing entities and edges representing relationships between entities in accordance with one or more domains. The method comprises: pre-processing the graph comprising assigning weights to the nodes and/or the edges of the graph in accordance with a specific domain of the domains, wherein the weight indicates a domain specific data quality problem of attribute values representing an edge of the edges and/or an entity involved in that edge. The weighted graph may be provided for enabling a processing of the graph in accordance with the specific domain.Type: GrantFiled: April 20, 2021Date of Patent: March 26, 2024Assignee: International Business Machines CorporationInventors: Martin Oberhofer, Mike W. Grasselt, Claudio Andrea Fanconi, Thuany Karoline Stuart, Yannick Saillet, Basem Elasioty, Hemanth Kumar Babu, Robert Kern
-
Publication number: 20230229644Abstract: Duplicating identification in a graph having a second version. The graph includes nodes representing entities and edges representing relationships between the entities. The method includes: providing a previous first version of the graph. At least two target nodes of the graph may be identified. The first version and the second version of the graph may be compared for determining a comparison score indicative of a similarity between the two target nodes. The comparison score may be used for determining whether the two target nodes are duplicates with respect to each other.Type: ApplicationFiled: January 18, 2022Publication date: July 20, 2023Inventors: Lars Bremer, Thuany Karoline Stuart, Hemanth Kumar Babu, Martin Anton Oberhofer
-
Patent number: 11651055Abstract: Methods, computer program products and/or systems are provided that perform the following operations: obtaining a first graph comprising first nodes representing first entities and first edges representing relationships between first entities, the first nodes being associated with first entity attributes descriptive of the first entities represented by the first nodes, the first edges being associated with first edge attributes descriptive of the relationships represented by the first edges; determining a first subgraph for a certain node of the first nodes of the first graph, the first subgraph including the certain node and at least one neighboring node of the certain node; and determining a data quality issue regarding the certain node based, at least in part, on applying one or more applicable rules of a set of data quality rules to first entity attribute values and first edge attribute values of the first subgraph.Type: GrantFiled: October 29, 2020Date of Patent: May 16, 2023Assignee: International Business Machines CorporationInventors: Yannick Saillet, Claudio Andrea Fanconi, Martin Oberhofer, Hemanth Kumar Babu, Basem Elasioty, Mike W. Grasselt, Robert Kern, Thuany Karoline Stuart
-
Publication number: 20230110007Abstract: In an approach, a processor receives a first record from a first data source, where the first record comprises attributes, a second record from a second data source, where the second record comprises said attributes, a first individual quality rating for the attributes of the first record, and a second individual quality rating for the attributes of the second record. A processor, in response to inputting the first record and the second record into a probabilistic matching engine, receives a matching score for each of the respective attributes. A processor calculates a weighted matching score for each of the respective attributes by weighting the matching score for each of the respective attributes with the first individual quality rating and the second individual quality rating. A processor assesses whether the first record and the second record represent the same entity based on the weighted matching score.Type: ApplicationFiled: September 30, 2021Publication date: April 13, 2023Inventors: Martin Anton Oberhofer, Lars Bremer, Basem Elasioty, Hemanth Kumar Babu, Thuany Karoline Stuart
-
Patent number: 11531656Abstract: Embodiments of the present invention provide for a method for duplicate determination in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. Embodiments of the present invention identify two target nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. Embodiments of the present invention can then determine whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs and based on a comparison of the two target nodes of the subgraphs.Type: GrantFiled: June 8, 2021Date of Patent: December 20, 2022Assignee: International Business Machines CorporationInventors: Lars Bremer, Thuany Karoline Stuart, Hemanth Kumar Babu, Martin Oberhofer
-
Publication number: 20220391365Abstract: Embodiments of the present invention provide for a method for duplicate determination in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. Embodiments of the present invention identify two target nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. Embodiments of the present invention can then determine whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs and based on a comparison of the two target nodes of the subgraphs.Type: ApplicationFiled: June 8, 2021Publication date: December 8, 2022Inventors: Lars Bremer, Thuany Karoline Stuart, Hemanth Kumar Babu, Martin Oberhofer
-
Patent number: 11500876Abstract: Embodiments of the present invention determines duplicates in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. The method comprises: identifying at least two nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. The method further comprises determining whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs.Type: GrantFiled: December 8, 2020Date of Patent: November 15, 2022Assignee: International Business Machines CorporationInventors: Thuany Karoline Stuart, Basem Elasioty, Claudio Andrea Fanconi, Mike W. Grasselt, Hemanth Kumar Babu, Yannick Saillet, Robert Kern, Martin Oberhofer, Lars Bremer, Jonathan Roesner, Jason Allen Woods
-
Publication number: 20220215286Abstract: A method, computer system, and computer program product for training a machine learning model for use by a task management system are provided. The embodiment may include presenting a task to be resolved to a user via a user interface. The embodiment may also include presenting a further task to be resolved to the user via the user interface. The embodiment may further include predicting time to be spent on the further task presented to the user. The embodiment may also include determining actual time the user spent completing the further task. The embodiment may further include training a machine learning model for a subsequent similar task based on the predicted time and the determined actual time.Type: ApplicationFiled: January 4, 2021Publication date: July 7, 2022Inventors: Alexandre Luz Xavier Da Costa, Lars Bremer, Karin Steckler, Thuany Karoline Stuart
-
Publication number: 20220188344Abstract: The present disclosure relates to a method, computer program product and system. The method may comprise providing a first graph being an instance of a first ontology. Sample values of a plurality of concept attributes may be collected from the first graph. The sample values may be clustered into one or more clusters based on content and/or format of the sample values. A cluster of the clusters that contains sample values representing different concept attributes may be identified. An additional concept and associated set of relations representing the concept attribute values of the cluster may be determined and the first ontology may be updated using the additional concept and associated set of relations.Type: ApplicationFiled: December 14, 2020Publication date: June 16, 2022Inventors: Thuany Karoline Stuart, Martin Oberhofer, Lars Bremer, Hemanth Kumar Babu
-
Publication number: 20220138512Abstract: Methods, computer program products and/or systems are provided that perform the following operations: obtaining a first graph comprising first nodes representing first entities and first edges representing relationships between first entities, the first nodes being associated with first entity attributes descriptive of the first entities represented by the first nodes, the first edges being associated with first edge attributes descriptive of the relationships represented by the first edges; determining a first subgraph for a certain node of the first nodes of the first graph, the first subgraph including the certain node and at least one neighboring node of the certain node; and determining a data quality issue regarding the certain node based, at least in part, on applying one or more applicable rules of a set of data quality rules to first entity attribute values and first edge attribute values of the first subgraph.Type: ApplicationFiled: October 29, 2020Publication date: May 5, 2022Inventors: Yannick Saillet, Claudio Andrea Fanconi, Martin Oberhofer, Hemanth Kumar Babu, Basem Elasioty, Mike W. Grasselt, Robert Kern, Thuany Karoline Stuart
-
Publication number: 20220092469Abstract: In an approach to improving machine learning model training for data matching from manual decisions, one or more computer processors detect a correction made to two data records. One or more computer processors determine a common attribute between the two data records. One or more computer processors identify a first machine learning model associated with the common attribute. One or more computer processors add comparison data of the two data records to training data for the machine learning model, where the comparison data includes the correction.Type: ApplicationFiled: September 23, 2020Publication date: March 24, 2022Inventors: Lars Bremer, Hemanth Kumar Babu, Thuany Karoline Stuart, Martin Oberhofer
-
Publication number: 20220092064Abstract: Selecting entity matching systems sample record pairs by selecting at least one first record pair from entity matching system data records, scoring attribute compare methods of the at least one first record pair according to the entity matching system, adding the at least one first record pair to a no-match set according to the attribute matching score, selecting at least one second record pair from an entity matching system data record bucket, scoring attribute compare methods of the at least one second record pair according to the entity matching system, adding the at least one second record pair to record pair set, according to the second record pair attribute compare method score, and providing the record pair set to a user.Type: ApplicationFiled: September 21, 2020Publication date: March 24, 2022Inventors: Lars Bremer, Martin Oberhofer, Thuany Karoline Stuart, Claudio Andrea Fanconi, Jonathan Roesner, Daniel Suski
-
Publication number: 20210342397Abstract: The present disclosure relates to a method for a weighting graph comprising nodes representing entities and edges representing relationships between entities in accordance with one or more domains. The method comprises: pre-processing the graph comprising assigning weights to the nodes and/or the edges of the graph in accordance with a specific domain of the domains, wherein the weight indicates a domain specific data quality problem of attribute values representing an edge of the edges and/or an entity involved in that edge. The weighted graph may be provided for enabling a processing of the graph in accordance with the specific domain.Type: ApplicationFiled: April 20, 2021Publication date: November 4, 2021Inventors: Martin Oberhofer, Mike W. Grasselt, Claudio Andrea Fanconi, Thuany Karoline Stuart, Yannick Saillet, Basem Elasioty, Hemanth Kumar Babu, Robert Kern
-
Publication number: 20210342352Abstract: Embodiments of the present invention determines duplicates in a graph. The graph comprises nodes representing entities and edges representing relationships between the entities. The method comprises: identifying at least two nodes in the graph. A neighborhood subgraph may be determined for each of the two nodes. The neighborhood subgraph includes the respective node. The method further comprises determining whether the two nodes are duplicates with respect to each other, based on a result of a comparison between the two subgraphs.Type: ApplicationFiled: December 8, 2020Publication date: November 4, 2021Inventors: Thuany Karoline Stuart, Basem Elasioty, Claudio Andrea Fanconi, Mike W. Grasselt, Hemanth Kumar Babu, Yannick Saillet, Robert Kern, Martin Oberhofer, Lars Bremer, Jonathan Roesner, Jason Allen Woods