Patents by Inventor James Albert O'Neill, JR.
James Albert O'Neill, JR. has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11720807Abstract: Techniques for data evaluation are provided. A plurality of data records is received, and a first comparison vector is generated by comparing a first and a second data record of the plurality of data records, where the first comparison vector indicates differences between the first and second data records. A machine learning model is trained based at least in part on the first comparison vector. The plurality of data records is evaluated using the machine learning model, and at least two of the plurality of data records are linked based on the evaluation.Type: GrantFiled: March 4, 2020Date of Patent: August 8, 2023Assignee: International Business Machines CorporationInventors: Abhishek Seth, James Albert O'Neill, Jr., Scott Schumacher, Soma Shekar Naganna, Mohammad Khatibi
-
Patent number: 11687828Abstract: A method for relating different types of records. The method may include providing comparison functions, wherein each comparison function corresponds to a semantical class, and wherein a computational cost is associated with each comparison function. The method may include determining one or more attribute pairs between the different types of records. The method may include sorting the comparison functions according to a determined accuracy. The method may include selecting a set of comparison functions associated with semantical classes according to a predefined rule. The method may include determining a total computational cost based on the computational cost of the selected set of comparison functions. The method may include determining whether two or more records are related using the selected set of comparison functions. The method may include relating the two or more records. The method may include determining a rate of false negative records.Type: GrantFiled: October 11, 2019Date of Patent: June 27, 2023Assignee: International Business Machines CorporationInventors: Martin Oberhofer, Sergio Luis Olvera Gutierrez, Soma Shekar Naganna, Abhishek Seth, James Albert O'Neill, Jr.
-
Patent number: 11681689Abstract: A method for receiving an additional dataset including a plurality of additional data records; determining a record type using classifiers and an internal domain knowledge corpus; dividing the plurality of additional data records into a plurality of indexing groups; assigning the given additional data record to a match set based on completeness and similarity of natures of attributes of the given additional data record; and assigning the given additional data record to and a comparison group based on completeness and similarity of natures of attributes of the given additional data record.Type: GrantFiled: June 9, 2021Date of Patent: June 20, 2023Assignee: International Business Machines CorporationInventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, Jr., Geetha Sravanthi Pulipaty, Neeraj Ramkrishna Singh
-
Patent number: 11615064Abstract: Configuring a data management system by receiving user interaction data associated with search results associated with a first system configuration, identifying a usage pattern in the user interaction data using a first machine learning model, and altering the first system configuration according to the usage pattern.Type: GrantFiled: May 3, 2021Date of Patent: March 28, 2023Assignee: International Business Machines CorporationInventors: Soma Shekar Naganna, James Albert O'Neill, Jr., Geetha Sravanthi Pulipaty, Abhishek Seth, Neeraj Ramkrishna Singh
-
Publication number: 20230029643Abstract: A method, a structure, and a computer system for mapping data fields. The exemplary embodiments may include, based on determining that a first data set and a second data set contain homogenous data, mapping at least one column of the first data set to at least one column of the second data set based on comparing at least one of relative column position and unique value sets. Based on determining that the first data set and the second data set contain heterogeneous data, the exemplary embodiments may include mapping the at least one column of the first data set to the at least one column of the second data set based on a difference between distribution signatures of unique value sets within each of the first data set and the second data set being less than a threshold.Type: ApplicationFiled: July 26, 2021Publication date: February 2, 2023Inventors: Neeraj Ramkrishna Singh, James Albert O'Neill, JR., Soma Shekar Naganna, Geetha Sravanthi Pulipaty, Abhishek Seth
-
Publication number: 20230027115Abstract: A method, a structure, and a computer system for event-based record matching. The exemplary embodiments may include extracting one or more events from a first record and a second record, as well as calculating an event-based score based on comparing the one or more events extracted from the first record with the one or more events extracted from the second record. The exemplary embodiments may further include matching the first record to the second record based on the event-based score exceeding a threshold.Type: ApplicationFiled: July 26, 2021Publication date: January 26, 2023Inventors: Geetha Sravanthi Pulipaty, Soma Shekar Naganna, Neeraj Ramkrishna Singh, Abhishek Seth, James Albert O'Neill, Jr.
-
Publication number: 20220398241Abstract: A method for receiving an additional dataset including a plurality of additional data records; determining a record type using classifiers and an internal domain knowledge corpus; dividing the plurality of additional data records into a plurality of indexing groups; assigning the given additional data record to a match set based on completeness and similarity of natures of attributes of the given additional data record; and assigning the given additional data record to and a comparison group based on completeness and similarity of natures of attributes of the given additional data record.Type: ApplicationFiled: June 9, 2021Publication date: December 15, 2022Inventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, JR., Geetha Sravanthi Pulipaty, Neeraj Ramkrishna Singh
-
Publication number: 20220350782Abstract: Configuring a data management system by receiving user interaction data associated with search results associated with a first system configuration, identifying a usage pattern in the user interaction data using a first machine learning model, and altering the first system configuration according to the usage pattern.Type: ApplicationFiled: May 3, 2021Publication date: November 3, 2022Inventors: Soma Shekar Naganna, James Albert O'Neill, JR., Geetha Sravanthi Pulipaty, Abhishek Seth, Neeraj Ramkrishna Singh
-
Patent number: 11386090Abstract: Matching records in an entity resolution system by defining entity attribute feature vectors, determining an entity attribute matching score according to a distance between two entity attribute feature vectors, assigning a statistical weight to an entity attribute matching score, adjusting the entity attribute matching score according to the statistical weight and an entity attribute frequency of occurrence, and determining an aggregate entity attribute matching score.Type: GrantFiled: April 8, 2020Date of Patent: July 12, 2022Assignee: International Business Machines CorporationInventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, Jr., Lars Bremer, Mariya Chkalova
-
Publication number: 20220164396Abstract: A method, apparatus, computer system, and computer program product for managing information. A set of bucket hashes and comparison information for a data record are identified by a computer system. The set of bucket hashes is generated from the comparison information, wherein the set of bucket hashes and the comparison information form a metadata record. A number of candidate metadata records in a metadata database is identified by the computer system using the set of bucket hashes, wherein the number of candidate metadata records comprises a set of candidate bucket hashes and candidate comparison information. An entity membership is identified by the computer system for the data record from a comparison of the comparison information in the metadata record with the candidate comparison information in the number of candidate metadata records.Type: ApplicationFiled: November 25, 2020Publication date: May 26, 2022Inventors: Abhishek Seth, Soma Shekar Naganna, Randy Hu, James Albert O'Neill, JR.
-
Publication number: 20210319026Abstract: Matching records in an entity resolution system by defining entity attribute feature vectors, determining an entity attribute matching score according to a distance between two entity attribute feature vectors, assigning a statistical weight to an entity attribute matching score, adjusting the entity attribute matching score according to the statistical weight and an entity attribute frequency of occurrence, and determining an aggregate entity attribute matching score.Type: ApplicationFiled: April 8, 2020Publication date: October 14, 2021Inventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, JR., Lars Bremer, Mariya Chkalova
-
Publication number: 20210279604Abstract: Techniques for data evaluation are provided. A plurality of data records is received, and a first comparison vector is generated by comparing a first and a second data record of the plurality of data records, where the first comparison vector indicates differences between the first and second data records. A machine learning model is trained based at least in part on the first comparison vector. The plurality of data records is evaluated using the machine learning model, and at least two of the plurality of data records are linked based on the evaluation.Type: ApplicationFiled: March 4, 2020Publication date: September 9, 2021Inventors: Abhishek SETH, James Albert O'NEILL, JR., Scott SCHUMACHER, Soma Shekar NAGANNA, Mohammad KHATIBI
-
Publication number: 20210110295Abstract: A method for relating different types of records. The method may include providing comparison functions, wherein each comparison function corresponds to a semantical class, and wherein a computational cost is associated with each comparison function. The method may include determining one or more attribute pairs between the different types of records. The method may include sorting the comparison functions according to a determined accuracy. The method may include selecting a set of comparison functions associated with semantical classes according to a predefined rule. The method may include determining a total computational cost based on the computational cost of the selected set of comparison functions. The method may include determining whether two or more records are related using the selected set of comparison functions. The method may include relating the two or more records. The method may include determining a rate of false negative records.Type: ApplicationFiled: October 11, 2019Publication date: April 15, 2021Inventors: Martin Oberhofer, Sergio Luis Olvera Gutierrez, Soma Shekar Naganna, Abhishek Seth, James Albert O'Neill, JR.