Patents by Inventor James Albert O'Neill, JR.

James Albert O'Neill, JR. has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11720807
    Abstract: Techniques for data evaluation are provided. A plurality of data records is received, and a first comparison vector is generated by comparing a first and a second data record of the plurality of data records, where the first comparison vector indicates differences between the first and second data records. A machine learning model is trained based at least in part on the first comparison vector. The plurality of data records is evaluated using the machine learning model, and at least two of the plurality of data records are linked based on the evaluation.
    Type: Grant
    Filed: March 4, 2020
    Date of Patent: August 8, 2023
    Assignee: International Business Machines Corporation
    Inventors: Abhishek Seth, James Albert O'Neill, Jr., Scott Schumacher, Soma Shekar Naganna, Mohammad Khatibi
  • Patent number: 11687828
    Abstract: A method for relating different types of records. The method may include providing comparison functions, wherein each comparison function corresponds to a semantical class, and wherein a computational cost is associated with each comparison function. The method may include determining one or more attribute pairs between the different types of records. The method may include sorting the comparison functions according to a determined accuracy. The method may include selecting a set of comparison functions associated with semantical classes according to a predefined rule. The method may include determining a total computational cost based on the computational cost of the selected set of comparison functions. The method may include determining whether two or more records are related using the selected set of comparison functions. The method may include relating the two or more records. The method may include determining a rate of false negative records.
    Type: Grant
    Filed: October 11, 2019
    Date of Patent: June 27, 2023
    Assignee: International Business Machines Corporation
    Inventors: Martin Oberhofer, Sergio Luis Olvera Gutierrez, Soma Shekar Naganna, Abhishek Seth, James Albert O'Neill, Jr.
  • Patent number: 11681689
    Abstract: A method for receiving an additional dataset including a plurality of additional data records; determining a record type using classifiers and an internal domain knowledge corpus; dividing the plurality of additional data records into a plurality of indexing groups; assigning the given additional data record to a match set based on completeness and similarity of natures of attributes of the given additional data record; and assigning the given additional data record to and a comparison group based on completeness and similarity of natures of attributes of the given additional data record.
    Type: Grant
    Filed: June 9, 2021
    Date of Patent: June 20, 2023
    Assignee: International Business Machines Corporation
    Inventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, Jr., Geetha Sravanthi Pulipaty, Neeraj Ramkrishna Singh
  • Patent number: 11615064
    Abstract: Configuring a data management system by receiving user interaction data associated with search results associated with a first system configuration, identifying a usage pattern in the user interaction data using a first machine learning model, and altering the first system configuration according to the usage pattern.
    Type: Grant
    Filed: May 3, 2021
    Date of Patent: March 28, 2023
    Assignee: International Business Machines Corporation
    Inventors: Soma Shekar Naganna, James Albert O'Neill, Jr., Geetha Sravanthi Pulipaty, Abhishek Seth, Neeraj Ramkrishna Singh
  • Publication number: 20230029643
    Abstract: A method, a structure, and a computer system for mapping data fields. The exemplary embodiments may include, based on determining that a first data set and a second data set contain homogenous data, mapping at least one column of the first data set to at least one column of the second data set based on comparing at least one of relative column position and unique value sets. Based on determining that the first data set and the second data set contain heterogeneous data, the exemplary embodiments may include mapping the at least one column of the first data set to the at least one column of the second data set based on a difference between distribution signatures of unique value sets within each of the first data set and the second data set being less than a threshold.
    Type: Application
    Filed: July 26, 2021
    Publication date: February 2, 2023
    Inventors: Neeraj Ramkrishna Singh, James Albert O'Neill, JR., Soma Shekar Naganna, Geetha Sravanthi Pulipaty, Abhishek Seth
  • Publication number: 20230027115
    Abstract: A method, a structure, and a computer system for event-based record matching. The exemplary embodiments may include extracting one or more events from a first record and a second record, as well as calculating an event-based score based on comparing the one or more events extracted from the first record with the one or more events extracted from the second record. The exemplary embodiments may further include matching the first record to the second record based on the event-based score exceeding a threshold.
    Type: Application
    Filed: July 26, 2021
    Publication date: January 26, 2023
    Inventors: Geetha Sravanthi Pulipaty, Soma Shekar Naganna, Neeraj Ramkrishna Singh, Abhishek Seth, James Albert O'Neill, Jr.
  • Publication number: 20220398241
    Abstract: A method for receiving an additional dataset including a plurality of additional data records; determining a record type using classifiers and an internal domain knowledge corpus; dividing the plurality of additional data records into a plurality of indexing groups; assigning the given additional data record to a match set based on completeness and similarity of natures of attributes of the given additional data record; and assigning the given additional data record to and a comparison group based on completeness and similarity of natures of attributes of the given additional data record.
    Type: Application
    Filed: June 9, 2021
    Publication date: December 15, 2022
    Inventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, JR., Geetha Sravanthi Pulipaty, Neeraj Ramkrishna Singh
  • Publication number: 20220350782
    Abstract: Configuring a data management system by receiving user interaction data associated with search results associated with a first system configuration, identifying a usage pattern in the user interaction data using a first machine learning model, and altering the first system configuration according to the usage pattern.
    Type: Application
    Filed: May 3, 2021
    Publication date: November 3, 2022
    Inventors: Soma Shekar Naganna, James Albert O'Neill, JR., Geetha Sravanthi Pulipaty, Abhishek Seth, Neeraj Ramkrishna Singh
  • Patent number: 11386090
    Abstract: Matching records in an entity resolution system by defining entity attribute feature vectors, determining an entity attribute matching score according to a distance between two entity attribute feature vectors, assigning a statistical weight to an entity attribute matching score, adjusting the entity attribute matching score according to the statistical weight and an entity attribute frequency of occurrence, and determining an aggregate entity attribute matching score.
    Type: Grant
    Filed: April 8, 2020
    Date of Patent: July 12, 2022
    Assignee: International Business Machines Corporation
    Inventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, Jr., Lars Bremer, Mariya Chkalova
  • Publication number: 20220164396
    Abstract: A method, apparatus, computer system, and computer program product for managing information. A set of bucket hashes and comparison information for a data record are identified by a computer system. The set of bucket hashes is generated from the comparison information, wherein the set of bucket hashes and the comparison information form a metadata record. A number of candidate metadata records in a metadata database is identified by the computer system using the set of bucket hashes, wherein the number of candidate metadata records comprises a set of candidate bucket hashes and candidate comparison information. An entity membership is identified by the computer system for the data record from a comparison of the comparison information in the metadata record with the candidate comparison information in the number of candidate metadata records.
    Type: Application
    Filed: November 25, 2020
    Publication date: May 26, 2022
    Inventors: Abhishek Seth, Soma Shekar Naganna, Randy Hu, James Albert O'Neill, JR.
  • Publication number: 20210319026
    Abstract: Matching records in an entity resolution system by defining entity attribute feature vectors, determining an entity attribute matching score according to a distance between two entity attribute feature vectors, assigning a statistical weight to an entity attribute matching score, adjusting the entity attribute matching score according to the statistical weight and an entity attribute frequency of occurrence, and determining an aggregate entity attribute matching score.
    Type: Application
    Filed: April 8, 2020
    Publication date: October 14, 2021
    Inventors: Abhishek Seth, Soma Shekar Naganna, James Albert O'Neill, JR., Lars Bremer, Mariya Chkalova
  • Publication number: 20210279604
    Abstract: Techniques for data evaluation are provided. A plurality of data records is received, and a first comparison vector is generated by comparing a first and a second data record of the plurality of data records, where the first comparison vector indicates differences between the first and second data records. A machine learning model is trained based at least in part on the first comparison vector. The plurality of data records is evaluated using the machine learning model, and at least two of the plurality of data records are linked based on the evaluation.
    Type: Application
    Filed: March 4, 2020
    Publication date: September 9, 2021
    Inventors: Abhishek SETH, James Albert O'NEILL, JR., Scott SCHUMACHER, Soma Shekar NAGANNA, Mohammad KHATIBI
  • Publication number: 20210110295
    Abstract: A method for relating different types of records. The method may include providing comparison functions, wherein each comparison function corresponds to a semantical class, and wherein a computational cost is associated with each comparison function. The method may include determining one or more attribute pairs between the different types of records. The method may include sorting the comparison functions according to a determined accuracy. The method may include selecting a set of comparison functions associated with semantical classes according to a predefined rule. The method may include determining a total computational cost based on the computational cost of the selected set of comparison functions. The method may include determining whether two or more records are related using the selected set of comparison functions. The method may include relating the two or more records. The method may include determining a rate of false negative records.
    Type: Application
    Filed: October 11, 2019
    Publication date: April 15, 2021
    Inventors: Martin Oberhofer, Sergio Luis Olvera Gutierrez, Soma Shekar Naganna, Abhishek Seth, James Albert O'Neill, JR.