Patents by Inventor George Anwar Dany Beskales

George Anwar Dany Beskales has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11948055
    Abstract: Record clustering is performed for a collection of records using training rules, training-rule labels, training data created from a sample of pairs of records, a pair-wise classifier, and a clustering algorithm. Record clustering is also performed for a collection of records using prediction rules, prediction-rule labels, a pair-wise classifier, and a clustering algorithm.
    Type: Grant
    Filed: March 1, 2023
    Date of Patent: April 2, 2024
    Assignee: TAMR, INC.
    Inventors: George Anwar Dany Beskales, Nikolaus Bates-Haus, Ihab F. Ilyas
  • Patent number: 11416780
    Abstract: A collection of clusters are selected to be used in training in an active learning workflow when using clusters to train supervised entity resolution in data sets. A collection of records is provided wherein each record in the collection has a cluster membership. A collection of record pairs is also provided, each record pair containing two distinct records from the collection of records, and each record pair having a similarity score. A collection of clusters is generated with uncertainty from the collection of records and the collection of record pairs. A subset of the collection of clusters with uncertainty is then selected using weighted sampling, wherein a function of the cluster uncertainty is used as the weight in the weighted sampling. The subset of the collection of clusters with uncertainty is the collection of clusters for training in and active learning workflow when using clusters to train supervised entity resolution in data sets.
    Type: Grant
    Filed: September 22, 2021
    Date of Patent: August 16, 2022
    Assignee: TAMR, INC.
    Inventor: George Anwar Dany Beskales
  • Patent number: 11321359
    Abstract: Methods are provided to represent proposed changes to clusterings for ease of review, as well as tools to help subject matter experts identify clusters that warrant review versus those that do not. These tools make overall assessment of proposed clustering changes and targeted curation practical at large scale. Use of these tools and method enables efficient data management operations when dealing with extreme scale, such as where entity resolution involves clusterings created from data sources involving millions of entities.
    Type: Grant
    Filed: December 6, 2019
    Date of Patent: May 3, 2022
    Assignee: TAMR, INC.
    Inventors: Timothy Kwok Webber, George Anwar Dany Beskales, Dennis Cunningham, Alan Benjamin Wagner Rodriguez, Liam Cleary
  • Patent number: 11294937
    Abstract: A method is provided for producing a record clustering with estimated accuracy metrics with confidence intervals. These metrics can be used to determine whether a clustering should be accepted as the output of the system, and whether model training is necessary to meet desired clustering accuracy. A collection of test records is used in the process, wherein each test record is a member of a collection of input records.
    Type: Grant
    Filed: October 4, 2021
    Date of Patent: April 5, 2022
    Assignee: TAMR, INC.
    Inventors: George Anwar Dany Beskales, Alexandra V. Batchelor, Brian A. Long
  • Publication number: 20220004565
    Abstract: Methods are provided to represent proposed changes to clusterings for ease of review, as well as tools to help subject matter experts identify clusters that warrant review versus those that do not. These tools make overall assessment of proposed clustering changes and targeted curation practical at large scale. Use of these tools and method enables efficient data management operations when dealing with extreme scale, such as where entity resolution involves clusterings created from data sources involving millions of entities.
    Type: Application
    Filed: December 6, 2019
    Publication date: January 6, 2022
    Inventors: Timothy Kwok WEBBER, George Anwar Dany BESKALES, Dennis CUNNINGHAM, Alan Benjamin WAGNER RODRIGUEZ, Liam CLEARY
  • Patent number: 11049028
    Abstract: Record clustering is performed by learning from verified clusters which are used as the source of training data in a deduplication workflow utilizing supervised machine learning.
    Type: Grant
    Filed: March 9, 2021
    Date of Patent: June 29, 2021
    Assignee: TAMR, INC.
    Inventors: George Anwar Dany Beskales, Pedro Giesemann Cattori, Alexandra V. Batchelor, Brian A. Long, Nikolaus Bates-Haus
  • Patent number: 10877948
    Abstract: Given a local distance metric for geospatial features, a binning is produced that is guaranteed to label features within a given distance threshold with the same bin, while labeling a minimum number of features separated by a distance that is greater than the threshold with the same bin.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: December 29, 2020
    Assignee: TAMR, INC.
    Inventors: George Anwar Dany Beskales, Nikolaus Bates-Haus