Patents by Inventor Nikolaus Bates-Haus

Nikolaus Bates-Haus has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11948055
    Abstract: Record clustering is performed for a collection of records using training rules, training-rule labels, training data created from a sample of pairs of records, a pair-wise classifier, and a clustering algorithm. Record clustering is also performed for a collection of records using prediction rules, prediction-rule labels, a pair-wise classifier, and a clustering algorithm.
    Type: Grant
    Filed: March 1, 2023
    Date of Patent: April 2, 2024
    Assignee: TAMR, INC.
    Inventors: George Anwar Dany Beskales, Nikolaus Bates-Haus, Ihab F. Ilyas
  • Patent number: 11500818
    Abstract: An end-to-end data curation system and the various methods used in linking, matching, and cleaning large-scale data sources. The goal of this system is to provide scalable and efficient record deduplication. The system uses a crowd of experts to train the system. The system operator can optionally provide a set of hints to reduce the number of questions send to the experts. The system solves the problem of schema mapping and record deduplication a holistic way by unifying these problems into a unified linkage problem.
    Type: Grant
    Filed: February 19, 2021
    Date of Patent: November 15, 2022
    Assignee: TAMR, INC.
    Inventors: Nikolaus Bates-Haus, George Beskales, Daniel Meir Bruckner, Ihab F. Ilyas, Alexander Richter Pagan, Michael Ralph Stonebraker
  • Patent number: 11049028
    Abstract: Record clustering is performed by learning from verified clusters which are used as the source of training data in a deduplication workflow utilizing supervised machine learning.
    Type: Grant
    Filed: March 9, 2021
    Date of Patent: June 29, 2021
    Assignee: TAMR, INC.
    Inventors: George Anwar Dany Beskales, Pedro Giesemann Cattori, Alexandra V. Batchelor, Brian A. Long, Nikolaus Bates-Haus
  • Patent number: 11042523
    Abstract: A data curation system is provided that includes various methods to enable efficient reuse of human and machine effort. To reuse effort, various facilities are presented that model, save, and allow for querying of provenance and state information of a curation workflow and allow for incremental, stateful transitions of the data and metadata thereof.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: June 22, 2021
    Assignee: TAMR, INC.
    Inventors: Vladimir Gluzman Peregrine, Ihab F. Ilyas, Michael Ralph Stonebraker, Stan Zdonik, Andrew H. Palmer, Alexander Richter Pagan, Daniel Meir Bruckner, George Beskales, Aizana Turmukhametova, Tianyu Zhu, Kanak Kshetri, Jason Liu, Nikolaus Bates-Haus
  • Publication number: 20210173817
    Abstract: An end-to-end data curation system and the various methods used in linking, matching, and cleaning large-scale data sources. The goal of this system is to provide scalable and efficient record deduplication. The system uses a crowd of experts to train the system. The system operator can optionally provide a set of hints to reduce the number of questions send to the experts. The system solves the problem of schema mapping and record deduplication a holistic way by unifying these problems into a unified linkage problem.
    Type: Application
    Filed: February 19, 2021
    Publication date: June 10, 2021
    Inventors: Nikolaus BATES-HAUS, George BESKALES, Daniel Meir BRUCKNER, Ihab F. ILYAS, Alexander Richter PAGAN, Michael Ralph STONEBRAKER
  • Patent number: 10929348
    Abstract: An end-to-end data curation system and the various methods used in linking, matching, and cleaning large-scale data sources. The goal of this system is to provide scalable and efficient record deduplication. The system uses a crowd of experts to train the system. The system operator can optionally provide a set of hints to reduce the number of questions sent to the experts. The system solves the problem of schema mapping and record deduplication in a holistic way by unifying these problems into a unified linkage problem.
    Type: Grant
    Filed: November 23, 2016
    Date of Patent: February 23, 2021
    Assignee: TAMR, INC.
    Inventors: Nikolaus Bates-Haus, George Beskales, Daniel Meir Bruckner, Ihab F. Ilyas, Alexander Richter Pagan, Michael Ralph Stonebraker
  • Patent number: 10877948
    Abstract: Given a local distance metric for geospatial features, a binning is produced that is guaranteed to label features within a given distance threshold with the same bin, while labeling a minimum number of features separated by a distance that is greater than the threshold with the same bin.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: December 29, 2020
    Assignee: TAMR, INC.
    Inventors: George Anwar Dany Beskales, Nikolaus Bates-Haus
  • Publication number: 20200117643
    Abstract: A data curation system is provided that includes various methods to enable efficient reuse of human and machine effort. To reuse effort, various facilities are presented that model, save, and allow for querying of provenance and state information of a curation workflow and allow for incremental, stateful transitions of the data and metadata thereof.
    Type: Application
    Filed: December 11, 2019
    Publication date: April 16, 2020
    Inventors: Vladimir Gluzman PEREGRINE, Ihab F. ILYAS, Michael Ralph STONEBRAKER, Stan ZDONIK, Andrew H. PALMER, Alexander Richter PAGAN, Daniel Meir BRUCKNER, George BESKALES, Aizana TURMUKHAMETOVA, Tianyu ZHU, Kanak KSHETRI, Jason LIU, Nikolaus BATES-HAUS
  • Publication number: 20180341667
    Abstract: A data curation system that includes various methods to enable efficient reuse of human and machine effort. To reuse effort, various facilities are presented that model, save, and allow the querying of provenance and state information of a curation workflow and allow for incremental, stateful transitions of the data and the metadata.
    Type: Application
    Filed: August 2, 2018
    Publication date: November 29, 2018
    Inventors: Vladimir Gluzman Peregrine, Ihab F. Ilyas, Michael Ralph Stonebraker, Stan Zdonik, Andrew H. Palmer, Alexander Richter Pagan, Daniel Meir Bruckner, George Beskales, Aizana Turmukhametova, Tianyu Zhu, Kanak Kshetri, Jason Liu, Nikolaus Bates-Haus
  • Publication number: 20170075918
    Abstract: An end-to-end data curation system and the various methods used in linking, matching, and cleaning large-scale data sources. The goal of this system is to provide scalable and efficient record deduplication. The system uses a crowd of experts to train the system. The system operator can optionally provide a set of hints to reduce the number of questions sent to the experts. The system solves the problem of schema mapping and record deduplication in a holistic way by unifying these problems into a unified linkage problem.
    Type: Application
    Filed: November 23, 2016
    Publication date: March 16, 2017
    Inventors: Nikolaus Bates-Haus, George Beskales, Daniel Meir Bruckner, Ihab F. Ilyas, Alexander Richter Pagan, Michael Ralph Stonebraker
  • Patent number: 9542412
    Abstract: An end-to-end data curation system and the various methods used in linking, matching, and cleaning large-scale data sources. The goal of this system is to provide scalable and efficient record deduplication. The system uses a crowd of experts to train the system. The system operator can optionally provide a set of hints to reduce the number of questions send to the experts. The system solves the problem of schema mapping and record deduplication a holistic way by unifying these problems into a unified linkage problem.
    Type: Grant
    Filed: March 28, 2014
    Date of Patent: January 10, 2017
    Assignee: Tamr, Inc.
    Inventors: Nikolaus Bates-Haus, George Beskales, Daniel Meir Bruckner, Ihab F. Ilyas, Alexander Richter Pagan, Michael Ralph Stonebraker
  • Publication number: 20160048542
    Abstract: A data curation system that includes various methods to enable efficient reuse of human and machine effort. To reuse effort, various facilities are presented that model, save, and allow the querying of provenance and state information of a curation workflow and allow for incremental, stateful transitions of the data and the metadata.
    Type: Application
    Filed: September 2, 2014
    Publication date: February 18, 2016
    Inventors: Vladimir Gluzman Peregrine, Ihab F. Ilyas, Michael Ralph Stonebraker, Stan Zdonik, Andrew H. Palmer, Alexander Richter Pagan, Daniel Meir Bruckner, George Beskales, Aizana Turmukhametova, Tianyu Zhu, Kanak Kshetri, Jason Liu, Nikolaus Bates-Haus
  • Publication number: 20150278241
    Abstract: An end-to-end data curation system and the various methods used in linking, matching, and cleaning large-scale data sources. The goal of this system is to provide scalable and efficient record deduplication. The system uses a crowd of experts to train the system. The system operator can optionally provide a set of hints to reduce the number of questions send to the experts. The system solves the problem of schema mapping and record deduplication a holistic way by unifying these problems into a unified linkage problem.
    Type: Application
    Filed: March 28, 2014
    Publication date: October 1, 2015
    Applicant: DATATAMER, INC.
    Inventors: Nikolaus Bates-Haus, George Beskales, Daniel Meir Bruckner, Ihab F. Ilyas, Alexander Richter Pagan, Michael Ralph Stonebraker