Patents by Inventor Christopher Re

Christopher Re has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Declarative framework for deduplication

Patent number: 8200640

Abstract: A system, framework, and algorithms for data deduplication are described. A declarative language, such as a Datalog-type logic language, is provided. Programs in the language describe data to be deduplicated and soft and hard constraints that must/should be satisfied by data deduplicated according to the program. To execute the programs, algorithms for performing graph clustering are described.

Type: Grant

Filed: June 15, 2009

Date of Patent: June 12, 2012

Assignee: Microsoft Corporation

Inventors: Arvind Arasu, Christopher Re, Dan Suciu
DECLARATIVE FRAMEWORK FOR DEDUPLICATION

Publication number: 20100318499

Abstract: A system, framework, and algorithms for data deduplication are described. A declarative language, such as a Datalog-type logic language, is provided. Programs in the language describe data to be deduplicated and soft and hard constraints that must/should be satisfied by data deduplicated according to the program. To execute the programs, algorithms for performing graph clustering are described.

Type: Application

Filed: June 15, 2009

Publication date: December 16, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Arvind Arasu, Christopher Re, Dan Suciu
Efficient top-K query evaluation on probabilistic data

Patent number: 7814113

Abstract: A novel approach that computes and efficiently ranks the top-k answers to a query on a probabilistic database. The approach identifies the top-k answers, since imprecisions in the data often lead to a large number of answers of low quality. The algorithm is used to run several Monte Carlo simulations in parallel, one for each candidate answer, and approximates the probability of each only to the extent needed to correctly determine the top-k answers. The algorithm is provably optimal and scales to large databases. A more general application can identify a number of top-rated entities of a group that satisfy a condition, based on a criteria or score computed for the entities. Also disclosed are several optimization techniques. One option is to rank the top-rated results; another option provides for interrupting the iteration to return the number of top-rated entities that have thus far been identified.

Type: Grant

Filed: November 5, 2007

Date of Patent: October 12, 2010

Assignee: University of Washington through its Center for Commercialization

Inventors: Dan Suciu, Christopher Re
EFFICIENT TOP-K QUERY EVALUATION ON PROBABILISTIC DATA

Publication number: 20080109428

Abstract: A novel approach that computes and efficiently ranks the top-k answers to a query on a probabilistic database. The approach identifies the top-k answers, since imprecisions in the data often lead to a large number of answers of low quality. The algorithm is used to run several Monte Carlo simulations in parallel, one for each candidate answer, and approximates the probability of each only to the extent needed to correctly determine the top-k answers. The algorithm is provably optimal and scales to large databases. A more general application can identify a number of top-rated entities of a group that satisfy a condition, based on a criteria or score computed for the entities. Also disclosed are several optimization techniques. One option is to rank the top-rated results; another option provides for interrupting the iteration to return the number of top-rated entities that have thus far been identified.

Type: Application

Filed: November 5, 2007

Publication date: May 8, 2008

Applicant: University of Washington

Inventors: Dan Suciu, Christopher Re

Declarative framework for deduplication

DECLARATIVE FRAMEWORK FOR DEDUPLICATION

Efficient top-K query evaluation on probabilistic data

EFFICIENT TOP-K QUERY EVALUATION ON PROBABILISTIC DATA