Patents by Inventor Felix NAUMANN

Felix NAUMANN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Apparatus, system, and method for cross-platform data processing

Patent number: 11288270

Abstract: The present disclosure generally relates to a cost-based optimizer for efficiently processing data through the use of multiple different data processing platforms. The cost-based optimizer may receive an input plan for processing data that includes a number of base operators. The cost-based optimizer may then determine execution operators for each base operator, where each execution operator corresponds to a different data processing platform. From the execution operators, the cost-based optimizer may determine possible subplans for executing the input plan on one or more data processing platforms. The cost-based optimizer may determine the cost of executing each possible subplan and choose the subplan with the lowest cost.

Type: Grant

Filed: May 9, 2019

Date of Patent: March 29, 2022

Assignees: QATAR FOUNDATION FOR EDUCATION, SCIENCE AND COMMUNITY DEVELOPMENT, HASSO PLATTNER INSTITUTE

Inventors: Jorge Arnulfo Quiane Ruiz, Sebastian Kruse, Zoi Kaoudi, Sanjay Chawla, Bertty Contreras, Felix Naumann
Systems and methods for data integration

Patent number: 10528532

Abstract: A computer implemented method for integrating data into a target database may include: providing a plurality of source databases which each may include a relational schema and data for integration into the target database; generating at least one complexity model based on the relational schema and data of each source database, each complexity model indicating at least one inconsistency between two or more of the data sources which may be require to be resolved to integrate the data from the data sources into the target database; and generating an effort model that may include an effort value for each inconsistency indicated by each complexity model, each effort value indicating at least one of a time period and a financial cost to resolve the inconsistency to integrate data from the data sources into the target database.

Type: Grant

Filed: June 30, 2014

Date of Patent: January 7, 2020

Assignees: Qatar Foundation, Hasso-Plattner-Institut Für Softwaresystemtechnik GmbH

Inventors: Paolo Papotti, Felix Naumann, Sebastian Kruse, El Kindi Rezig
Datasets profiling tools, methods, and systems

Patent number: 10318388

Abstract: A dataset profiling tool configured to identify unique and non-unique column combinations in a dataset which includes a plurality of tuples, the tool including: an inserts handler module configured to: receive one or more new tuples for insertion into the dataset, receive one or more minimal uniques and one or more maximal non-uniques for the dataset, identify and group, for each minimal unique, any tuples of the dataset and any of the one or more new tuples which contain duplicate values in the column combinations of the minimal unique, to form grouped tuples which are grouped according to the minimal unique to which the tuples relate, validate the grouped tuples to identify supersets of the minimal uniques for which duplicate values were identified, to generate a new set of one or more minimal uniques and one or more maximal non-uniques, and output the new set of one or more updated minimal uniques and one or more maximal non-uniques.

Type: Grant

Filed: May 20, 2014

Date of Patent: June 11, 2019

Assignee: Qatar Foundation

Inventors: Jorge Arnulfo Quiané Ruiz, Felix Naumann, Ziawasch Abedjan
SYSTEMS AND METHODS FOR DATA INTEGRATION

Publication number: 20160154830

Abstract: A computer implemented method for integrating data into a target database may include: providing a plurality of source databases which each may include a relational schema and data for integration into the target database; generating at least one complexity model based on the relational schema and data of each source database, each complexity model indicating at least one inconsistency between two or more of the data sources which may be require to be resolved to integrate the data from the data sources into the target database; and generating an effort model that may include an effort value for each inconsistency indicated by each complexity model, each effort value indicating at least one of a time period and a financial cost to resolve the inconsistency to integrate data from the data sources into the target database.

Type: Application

Filed: June 30, 2014

Publication date: June 2, 2016

Inventors: Paolo Papotti, Felix Naumann, Sebastian Kruse, EI Kindi Rezig
DATASETS PROFILING TOOLS, METHODS, AND SYSTEMS

Publication number: 20160139997

Abstract: A dataset profiling tool configured to identify unique and non-unique column combinations in a dataset which comprises a plurality of tuples, the tool including: an inserts handler module configured to: receive one or more new tuples for insertion into the dataset, receive one or more minimal uniques and one or more maximal non-uniques for the dataset, identify and group, for each minimal unique, any tuples of the dataset and any of the one or more new tuples which contain duplicate values in the column combinations of the minimal unique, to form grouped tuples which are grouped according to the minimal unique to which the tuples relate, validate the grouped tuples to identify supersets of the minimal uniques for which duplicate values were identified, to generate a new set of one or more minimal uniques and one or more maximal non-uniques, and output the new set of one or more updated minimal uniques and one or more maximal non-uniques.

Type: Application

Filed: May 20, 2014

Publication date: May 19, 2016

Inventors: Jorge Arnulfo Quiané Ruiz, Felix Naumann, Ziawasch Abedjan
METHOD AND SYSTEM TO DISCOVER DEPENDENCIES IN DATASETS

Publication number: 20160117415

Abstract: A method of processing data stored in a database which comprises a plurality of rows and columns, the method comprising identifying a plurality of sets of column combinations, each set of column combinations comprising an identifier of at least one column allocating each set of column combinations to one of a plurality of nodes mapping the nodes to a lattice structure in which the nodes are connected in a superset or subset relationship according to the set of column combinations of each node selecting a current node processing the data in the set of columns of the current node to detect if the column combination is unique or non-unique traversing the lattice to a next node which is connected to the current node processing the data in the set of columns of the next node to detect if the column combination of the next node is unique or non-unique; and storing a record of whether each processed set of column combinations is unique or non-unique.

Type: Application

Filed: July 10, 2013

Publication date: April 28, 2016

Inventors: Jorge Arnulfo QUIANÉ RUIZ, Felix NAUMANN, Arvid HEISE

Apparatus, system, and method for cross-platform data processing

Systems and methods for data integration

Datasets profiling tools, methods, and systems

SYSTEMS AND METHODS FOR DATA INTEGRATION

DATASETS PROFILING TOOLS, METHODS, AND SYSTEMS

METHOD AND SYSTEM TO DISCOVER DEPENDENCIES IN DATASETS