Patents by Inventor Grzegorz S. Milka

Grzegorz S. Milka has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Optimizing an order of execution of multiple join operations

Patent number: 10061804

Abstract: A computer-implemented method, system, and/or computer program product optimizes an order of execution of column join operations. A first partitioning of the first data column splits the first data column into first subsets of rows. A second partitioning of the second data column splits the second data column into a second subsets of rows. Cardinalities of sub-tables derived by a respective joining of the subsets of rows of the first and second data columns are estimated, based on the first and second value frequency information. An order of execution of multiple join operations is then optimized based on the estimated cardinalities of the sub-tables.

Type: Grant

Filed: October 30, 2017

Date of Patent: August 28, 2018

Assignee: International Business Machines Corporation

Inventors: Marek Grochowski, Artur M. Gruszecki, Tomasz Kazalski, Grzegorz S. Milka, Konrad K. Skibski, Tomasz Stradomski
OPTIMIZING AN ORDER OF EXECUTION OF MULTIPLE JOIN OPERATIONS

Publication number: 20180046674

Abstract: A computer-implemented method, system, and/or computer program product optimizes an order of execution of column join operations. A first partitioning of the first data column splits the first data column into first subsets of rows. A second partitioning of the second data column splits the second data column into a second subsets of rows. Cardinalities of sub-tables derived by a respective joining of the subsets of rows of the first and second data columns are estimated, based on the first and second value frequency information. An order of execution of multiple join operations is then optimized based on the estimated cardinalities of the sub-tables.

Type: Application

Filed: October 30, 2017

Publication date: February 15, 2018

Inventors: MAREK GROCHOWSKI, ARTUR M. GRUSZECKI, TOMASZ KAZALSKI, GRZEGORZ S. MILKA, KONRAD K. SKIBSKI, TOMASZ STRADOMSKI
Optimizing an order of execution of multiple join operations

Patent number: 9852181

Abstract: A computer-implemented method, system, and/or computer program product optimizes an order of execution of column join operations. A first partitioning of the first data column splits the first data column into first subsets of rows. A second partitioning of the second data column splits the second data column into a second subsets of rows. A first value frequency information indicates a frequency of attribute values within a subset of rows of the first data column processed. A second value frequency information indicates a frequency of attribute values within a subset of rows of the second data column. Cardinalities of sub-tables derived by a respective joining of the subsets of rows of the first and second data columns are estimated, based on the first and second value frequency information. An order of execution of multiple join operations is then optimized based on the estimated cardinalities of the sub-tables.

Type: Grant

Filed: November 11, 2013

Date of Patent: December 26, 2017

Assignee: International Business Machines Corporation

Inventors: Marek Grochowski, Artur M. Gruszecki, Tomasz Kazalski, Grzegorz S. Milka, Konrad K. Skibski, Tomasz Stradomski
Avoidance of intermediate data skew in a massive parallel processing environment

Patent number: 9569493

Abstract: A computer-implemented method for minimizing join operation processing time within a database system based on estimated joined table spread of the database system has been provided. The computer-implemented method includes, estimating value distribution of data in a joined table, wherein the joined table is a result of join operation between two instances of tables of a database system. The computer-implemented method further includes determining boundaries for partitioning at least one range of attributes of the estimated value distribution, wherein the boundaries for partitioning at least one range of attributes of the estimated value distribution corresponds to a same number of rows of the joined table. The computer-implemented method further includes determining at least one assignment of the determined partition of the at least one range of attributes to processing units of the database system.

Type: Grant

Filed: December 31, 2013

Date of Patent: February 14, 2017

Assignee: International Business Machines Corporatin

Inventors: Lukasz Gaza, Artur M. Gruszecki, Tomasz Kazalski, Grzegorz S. Milka, Konrad K. Skibski, Tomasz Stradomski
Avoidance of intermediate data skew in a massive parallel processing environment

Patent number: 9569494

Abstract: A computer-implemented method for minimizing join operation processing time within a database system based on estimated joined table spread of the database system has been provided. The computer-implemented method includes estimating value distribution of data in a joined table, wherein the joined table is a result of join operation between two instances of tables of a database system. The computer-implemented method further includes determining boundaries for partitioning at least one range of attributes of the estimated value distribution, wherein the boundaries for partitioning at least one range of attributes of the estimated value distribution corresponds to a same number of rows of the joined table. The computer-implemented method further includes determining at least one assignment of the determined partition of the at least one range of attributes to processing units of the database system.

Type: Grant

Filed: June 24, 2014

Date of Patent: February 14, 2017

Assignee: International Business Machines Corporation

Inventors: Lukasz Gaza, Artur M. Gruszecki, Tomasz Kazalski, Grzegorz S. Milka, Konrad K. Skibski, Tomasz Stradomski
Method and system for estimating the size of a joined table

Patent number: 9460153

Abstract: A method, system, and/or computer program product estimate a cardinality of a joined table (T) obtained by joining at least a first data column (R) and a second data column (S), where R and S each comprise attribute values. A first density distribution function f(x) describes a frequency of the attribute values of R. A second density distribution function (g(x)) describes the frequency of the attribute values of S. A first information on values in R is based on a sample of values of R. A second information on values in S is based on a sample of values of S. One or more processors then estimate a cardinality of a joined table (T) based on the first and second density distribution function (f(x), g(x)) and the first and second information on values.

Type: Grant

Filed: October 14, 2013

Date of Patent: October 4, 2016

Assignee: International Business Machines Corporation

Inventors: Artur M. Gruszecki, Tomasz Kazalski, Grzegorz S. Milka, Konrad K. Skibski, Tomasz Stradomski
Functionality of decomposition data skew in asymmetric massively parallel processing databases

Patent number: 9355127

Abstract: Database queries are optimized through the functionality of decomposition data skew in an asymmetric massively parallel processing database system. A table having data skew is restructured by (1) storing original data values of a distribution key in a special switch column added to the table, (2) replacing the original data values of the distribution key with modified data values such as randomly generated data values, and (3) partitioning the rows across the nodes of the asymmetric massively parallel processing database system based on the distribution key. The original data values that are stored and replaced may only comprise a subset of the original data values that cause data skew in the table. Data skew is reduced, which improves performance, yet the original data values remain available, which reduces the impact on collocated joins.

Type: Grant

Filed: October 12, 2012

Date of Patent: May 31, 2016

Assignee: International Business Machines Corporation

Inventors: Lukasz Gaza, Artur M. Gruszecki, Tomasz Kazalski, Grzegorz S. Milka, Konrad Krzysztof Skibski, Tomasz Stradomski, Natalya A. Yanayt
AVOIDANCE OF INTERMEDIATE DATA SKEW IN A MASSIVE PARALLEL PROCESSING ENVIRONMENT

Publication number: 20150186465

Abstract: A computer-implemented method for minimizing join operation processing time within a database system based on estimated joined table spread of the database system has been provided. The computer-implemented method includes, estimating value distribution of data in a joined table, wherein the joined table is a result of join operation between two instances of tables of a database system. The computer-implemented method further includes determining boundaries for partitioning at least one range of attributes of the estimated value distribution, wherein the boundaries for partitioning at least one range of attributes of the estimated value distribution corresponds to a same number of rows of the joined table. The computer-implemented method further includes determining at least one assignment of the determined partition of the at least one range of attributes to processing units of the database system.

Type: Application

Filed: December 31, 2013

Publication date: July 2, 2015

Applicant: International Business Machines Corporation

Inventors: Lukasz Gaza, ARTUR M. GRUSZECKI, TOMASZ KAZALSKI, GRZEGORZ S. MILKA, KONRAD K. SKIBSKI, TOMASZ STRADOMSKI
AVOIDANCE OF INTERMEDIATE DATA SKEW IN A MASSIVE PARALLEL PROCESSING ENVIRONMENT

Publication number: 20150186466

Abstract: A computer-implemented method for minimizing join operation processing time within a database system based on estimated joined table spread of the database system has been provided. The computer-implemented method includes estimating value distribution of data in a joined table, wherein the joined table is a result of join operation between two instances of tables of a database system. The computer-implemented method further includes determining boundaries for partitioning at least one range of attributes of the estimated value distribution, wherein the boundaries for partitioning at least one range of attributes of the estimated value distribution corresponds to a same number of rows of the joined table. The computer-implemented method further includes determining at least one assignment of the determined partition of the at least one range of attributes to processing units of the database system.

Type: Application

Filed: June 24, 2014

Publication date: July 2, 2015

Inventors: Lukasz Gaza, ARTUR M. GRUSZECKI, TOMASZ KAZALSKI, GRZEGORZ S. MILKA, KONRAD K. SKIBSKI, TOMASZ STRADOMSKI
OPTIMIZING AN ORDER OF EXECUTION OF MULTIPLE JOIN OPERATIONS

Publication number: 20140156635

Abstract: A computer-implemented method, system, and/or computer program product optimizes an order of execution of column join operations. A first partitioning of the first data column splits the first data column into first subsets of rows. A second partitioning of the second data column splits the second data column into a second subsets of rows. A first value frequency information indicates a frequency of attribute values within a subset of rows of the first data column processed. A second value frequency information indicates a frequency of attribute values within a subset of rows of the second data column. Cardinalities of sub-tables derived by a respective joining of the subsets of rows of the first and second data columns are estimated, based on the first and second value frequency information. An order of execution of multiple join operations is then optimized based on the estimated cardinalities of the sub-tables.

Type: Application

Filed: November 11, 2013

Publication date: June 5, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: MAREK GROCHOWSKI, ARTUR M. GRUSZECKI, TOMASZ KAZALSKI, GRZEGORZ S. MILKA, KONRAD K. SKIBSKI, TOMASZ STRADOMSKI
METHOD AND SYSTEM FOR ESTIMATING THE SIZE OF A JOINED TABLE

Publication number: 20140149388

Abstract: A method, system, and/or computer program product estimate a cardinality of a joined table (T) obtained by joining at least a first data column (R) and a second data column (S), where R and S each comprise attribute values. A first density distribution function f(x) describes a frequency of the attribute values of R. A second density distribution function (g(x)) describes the frequency of the attribute values of S. A first information on values in R is based on a sample of values of R. A second information on values in S is based on a sample of values of S. One or more processors then estimate a cardinality of a joined table (T) based on the first and second density distribution function (f(x), g(x)) and the first and second information on values.

Type: Application

Filed: October 14, 2013

Publication date: May 29, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: ARTUR M. GRUSZECKI, TOMASZ KAZALSKI, GRZEGORZ S. MILKA, KONRAD K. SKIBSKI, TOMASZ STRADOMSKI
FUNCTIONALITY OF DECOMPOSITION DATA SKEW IN ASYMMETRIC MASSIVELY PARALLEL PROCESSING DATABASES

Publication number: 20140108459

Abstract: Database queries are optimized through the functionality of decomposition data skew in an asymmetric massively parallel processing database system. A table having data skew is restructured by (1) storing original data values of a distribution key in a special switch column added to the table, (2) replacing the original data values of the distribution key with modified data values such as randomly generated data values, and (3) partitioning the rows across the nodes of the asymmetric massively parallel processing database system based on the distribution key. The original data values that are stored and replaced may only comprise a subset of the original data values that cause data skew in the table. Data skew is reduced, which improves performance, yet the original data values remain available, which reduces the impact on collocated joins.

Type: Application

Filed: October 12, 2012

Publication date: April 17, 2014

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lukasz Gaza, Artur M. Gruszecki, Tomasz Kazalski, Grzegorz S. Milka, Konrad Krzysztof Skibski, Tomasz Stradomski, Natalya A. Yanayt