Patents by Inventor Konrad Emanowicz

Konrad Emanowicz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Runtime control of automation accuracy using adjustable thresholds

Patent number: 11468192

Abstract: A computer-implemented method, computer program product and system for identifying pseudonymized data within data sources. One or more data repositories within one or more of the data sources are selected. One or more privacy data models are provided, where each of the privacy data models includes pattern(s) and/or parameter(s). One or more of the one or more privacy data models are selected. Data identification information is generated, where the data identification information indicates a presence or absence of pseudonymized data and of non-pseudonymized data within the one or more of the data sources. The data identification information is generated utilizing the pattern(s) and/or the parameter(s) to determine pseudonymized data.

Type: Grant

Filed: March 25, 2020

Date of Patent: October 11, 2022

Inventors: Pedro Barbas, Austin Clifford, Konrad Emanowicz, Patrick G. O'Sullivan
Test data generation and scale up for database testing using unique common factor sequencing

Patent number: 11163744

Abstract: Embodiments of the present invention provide a method, system and computer program product for test data generation using unique common factor sequencing. In an embodiment of the invention, a method for test data generation using unique common factor sequencing is provided. The method includes loading a table for population with test data in a test data generation tool executing in memory of a computer. A column set of multiple columns in the table associated with a key to the table can be selected for processing and different cardinality sequence values are assigned to the columns in the set such that the cardinality sequence values do not share a common factor except for unity as in the case of prime numbers.

Type: Grant

Filed: July 8, 2019

Date of Patent: November 2, 2021

Assignee: International Business Machines Corporation

Inventors: Austin Clifford, Konrad Emanowicz, Enda McCallig, Gary Murtagh, Clare Scally
Columnar database compression

Patent number: 11036684

Abstract: Disclosed is an approach comprising a column partitioned into a plurality of partitions including an empty partition and a plurality of filled partitions each comprising data entries associated with a set of parameters having parameter values, the data entries compressed in accordance with a compression dictionary. The approach comprises receiving forecasted parameter values for an expected set of data entries to be stored in an empty partition; predicting a recurrence frequency of the data entries in the expected set using the forecasted parameter values by evaluating the respective compression dictionaries of the filled partitions with a machine learning algorithm; generating a predictive compression dictionary for the expected set of data entries based on the predicted recurrence frequency of the data entries in the expected set; receiving the expected set of data entries; and compressing at least part of the received expected set of data entries using the predictive compression dictionary.

Type: Grant

Filed: November 29, 2018

Date of Patent: June 15, 2021

Assignee: International Business Machines Corporation

Inventors: Sami Abed, Pedro Barbas, Austin Clifford, Konrad Emanowicz
Test data management

Patent number: 10866973

Abstract: As disclosed herein, a method includes receiving a plurality of datasets from a database, wherein each dataset comprises one or more data fields represented in a single data format, and wherein the data fields from at least two of the datasets are represented in different data formats, combining the plurality of datasets to provide a created data column corresponding to all of the data fields from the plurality of datasets, organizing the data column into data clusters, wherein each data cluster includes data fields represented in a single data format, and wherein each data field belongs to a data cluster, providing a key-value map referencing data fields with respect to their corresponding data formats, and verifying the database with respect to the created column. A corresponding computer program product and computer system are also disclosed.

Type: Grant

Filed: December 7, 2017

Date of Patent: December 15, 2020

Assignee: International Business Machines Corporation

Inventors: Pedro M. Barbas, Konrad Emanowicz, Enda McCallig, Aslam F. Nomani, Lei Pan
Test data management

Patent number: 10860616

Abstract: As disclosed herein, a method includes receiving a plurality of datasets from a database, wherein each dataset comprises one or more data fields represented in a single data format, and wherein the data fields from at least two of the datasets are represented in different data formats, combining the plurality of datasets to provide a created data column corresponding to all of the data fields from the plurality of datasets, organizing the data column into data clusters, wherein each data cluster includes data fields represented in a single data format, and wherein each data field belongs to a data cluster, providing a key-value map referencing data fields with respect to their corresponding data formats, and verifying the database with respect to the created column. A corresponding computer program product and computer system are also disclosed.

Type: Grant

Filed: June 6, 2016

Date of Patent: December 8, 2020

Assignee: International Business Machines Corporation

Inventors: Pedro M. Barbas, Konrad Emanowicz, Enda McCallig, Aslam F. Nomani, Lei Pan
Data migration in a networked computer environment

Patent number: 10783124

Abstract: Data structures stored on a source database are migrated to a destination database in which the data are structured in a different format than that of the source database. Dictionaries are stored that are based on representations of the data structures stored on the source database that are formatted in other than the structural format used on the source database for the data structures. One of the data structures and a corresponding one of the dictionaries are transferred from the source database to a destination database. The transferred data structure is loaded onto the destination database in accordance with the transferred dictionary.

Type: Grant

Filed: December 6, 2017

Date of Patent: September 22, 2020

Assignee: International Business Machines Corporation

Inventors: Pedro M. Barbas, Konrad Emanowicz, Gareth Jenkins, Enda McCallig, Lei Pan
Identification of pseudonymized data within data sources

Patent number: 10747903

Abstract: A computer-implemented method, computer program product and system for identifying pseudonymized data within data sources. One or more data repositories within one or more of the data sources are selected. One or more privacy data models are provided, where each of the privacy data models includes pattern(s) and/or parameter(s). One or more of the one or more privacy data models are selected. Data identification information is generated, where the data identification information indicates a presence or absence of pseudonymized data and of non-pseudonymized data within the one or more of the data sources. The data identification information is generated utilizing the pattern(s) and/or the parameter(s) to determine pseudonymized data.

Type: Grant

Filed: April 25, 2019

Date of Patent: August 18, 2020

Assignee: International Business Machines Corporation

Inventors: Pedro Barbas, Austin Clifford, Konrad Emanowicz, Patrick G. O'Sullivan
RUNTIME CONTROL OF AUTOMATION ACCURACY USING ADJUSTABLE THRESHOLDS

Publication number: 20200226289

Abstract: A computer-implemented method, computer program product and system for identifying pseudonymized data within data sources. One or more data repositories within one or more of the data sources are selected. One or more privacy data models are provided, where each of the privacy data models includes pattern(s) and/or parameter(s). One or more of the one or more privacy data models are selected. Data identification information is generated, where the data identification information indicates a presence or absence of pseudonymized data and of non-pseudonymized data within the one or more of the data sources. The data identification information is generated utilizing the pattern(s) and/or the parameter(s) to determine pseudonymized data.

Type: Application

Filed: March 25, 2020

Publication date: July 16, 2020

Inventors: Pedro Barbas, Austin Clifford, Konrad Emanowicz, Patrick G. O'Sullivan
Identification of pseudonymized data within data sources

Patent number: 10657287

Abstract: A computer-implemented method, computer program product and system for identifying pseudonymized data within data sources. One or more data repositories within one or more of the data sources are selected. One or more privacy data models are provided, where each of the privacy data models includes pattern(s) and/or parameter(s). One or more of the one or more privacy data models are selected. Data identification information is generated, where the data identification information indicates a presence or absence of pseudonymized data and of non-pseudonymized data within the one or more of the data sources. The data identification information is generated utilizing the pattern(s) and/or the parameter(s) to determine pseudonymized data.

Type: Grant

Filed: November 1, 2017

Date of Patent: May 19, 2020

Assignee: International Business Machines Corporation

Inventors: Pedro Barbas, Austin Clifford, Konrad Emanowicz, Patrick G. O'Sullivan
Data migration in a networked computer environment

Patent number: 10649965

Abstract: Data structures stored on a source database are migrated to a destination database in which the data are structured in a different format than that of the source database. Dictionaries are stored that are based on representations of the data structures stored on the source database that are formatted in other than the structural format used on the source database for the data structures. One of the data structures and a corresponding one of the dictionaries are transferred from the source database to a destination database. The transferred data structure is loaded onto the destination database in accordance with the transferred dictionary.

Type: Grant

Filed: November 14, 2016

Date of Patent: May 12, 2020

Assignee: International Business Machines Corporation

Inventors: Pedro M. Barbas, Konrad Emanowicz, Gareth Jenkins, Enda McCallig, Lei Pan
TEST DATA GENERATION AND SCALE UP FOR DATABASE TESTING USING UNIQUE COMMON FACTOR SEQUENCING

Publication number: 20190332592

Abstract: Embodiments of the present invention provide a method, system and computer program product for test data generation using unique common factor sequencing. In an embodiment of the invention, a method for test data generation using unique common factor sequencing is provided. The method includes loading a table for population with test data in a test data generation tool executing in memory of a computer. A column set of multiple columns in the table associated with a key to the table can be selected for processing and different cardinality sequence values are assigned to the columns in the set such that the cardinality sequence values do not share a common factor except for unity as in the case of prime numbers.

Type: Application

Filed: July 8, 2019

Publication date: October 31, 2019

Inventors: Austin CLIFFORD, Konrad Emanowicz, Enda McCallig, Gary Murtagh, Clare Scally
RUNTIME CONTROL OF AUTOMATION ACCURACY USING ADJUSTABLE THRESHOLDS

Publication number: 20190251292

Abstract: A computer-implemented method, computer program product and system for identifying pseudonymized data within data sources. One or more data repositories within one or more of the data sources are selected. One or more privacy data models are provided, where each of the privacy data models includes pattern(s) and/or parameter(s). One or more of the one or more privacy data models are selected. Data identification information is generated, where the data identification information indicates a presence or absence of pseudonymized data and of non-pseudonymized data within the one or more of the data sources. The data identification information is generated utilizing the pattern(s) and/or the parameter(s) to determine pseudonymized data.

Type: Application

Filed: April 25, 2019

Publication date: August 15, 2019

Inventors: Pedro Barbas, Austin Clifford, Konrad Emanowicz, Patrick G. O'Sullivan
Test data generation and scale up for database testing using unique common factor sequencing

Patent number: 10346380

Abstract: Embodiments of the present invention provide a method, system and computer program product for test data generation using unique common factor sequencing. In an embodiment of the invention, a method for test data generation using unique common factor sequencing is provided. The method includes loading a table for population with test data in a test data generation tool executing in a memory of a computer. A column set of multiple columns in the table associated with a key to the table is selected for processing and different cardinality sequence values are assigned to the columns in the set such that the cardinality sequence values do not share a common factor except for unity as in the case of prime numbers.

Type: Grant

Filed: September 19, 2015

Date of Patent: July 9, 2019

Assignee: International Business Machines Corporation

Inventors: Austin Clifford, Konrad Emanowicz, Enda McCallig, Gary Murtagh, Clare Scally
RUNTIME CONTROL OF AUTOMATION ACCURACY USING ADJUSTABLE THRESHOLDS

Publication number: 20190130132

Abstract: A computer-implemented method, computer program product and system for identifying pseudonymized data within data sources. One or more data repositories within one or more of the data sources are selected. One or more privacy data models are provided, where each of the privacy data models includes pattern(s) and/or parameter(s). One or more of the one or more privacy data models are selected. Data identification information is generated, where the data identification information indicates a presence or absence of pseudonymized data and of non-pseudonymized data within the one or more of the data sources. The data identification information is generated utilizing the pattern(s) and/or the parameter(s) to determine pseudonymized data.

Type: Application

Filed: November 1, 2017

Publication date: May 2, 2019

Inventors: Pedro Barbas, Austin Clifford, Konrad Emanowicz, Patrick G. O'Sullivan
COLUMNAR DATABASE COMPRESSION

Publication number: 20190095461

Abstract: Disclosed is an approach comprising a column partitioned into a plurality of partitions including an empty partition and a plurality of filled partitions each comprising data entries associated with a set of parameters having parameter values, the data entries compressed in accordance with a compression dictionary. The approach comprises receiving forecasted parameter values for an expected set of data entries to be stored in an empty partition; predicting a recurrence frequency of the data entries in the expected set using the forecasted parameter values by evaluating the respective compression dictionaries of the filled partitions with a machine learning algorithm; generating a predictive compression dictionary for the expected set of data entries based on the predicted recurrence frequency of the data entries in the expected set; receiving the expected set of data entries; and compressing at least part of the received expected set of data entries using the predictive compression dictionary.

Type: Application

Filed: November 29, 2018

Publication date: March 28, 2019

Inventors: Sami Abed, Pedro Barbas, Austin Clifford, Konrad Emanowicz
Columnar database compression

Patent number: 10169361

Abstract: Disclosed is a computer-implemented method of compressing data in a columnar database comprising at least one column partitioned into a plurality of partitions including at least one empty partition and a plurality of filled partitions each comprising data entries associated with a set of parameters having parameter values relevant to the recurrence frequency of the data entry in the partition, the data entries being compressed in accordance with a compression dictionary based on the respective recurrence frequencies of the data entries in the filled partition.

Type: Grant

Filed: November 16, 2015

Date of Patent: January 1, 2019

Assignee: International Business Machines Corporation

Inventors: Sami Abed, Pedro M Barbas, Austin Clifford, Konrad Emanowicz
DATA MIGRATION IN A NETWORKED COMPUTER ENVIRONMENT

Publication number: 20180137114

Abstract: Data structures stored on a source database are migrated to a destination database in which the data are structured in a different format than that of the source database. Dictionaries are stored that are based on representations of the data structures stored on the source database that are formatted in other than the structural format used on the source database for the data structures. One of the data structures and a corresponding one of the dictionaries are transferred from the source database to a destination database. The transferred data structure is loaded onto the destination database in accordance with the transferred dictionary.

Type: Application

Filed: December 6, 2017

Publication date: May 17, 2018

Inventors: Pedro M. Barbas, Konrad Emanowicz, Gareth Jenkins, Enda McCallig, Lei Pan
DATA MIGRATION IN A NETWORKED COMPUTER ENVIRONMENT

Publication number: 20180137112

Abstract: Data structures stored on a source database are migrated to a destination database in which the data are structured in a different format than that of the source database. Dictionaries are stored that are based on representations of the data structures stored on the source database that are formatted in other than the structural format used on the source database for the data structures. One of the data structures and a corresponding one of the dictionaries are transferred from the source database to a destination database. The transferred data structure is loaded onto the destination database in accordance with the transferred dictionary.

Type: Application

Filed: November 14, 2016

Publication date: May 17, 2018

Inventors: Pedro M. Barbas, Konrad Emanowicz, Gareth Jenkins, Enda McCallig, Lei Pan
TEST DATA MANAGEMENT

Publication number: 20180096051

Abstract: As disclosed herein, a method includes receiving a plurality of datasets from a database, wherein each dataset comprises one or more data fields represented in a single data format, and wherein the data fields from at least two of the datasets are represented in different data formats, combining the plurality of datasets to provide a created data column corresponding to all of the data fields from the plurality of datasets, organizing the data column into data clusters, wherein each data cluster includes data fields represented in a single data format, and wherein each data field belongs to a data cluster, providing a key-value map referencing data fields with respect to their corresponding data formats, and verifying the database with respect to the created column. A corresponding computer program product and computer system are also disclosed.

Type: Application

Filed: December 7, 2017

Publication date: April 5, 2018

Inventors: Pedro M. Barbas, Konrad Emanowicz, Enda McCallig, Aslam F. Nomani, Lei Pan
TEST DATA MANAGEMENT

Publication number: 20170351746

Abstract: As disclosed herein, a method includes receiving a plurality of datasets from a database, wherein each dataset comprises one or more data fields represented in a single data format, and wherein the data fields from at least two of the datasets are represented in different data formats, combining the plurality of datasets to provide a created data column corresponding to all of the data fields from the plurality of datasets, organizing the data column into data clusters, wherein each data cluster includes data fields represented in a single data format, and wherein each data field belongs to a data cluster, providing a key-value map referencing data fields with respect to their corresponding data formats, and verifying the database with respect to the created column. A corresponding computer program product and computer system are also disclosed.

Type: Application

Filed: June 6, 2016

Publication date: December 7, 2017

Inventors: Pedro M. Barbas, Konrad Emanowicz, Enda McCallig, Aslam F. Nomani, Lei Pan

1 2 next