Patents by Inventor Austin Clifford
Austin Clifford has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10346380Abstract: Embodiments of the present invention provide a method, system and computer program product for test data generation using unique common factor sequencing. In an embodiment of the invention, a method for test data generation using unique common factor sequencing is provided. The method includes loading a table for population with test data in a test data generation tool executing in a memory of a computer. A column set of multiple columns in the table associated with a key to the table is selected for processing and different cardinality sequence values are assigned to the columns in the set such that the cardinality sequence values do not share a common factor except for unity as in the case of prime numbers.Type: GrantFiled: September 19, 2015Date of Patent: July 9, 2019Assignee: International Business Machines CorporationInventors: Austin Clifford, Konrad Emanowicz, Enda McCallig, Gary Murtagh, Clare Scally
-
Publication number: 20190130132Abstract: A computer-implemented method, computer program product and system for identifying pseudonymized data within data sources. One or more data repositories within one or more of the data sources are selected. One or more privacy data models are provided, where each of the privacy data models includes pattern(s) and/or parameter(s). One or more of the one or more privacy data models are selected. Data identification information is generated, where the data identification information indicates a presence or absence of pseudonymized data and of non-pseudonymized data within the one or more of the data sources. The data identification information is generated utilizing the pattern(s) and/or the parameter(s) to determine pseudonymized data.Type: ApplicationFiled: November 1, 2017Publication date: May 2, 2019Inventors: Pedro Barbas, Austin Clifford, Konrad Emanowicz, Patrick G. O'Sullivan
-
Publication number: 20190095461Abstract: Disclosed is an approach comprising a column partitioned into a plurality of partitions including an empty partition and a plurality of filled partitions each comprising data entries associated with a set of parameters having parameter values, the data entries compressed in accordance with a compression dictionary. The approach comprises receiving forecasted parameter values for an expected set of data entries to be stored in an empty partition; predicting a recurrence frequency of the data entries in the expected set using the forecasted parameter values by evaluating the respective compression dictionaries of the filled partitions with a machine learning algorithm; generating a predictive compression dictionary for the expected set of data entries based on the predicted recurrence frequency of the data entries in the expected set; receiving the expected set of data entries; and compressing at least part of the received expected set of data entries using the predictive compression dictionary.Type: ApplicationFiled: November 29, 2018Publication date: March 28, 2019Inventors: Sami Abed, Pedro Barbas, Austin Clifford, Konrad Emanowicz
-
Patent number: 10169361Abstract: Disclosed is a computer-implemented method of compressing data in a columnar database comprising at least one column partitioned into a plurality of partitions including at least one empty partition and a plurality of filled partitions each comprising data entries associated with a set of parameters having parameter values relevant to the recurrence frequency of the data entry in the partition, the data entries being compressed in accordance with a compression dictionary based on the respective recurrence frequencies of the data entries in the filled partition.Type: GrantFiled: November 16, 2015Date of Patent: January 1, 2019Assignee: International Business Machines CorporationInventors: Sami Abed, Pedro M Barbas, Austin Clifford, Konrad Emanowicz
-
Patent number: 9773027Abstract: In an exemplary embodiment of this disclosure, a method for loading data from a backup image of a database includes selecting a subset statement defining a subset of the data in the database. Tables of the database are identified based on metadata of the database. A target database is written having the structure but not the data of the identified tables. One or more table statements are constructed, by a computer processor, defining a subset of each identified table based on the subset statement. Selected data is unloaded from a backup image into the target database using respective table statements as filters.Type: GrantFiled: September 11, 2015Date of Patent: September 26, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Sami Abed, Austin Clifford, Konrad Emanowicz, Gareth Jenkins
-
Publication number: 20170139947Abstract: Disclosed is a computer-implemented method of compressing data in a columnar database comprising at least one column partitioned into a plurality of partitions including at least one empty partition and a plurality of filled partitions each comprising data entries associated with a set of parameters having parameter values relevant to the recurrence frequency of the data entry in the partition, the data entries being compressed in accordance with a compression dictionary based on the respective recurrence frequencies of the data entries in the filled partition.Type: ApplicationFiled: November 16, 2015Publication date: May 18, 2017Inventors: Sami Abed, Pedro M. Barbas, Austin Clifford, Konrad Emanowicz
-
Patent number: 9589019Abstract: A method and system are provided for performance analysis of a database. The method includes receiving a proposed data model, generating a hypothetical query workload using a plurality of sample query templates representing different query constructs for the proposed data model, generating hypothetical optimizer statistics using predefined generating rules that include a projected cardinality for the proposed data model and creating a sample empty database and database schema using the proposed data model. The method also includes applying the hypothetical optimizer statistics to the sample empty database, based on generating the hypothetical optimizer statistics, applying each query construct of the hypothetical query workload to the database schema and estimating a cost of the hypothetical query workload for the proposed data model.Type: GrantFiled: May 6, 2013Date of Patent: March 7, 2017Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Austin Clifford, Konrad Emanowicz, Enda McCallig
-
Patent number: 9471607Abstract: In an exemplary embodiment of this disclosure, a method for loading data from a backup image of a database includes selecting a subset statement defining a subset of the data in the database. Tables of the database are identified based on metadata of the database. A target database is written having the structure but not the data of the identified tables. One or more table statements are constructed, by a computer processor, defining a subset of each identified table based on the subset statement. Selected data is unloaded from a backup image into the target database using respective table statements as filters.Type: GrantFiled: August 14, 2013Date of Patent: October 18, 2016Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Sami Abed, Austin Clifford, Konrad Emanowicz, Gareth Jenkins
-
Publication number: 20160299952Abstract: Provided are a system, method and computer program product for redistribution of data in an online shared nothing database, said shared nothing database comprising a plurality of original partitions and at least one new partition.Type: ApplicationFiled: December 9, 2014Publication date: October 13, 2016Applicant: International Business Machines CorporationInventors: Enzo Cialini, Austin Clifford, Garrett Fitzsimons
-
Patent number: 9465840Abstract: Dynamically identifying and preventing skewed partitions in a shared-nothing database is provided. The database management system software receives a parameter for identifying a threshold value associated with at least one distribution key value. Optimizer statistics are gathered on a first table that is distributed across one or more partitions in the shared-nothing database, wherein the first table includes a first table name. Distribution key skew is identified based on the gathered optimizer statistics indicating the threshold value being exceeded. A second table with an alternate distribution key, is created having a second table name for receiving overflow data rows associated with the at least one distribution key value based on the identified distribution key skew. A union all view is created based on the first and the second table.Type: GrantFiled: March 14, 2014Date of Patent: October 11, 2016Assignee: International Business Machines CorporationInventors: Austin Clifford, Konrad Emanowicz, Enda McCallig
-
Patent number: 9460152Abstract: Dynamically identifying and preventing skewed partitions in a shared-nothing database is provided. The database management system software receives a parameter for identifying a threshold value associated with at least one distribution key value. Optimizer statistics are gathered on a first table that is distributed across one or more partitions in the shared-nothing database, wherein the first table includes a first table name. Distribution key skew is identified based on the gathered optimizer statistics indicating the threshold value being exceeded. A second table with an alternate distribution key, is created having a second table name for receiving overflow data rows associated with the at least one distribution key value based on the identified distribution key skew. A union all view is created based on the first and the second table.Type: GrantFiled: November 7, 2014Date of Patent: October 4, 2016Assignee: International Business Machines CorporationInventors: Austin Clifford, Konrad Emanowicz, Enda McCallig
-
Patent number: 9436734Abstract: Embodiments of the present invention provide a method, system and computer program product for pre-migration performance prediction of a database management system (DBMS). In an embodiment of the invention, a method for pre-migration performance prediction of a DBMS can include executing a calibration workload in a target DBMS to produce a conversion factor of cost of executing the calibration workload to temporal performance of executing the calibration workload. The method also can include subsequently submitting a sample workload from a database of a source DBMS for evaluation of cost of execution on an empty replica of the database in the target DBMS. Finally, the method can include predicting a temporal performance of the sample workload in the target DBMS as a product of the conversion factor and the cost of execution of the sample workload on the empty replica of the database in the target DBMS.Type: GrantFiled: October 20, 2013Date of Patent: September 6, 2016Assignee: International Business Machines CorporationInventors: Austin Clifford, Enda McCallig
-
Publication number: 20160188418Abstract: In an exemplary embodiment of this disclosure, a method for loading data from a backup image of a database includes selecting a subset statement defining a subset of the data in the database. Tables of the database are identified based on metadata of the database. A target database is written having the structure but not the data of the identified tables. One or more table statements are constructed, by a computer processor, defining a subset of each identified table based on the subset statement. Selected data is unloaded from a backup image into the target database using respective table statements as filters.Type: ApplicationFiled: September 11, 2015Publication date: June 30, 2016Inventors: Sami Abed, Austin Clifford, Konrad Emanowicz, Gareth Jenkins
-
Publication number: 20160012093Abstract: Embodiments of the present invention provide a method, system and computer program product for test data generation using unique common factor sequencing. In an embodiment of the invention, a method for test data generation using unique common factor sequencing is provided. The method includes loading a table for population with test data in a test data generation tool executing in memory of a computer. A column set of multiple columns in the table associated with a key to the table can be selected for processing and different cardinality sequence values are assigned to the columns in the set such that the cardinality sequence values do not share a common factor except for unity as in the case of prime numbers.Type: ApplicationFiled: September 19, 2015Publication date: January 14, 2016Inventors: Austin CLIFFORD, Konrad EMANOWICZ, Enda McCALLIG, Gary MURTAGH, Clare SCALLY
-
Patent number: 9171025Abstract: Embodiments of the present invention provide a system and computer program product for test data generation using unique common factor sequencing. In an embodiment of the invention, a computer program product for test data generation using unique common factor sequencing is provided. The computer program product includes loading a table for population with test data in a test data generation tool executing in a memory of a computer. A column set of multiple columns in the table associated with a key to the table is selected for processing and different cardinality sequence values are assigned to the columns in the set such that the cardinality sequence values do not share a common factor except for unity as in the case of prime numbers.Type: GrantFiled: August 23, 2013Date of Patent: October 27, 2015Assignee: International Business Machines CorporationInventors: Austin Clifford, Konrad Emanowicz, Enda McCallig, Gary Murtagh, Clare Scally
-
Patent number: 9171026Abstract: Embodiments of the present invention provide a method for test data generation using unique common factor sequencing. In an embodiment of the invention, a method for test data generation using unique common factor sequencing is provided. The method includes loading a table for population with test data in a test data generation tool executing in a memory of a computer. A column set of multiple columns in the table associated with a key to the table is selected for processing and different cardinality sequence values are assigned to the columns in the set such that the cardinality sequence values do not share a common factor except for unity as in the case of prime numbers.Type: GrantFiled: August 23, 2013Date of Patent: October 27, 2015Assignee: International Business Machines CorporationInventors: Austin Clifford, Konrad Emanowicz, Enda McCallig, Gary Murtagh, Clare Scally
-
Publication number: 20150261816Abstract: Dynamically identifying and preventing skewed partitions in a shared-nothing database is provided. The database management system software receives a parameter for identifying a threshold value associated with at least one distribution key value. Optimizer statistics are gathered on a first table that is distributed across one or more partitions in the shared-nothing database, wherein the first table includes a first table name. Distribution key skew is identified based on the gathered optimizer statistics indicating the threshold value being exceeded. A second table with an alternate distribution key, is created having a second table name for receiving overflow data rows associated with the at least one distribution key value based on the identified distribution key skew. A union all view is created based on the first and the second table.Type: ApplicationFiled: March 14, 2014Publication date: September 17, 2015Applicant: International Business Machines CorporationInventors: Austin Clifford, Konrad Emanowicz, Enda McCallig
-
Publication number: 20150261840Abstract: Dynamically identifying and preventing skewed partitions in a shared-nothing database is provided. The database management system software receives a parameter for identifying a threshold value associated with at least one distribution key value. Optimizer statistics are gathered on a first table that is distributed across one or more partitions in the shared-nothing database, wherein the first table includes a first table name. Distribution key skew is identified based on the gathered optimizer statistics indicating the threshold value being exceeded. A second table with an alternate distribution key, is created having a second table name for receiving overflow data rows associated with the at least one distribution key value based on the identified distribution key skew. A union all view is created based on the first and the second table.Type: ApplicationFiled: November 7, 2014Publication date: September 17, 2015Inventors: Austin Clifford, Konrad Emanowicz, Enda McCallig
-
Patent number: 9135315Abstract: Aspects include data masking in database operations including intercepting a database query and identifying masked data in the query. The masked data in the query is unmasked by substituting the masked data with equivalent real values. The unmasked query is sent to the database engine, and a result provided by the database engine in response to the unmasked query is intercepted. The values in the result that correspond to sensitive information are identified. The result is masked by substituting the values in the result corresponding to sensitive information with masked equivalents. The masked result is returned to a user.Type: GrantFiled: February 26, 2013Date of Patent: September 15, 2015Assignee: Internatonal Business Machines CorporationInventors: Pedro Barbas, Austin Clifford, Gareth Jenkins, Brian McKeown
-
Patent number: 9110948Abstract: Embodiments of the present invention provide a method, system and computer program product for pre-migration performance prediction of a database management system (DBMS). In an embodiment of the invention, a method for pre-migration performance prediction of a DBMS can include executing a calibration workload in a target DBMS to produce a conversion factor of cost of executing the calibration workload to temporal performance of executing the calibration workload. The method also can include subsequently submitting a sample workload from a database of a source DBMS for evaluation of cost of execution on an empty replica of the database in the target DBMS. Finally, the method can include predicting a temporal performance of the sample workload in the target DBMS as a product of the conversion factor and the cost of execution of the sample workload on the empty replica of the database in the target DBMS.Type: GrantFiled: November 9, 2012Date of Patent: August 18, 2015Assignee: International Business Machines CorporationInventors: Austin Clifford, Enda McCallig