Patents by Inventor Vincent Corvinelli
Vincent Corvinelli has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250013641Abstract: Aspects of the invention include techniques for robust query execution plan selection. A non-limiting example method includes training a model to estimate, for an input including a query and one or more plans, an execution time for each plan and a respective uncertainty of the execution time. A new query and a search space including a plurality of candidate plans are input to the model. Given an estimated distribution for the execution time of each candidate plan, a suboptimality risk is computed for each candidate plan. A plan of the plurality of candidate plans is selected according to a plan selection policy. The plan selection policy includes at least one of: selecting a plan by assuming that plans have higher costs proportional to an estimated standard deviation of their respective uncertainty; and selecting a plan with minimum risk using model uncertainty, data uncertainty, or a total uncertainty.Type: ApplicationFiled: August 10, 2023Publication date: January 9, 2025Inventors: SEYED MOHAMMAD AMIN KAMALI, CALISTO ZUZARTE, Vincent Corvinelli, Brandon Lewis Frendo, Vasiliki Kantere, Ning WANG
-
Publication number: 20240427768Abstract: Aspects of the invention include techniques for providing a learned join cardinality estimation using a join graph representation. A non-limiting example method includes building a join cardinality estimation model. The model can be built by generating a training query having a known join cardinality, generating an adjacency matrix encoding a join graph of the training query, encoding one side of a diagonal axis of the adjacency matrix, and training the join cardinality estimation model using the encoded adjacency matrix and the known join cardinality. The method includes performing an inference using the join cardinality estimation model. The inference includes a predicted join cardinality for a query. The method includes executing a query execution plan for the query using the predicted join cardinality.Type: ApplicationFiled: June 22, 2023Publication date: December 26, 2024Inventors: SEYED MOHAMMAD AMIN KAMALI, Vincent Corvinelli, CALISTO ZUZARTE
-
Publication number: 20240086405Abstract: Examples described herein provide a computer-implemented method that includes training a machine learning model. The model is trained by generating a set of training queries using at least one of a query workload and relationships between tables in a database, building a query graph for each of the set of training queries, computing, for each training query of the set of training queries, a selectivity based at least in part on the query graph, and building, based at least in part on the set of training queries, an initial join result distribution as a collection of query graphs.Type: ApplicationFiled: September 14, 2022Publication date: March 14, 2024Inventors: Mohammed Fahd Alhamid, Vincent Corvinelli, Calisto Zuzarte
-
Patent number: 11921719Abstract: Examples described herein provide a computer-implemented method that includes training a machine learning model. The model is trained by generating a set of training queries using at least one of a query workload and relationships between tables in a database, building a query graph for each of the set of training queries, computing, for each training query of the set of training queries, a selectivity based at least in part on the query graph, and building, based at least in part on the set of training queries, an initial join result distribution as a collection of query graphs.Type: GrantFiled: September 14, 2022Date of Patent: March 5, 2024Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Mohammed Fahd Alhamid, Vincent Corvinelli, Calisto Zuzarte
-
Patent number: 11720565Abstract: A method, a computer system, and a computer program product for cardinality estimation is provided. Embodiments of the present invention includes accessing database relations. The database relations are utilized to collect a random sample from each of the database relations. Training data is then generated from the random sample. The training data is used to build a cumulative frequency function (CFF) model. The cumulative frequency function (CFF) model then provides a cardinality estimation for an output for SQL operators.Type: GrantFiled: August 27, 2020Date of Patent: August 8, 2023Assignee: International Business Machines CorporationInventors: Mohamad F. Kalil, Calisto Zuzarte, Mustafa Dawoud, Mohammed Fahd Alhamid, Vincent Corvinelli, Wai Keat Tan, Ronghao Yang
-
Patent number: 11593372Abstract: In an approach to improve query optimization in a database management system, embodiments identify opportunities for improvement in a cardinality estimate using a workload feedback process using a query feedback performed during query compilation. Embodiments identify correlations and relationships based on the structure of the query feedback and the runtime feedback performed, and collects data from the execution of a query to identify errors in estimates of the query optimizer. Further, embodiments submit the query feedback and the runtime feedback to a machine learning engine to update a set of models. Additionally, embodiments update a set of models based on the submitted query feedback and runtime feedback, and output a new, updated, or re-trained model based on collected data from the execution of the query to identify the errors in estimates of the query optimizer, the submitted query feedback and the runtime feedback, or a trained generated mode.Type: GrantFiled: July 1, 2020Date of Patent: February 28, 2023Assignee: International Business Machines CorporationInventors: Vincent Corvinelli, Mohammed Fahd Alhamid, Mohamad F. Kalil, Calisto Zuzarte
-
Publication number: 20220067045Abstract: A method, a computer system, and a computer program product for cardinality estimation is provided. Embodiments of the present invention includes accessing database relations. The database relations are utilized to collect a random sample from each of the database relations. Training data is then generated from the random sample. The training data is used to build a cumulative frequency function (CFF) model. The cumulative frequency function (CFF) model then provides a cardinality estimation for an output for SQL operators.Type: ApplicationFiled: August 27, 2020Publication date: March 3, 2022Inventors: MOHAMAD F. KALIL, CALISTO ZUZARTE, MUSTAFA DAWOUD, MOHAMMED FAHD ALHAMID, Vincent Corvinelli, Wai Keat Tan, Ronghao Yang
-
Publication number: 20220004553Abstract: In an approach to improve query optimization in a database management system, embodiments identify opportunities for improvement in a cardinality estimate using a workload feedback process using a query feedback performed during query compilation. Embodiments identify correlations and relationships based on the structure of the query feedback and the runtime feedback performed, and collects data from the execution of a query to identify errors in estimates of the query optimizer. Further, embodiments submit the query feedback and the runtime feedback to a machine learning engine to update a set of models. Additionally, embodiments update a set of models based on the submitted query feedback and runtime feedback, and output a new, updated, or re-trained model based on collected data from the execution of the query to identify the errors in estimates of the query optimizer, the submitted query feedback and the runtime feedback, or a trained generated mode.Type: ApplicationFiled: July 1, 2020Publication date: January 6, 2022Inventors: Vincent Corvinelli, Mohammed Fahd Alhamid, Mohamad F. Kalil, Calisto Zuzarte
-
Patent number: 11210290Abstract: A maintenance subsystem of a database-management system (DBMS) receives a database query that requests access to data stored in a database column. The subsystem retrieves or infers frequent-value statistics for that column, each of which specifies the number of times one distinct value is stored in the column. The statistics are partitioned into Keep and Discard clusters and, using statistical or other computational methods based on the column's data distribution, the subsystem determines an optimal number of the statistics that should be kept by the DBMS in order to minimize cost, errors, or other parameters desired by an implementer. The subsystem then directly or indirectly directs a query-optimizer component of the DBMS to consider the optimal number of frequent-value statistics when selecting an optimal data-access plan. The selected plan is then used by the DBMS's storage-manager component to access the column when servicing the received query.Type: GrantFiled: January 6, 2020Date of Patent: December 28, 2021Assignee: International Business Machines CorporationInventors: Mohamad F. Kalil, Vincent Corvinelli, Calisto Zuzarte, Petrus Chan
-
Publication number: 20210209110Abstract: A maintenance subsystem of a database-management system (DBMS) receives a database query that requests access to data stored in a database column. The subsystem retrieves or infers frequent-value statistics for that column, each of which specifies the number of times one distinct value is stored in the column. The statistics are partitioned into Keep and Discard clusters and, using statistical or other computational methods based on the column's data distribution, the subsystem determines an optimal number of the statistics that should be kept by the DBMS in order to minimize cost, errors, or other parameters desired by an implementer. The subsystem then directly or indirectly directs a query-optimizer component of the DBMS to consider the optimal number of frequent-value statistics when selecting an optimal data-access plan. The selected plan is then used by the DBMS's storage-manager component to access the column when servicing the received query.Type: ApplicationFiled: January 6, 2020Publication date: July 8, 2021Inventors: Mohamad F. Kalil, Vincent Corvinelli, Calisto Zuzarte, Petrus Chan
-
Patent number: 11030521Abstract: A database query comprising predicates may be received. Each predicate may operate on database columns. The database query may be determined to comprise strict operators. An upper bound neural network may be defined for calculating an adjacent upper bound and a lower bound neural network may be defined for calculating an adjacent lower bound. The upper bound neural network and the lower bound neural network may be trained using a selected value from data of a database table associated with the database query to be executed through the upper bound neural network and the lower bound neural network. The upper bound neural network and the lower bound neural network may be adjusted by passing in an expected value using an error found in expressions. The adjacent lower bound and the adjacent upper bound may be calculated in response to completion of initial training for the database columns.Type: GrantFiled: February 20, 2018Date of Patent: June 8, 2021Assignee: International Business Machines CorporationInventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Publication number: 20200409948Abstract: A computer implemented method for processing database queries includes receiving a query and a set of runtime metrics corresponding to the query, wherein the query includes a set of elements, generating a set of encoded elements corresponding to the set of elements, processing the set of encoded elements and the set of runtime metrics to identify one or more possibly query classifications, determining a query execution plan according to the identified one or more possible query classifications, and executing the query according to the determined query execution plan. The computer implemented method may additionally include providing one or more runtime metrics corresponding to the executed query. A computer program product and a computer system corresponding to the method are also disclosed.Type: ApplicationFiled: June 26, 2019Publication date: December 31, 2020Inventors: Vincent Corvinelli, Calisto Zuzarte, Vinith Suriyakumar, Joel Raymond Scarfone, Diana Koval
-
Patent number: 10706354Abstract: A database query comprising predicates may be received. Each predicate may operate on database columns. The database query may be determined to comprise strict operators. An upper bound neural network may be defined for calculating an adjacent upper bound and a lower bound neural network may be defined for calculating an adjacent lower bound. The upper bound neural network and the lower bound neural network may be trained using a selected value from data of a database table associated with the database query to be executed through the upper bound neural network and the lower bound neural network. The upper bound neural network and the lower bound neural network may be adjusted by passing in an expected value using an error found in expressions. The adjacent lower bound and the adjacent upper bound may be calculated in response to completion of initial training for the database columns.Type: GrantFiled: May 6, 2016Date of Patent: July 7, 2020Assignee: International Business Machines CorporationInventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Patent number: 10643132Abstract: In an approach for generating a selectivity estimation, one or more processors generate an artificial neural network and receive a DBMS query comprising one or more predicates. One or more processors replace one or more predicates in the one or more predicates that have strict operators with one or more predicates that have non-strict operators. One or more processors generate a selectivity function from the one or more predicates that has one or more arguments that are each comprised of an upper bound and a lower bound for a value in a predicate. One or more processors generate a training data set from a data distribution in the database and train the artificial neural network on the training data set to compute the selectivity function. One or more processors generate a selectivity estimation with the artificial neural network for one or more predicates in the DBMS query.Type: GrantFiled: March 23, 2016Date of Patent: May 5, 2020Assignee: International Business Machines CorporationInventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Patent number: 10318866Abstract: In an approach for generating a selectivity estimation, one or more processors generate an artificial neural network and receive a DBMS query comprising one or more predicates. One or more processors replace one or more predicates in the one or more predicates that have strict operators with one or more predicates that have non-strict operators. One or more processors generate a selectivity function from the one or more predicates that has one or more arguments that are each comprised of an upper bound and a lower bound for a value in a predicate. One or more processors generate a training data set from a data distribution in the database and train the artificial neural network on the training data set to compute the selectivity function. One or more processors generate a selectivity estimation with the artificial neural network for one or more predicates in the DBMS query.Type: GrantFiled: March 5, 2015Date of Patent: June 11, 2019Assignee: International Business Machines CorporationInventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Publication number: 20180174048Abstract: A database query comprising predicates may be received. Each predicate may operate on database columns. The database query may be determined to comprise strict operators. An upper bound neural network may be defined for calculating an adjacent upper bound and a lower bound neural network may be defined for calculating an adjacent lower bound. The upper bound neural network and the lower bound neural network may be trained using a selected value from data of a database table associated with the database query to be executed through the upper bound neural network and the lower bound neural network. The upper bound neural network and the lower bound neural network may be adjusted by passing in an expected value using an error found in expressions. The adjacent lower bound and the adjacent upper bound may be calculated in response to completion of initial training for the database columns.Type: ApplicationFiled: February 20, 2018Publication date: June 21, 2018Inventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Publication number: 20170323200Abstract: A database query comprising predicates may be received. Each predicate may operate on database columns. The database query may be determined to comprise strict operators. An upper bound neural network may be defined for calculating an adjacent upper bound and a lower bound neural network may be defined for calculating an adjacent lower bound. The upper bound neural network and the lower bound neural network may be trained using a selected value from data of a database table associated with the database query to be executed through the upper bound neural network and the lower bound neural network. The upper bound neural network and the lower bound neural network may be adjusted by passing in an expected value using an error found in expressions. The adjacent lower bound and the adjacent upper bound may be calculated in response to completion of initial training for the database columns.Type: ApplicationFiled: May 6, 2016Publication date: November 9, 2017Inventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Publication number: 20160275398Abstract: In an approach for generating a selectivity estimation, one or more processors generate an artificial neural network and receive a DBMS query comprising one or more predicates. One or more processors replace one or more predicates in the one or more predicates that have strict operators with one or more predicates that have non-strict operators. One or more processors generate a selectivity function from the one or more predicates that has one or more arguments that are each comprised of an upper bound and a lower bound for a value in a predicate. One or more processors generate a training data set from a data distribution in the database and train the artificial neural network on the training data set to compute the selectivity function. One or more processors generate a selectivity estimation with the artificial neural network for one or more predicates in the DBMS query.Type: ApplicationFiled: March 23, 2016Publication date: September 22, 2016Inventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Publication number: 20160260011Abstract: In an approach for generating a selectivity estimation, one or more processors generate an artificial neural network and receive a DBMS query comprising one or more predicates. One or more processors replace one or more predicates in the one or more predicates that have strict operators with one or more predicates that have non-strict operators. One or more processors generate a selectivity function from the one or more predicates that has one or more arguments that are each comprised of an upper bound and a lower bound for a value in a predicate. One or more processors generate a training data set from a data distribution in the database and train the artificial neural network on the training data set to compute the selectivity function. One or more processors generate a selectivity estimation with the artificial neural network for one or more predicates in the DBMS query.Type: ApplicationFiled: March 5, 2015Publication date: September 8, 2016Inventors: Vincent Corvinelli, Huaxin Liu, Mingbin Xu, Ziting Yu, Calisto P. Zuzarte
-
Patent number: 8630999Abstract: In accordance with aspects of the present invention, provided are systems and computer program products for incrementally estimating the cardinality of a derived relation including statistically correlated partially applicable predicates for a range-partitioned table. During the generation of a QEP a cardinality estimate is calculated in which one or more partially applicable predicates is correlated to another partially applicable predicate and/or to one or more fully applicable predicates. The cardinality includes a number of rows expected to be returned by the QEP and is computed in an incremental fashion for each operator of the QEP.Type: GrantFiled: January 30, 2012Date of Patent: January 14, 2014Assignee: International Business Machines CorporationInventors: Vincent Corvinelli, John Frederick Hornibrook, Bingjie Miao