Patents by Inventor Mohamad F. Kalil
Mohamad F. Kalil has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11928107Abstract: Methods and systems for similarity-based value-to-column classification are disclosed. A method includes: receiving, by a computing device, a natural language search query; determining, by the computing device, a filtering phrase in the natural language search query using a natural language understanding model; encoding, by the computing device, the filtering phrase; retrieving, by the computing device, a plurality of encoded columns; for each of the plurality of encoded columns, the computing device determining a similarity score based on a similarity between the encoded filtering phrase and the encoded column; and outputting, by the computing device, a column corresponding to an encoded column of the plurality of encoded columns having a highest similarity score.Type: GrantFiled: May 22, 2020Date of Patent: March 12, 2024Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Mohammed Mostafa, Ali Khanafer, Mohamad F. Kalil, Mohamed El Gemaiey, Morvarid Sehatkar
-
Patent number: 11720565Abstract: A method, a computer system, and a computer program product for cardinality estimation is provided. Embodiments of the present invention includes accessing database relations. The database relations are utilized to collect a random sample from each of the database relations. Training data is then generated from the random sample. The training data is used to build a cumulative frequency function (CFF) model. The cumulative frequency function (CFF) model then provides a cardinality estimation for an output for SQL operators.Type: GrantFiled: August 27, 2020Date of Patent: August 8, 2023Assignee: International Business Machines CorporationInventors: Mohamad F. Kalil, Calisto Zuzarte, Mustafa Dawoud, Mohammed Fahd Alhamid, Vincent Corvinelli, Wai Keat Tan, Ronghao Yang
-
Patent number: 11593372Abstract: In an approach to improve query optimization in a database management system, embodiments identify opportunities for improvement in a cardinality estimate using a workload feedback process using a query feedback performed during query compilation. Embodiments identify correlations and relationships based on the structure of the query feedback and the runtime feedback performed, and collects data from the execution of a query to identify errors in estimates of the query optimizer. Further, embodiments submit the query feedback and the runtime feedback to a machine learning engine to update a set of models. Additionally, embodiments update a set of models based on the submitted query feedback and runtime feedback, and output a new, updated, or re-trained model based on collected data from the execution of the query to identify the errors in estimates of the query optimizer, the submitted query feedback and the runtime feedback, or a trained generated mode.Type: GrantFiled: July 1, 2020Date of Patent: February 28, 2023Assignee: International Business Machines CorporationInventors: Vincent Corvinelli, Mohammed Fahd Alhamid, Mohamad F. Kalil, Calisto Zuzarte
-
Patent number: 11520842Abstract: Embodiments of the present invention disclose a method, computer program product, and system for determining at least one characteristic about a figure and searching a data set based on an indicated search area for at least one entry that falls within a threshold value of the determined at least one characteristic about the figure, wherein the search area indicates which part of the data set to be searched. Displaying the at least one entry from the data set that falls within a threshold value of the determined at least one characteristic about the figure.Type: GrantFiled: July 16, 2020Date of Patent: December 6, 2022Assignee: International Business Machines CorporationInventors: Joyce Miryam Habbouche, Mohamad F. Kalil, Stephen David Gibson
-
Publication number: 20220067045Abstract: A method, a computer system, and a computer program product for cardinality estimation is provided. Embodiments of the present invention includes accessing database relations. The database relations are utilized to collect a random sample from each of the database relations. Training data is then generated from the random sample. The training data is used to build a cumulative frequency function (CFF) model. The cumulative frequency function (CFF) model then provides a cardinality estimation for an output for SQL operators.Type: ApplicationFiled: August 27, 2020Publication date: March 3, 2022Inventors: MOHAMAD F. KALIL, CALISTO ZUZARTE, MUSTAFA DAWOUD, MOHAMMED FAHD ALHAMID, Vincent Corvinelli, Wai Keat Tan, Ronghao Yang
-
Publication number: 20220019633Abstract: Embodiments of the present invention disclose a method, computer program product, and system for determining at least one characteristic about a figure and searching a data set based on an indicated search area for at least one entry that falls within a threshold value of the determined at least one characteristic about the figure, wherein the search area indicates which part of the data set to be searched. Displaying the at least one entry from the data set that falls within a threshold value of the determined at least one characteristic about the figure.Type: ApplicationFiled: July 16, 2020Publication date: January 20, 2022Inventors: Joyce Miryam Habbouche, Mohamad F. Kalil, Stephen David Gibson
-
Publication number: 20220004553Abstract: In an approach to improve query optimization in a database management system, embodiments identify opportunities for improvement in a cardinality estimate using a workload feedback process using a query feedback performed during query compilation. Embodiments identify correlations and relationships based on the structure of the query feedback and the runtime feedback performed, and collects data from the execution of a query to identify errors in estimates of the query optimizer. Further, embodiments submit the query feedback and the runtime feedback to a machine learning engine to update a set of models. Additionally, embodiments update a set of models based on the submitted query feedback and runtime feedback, and output a new, updated, or re-trained model based on collected data from the execution of the query to identify the errors in estimates of the query optimizer, the submitted query feedback and the runtime feedback, or a trained generated mode.Type: ApplicationFiled: July 1, 2020Publication date: January 6, 2022Inventors: Vincent Corvinelli, Mohammed Fahd Alhamid, Mohamad F. Kalil, Calisto Zuzarte
-
Patent number: 11210290Abstract: A maintenance subsystem of a database-management system (DBMS) receives a database query that requests access to data stored in a database column. The subsystem retrieves or infers frequent-value statistics for that column, each of which specifies the number of times one distinct value is stored in the column. The statistics are partitioned into Keep and Discard clusters and, using statistical or other computational methods based on the column's data distribution, the subsystem determines an optimal number of the statistics that should be kept by the DBMS in order to minimize cost, errors, or other parameters desired by an implementer. The subsystem then directly or indirectly directs a query-optimizer component of the DBMS to consider the optimal number of frequent-value statistics when selecting an optimal data-access plan. The selected plan is then used by the DBMS's storage-manager component to access the column when servicing the received query.Type: GrantFiled: January 6, 2020Date of Patent: December 28, 2021Assignee: International Business Machines CorporationInventors: Mohamad F. Kalil, Vincent Corvinelli, Calisto Zuzarte, Petrus Chan
-
Publication number: 20210365443Abstract: Methods and systems for similarity-based value-to-column classification are disclosed. A method includes: receiving, by a computing device, a natural language search query; determining, by the computing device, a filtering phrase in the natural language search query using a natural language understanding model; encoding, by the computing device, the filtering phrase; retrieving, by the computing device, a plurality of encoded columns; for each of the plurality of encoded columns, the computing device determining a similarity score based on a similarity between the encoded filtering phrase and the encoded column; and outputting, by the computing device, a column corresponding to an encoded column of the plurality of encoded columns having a highest similarity score.Type: ApplicationFiled: May 22, 2020Publication date: November 25, 2021Inventors: Mohammed MOSTAFA, Ali KHANAFER, Mohamad F. KALIL, Mohamed EL GEMAIEY, Morvarid SEHATKAR
-
Publication number: 20210209110Abstract: A maintenance subsystem of a database-management system (DBMS) receives a database query that requests access to data stored in a database column. The subsystem retrieves or infers frequent-value statistics for that column, each of which specifies the number of times one distinct value is stored in the column. The statistics are partitioned into Keep and Discard clusters and, using statistical or other computational methods based on the column's data distribution, the subsystem determines an optimal number of the statistics that should be kept by the DBMS in order to minimize cost, errors, or other parameters desired by an implementer. The subsystem then directly or indirectly directs a query-optimizer component of the DBMS to consider the optimal number of frequent-value statistics when selecting an optimal data-access plan. The selected plan is then used by the DBMS's storage-manager component to access the column when servicing the received query.Type: ApplicationFiled: January 6, 2020Publication date: July 8, 2021Inventors: Mohamad F. Kalil, Vincent Corvinelli, Calisto Zuzarte, Petrus Chan