Patents by Inventor Awny Kayed Al-Omari
Awny Kayed Al-Omari has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240012826Abstract: A method and apparatus for optimizing a query in a relational database management system (RDBMS) when a predicate on a data column in the query has a correlation to a partitioning attribute of a partitioning column in data retrieved from a cloud-based store, wherein the optimizing uses the correlation between the data column in the query to the partitioning column in the data retrieved from the cloud-based store for data elimination when processing the query. The correlation is defined in a formula or lookup data structure that maps or range-maps from the data column to the partitioning column.Type: ApplicationFiled: September 26, 2023Publication date: January 11, 2024Applicant: Teradata US, IncInventors: Mohamed Ahmed Yassin Eltabakh, Mohammed Al-Kateb, Sanjay Nair, Awny Kayed Al-Omari
-
Patent number: 11775546Abstract: A method and apparatus for optimizing a query in a relational database management system (RDBMS) when a predicate on a data column in the query has a correlation to a partitioning attribute of a partitioning column in data retrieved from a cloud-based store, wherein the optimizing uses the correlation between the data column in the query to the partitioning column in the data retrieved from the cloud-based store for data elimination when processing the query. The correlation is defined in a formula or lookup data structure that maps or range-maps from the data column to the partitioning column.Type: GrantFiled: December 30, 2020Date of Patent: October 3, 2023Assignee: Teradata US, Inc.Inventors: Mohamed Ahmed Yassin Eltabakh, Mohammed Al-Kateb, Sanjay Nair, Awny Kayed Al-Omari
-
Patent number: 11709891Abstract: In some examples, a system receives function descriptors for different types of functions to be used when processing database queries, each function descriptor of the function descriptors comprising information relating to a respective function of the different types of functions. The system computes, based on a first function descriptor for a first function of the different types of functions, an estimate of a runtime metric associated with execution of the first function for processing a database query.Type: GrantFiled: December 31, 2020Date of Patent: July 25, 2023Assignee: Teradata US, Inc.Inventors: Awny Kayed Al-Omari, Mohammed Al-Kateb, Mohamed Ahmed Yassin Eltabakh, Douglas Paul Brown
-
Patent number: 11675792Abstract: In some examples, a database system receives data relating to plural micro-models that apply respective analytics, and distributes a plurality of data segments of the received data across the plurality of processing engines based on values of a segmentation key included in the received data. A plurality of processing engines, performs in parallel, operations associated with the plural micro-models using respective data segments of the plurality of data segments, where different processing engines of the plurality of processing engines perform operations associated with respective micro-models of the plural micro-models.Type: GrantFiled: December 28, 2020Date of Patent: June 13, 2023Assignee: Teradata US, Inc.Inventors: Maria Guadalupe Castellanos, Xiang Zuo, Faraz Ahmad, Awny Kayed Al-Omari
-
Patent number: 11593371Abstract: A relational database management system (RDBMS) accepts a workload comprised of one or more queries against a relational database. The RDBMS evolves a default cost profile into a plurality of cost profiles using fixed or dynamic evolution, wherein each of the cost profiles captures one or more cost parameters for the workload. The cost profiles are represented by a multi-dimensional matrix that has one or more dimensions, and each of the dimensions represents one of the cost parameters. The RDBMS dynamically determines which of the cost profiles is an optimal cost profile for the workload by mapping the cost profiles to the workload using a random walk scoring algorithm or a biased walk scoring algorithm that searches the multi-dimensional matrix to identify the optimal cost profile. The RDBMS selects and performs one or more query execution plans for the workload based on the optimal cost profile for the workload.Type: GrantFiled: August 18, 2020Date of Patent: February 28, 2023Assignee: Teradata US, Inc.Inventors: Wellington Marcos Cabrera Arevalo, Kassem Awada, Mahbub Hasan, Allen N. Diaz, Mohammed Al-Kateb, Awny Kayed Al-Omari
-
Patent number: 11526505Abstract: A database system receives a request from a user. The request invokes a data set function (DSF) and uses a property to be provided by the DSF. The database system determines that a function descriptor is available for the DSF. The function descriptor is expressed as markup language instructions. The function descriptor defines the property of the DSF. The database system uses the function descriptor to define a property for the DSF.Type: GrantFiled: December 5, 2019Date of Patent: December 13, 2022Assignee: Teradata US, Inc.Inventors: B. Anantha Subramanian, Mohamed Yassin Eltabakh, Mahbub Hasan, Robert Matthew Wehrmeister, Awny Kayed Al-Omari, Sanjay Sukumaran Nair, Kashif Abdullah Siddiqui, Mohammed Yassin Al-Kateb
-
Patent number: 11520756Abstract: Improved techniques for processing large-scale data and various large-scale data applications (e.g., large-scale Data Mining (DM), large-scale data analysis (LSDA)) in computing systems (e.g., Data Information Systems, Database Systems) are disclosed. Redundancy-reduced data (RRDS) can be provided as data that can be used more efficiently by various applications, especially, large-scale data applications. In doing so, at least one assumption about the distribution of a multi-dimensional data set (MDDS) and its corresponding set of responses (Y) can be made in order to reduce the multi-dimensional data set (MDDS). For example, a normal distribution (e.g.Type: GrantFiled: December 23, 2019Date of Patent: December 6, 2022Assignee: Teradata US, Inc.Inventors: Choudur K. Lakshminarayan, Thiagarajan Ramakrishnan, Awny Kayed Al-Omari
-
Patent number: 11468102Abstract: A relational database management system (RDBMS) optimizes limit queries over analytical functions, wherein the limit queries include an output clause comprising a LIMIT, TOP and SAMPLE clause with an expression specifying a limit that is a number K or a percentage ? %. The optimizations of the limit queries include: (1) static compile-time optimizations, and (2) dynamic run-time optimizations, based on semantic properties of “granularity” and “input-to-output cardinality” for the analytical functions.Type: GrantFiled: September 28, 2020Date of Patent: October 11, 2022Assignee: Teradata US, Inc.Inventors: Mohamed Ahmed Yassin Eltabakh, Mahbub Hasan, Awny Kayed Al-Omari, Mohammed Al-Kateb
-
Patent number: 11409745Abstract: Execution of a query invoking an analytical function (AF) is optimized. The query includes a join operation between an AF table and an AuxiliaryTable. A determination is made that the AF includes a plurality of AF properties. Query-level properties about the query are inferred. A determination is made to change an order of the join operation from the plurality of AF properties and query-level properties.Type: GrantFiled: December 5, 2019Date of Patent: August 9, 2022Assignee: Teradata US, Inc.Inventors: Christina Pavlopoulou, Mahbub Hasan, B. Anantha Subramanian, Mohammed Al-Kateb, Awny Kayed Al-Omari, Kashif Abdullah Siddiqui, Robert Matthew Wehrmeister, Mohamed Yassin Eltabakh
-
Patent number: 11409743Abstract: In some examples, a system learns properties of an analytical function based on information of queries invoking the analytical function that have been previously executed, creates a function descriptor for the analytical function based on the learning, and provides the function descriptor for use by an optimizer in generating an execution plan for a received database query that includes the analytical function.Type: GrantFiled: December 23, 2019Date of Patent: August 9, 2022Assignee: Teradata US, Inc.Inventors: Mohamed Ahmed Yassin Eltabakh, Mohammed Al-Kateb, Awny Kayed Al-Omari, Sanjay Nair
-
Publication number: 20220207045Abstract: In some examples, a database system receives data relating to plural micro-models that apply respective analytics, and distributes a plurality of data segments of the received data across the plurality of processing engines based on values of a segmentation key included in the received data. A plurality of processing engines, performs in parallel, operations associated with the plural micro-models using respective data segments of the plurality of data segments, where different processing engines of the plurality of processing engines perform operations associated with respective micro-models of the plural micro-models.Type: ApplicationFiled: December 28, 2020Publication date: June 30, 2022Inventors: Maria Guadalupe Castellanos, Xiang Zuo, Faraz Ahmad, Awny Kayed Al-Omari
-
Publication number: 20220207052Abstract: A method and apparatus for optimizing a query in a relational database management system (RDBMS) when a predicate on a data column in the query has a correlation to a partitioning attribute of a partitioning column in data retrieved from a cloud-based store, wherein the optimizing uses the correlation between the data column in the query to the partitioning column in the data retrieved from the cloud-based store for data elimination when processing the query. The correlation is defined in a formula or lookup data structure that maps or range-maps from the data column to the partitioning column.Type: ApplicationFiled: December 30, 2020Publication date: June 30, 2022Applicant: Teradata US, Inc.Inventors: Mohamed Ahmed Yassin Eltabakh, Mohammed Al-Kateb, Sanjay Nair, Awny Kayed Al-Omari
-
Publication number: 20220207084Abstract: In some examples, a system receives function descriptors for different types of functions to be used when processing database queries, each function descriptor of the function descriptors comprising information relating to a respective function of the different types of functions. The system computes, based on a first function descriptor for a first function of the different types of functions, an estimate of a runtime metric associated with execution of the first function for processing a database query.Type: ApplicationFiled: December 31, 2020Publication date: June 30, 2022Inventors: Awny Kayed Al-Omari, Mohammed Al-Kateb, Mohamed Ahmed Yassin Eltabakh, Douglas Paul Brown
-
Publication number: 20210397975Abstract: Hyperparameter tuning for a machine learning model is performed in a massively parallel database system. A computer system comprised of a plurality of compute units executes a relational database management system (RDBMS), wherein the RDBMS manages a relational database comprised of one or more tables storing data. One or more of the compute units perform the hyperparameter tuning for the machine learning model, wherein the hyperparameters are control parameters used in construction of the model, and the tuning of the hyperparameters is implemented as an operation in the RDBMS that accepts training and scoring data for the model, constructs the model using the hyperparameters and the training data, and generates goodness metrics for the model using the scoring data.Type: ApplicationFiled: December 16, 2020Publication date: December 23, 2021Applicant: Teradata US, Inc.Inventors: Awny Kayed Al-Omari, Maksym Sergiyovych Oblogin, Khaled Bouaziz, Michael James Hanlon, Kashif Abdullah Siddiqui
-
Publication number: 20210382920Abstract: A relational database management system (RDBMS) optimizes limit queries over analytical functions, wherein the limit queries include an output clause comprising a LIMIT, TOP and SAMPLE clause with an expression specifying a limit that is a number K or a percentage ? %. The optimizations of the limit queries include: (1) static compile-time optimizations, and (2) dynamic run-time optimizations, based on semantic properties of “granularity” and “input-to-output cardinality” for the analytical functions.Type: ApplicationFiled: September 28, 2020Publication date: December 9, 2021Applicant: Teradata US, Inc.Inventors: Mohamed Ahmed Yassin Eltabakh, Mahbub Hasan, Awny Kayed Al-Omari, Mohammed Al-Kateb
-
Publication number: 20210191912Abstract: Improved techniques for reducing the size of data of the multidimensional data are disclosed. The improved techniques are highly suitable for processing large-scale data and various large-scale data applications (e.g., large-scale Data Mining (DM), large-scale data analysis (LSDA) in computing systems (e.g., Data Information Systems, Database Systems). Redundancy-reduced data can be provided (RRDS) thereby providing data that can be used more efficiently by various applications, especially, large-scale data applications. At least one assumption about the distribution of a multi-dimensional data set (MDDS) and its corresponding set of responses (Y) can be made in order to reduce the multi-dimensional data set (MDDS). It should be noted that the assumption can be made after effectively combining multi-dimensional data set (MDDS) and its corresponding set of responses (Y) so that the set of responses (Y) can also be in considered in effectively reducing the size of the multi-dimensional data set (MDDS).Type: ApplicationFiled: December 23, 2019Publication date: June 24, 2021Applicant: Teradata US, Inc.Inventors: Choudur K. Lakshminarayan, Thiagarajan Ramakrishnan, Awny Kayed Al-Omari
-
Publication number: 20210117422Abstract: A relational database management system (RDBMS) accepts a workload comprised of one or more queries against a relational database. The RDBMS evolves a default cost profile into a plurality of cost profiles using fixed or dynamic evolution, wherein each of the cost profiles captures one or more cost parameters for the workload. The cost profiles are represented by a multi-dimensional matrix that has one or more dimensions, and each of the dimensions represents one of the cost parameters. The RDBMS dynamically determines which of the cost profiles is an optimal cost profile for the workload by mapping the cost profiles to the workload using a random walk scoring algorithm or a biased walk scoring algorithm that searches the multi-dimensional matrix to identify the optimal cost profile. The RDBMS selects and performs one or more query execution plans for the workload based on the optimal cost profile for the workload.Type: ApplicationFiled: August 18, 2020Publication date: April 22, 2021Applicant: Teradata US, Inc.Inventors: Wellington Marcos Cabrera Arevalo, Kassem Awada, Mahbub Hasan, Allen N. Diaz, Mohammed AI-Kateb, Awny Kayed Al-Omari
-
Publication number: 20210034624Abstract: In some examples, a system learns properties of an analytical function based on information of queries invoking the analytical function that have been previously executed, creates a function descriptor for the analytical function based on the learning, and provides the function descriptor for use by an optimizer in generating an execution plan for a received database query that includes the analytical function.Type: ApplicationFiled: December 23, 2019Publication date: February 4, 2021Inventors: Mohamed Ahmed Yassin Eltabakh, Mohammed Al-Kateb, Awny Kayed Al-Omari, Sanjay Nair
-
Publication number: 20200278966Abstract: As an abstract representation, a set of equivalent logical structures representative of multiple execution plans for execution of a database query can be used to optimize a database query. A logical structure can include one or more logical operators each representing multiple physical operators for executing the database query. Group and Operator Rules can be applied as rules to the set of equivalent logical structures to obtain additional equivalent logical structures and logical operator until no additional logical operators can be obtained. A set of possible implementation plans for the total number of the obtained logical operators can be obtained, for example, based on physical and/or implementation context.Type: ApplicationFiled: April 28, 2020Publication date: September 3, 2020Inventors: Awny Kayed Al-Omari, Robert Matthew Wehrmeister, Kashif Abdullah Siddiqui
-
Patent number: 10740329Abstract: As an abstract representation, a set of equivalent logical structures representative of multiple execution plans for execution of a database query can be used to optimize a database query. A logical structure can include one or more logical operators each representing multiple physical operators for executing the database query. Group and Operator Rules can be applied as rules to the set of equivalent logical structures to obtain additional equivalent logical structures and logical operator until no additional logical operators can be obtained. A set of possible implementation plans for the total number of the obtained logical operators can be obtained, for example, based on physical and/or implementation context.Type: GrantFiled: March 12, 2014Date of Patent: August 11, 2020Assignee: Teradata US, Inc.Inventors: Awny Kayed Al-Omari, Robert Matthew Wehrmeister, Kashif Abdullah Siddiqui