Patents by Inventor Yuanzhe Bei
Yuanzhe Bei has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11971793Abstract: Current physical resources utilization of a computing system as a whole is monitored. The number of queries concurrently being executed against a database by a database management system (DBMS) running on a computing system is monitored. A query plan for a received query to be executed against the database is generated. The query plan includes operators; the generation of the query plan includes generation of query-based statistics for the received query on a per-operator basis without consideration of the queries concurrently being executed. An estimated execution time of the received query is dynamically predicted using a machine-learning model based on the query-based statistics generated for the received query on the per-operator basis, the current physical resources utilization of the computing system, and the number of queries concurrently being executed. The received query is executed against the database based on the dynamically predicted estimated execution time for the received query.Type: GrantFiled: March 5, 2019Date of Patent: April 30, 2024Assignee: Micro Focus LLCInventors: Yuanzhe Bei, Zhihao Zheng
-
Patent number: 11880362Abstract: A query plan includes steps to implement a query and debug steps interleaved among the steps. An execution engine of a database system executes each step of the query plan to realize the query. The execution engine executes each debug step of the query plan to generate debugging information for the step preceding the debug step within the query plan. The debugging information is queryable.Type: GrantFiled: January 10, 2022Date of Patent: January 23, 2024Assignee: Micro Focus LLCInventors: Devosh Mathivanan, Manasee Moghey, Yuanzhe Bei
-
Publication number: 20230229629Abstract: A distributed database system maintains a database including a data shard for which a primary computing node is responsible. The primary computing node identifies a data storage plan for the data shard. The plan identifies a file subset of data storage files of the shard to be merged into a larger data storage file, and a node subset of computing nodes of the system that subscribe to the data shard. The primary node identifies which computing nodes of the node subset each have sufficient computing resources to execute the plan, as candidate computing nodes. The primary node identifies which files of the file subset each candidate computing node locally caches. The primary node selects one candidate computing node to execute the plan, based on the files of the file subset that each candidate computing node locally caches. The primary node causes the selected candidate computing node to execute the plan.Type: ApplicationFiled: January 19, 2022Publication date: July 20, 2023Inventors: Ning Deng, Yuanzhe Bei, Alexander Kalinin
-
Publication number: 20230222048Abstract: A query plan includes steps to implement a query and debug steps interleaved among the steps. An execution engine of a database system executes each step of the query plan to realize the query. The execution engine executes each debug step of the query plan to generate debugging information for the step preceding the debug step within the query plan. The debugging information is queryable.Type: ApplicationFiled: January 10, 2022Publication date: July 13, 2023Inventors: Devosh Mathivanan, Manasee Moghey, Yuanzhe Bei
-
Publication number: 20230034257Abstract: According to examples, an apparatus may include a processor and a memory on which are stored machine-readable instructions that when executed by the processor cause the processor to receive parameters for a partition range for a partition key associated with a table. The table may have a vertical table structure. The processor may generate an index of vertical table columns based on the received parameters for the partition range. The generated index may be a physical store for data from the table and may have a subset of rows of the table correlating to the partition range. The processor may receive a query having a predicate associated with the partition key associated with the table and, based on a determination that the predicate included in the received query is within a partition range for the generated index, the processor may execute the query using the generated index.Type: ApplicationFiled: July 28, 2021Publication date: February 2, 2023Applicant: MICRO FOCUS LLCInventors: Yuanzhe BEI, Pan YE
-
Patent number: 11379464Abstract: In some examples, asymmetric quorum protocol based distributed transaction database consistency control may include determining, by a set of primary participants of a cluster, whether a cluster-wide quorum has been reached to commit a transaction associated with a distributed transactional database. Further, based on the determination by the set of primary participants that the cluster-wide quorum has been reached to commit the transaction, instructions to a set of secondary participants of the cluster may be sent to commit the transaction. The set of secondary participants may be configured to comply with a determination by the set of primary participants to commit or to not-commit the transaction.Type: GrantFiled: December 12, 2019Date of Patent: July 5, 2022Assignee: MICRO FOCUS LLCInventors: Niccolo Meneghetti, Yuanzhe Bei
-
Patent number: 11288266Abstract: In some examples, candidate projection enumeration based query response generation may include obtaining a query, and identifying, based on the query, a plurality of candidate columns from at least one table to respond to the query. Using a tree-based model, the candidate columns may be analyzed to generate a candidate set of projections to respond to the query. Each candidate projection of the candidate set of projections may include a sort order associated with at least one candidate column of the candidate columns, a segmentation key value associated with at least one candidate column of the candidate columns, and a replication value associated with the candidate projection. Based on the candidate set of projections, a response to the query may be generated.Type: GrantFiled: December 17, 2019Date of Patent: March 29, 2022Assignee: MICRO FOCUS LLCInventors: Chi Zhang, Zhibo Peng, Yuanzhe Bei, Olga Papaemmanouil
-
Patent number: 11068451Abstract: According to an example, a system may receive a refresh column command for a column-oriented database. The refresh column command may identify a table in the column-oriented database and a denormalized column in the table. The system may determine column definitions for the column, generate a query to populate values from a data source associated with the column into the column based on the column definitions, and may execute the query to replace entire set of existing column values with the data source values.Type: GrantFiled: August 28, 2017Date of Patent: July 20, 2021Assignee: MICRO FOCUS LLCInventors: Michael Leuchtenburg, Yuanzhe Bei, Charles Edward Bear, Nga Tran
-
Patent number: 11055304Abstract: The present disclosure includes systems and methods for performing a plurality of base statistical analysis on a dataset to yield multiple, different base statistics with respect to a first predetermined number of features. The dataset and the base statistics can be used to find a transformation that determines significant features while controlling a false discovery rate (FDR) to be below a target FDR threshold (?), wherein a second predetermined number of features are assumed to be independent from each other. Using the transformation, a composite index can be generated that represents a synthesis of the base statistics and a report can be generated that indicates at least the relative presence of the significant features in the dataset using the composite index.Type: GrantFiled: October 16, 2015Date of Patent: July 6, 2021Assignee: BRANDEIS UNIVERSITYInventors: Pengyu Hong, Yuanzhe Bei
-
Publication number: 20210182270Abstract: In some examples, asymmetric quorum protocol based distributed transaction database consistency control may include determining, by a set of primary participants of a cluster, whether a cluster-wide quorum has been reached to commit a transaction associated with a distributed transactional database. Further, based on the determination by the set of primary participants that the cluster-wide quorum has been reached to commit the transaction, instructions to a set of secondary participants of the cluster may be sent to commit the transaction. The set of secondary participants may be configured to comply with a determination by the set of primary participants to commit or to not-commit the transaction.Type: ApplicationFiled: December 12, 2019Publication date: June 17, 2021Applicant: MICRO FOCUS LLCInventors: Niccolo MENEGHETTI, Yuanzhe BEI
-
Publication number: 20210182293Abstract: In some examples, candidate projection enumeration based query response generation may include obtaining a query, and identifying, based on the query, a plurality of candidate columns from at least one table to respond to the query. Using a tree-based model, the candidate columns may be analyzed to generate a candidate set of projections to respond to the query. Each candidate projection of the candidate set of projections may include a sort order associated with at least one candidate column of the candidate columns, a segmentation key value associated with at least one candidate column of the candidate columns, and a replication value associated with the candidate projection. Based on the candidate set of projections, a response to the query may be generated.Type: ApplicationFiled: December 17, 2019Publication date: June 17, 2021Applicant: MICRO FOCUS LLCInventors: Chi Zhang, Zhibo Peng, Yuanzhe Bei, Olga Papaemmanouil
-
Patent number: 10872071Abstract: A flattened table (FT) of a database of a database management system (DBMS) is defined. The FT logically materializes a number of columns over a number of partitions. The columns include normalized columns, denormalized columns from a plurality of source tables of the database, as well as an aggregate column defining an aggregation of a selected normalized column over a selected denormalized column. A live-aggregate projection (LAP) is defined on the FT of the database. The LAP corresponds to the aggregate column and physically materializes the aggregation of the selected denormalized column over the selected denormalized column, as defined by the aggregate column. The FT is refreshed on-demand, on a per-column, per-partition basis. Responsive to the FT being refreshed on-demand, the LAP is automatically refreshed.Type: GrantFiled: March 28, 2019Date of Patent: December 22, 2020Assignee: MICRO FOCUS LLCInventors: Thao Nguyen Pham, Yuanzhe Bei, Michael Leuchtenburg, Hoa Binh Nga Tran
-
Publication number: 20200311043Abstract: A flattened table (FT) of a database of a database management system (DBMS) is defined. The FT logically materializes a number of columns over a number of partitions. The columns include normalized columns, denormalized columns from a plurality of source tables of the database, as well as an aggregate column defining an aggregation of a selected normalized column over a selected denormalized column. A live-aggregate projection (LAP) is defined on the FT of the database. The LAP corresponds to the aggregate column and physically materializes the aggregation of the selected denormalized column over the selected denormalized column, as defined by the aggregate column. The FT is refreshed on-demand, on a per-column, per-partition basis. Responsive to the FT being refreshed on-demand, the LAP is automatically refreshed.Type: ApplicationFiled: March 28, 2019Publication date: October 1, 2020Inventors: Thao Nguyen Pham, Yuanzhe Bei, Michael Leuchtenburg, Hoa Binh Nga Tran
-
Publication number: 20200285642Abstract: Current physical resources utilization of a computing system as a whole is monitored. The number of queries concurrently being executed against a database by a database management system (DBMS) running on a computing system is monitored. A query plan for a received query to be executed against the database is generated. The query plan includes operators; the generation of the query plan includes generation of query-based statistics for the received query on a per-operator basis without consideration of the queries concurrently being executed. An estimated execution time of the received query is dynamically predicted using a machine-learning model based on the query-based statistics generated for the received query on the per-operator basis, the current physical resources utilization of the computing system, and the number of queries concurrently being executed. The received query is executed against the database based on the dynamically predicted estimated execution time for the received query.Type: ApplicationFiled: March 5, 2019Publication date: September 10, 2020Inventors: Yuanzhe Bei, Zhihao Zheng
-
Publication number: 20190065538Abstract: According to an example, a system may receive a refresh column command for a column-oriented database. The refresh column command may identify a table in the column-oriented database and a denormalized column in the table. The system may determine column definitions for the column, generate a query to populate values from a data source associated with the column into the column based on the column definitions, and may execute the query to replace entire set of existing column values with the data source values.Type: ApplicationFiled: August 28, 2017Publication date: February 28, 2019Applicant: EntlT Software LLCInventors: Michael LEUCHTENBURG, Yuanzhe Bei, Charles Edward Bear, Nga Tran
-
Publication number: 20170308594Abstract: The present disclosure includes systems and methods for performing a plurality of base statistical analysis on a dataset to yield multiple, different base statistics with respect to a first predetermined number of features. The dataset and the base statistics can be used to find a transformation that determines significant features while controlling a false discovery rate (FDR) to be below a target FDR threshold (?), wherein a second predetermined number of features are assumed to be independent from each other. Using the transformation, a composite index can be generated that represents a synthesis of the base statistics and a report can be generated that indicates at least the relative presence of the significant features in the dataset using the composite index.Type: ApplicationFiled: October 16, 2015Publication date: October 26, 2017Inventors: Pengyu Hong, Yuanzhe Bei