Patents by Inventor Qingqing Zhou

Qingqing Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200210273
    Abstract: A method for data redistribution of a job data in a first datanode (DN) to at least one additional DN in a Massively Parallel Processing (MPP) Database (DB) is provided. The method includes recording a snapshot of the job data, creating a first data portion in the first DN and a redistribution data portion in the first DN, collecting changes to a job data copy stored in a temporary table, and initiating transfer of the redistribution data portion to the at least one additional DN.
    Type: Application
    Filed: January 8, 2020
    Publication date: July 2, 2020
    Inventors: Le Cai, Qingqing Zhou, Yang Sun
  • Patent number: 10545815
    Abstract: A method for data redistribution of a job data in a first datanode (DN) to at least one additional DN in a Massively Parallel Processing (MPP) Database (DB) is provided. The method includes recording a snapshot of the job data, creating a first data portion in the first DN and a redistribution data portion in the first DN, collecting changes to a job data copy stored in a temporary table, and initiating transfer of the redistribution data portion to the at least one additional DN.
    Type: Grant
    Filed: August 3, 2016
    Date of Patent: January 28, 2020
    Assignee: Futurewei Technologies, Inc.
    Inventors: Le Cai, QingQing Zhou, Yang Sun
  • Patent number: 10360267
    Abstract: Data messages having different priorities may be stored in different communication buffers of a network node. The data messages may then be forwarded from the communication buffers to working buffers as space becomes available in the working buffers. After being forwarded to the working buffers, the data messages may be available to be processed by upper-layer operations of the network node. Priorities may be assigned to the data messages based on a priority level of a query associated with the data messages, a priority level of an upper-layer operation assigned to process the data messages, or combinations thereof.
    Type: Grant
    Filed: June 10, 2015
    Date of Patent: July 23, 2019
    Assignee: Futurewei Technologies, Inc.
    Inventors: Yu Dong, Qingqing Zhou, Guogen Zhang
  • Publication number: 20190205291
    Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.
    Type: Application
    Filed: January 10, 2019
    Publication date: July 4, 2019
    Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
  • Publication number: 20190163773
    Abstract: A computer-implemented method of relocating data in a distributed database comprises: creating, by one or more processors, a second table in the distributed database, the second table including all columns from a first table; copying, by the one or more processors, a first set of tuples from the first table to the second table; modifying, by the one or more processors, during the copying of the first set of tuples, data of the first table according to a modification; after the copying of the first set of tuples, modifying, by the one or more processors, data of the second table according to the modification; and switching, by the one or more processors, the second table for the first table in a catalog of the distributed database.
    Type: Application
    Filed: November 30, 2017
    Publication date: May 30, 2019
    Inventors: Jason Yang Sun, Cheng Zhu, Le Cai, Qingqing Zhou
  • Patent number: 10303654
    Abstract: System and method for hybrid distribution mode in massively parallel processing (MPP) database preventing storage imbalance issues caused by data skew. Key values of the database are identified as outliers if records of those keys cause database skew. In hybrid mode, records having the outlier key values are distributed using a random distribution scheme. Other records are distributed using a hash distribution scheme. A threshold skew amount is configurable for the system. Record lookups, insertions, deletions, and updates are processed according to a query plan optimized for the distribution mode of the records referenced in a database query.
    Type: Grant
    Filed: February 23, 2015
    Date of Patent: May 28, 2019
    Assignee: Futurewei Technologies, Inc.
    Inventors: Jason Yang Sun, Qingqing Zhou
  • Publication number: 20190129811
    Abstract: A method for processing query fault, where a database server receives a query statement and generates a corresponding query plan tree including multiple layers of operators in a pipeline relationship, and each layer includes operation symbols having logical relationship with each other. The server executes the query statement according to the query plan tree, extracts intermediate status information of a faulty operator when a fault occurs in a process of executing the query statement, updates operation symbols of the faulty operator and a logical relationship among the operation symbols according to the query plan tree and the intermediate status information to obtain a reconstructed query plan tree, and continues to execute the query statement according to the reconstructed query plan tree after the fault is recovered.
    Type: Application
    Filed: December 27, 2018
    Publication date: May 2, 2019
    Inventors: Jinwei Zhu, Qingqing Zhou, Pinggao Zhou
  • Patent number: 10216777
    Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.
    Type: Grant
    Filed: January 16, 2017
    Date of Patent: February 26, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
  • Publication number: 20180329951
    Abstract: The disclosure relates to technology for estimating a number of samples satisfying a database query. One or more subsets from a sample dataset of a collection of all data are randomly drawn. The one or more subsets are queried to determine a number of cardinalities as training data. A prediction model based on the training data is then trained using machine learning or statistical methods, and a sample size satisfying the database query of the collection of all data is estimated using the trained prediction model.
    Type: Application
    Filed: May 11, 2017
    Publication date: November 15, 2018
    Applicant: Futurewei Technologies, Inc.
    Inventors: Jiangsheng Yu, Shijun Ma, Qingqing Zhou
  • Publication number: 20180293272
    Abstract: A method for cloning data samples in a data set based on statistic information of the data samples. The method does not use any of the data samples to perform the cloning. The statistic information includes a first set of statistic parameters obtained from a data matrix formed by data entries of the data samples based on Eckart-Young theorem, and a second set of statistic parameters indicating statistical properties of the data entries of the data samples. The data samples are reconstructed using the first and the second sets of statistic parameters based on Eckart-Young theorem.
    Type: Application
    Filed: April 5, 2017
    Publication date: October 11, 2018
    Inventors: Jiangsheng Yu, Shijun Ma, Qingqing Zhou, Ting Yu Cliff Leung
  • Publication number: 20180205672
    Abstract: A computer-implemented method and system are provided, including executing an application programming interface (API) in a network switch to define at least one of one or more database functions, performing, using one or more processors, the one or more database functions on at least a portion of data contained in a data message received at the switch, to generate result data, and routing the result data to one or more destination nodes. A database function-defined network switch includes a network switch and one or more processors to perform a pre-defined database function on query data contained in data messages received at the switch, to produce result data, wherein the pre-defined database function is performed on the query data in a first mode of operation to a state of full completion, generating complete result data and no skipped query data, or to a state of partial completion, generating partially completed result data and skipped query data.
    Type: Application
    Filed: January 17, 2017
    Publication date: July 19, 2018
    Inventors: Yu Dong, Qingqing Zhou, Guogen Zhang
  • Publication number: 20180203895
    Abstract: A computer-implemented method and system at a network switch provides using one or more processors to perform a pre-defined database function on query data contained in data messages received at the network switch, with the performing producing result data, and wherein the pre-defined database function is performed on the query data in a first mode of operation to a state of full completion, generating complete result data and no skipped query data, and in a second mode of operation to a state of partial completion, generating partially complete result data and skipped query data. Further, the method and system performing one or more network switch functions to route the complete result data, and/or route the partially complete result data and skipped query data, to one or more destination nodes. In addition, an application programming interface (API) is used to define the database function.
    Type: Application
    Filed: January 17, 2017
    Publication date: July 19, 2018
    Inventors: Yu Dong, Qingqing Zhou
  • Patent number: 9998145
    Abstract: A data processing method and device are provided, where the method includes encoding data by using an arithmetic coding algorithm, so as to obtain a code value interval. When a code value corresponding to the data exists in the code value interval, obtaining the code value according to the code value interval. The method further includes comparing a quantity of bits of the code value with a quantity of bits of the data, so as to obtain a comparison result, and performing a storage operation according to the comparison result.
    Type: Grant
    Filed: August 3, 2017
    Date of Patent: June 12, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Jinwei Zhu, Long Yan, Qingqing Zhou
  • Patent number: 9959332
    Abstract: In one embodiment, a method includes determining a number of initial servers in a massively parallel processing (MPP) database cluster and determining an initial bucket configuration of the MPP database cluster, where the initial bucket configuration has a number of initial buckets. The method also includes adding a number of additional servers to the MPP database cluster to produce a number of updated servers, where the updated servers include the initial servers and the additional servers and creating an updated bucket configuration in accordance with the number of initial servers, the initial bucket configuration, and the number of additional servers, where the updated bucket configuration has a number of updated buckets. Additionally, the method includes redistributing data of the MPP cluster in accordance with the updated bucket configuration.
    Type: Grant
    Filed: January 21, 2015
    Date of Patent: May 1, 2018
    Assignee: Futurewei Technologies, Inc.
    Inventors: Le Cai, Qingqing Zhou, Kamini Jagtiani
  • Publication number: 20180039534
    Abstract: A method for data redistribution of a job data in a first datanode (DN) to at least one additional DN in a Massively Parallel Processing (MPP) Database (DB) is provided. The method includes recording a snapshot of the job data, creating a first data portion in the first DN and a redistribution data portion in the first DN, collecting changes to a job data copy stored in a temporary table, and initiating transfer of the redistribution data portion to the at least one additional DN.
    Type: Application
    Filed: August 3, 2016
    Publication date: February 8, 2018
    Applicant: Futurewei Technologies, Inc.
    Inventors: Le CAI, QingQing ZHOU, Yang SUN
  • Patent number: 9875186
    Abstract: The present technology relates to managing data caching in processing nodes of a massively parallel processing (MPP) database system. A directory is maintained that includes a list and a storage location of the data pages in the MPP database system. Memory usage is monitored in processing nodes by exchanging memory usage information with each other. Each of the processing nodes manages a list and a corresponding amount of available memory in each of the processing nodes based on the memory usage information. Data pages are read from a memory of the processing nodes in response to receiving a request to fetch the data pages, and a remote memory manager is queried for available memory in each of the processing nodes in response to receiving the request. The data pages are distributed to the memory of the processing nodes having sufficient space available for storage during data processing.
    Type: Grant
    Filed: July 8, 2015
    Date of Patent: January 23, 2018
    Assignee: FutureWei Technologies, Inc.
    Inventors: Huaizhi Li, Qingqing Zhou, Guogen Zhang
  • Publication number: 20170331492
    Abstract: A data processing method and device are provided, where the method includes encoding data by using an arithmetic coding algorithm, so as to obtain a code value interval. When a code value corresponding to the data exists in the code value interval, obtaining the code value according to the code value interval. The method further includes comparing a quantity of bits of the code value with a quantity of bits of the data, so as to obtain a comparison result, and performing a storage operation according to the comparison result.
    Type: Application
    Filed: August 3, 2017
    Publication date: November 16, 2017
    Inventors: Jinwei Zhu, Long Yan, Qingqing Zhou
  • Publication number: 20170124128
    Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.
    Type: Application
    Filed: January 16, 2017
    Publication date: May 4, 2017
    Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
  • Publication number: 20170097972
    Abstract: System and method embodiments are provided for using different storage formats for a primary database and its replicas in a database managed replication (DMR) system. As such, the advantages of both formats can be combined with suitable design complexity and implementation. In an embodiment, data is arranged in a sequence of rows and stored in a first storage format at the primary database. The data arranged in the sequence of rows is also stored in a second storage format at the replica database. The sequence of rows is determined according to the first storage format or the second storage format. The first storage format is a row store (RS) and the second storage format is a column store (CS), or vice versa. In an embodiment, the sequence of rows is determined to improve compression efficiency at the CS.
    Type: Application
    Filed: October 1, 2015
    Publication date: April 6, 2017
    Inventors: Qingqing Zhou, Yang Sun, Guogen Zhang
  • Publication number: 20170075657
    Abstract: This disclosure provides a clustering storage method and apparatus. The method includes: storing to-be-stored first data row by row into a local memory in a database system; determining a first sorting column, where the first sorting column is used to sort data that has been cached in the local memory; sorting second data according to the first sorting column if the second data that has been cached in the local memory meets a preset condition, where the second data is data, which has been cached into the local memory, in the first data; and storing the sorted second data in a clustering manner into a storage medium in the database system.
    Type: Application
    Filed: November 23, 2016
    Publication date: March 16, 2017
    Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
    Inventors: Jinyu Zhang, Qingqing Zhou, Zhongliang Zhang