Patents by Inventor Jason Yang SUN
Jason Yang SUN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11789936Abstract: The present disclosure describes storage techniques for hybrid transactional and analytical processing. Data captured by a first processing engine may be received. The first processing engine may be configured to perform online transactional processing). Multiple replicas of logical logs generated based on the data may be distributed to a Delta Store by applying a quorum protocol on the multiple replicas. Data in the Delta Store are stored in a row format and are visible to a query for online analytical processing performed by a second processing engine. Data may be flushed from the Delta Store to a Base Store based on one or more predetermined rules. Data in the Base Store are stored in a columnar format and may be accessible by the second processing engine.Type: GrantFiled: August 31, 2021Date of Patent: October 17, 2023Assignee: LEMON INC.Inventors: Jianjun Chen, Yonghua Ding, Ye Liu, Fangshi Li, Li Zhang, Mingyi Zhang, Kui Wei, Wei Ding, Kai Wu, Jason Yang Sun
-
Publication number: 20230063730Abstract: The present disclosure describes storage techniques for hybrid transactional and analytical processing. Data captured by a first processing engine may be received. The first processing engine may be configured to perform online transactional processing). Multiple replicas of logical logs generated based on the data may be distributed to a Delta Store by applying a quorum protocol on the multiple replicas. Data in the Delta Store are stored in a row format and are visible to a query for online analytical processing performed by a second processing engine. Data may be flushed from the Delta Store to a Base Store based on one or more predetermined rules. Data in the Base Store are stored in a columnar format and may be accessible by the second processing engine.Type: ApplicationFiled: August 31, 2021Publication date: March 2, 2023Inventors: Jianjun Chen, Yonghua Ding, Ye Liu, Fangshi Li, Li Zhang, Mingyi Zhang, Kui Wei, Wei Ding, Kai Wu, Jason Yang Sun
-
Patent number: 11151111Abstract: A computer-implemented method of relocating data in a distributed database comprises: creating, by one or more processors, a second table in the distributed database, the second table including all columns from a first table; copying, by the one or more processors, a first set of tuples from the first table to the second table; modifying, by the one or more processors, during the copying of the first set of tuples, data of the first table according to a modification; after the copying of the first set of tuples, modifying, by the one or more processors, data of the second table according to the modification; and switching, by the one or more processors, the second table for the first table in a catalog of the distributed database.Type: GrantFiled: November 30, 2017Date of Patent: October 19, 2021Assignee: Futurewei Technologies, Inc.Inventors: Jason Yang Sun, Cheng Zhu, Le Cai, Qingqing Zhou
-
Patent number: 10810268Abstract: A system and method for improved online transaction processing (OLTP) in a sharded database is provided. Overhead associated with a global transaction manager is reduced and scalability improved by determining whether incoming queries are single-shard transactions or multi-shard transactions. For multi-shard transactions, a distributed transaction ID (DXID) is requested from the GTM, and then forwarded with the query to one or more data notes. For single-shard transactions, the query is sent to a data node without requesting a DXID from the GTM.Type: GrantFiled: December 6, 2017Date of Patent: October 20, 2020Assignee: FUTUREWEI TECHNOLOGIES, INC.Inventors: Le Cai, Jianjun Chen, Mason Sharp, Yu Chen, Jun Chen, Jason Yang Sun, Yongyan Wang
-
Patent number: 10649996Abstract: A massively parallel processing shared nothing relational database management system includes a plurality of storages assigned to a plurality of compute nodes. The system comprises a non-transitory memory having instructions and one or more processors in communication with the memory. The one or more processors execute the instructions to store a set of data in a first set of storages in the plurality of storages. The first set of data is hashed into a repartitioned set of data. The first set of storages is reassigned to a second set of compute nodes in the plurality of compute nodes. The repartitioned set of data is distributed to the second set of compute nodes and a database operation is performed on the repartitioned set of data by the second set of compute nodes.Type: GrantFiled: December 9, 2016Date of Patent: May 12, 2020Assignee: Futurewei Technologies, Inc.Inventors: Li Zhang, Jason Yang Sun, Yonghua Ding
-
Publication number: 20190171763Abstract: A system and method for improved online transaction processing (OLTP) in a sharded database is provided. Overhead associated with a global transaction manager is reduced and scalability improved by determining whether incoming queries are single-shard transactions or multi-shard transactions. For multi-shard transactions, a distributed transaction ID (DXID) is requested from the GTM, and then forwarded with the query to one or more data notes. For single-shard transactions, the query is sent to a data node without requesting a DXID from the GTM.Type: ApplicationFiled: December 6, 2017Publication date: June 6, 2019Inventors: Le Cai, Jianjun Chen, Mason Sharp, Yu Chen, Jun Chen, Jason Yang Sun, Yongyan Wang
-
Publication number: 20190163773Abstract: A computer-implemented method of relocating data in a distributed database comprises: creating, by one or more processors, a second table in the distributed database, the second table including all columns from a first table; copying, by the one or more processors, a first set of tuples from the first table to the second table; modifying, by the one or more processors, during the copying of the first set of tuples, data of the first table according to a modification; after the copying of the first set of tuples, modifying, by the one or more processors, data of the second table according to the modification; and switching, by the one or more processors, the second table for the first table in a catalog of the distributed database.Type: ApplicationFiled: November 30, 2017Publication date: May 30, 2019Inventors: Jason Yang Sun, Cheng Zhu, Le Cai, Qingqing Zhou
-
Patent number: 10303654Abstract: System and method for hybrid distribution mode in massively parallel processing (MPP) database preventing storage imbalance issues caused by data skew. Key values of the database are identified as outliers if records of those keys cause database skew. In hybrid mode, records having the outlier key values are distributed using a random distribution scheme. Other records are distributed using a hash distribution scheme. A threshold skew amount is configurable for the system. Record lookups, insertions, deletions, and updates are processed according to a query plan optimized for the distribution mode of the records referenced in a database query.Type: GrantFiled: February 23, 2015Date of Patent: May 28, 2019Assignee: Futurewei Technologies, Inc.Inventors: Jason Yang Sun, Qingqing Zhou
-
Patent number: 10055458Abstract: A method includes dividing a dataset into partitions by hashing a specified key, selecting a set of distributed file system nodes as a primary node group for storage of the partitions, and causing a primary copy of the partitions to be stored on the primary node group by a distributed storage system file server such that the location of each partition is known by hashing of the specified key.Type: GrantFiled: July 30, 2015Date of Patent: August 21, 2018Assignee: Futurewei Technologies, Inc.Inventors: Jason Yang Sun, Guogen Zhang, Le Cai
-
Publication number: 20180165331Abstract: A massively parallel processing shared nothing relational database management system includes a plurality of storages assigned to a plurality of compute nodes. The system comprises a non-transitory memory having instructions and one or more processors in communication with the memory. The one or more processors execute the instructions to store a set of data in a first set of storages in the plurality of storages. The first set of data is hashed into a repartitioned set of data. The first set of storages is reassigned to a second set of compute nodes in the plurality of compute nodes. The repartitioned set of data is distributed to the second set of compute nodes and a database operation is performed on the repartitioned set of data by the second set of compute nodes.Type: ApplicationFiled: December 9, 2016Publication date: June 14, 2018Applicant: Futurewei Technologies, Inc.Inventors: Li Zhang, Jason Yang Sun, Yonghua Ding
-
Patent number: 9934051Abstract: The disclosure relates to technology for query compilation in a database management system. A first execution time of code for at least one database query without applying a code generation method is estimated and in response to receiving the at least one database query, and for one or more code generation methods, a compilation cost and a second execution time of the code as modified by the code generation methods is estimated. A cost savings for each of the one or more code generation methods is calculated, where the cost savings is calculated as the first execution time less the second execution time of the code generation method, less the compilation cost of the code generation method. One of the code generation methods or the no code generation method with the highest cost savings is then selected.Type: GrantFiled: April 17, 2017Date of Patent: April 3, 2018Assignee: FUTUREWEI TECHNOLOGIES, INC.Inventors: Yonghua Ding, Jason Yang Sun, Li Zhang
-
Publication number: 20170031988Abstract: A method includes dividing a dataset into partitions by hashing a specified key, selecting a set of distributed file system nodes as a primary node group for storage of the partitions, and causing a primary copy of the partitions to be stored on the primary node group by a distributed storage system file server such that the location of each partition is known by hashing of the specified key.Type: ApplicationFiled: July 30, 2015Publication date: February 2, 2017Inventors: Jason Yang Sun, Guogen Zhang, Le Cai
-
Publication number: 20160378824Abstract: A system and method for parallelizing hash-based operators in symmetric multiprocessing (SMP) databases is provided. In an embodiment, a method in a device for performing hash based database operations includes receiving at the device an database query; creating a plurality of execution workers to process the query; and building by the execution workers a hash table from a database table, the database table comprising one of a plurality of partitions and a plurality of scan units, the hash table shared by the execution workers, each execution worker scanning a corresponding partition and adding entries to the hash table if the database table is partitioned, each execution worker scanning an unprocessed scan unit and adding entries to the hash table according to the scan unit if the database table comprises scan units, and the workers performing the scanning and the adding in a parallel manner.Type: ApplicationFiled: June 24, 2015Publication date: December 29, 2016Inventors: Huaizhi Li, Guogen Zhang, Jason Yang Sun
-
Publication number: 20160306810Abstract: System and method for storing statistical data of records stored in a distributed file system. In one aspect a statistical data block is allocated in a memory of a data node for storing statistical data of records stored in a storage disk of the data node. Each data block of the plurality of data blocks in the data node has a respective entry in the statistical data block, which is collocated with data blocks on the data node. Statistical data of records stored in the distributed file system are collected, and written to statistical data block in the memory of the data node.Type: ApplicationFiled: April 15, 2015Publication date: October 20, 2016Inventors: Demai NI, Guogen ZHANG, Qingqing ZHOU, Jason Yang SUN
-
Publication number: 20160246785Abstract: System and method for hybrid distribution mode in massively parallel processing (MPP) database preventing storage imbalance issues caused by data skew. Key values of the database are identified as outliers if records of those keys cause database skew. In hybrid mode, records having the outlier key values are distributed using a random distribution scheme. Other records are distributed using a hash distribution scheme. A threshold skew amount is configurable for the system. Record lookups, insertions, deletions, and updates are processed according to a query plan optimized for the distribution mode of the records referenced in a database query.Type: ApplicationFiled: February 23, 2015Publication date: August 25, 2016Inventors: Jason Yang SUN, Qingqing ZHOU
-
Patent number: 9311354Abstract: Queries may be processed more efficiently in an massively parallel processing (MPP) database by locally optimizing the global execution plan. The global execution plan and a semantic tree may be provided to MPP data nodes by an MPP coordinator. The MPP data nodes may then use the global execution plan and the semantic tree to generate a local execution plan. Thereafter, the MPP data nodes may select either the global execution plan or the local execution plan is accordance with a cost evaluation.Type: GrantFiled: December 29, 2012Date of Patent: April 12, 2016Assignee: Futurewei Technologies, Inc.Inventors: Jason Yang Sun, Qingqing Zhou, Mohan Singamshetty
-
Publication number: 20160092488Abstract: Presented systems and methods can facilitate efficient and effective information storage management. A system may include a plurality of nodes, shared storage and a centralized lock manager. A storage management method can include: receiving an access request to information, performing a lock resolution process; and performing an access operation (e.g., read, information update, etc.). The information can be associated with a shared storage component. The lock resolution process can include participating in a lock management process that manages a physical lock (P-lock), wherein the lock management process utilizes transaction information associated with an implicit lock process and proceeds without communication overhead associated with explicit requests for a logical lock.Type: ApplicationFiled: September 26, 2014Publication date: March 31, 2016Inventors: Jason Yang SUN, Guogen ZHANG