Patents by Inventor Qiuye WANG

Qiuye WANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method for interconnecting data lake and relational database

Patent number: 11914609

Abstract: The present disclosure provides a method for interconnecting a data lake and a relational database, including the following steps: S1: adding a data source class of a relational database to a data lake; S2: matching and using, by the data lake, a data source class of the relational database; and S3: determining and loading a corresponding driver according to the data source class, so as to connect the corresponding relational database. By cascading a data source registering configuration file, a relational database configuration file and a driver package catalog in a parameter passing method, when the data lake is started, a specific database to be used is designated unnecessarily, but a corresponding database is used directly. The configuration file is also traversed unnecessarily, but the user acquires configuration information as required in the parameter passing method.

Type: Grant

Filed: December 21, 2022

Date of Patent: February 27, 2024

Assignees: NANHU LABORATORY, BEIJING BIG DATA ADVANCED TECHNOLOGY RESEARCH INSTITUTE

Inventors: Hao Liu, Tao Zhang, Lei Zhang, Peng Wang, Zhefeng Liu, Zhiling Chen, Qiuye Wang, Wei Chen, Yinlong Liu, Chenxi Yu
High-performance data lake system and data storage method

Patent number: 11789899

Abstract: The present disclosure provides a high-performance data lake system and a data storage method. The data storage method includes the following steps: S1: converting a file into a file stream; S2: converting the file stream into an array in which multiple subarrays are nested; and S3: converting the array into a resilient distributed dataset (RDD), and storing the RDD to a storage layer of a data lake. The present disclosure provides a nested field structure, which lays the foundation for parallel processing in reading, and effectively improves read performance. Furthermore, the present disclosure flexibly generates a number of nested subarrays according to hardware cores, such that the data lake achieves better extension performance, and can keep optimal writing efficiency for different users.

Type: Grant

Filed: November 17, 2022

Date of Patent: October 17, 2023

Assignees: Nanhu Laboratory, Advanced Institute of Big Data, Beijing

Inventors: Hao Liu, Zhiling Chen, Tao Zhang, Peng Wang, Qiuye Wang, Chenxi Yu, Wei Chen, Yinlong Liu, Zhefeng Liu, Yonggang Tu
METHOD FOR INTERCONNECTING DATA LAKE AND RELATIONAL DATABASE

Publication number: 20230222138

Abstract: The present disclosure provides a method for interconnecting a data lake and a relational database, including the following steps: S1: adding a data source class of a relational database to a data lake; S2: matching and using, by the data lake, a data source class of the relational database; and S3: determining and loading a corresponding driver according to the data source class, so as to connect the corresponding relational database. By cascading a data source registering configuration file, a relational database configuration file and a driver package catalog in a parameter passing method, when the data lake is started, a specific database to be used is designated unnecessarily, but a corresponding database is used directly. The configuration file is also traversed unnecessarily, but the user acquires configuration information as required in the parameter passing method.

Type: Application

Filed: December 21, 2022

Publication date: July 13, 2023

Inventors: Hao LIU, Tao ZHANG, Lei ZHANG, Peng WANG, Zhefeng LIU, Zhiling CHEN, Qiuye WANG, Wei CHEN, Yinlong LIU, Chenxi YU
HIGH-PERFORMANCE DATA LAKE SYSTEM AND DATA STORAGE METHOD

Publication number: 20230153267

Abstract: The present disclosure provides a high-performance data lake system and a data storage method. The data storage method includes the following steps: S1: converting a file into a file stream; S2: converting the file stream into an array in which multiple subarrays are nested; and S3: converting the array into a resilient distributed dataset (RDD), and storing the RDD to a storage layer of a data lake. The present disclosure provides a nested field structure, which lays the foundation for parallel processing in reading, and effectively improves read performance. Furthermore, the present disclosure flexibly generates a number of nested subarrays according to hardware cores, such that the data lake achieves better extension performance, and can keep optimal writing efficiency for different users.

Type: Application

Filed: November 17, 2022

Publication date: May 18, 2023

Applicants: Nanhu Laboratory, Advanced Institute of Big Data, Beijing

Inventors: Hao LIU, Zhiling CHEN, Tao ZHANG, Peng WANG, Qiuye WANG, Chenxi YU, Wei CHEN, Yinlong LIU, Zhefeng LIU, Yonggang TU

Method for interconnecting data lake and relational database

High-performance data lake system and data storage method

METHOD FOR INTERCONNECTING DATA LAKE AND RELATIONAL DATABASE

HIGH-PERFORMANCE DATA LAKE SYSTEM AND DATA STORAGE METHOD