Patents by Inventor Ke Wei Wei
Ke Wei Wei has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11947538Abstract: A method for processing a plurality of queries is provided according to embodiments of the present disclosure. In this method, based on a plurality of queries and an execution plan for the plurality of quires, a plurality of record identification (ID) numbers can be stored into a pool in a numerical order. Each of the plurality of record ID numbers can identify a data record in a database. Then, the execution plan can be performed to batch a plurality of data records corresponding to the plurality of record ID numbers in the database based on a distribution of the plurality of record ID numbers in the pool.Type: GrantFiled: May 4, 2022Date of Patent: April 2, 2024Assignee: International Business Machines CorporationInventors: Ke Wei Wei, Shuang Yu, Zhenyu Shi, Ji Gao Fu, Heng Liu
-
Publication number: 20230359621Abstract: A method for processing a plurality of queries is provided according to embodiments of the present disclosure. In this method, based on a plurality of queries and an execution plan for the plurality of quires, a plurality of record identification (ID) numbers can be stored into a pool in a numerical order. Each of the plurality of record ID numbers can identify a data record in a database. Then, the execution plan can be performed to batch a plurality of data records corresponding to the plurality of record ID numbers in the database based on a distribution of the plurality of record ID numbers in the pool.Type: ApplicationFiled: May 4, 2022Publication date: November 9, 2023Inventors: Ke Wei Wei, SHUANG YU, Zhenyu Shi, Ji Gao Fu, Heng Liu
-
Patent number: 11743133Abstract: A method includes generating a plurality of vectors representing words in a plurality of documents about an information technology (IT) system and clustering the plurality of vectors to produce a plurality of clusters. The method also includes identifying a cluster of the plurality of clusters that contains a plurality of clustered vectors, generating a feature based on a plurality of words represented by the plurality of clustered vectors, and training a machine learning model to identify an anomaly in the IT system based on the feature.Type: GrantFiled: September 14, 2021Date of Patent: August 29, 2023Assignee: International Business Machines CorporationInventors: Ke Wei Wei, Wei Liu, Guo Ran Sun, Shuang YS Yu, Meichi Maggie Lin, Yi Dai
-
Publication number: 20230214454Abstract: An embodiment generates an initial set of training data from monitoring data. The initial set of training data is generated by combining outputs from a plurality of pretrained classifiers. The embodiment trains a new classification model using the initial set of training data to identify anomalies in monitoring data. The embodiment performs a multiple-level clustering of the data samples resulting in a plurality of clusters of sub-clusters of data samples, and generates a review list of data samples by selecting a representative data sample from each of the clusters. The embodiment receives an updated data sample from the expert review that includes a revised target classification for at least one of the data samples of the expert review list. The embodiment then trains another replacement classification model using a revised set of training data that includes the updated data sample and associated revised target classification.Type: ApplicationFiled: January 4, 2022Publication date: July 6, 2023Applicant: International Business Machines CorporationInventors: Ke Wei Wei, Jun Wang, Shuang YS Yu, Guang Ming Zhang, Yuan Feng, Yi Dai, Ling Zhuo, Jing Xu
-
Publication number: 20230078661Abstract: A method includes generating a plurality of vectors representing words in a plurality of documents about an information technology (IT) system and clustering the plurality of vectors to produce a plurality of clusters. The method also includes identifying a cluster of the plurality of clusters that contains a plurality of clustered vectors, generating a feature based on a plurality of words represented by the plurality of clustered vectors, and training a machine learning model to identify an anomaly in the IT system based on the feature.Type: ApplicationFiled: September 14, 2021Publication date: March 16, 2023Inventors: Ke Wei WEI, Wei LIU, Guo Ran SUN, Shuang YS YU, Meichi Maggie LIN, Yi DAI
-
Patent number: 11568272Abstract: Aspects of the invention include a computer-implemented method that receives, by a processor, an ensemble decision tree and generates, by the processor, native code from the ensemble decision tree. The method compiles, by the processor, the native code into machine language and scores, by the processor, the execution time of the native code. The method dynamically reoptimizes, by the processor, portions of the native code corresponding to the most traversed portion of the ensemble decision tree.Type: GrantFiled: June 9, 2020Date of Patent: January 31, 2023Assignee: International Business Machines CorporationInventors: Jean-François Puget, Ke Wei Wei, Xinke Wang, Qi Wen, Chu Yun Tong, Tian Tian, Chi Liu
-
Publication number: 20220318652Abstract: An approach is provided in which a method, system, and program product analyze, while training a machine learning model, a set of first data transformation operators in a first data preparation pipeline that generates a plurality of constructed features from a set of training data. The method, system, and program product create a plurality of second data preparation pipelines from the first data preparation pipeline, wherein the set of first data transformation operators are converted to a set of second data transformation operators and each assigned to one of the plurality of second data preparation pipelines. The method, system, and program product deploy the plurality of second data preparation pipelines to a runtime system.Type: ApplicationFiled: March 31, 2021Publication date: October 6, 2022Inventors: Ke Wei Wei, Hong Min, Shuang YS Yu, Qi Zhang, Meichi Maggie Lin, Peter Bendel, Heng Liu
-
Publication number: 20210383244Abstract: Aspects of the invention include a computer-implemented method that receives, by a processor, an ensemble decision tree and generates, by the processor, native code from the ensemble decision tree. The method compiles, by the processor, the native code into machine language and scores, by the processor, the execution time of the native code. The method dynamically reoptimizes, by the processor, portions of the native code corresponding to the most traversed portion of the ensemble decision tree.Type: ApplicationFiled: June 9, 2020Publication date: December 9, 2021Inventors: Jean-François Puget, Ke Wei Wei, Xinke Wang, Qi Wen, Chu Yun Tong, Tian Tian, Chi Liu
-
Patent number: 11176128Abstract: Methods, computer program products, and systems are presented. The method computer program products, and systems can include, for instance: receiving, by a production database, a query statement for processing by the production database; the production database in performing runtime execution processing of the query statement identifying one or more query statement value of the query statement; the production database in performing runtime execution processing of the query statement establishing, using a predictive model, an access path based on the identified one or more query statement value; and the production database using the established access path in performing runtime execution processing of the query statement.Type: GrantFiled: May 15, 2018Date of Patent: November 16, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Xiaobo Wang, Shuo Li, Ke Wei Wei, Heng Liu
-
Patent number: 11138166Abstract: Database configuration tuning is provided. A set of database nodes having similar data factors is selected in a centerless network of database nodes. Configuration models corresponding to the set of database nodes are trained using data parallelism. Trained configuration models corresponding to the set of database nodes are combined to form a federated configuration model. It is determined whether performance indicators corresponding to the set of database nodes are greater than a performance threshold level. In response to determining that the performance indicators corresponding to the set of database nodes are greater than the performance threshold level, a database configuration corresponding to the federated configuration model is recommended to a new database node. The new database node is joined to the centerless network.Type: GrantFiled: August 26, 2019Date of Patent: October 5, 2021Assignee: International Business Machines CorporationInventors: ShengYan Sun, Ke Wei Wei, Meng Wan, Shuo Li, Xiaobo Wang
-
Patent number: 10983994Abstract: In an approach to determining an access method for a partition in a partition table, a computer receives a query and determines if there is a partition table utilized by the query. When there is a partition table utilized by the query, then the computer determines that a partition in the partition table meets the plurality of conditions of the query. The computer collects a plurality of partition level statistics for the partition that meets the plurality of conditions of the query. Additionally, the computer determines, based, at least in part, on the plurality of partition level statistics, a cost for one or more access methods for the partition that meets the plurality of conditions of the query. Furthermore, the computer determines, based, at least in part, on the cost for each access method, an access method for the partition that meets the plurality of conditions of the query.Type: GrantFiled: September 21, 2018Date of Patent: April 20, 2021Assignee: International Business Machines CorporationInventors: Shuo Li, Ping Liang, Ke Wei Wei, Xin Ying Yang
-
Publication number: 20210064591Abstract: Database configuration tuning is provided. A set of database nodes having similar data factors is selected in a centerless network of database nodes. Configuration models corresponding to the set of database nodes are trained using data parallelism. Trained configuration models corresponding to the set of database nodes are combined to form a federated configuration model. It is determined whether performance indicators corresponding to the set of database nodes are greater than a performance threshold level. In response to determining that the performance indicators corresponding to the set of database nodes are greater than the performance threshold level, a database configuration corresponding to the federated configuration model is recommended to a new database node. The new database node is joined to the centerless network.Type: ApplicationFiled: August 26, 2019Publication date: March 4, 2021Inventors: ShengYan Sun, Ke Wei Wei, Meng Wan, Shuo Li, Xiaobo Wang
-
Patent number: 10891300Abstract: A computer-implemented method includes identifying a primary database interaction statement and determining one or more intermediate database interaction statements based on the primary database interaction statement. The computer-implemented further includes determining, for each of the one or more intermediate database interaction statements, an intermediate statement transformation indicator. The computer-implemented method further includes determining one or more transformed database interaction statements based on each intermediate statement transformation indicator.Type: GrantFiled: October 27, 2017Date of Patent: January 12, 2021Assignee: International Business Machines CorporationInventors: Dong Jie Wei, Ke Wei Wei, Xin Ying Yang, Miao Zheng
-
Patent number: 10762099Abstract: A computer-implemented method includes identifying a primary database interaction statement and determining one or more intermediate database interaction statements based on the primary database interaction statement. The computer-implemented further includes determining, for each of the one or more intermediate database interaction statements, an intermediate statement transformation indicator. The computer-implemented method further includes determining one or more transformed database interaction statements based on each intermediate statement transformation indicator. A corresponding computer program product and computer system are also disclosed.Type: GrantFiled: June 7, 2016Date of Patent: September 1, 2020Assignee: International Business Machines CorporationInventors: Dong Jie Wei, Ke Wei Wei, Xin Ying Yang, Miao Zheng
-
Patent number: 10747766Abstract: A computer-implemented method includes receiving a batch processing job. The method identifies multiple access paths for the batch processing job. The method determines one or more runtime costs for each of the multiple access paths. The method determines whether the batch processing job includes comparable access paths. The method responds to the batch processing job not including comparable access paths by receiving a query and selecting a desired access, based on the one or more runtime costs. The method responds to the batch processing job including comparable access paths by determining one or more context costs for each of the one or more access paths, receiving a query, identifying a context for the query, and selecting a desired access path for the query. The desired access path is based on the one or more runtime costs, the one or more context costs, and the context for the query.Type: GrantFiled: May 11, 2018Date of Patent: August 18, 2020Assignee: International Business Machines CorporationInventors: Shuo Li, Ke Wei Wei, Li Yang, Xin Ying Yang, Jian Wei Zhang
-
Patent number: 10585890Abstract: Aspects of the present disclosure relate to an approach for generating query execution plans for a database query. A computer-implemented method comprises determining whether a size of a file to be generated during execution of a database query is variable in response to obtaining the database query. The method further comprises determining a plurality of ranges for the size of the file in response to determining that the size of the file is variable. The method further comprises generating a plurality of query execution plans corresponding to the plurality of ranges. Accordingly, a plurality of query execution plans corresponding to different file sizes can be generated for the database query, and an efficient and cost effective query execution plan may be selected based on the actual file size during the execution of the database query.Type: GrantFiled: October 27, 2016Date of Patent: March 10, 2020Assignee: International Business Machines CorporationInventors: Shuo Li, Ke Wei Wei, Xin Ying Yang, Chen Xin Yu
-
Patent number: 10572480Abstract: A computer-implemented method includes identifying a query, including one or more predicates and one or more branches, wherein one or more branches includes one or more legs. The computer-implemented method further includes, for each branch, in parallel: determining a risk, determining a return row threshold, estimating a number of return rows; terminating access if the return rows exceed the threshold. The computer-implemented method further includes, for each leg, in parallel: determining a leg return row threshold; accessing the leg; fetching one or more return rows into one or more leg return row pages; terminating access if the return rows exceed the threshold; intersecting one or more leg return row pages into one or more intersected leg return row pages; and applying the one or more predicates to the one or more intersected leg return row pages. The method may be embodied in a corresponding computer system or computer program product.Type: GrantFiled: November 10, 2017Date of Patent: February 25, 2020Assignee: International Business Machines CorporationInventors: Shuo Li, Heng Liu, Ke Wei Wei, Xin Ying Yang
-
Patent number: 10565201Abstract: Disclosed aspects relate to query processing management in a database management system (DBMS). The DBMS may detect that a query has a set of subqueries. The DBMS may detect that the set of subqueries indicates a set of data structures. A lead data structure may be selected from the set of data structures using a selection criterion. A set of input columns may be identified from the lead data structure of the set of data structures. A set of output columns may be identified from the set of data structures other than the lead data structure. A hash data structure may be constructed using the set of input columns and the set of output columns. A cache may be built based on the hash data structure.Type: GrantFiled: November 4, 2016Date of Patent: February 18, 2020Assignee: International Business Machines CorporationInventors: Thomas A. Beavin, Zhang Hao, Shuo Li, Xiaobo Wang, Ke Wei Wei
-
Patent number: 10528562Abstract: A computer-implemented method includes identifying a query, including one or more predicates and one or more branches, wherein one or more branches includes one or more legs. The computer-implemented method further includes, for each branch, in parallel: determining a risk, determining a return row threshold, estimating a number of return rows; terminating access if the return rows exceed the threshold. The computer-implemented method further includes, for each leg, in parallel: determining a leg return row threshold; accessing the leg; fetching one or more return rows into one or more leg return row pages; terminating access if the return rows exceed the threshold; intersecting one or more leg return row pages into one or more intersected leg return row pages; and applying the one or more predicates to the one or more intersected leg return row pages. The method may be embodied in a corresponding computer system or computer program product.Type: GrantFiled: September 12, 2016Date of Patent: January 7, 2020Assignee: International Business Machines CorporationInventors: Shuo Li, Heng Liu, Ke Wei Wei, Xin Ying Yang
-
Publication number: 20190354621Abstract: Methods, computer program products, and systems are presented. The method computer program products, and systems can include, for instance: receiving, by a production database, a query statement for processing by the production database; the production database in performing runtime execution processing of the query statement identifying one or more query statement value of the query statement; the production database in performing runtime execution processing of the query statement establishing, using a predictive model, an access path based on the identified one or more query statement value; and the production database using the established access path in performing runtime execution processing of the query statement.Type: ApplicationFiled: May 15, 2018Publication date: November 21, 2019Inventors: Xiaobo WANG, Shuo LI, Ke Wei WEI, Heng LIU