Patents by Inventor Sheng Yan Sun

Sheng Yan Sun has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Accelerating fetching of result sets

Patent number: 11960544

Abstract: A computer implemented method processes a query. A number of processor units processes the query to identify a result set in response to receiving the query from a first client. The number of processor units stores, the result set in a shared cache assigned to a group of clients, wherein result set stored in the shared cache is accessible by the group of clients. The number of processor units returns the result set to a second client in the group of clients from the shared cache in response to receiving the query from the second client in the group of clients.

Type: Grant

Filed: October 28, 2021

Date of Patent: April 16, 2024

Assignee: International Business Machines Corporation

Inventors: Sheng Yan Sun, Shuo Li, Xiaobo Wang, Hong Mei Zhang
Computer Memory Management With Efficient Index Access

Publication number: 20240111773

Abstract: Computer technology for retrieving data stored in a table that includes the following computer operations: receiving, from persistent storage of a computer, an original Index Tree data structure; storing, in volatile memory, a memory-based Index Tree data structure based on the original Index Tree data structure, with the memory-based Index Tree data structure including: a root node, a set of hierarchically arranged intermediate layer(s) with each intermediate layer including a plurality of non-leaf nodes, and a leaf layer including a plurality of leaf nodes; and retrieving data from a table in a database, with the retrieval including traversing the memory-based Index Tree data structure through a plurality of child lock pointers to locate pages including the data to be retrieved from the table.

Type: Application

Filed: September 29, 2022

Publication date: April 4, 2024

Inventors: Shuo Li, Xiaobo Wang, Sheng Yan Sun, YING ZHANG
Heterogeneous schema discovery for unstructured data

Patent number: 11947561

Abstract: An embodiment for analyzing and tracking data flow to determine proper schemas for unstructured data. The embodiment may automatically use a sidecar to collect schema discovery rules during conversion of raw data to unstructured data. The embodiment may automatically generate multiple schemas for different tenants using the collected schema discovery rules. The embodiment may automatically use ETL to export unstructured data to SQL databases with the generated multiple schemas for the different tenants. The embodiment may automatically monitor usage data of the SQL databases and collect the usage data. The embodiment may automatically optimize schema discovery using the collected usage data. The embodiment may automatically discover schemas with hot usage and apply the discovered schemas with hot usage to other tenants for consumption and further monitoring.

Type: Grant

Filed: June 21, 2022

Date of Patent: April 2, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Peng Hui Jiang, Jun Su, Sheng Yan Sun, Hong Mei Zhang, Meng Wan
INTELLIGENTLY SCALING DATABASE AS A SERVICE RESOURCES IN A CLOUD PLATFORM

Publication number: 20240103896

Abstract: A computer-implemented method, system and computer program product for scaling a resource of a Database as a Service (DBaaS) cluster in a cloud platform. User service requests from a service cluster to be processed by the DBaaS cluster are received. A first set of tracing data is generated by a service mesh, which facilitates service-to-service communication between the service cluster and the DBaaS cluster, from the user service requests. A second set of tracing data is generated by the DBaaS cluster from handling the user service requests. A dependency tree is then generated to discover application relationships to identify potential bottlenecks in nodes of the DBaaS cluster based on these sets of tracing data. The pod(s) of a DBaaS node are then scaled based on the dependency tree, which is used in part, to predict the utilization of the resources of the DBaaS node identified as being a potential bottleneck.

Type: Application

Filed: September 24, 2022

Publication date: March 28, 2024

Inventors: Peng Hui Jiang, Yue Wang, Jun Su, Su Liu, Sheng Yan Sun
Database compression oriented to combinations of record fields

Patent number: 11940998

Abstract: This disclosure provides a computer-implemented method, a computer system and a computer program product for database compression oriented to combinations of fields of a database record. One or more combinations of fields of a record of a database are determined that satisfy a frequency criterion indicating that access frequencies of the one or more combinations of fields are higher than an access frequency threshold. The record is reorganized based on the one or more combinations of fields to store fields of each combination of the one or more combinations of fields in a respective contiguous storage space. The reorganized record is compressed by applying a compression scheme to the one or more combinations of fields.

Type: Grant

Filed: June 10, 2022

Date of Patent: March 26, 2024

Assignee: International Business Machines Corporation

Inventors: Ying Zhang, Xiaobo Wang, Shuo Li, Sheng Yan Sun
Database distribution to avoid contention

Patent number: 11940975

Abstract: A computer-implemented method that includes receiving an ingestion request to ingest data to a database comprising physical shards and detecting that the ingestion request is directed to a first hotspot shard. The first hotspot shard has a contention level over a threshold value. The method also detects context characteristics within the data and generates a first virtual shard based on a first virtual shard key selected from the detected context characteristics. The first virtual shard virtually duplicates at least a portion of the first hotspot shard. The method also includes ingesting the data to the first virtual shard.

Type: Grant

Filed: September 28, 2020

Date of Patent: March 26, 2024

Assignee: International Business Machines Corporation

Inventors: Shuo Li, Peng Hui Jiang, Xiaobo Wang, Sheng Yan Sun
Log content modeling

Patent number: 11934359

Abstract: A method, computer system, and a computer program product is provided for computer log management. In one embodiment, in response to receiving a log request from a user, an input content is analyzed and adjusted according to input contents and user's previous activities. A similarity analysis and a fairness analysis is performed to determine similarities between the input content, as adjusted, and a plurality of log records in an object library. The similarity analysis includes analyzing any patterns and attributes. The attributes have a dimension, and each dimension has a predefined weight (W). The fairness analysis ensures that one type of log is not favored over others. A best possible match is then determined, and one or more logs are presented to the user providing the best possible match.

Type: Grant

Filed: November 18, 2022

Date of Patent: March 19, 2024

Assignee: International Business Machines Corporation

Inventors: Sheng Yan Sun, Peng Hui Jiang, Meng Wan, Hong Mei Zhang
SOURCE CODE REPOSITORY DEBUG CHAINING

Publication number: 20240086306

Abstract: One or more computer processors generate a debug chain from one or more similar resource bound breakpoints, wherein the debug chain provides dynamic code flow. The one or more computer processors distribute the generated debug chain to one or more tenants.

Type: Application

Filed: September 13, 2022

Publication date: March 14, 2024

Inventors: Peng Hui Jiang, Jun Su, Sheng Yan Sun, Hong Mei Zhang, Meng Wan
Dynamically changing query mini-plan with trustworthy AI

Patent number: 11914594

Abstract: A disclosed database system and enhanced methods implement enhanced mini-plans and dynamically changing a query mini-plan with trustworthy Artificial Intelligence (AI) to improve query execution performance in a database system. An AI cost model evaluates candidate mini-plans for executing a query. AI truth monitors evaluate the execution of the mini-plans, such as predicted input factors and adjusted mini-plans of one or more AI running data models. The AI truth monitors provide feedback to adjust the AI cost model based on evaluating the execution of the mini-plans. The AI truth monitors validate adjusted mini-plans, provide feedback to the AI cost model with improved overall prediction accuracy, and enhanced mini-plans to gain query performance.

Type: Grant

Filed: December 28, 2022

Date of Patent: February 27, 2024

Assignee: International Business Machines Corporation

Inventors: Hong Mei Zhang, Meng Wan, Sheng Yan Sun, Peng Hui Jiang
Column based database locks

Patent number: 11914573

Abstract: Disclosed are techniques for relational database locks based on columns. Database transactions may be targeted to specific columns of one or more records, instead of the entire row for those records, using primary keys. Column locks on specific keys are stored separately than column locks on ranges of keys, which are both checked when requesting a new column lock for either a single key or a range of keys. When a threshold number of columns for a given record, or range of records/keys, have been locked, the column locks for that record, or range of records, can be combined into a single row level lock to reduce resource costs for maintaining multiple concurrent locks.

Type: Grant

Filed: March 23, 2022

Date of Patent: February 27, 2024

Assignee: International Business Machines Corporation

Inventors: Shuo Li, Xiaobo Wang, Hong Mei Zhang, Sheng Yan Sun
Automated partitioning of a distributed database system

Patent number: 11914586

Abstract: An embodiment includes generating a partition schema for a distributed database based on historical usage data indicative of usage of the distributed database, where the generating of the partition schema comprises determining a partition range of a partition of the partition schema. The embodiment also includes generating a node identifier for the partition using a hash function and a first weight value assigned to the partition. The embodiment also includes monitoring performance data indicative of a performance of the distributed database, the monitoring comprising detecting a failure of the performance to satisfy a performance threshold. The embodiment also includes initiating, responsive to detecting the failure, a redistribution procedure by changing the node identifier of the partition by replacing the first weight value with a second weight value.

Type: Grant

Filed: March 31, 2022

Date of Patent: February 27, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Hong Mei Zhang, Sheng Yan Sun, Meng Wan, Peng Hui Jiang
INTELLIGENT WORKLOAD ROUTING FOR MICROSERVICES

Publication number: 20240062069

Abstract: A system may include a memory and a processor in communication with the memory. The processor may be configured to perform operations. The operations may include training a model. The operations may include enhancing the model with reinforcement learning and improving stability of the model with a graph neural network model. The operations may include predicting, with the model, a resource cost of a node and deploying the node.

Type: Application

Filed: August 19, 2022

Publication date: February 22, 2024

Inventors: Peng Hui Jiang, Sheng Yan Sun, Jun Su, Su Liu, Jeremy R. Fox, Hamid Majdabadi
PERFORMING AN OPERATION IN A TREE STRUCTURE

Publication number: 20240045852

Abstract: A computer-implemented method for performing an operation in a tree structure is provided according to embodiments of the present disclosure. In the computer-implemented method, an operation to be performed in a tree structure may be received. The tree structure may comprise a plurality of non-leaf nodes and a plurality of leaf nodes. The operation may be associated with a record comprising a pair of key and value. One of the non-leaf nodes may be determined based on the key of the record. Then, the operation may be performed in the determined non-leaf node.

Type: Application

Filed: August 8, 2022

Publication date: February 8, 2024

Inventors: Shuo Li, Xiaobo Wang, Leilei Li, Ping Wang, Sheng Yan Sun
BUILDING AND USING A SPARSE TIME SERIES DATABASE (TSDB)

Publication number: 20240045878

Abstract: Provided are techniques for building and using a sparse Time Series Database (TSDB). Time series records are received from a native TSDB, where each of the time series records includes a timestamp and one or more tags. Timeslots are determined for shards for the sparse TSDB based on the timestamp included in each of the time series records. The sparse TSDB is built by creating the shards for the determined timeslots and storing the time series records in the shards, while filling in empty ranges in the shards. A query that specifies at least one of the one or more tags is received. It is determined whether to execute the query against the sparse TSDB, and, in response to a determination to execute the query against the sparse TSDB, the query is executed against the sparse TSDB to generate results that are returned.

Type: Application

Filed: August 5, 2022

Publication date: February 8, 2024

Inventors: Peng Hui JIANG, Jun SU, Sheng Yan SUN, Hong Mei ZHANG, Meng WAN
Enhancing database query processing

Patent number: 11893020

Abstract: A system, program product, and method for enhancing automatic multidimensional query processing. The method includes executing a database query including semi-joining a plurality of dimension tables with a fact table. The method also includes identifying for extraction one or more data values from each dimension table of the plurality of dimension tables. The data values from each dimension table of the plurality of dimension tables are associated with a respective record identification (RID), thereby defining one or more RIDs. The method further includes generating a plurality of RID lists. Each RID list of the plurality of RID lists includes a collection of the one or more RIDs for the respective dimension table. The method also includes merging the plurality of RID lists, sorting, subject to the merging, the plurality of RIDs as a function of data location, and fetching the data values from the fact table.

Type: Grant

Filed: January 7, 2022

Date of Patent: February 6, 2024

Assignee: International Business Machines Corporation

Inventors: Sheng Yan Sun, Xiaobo Wang, Hong Mei Zhang, Shuo Li
PREFETCHING MANAGEMENT IN DATABASE SYSTEM

Publication number: 20240028515

Abstract: This disclosure provides a method, a computing system, and a computer program product for managing prefetching of pages in a database system. The method comprises obtaining shared information associated with page access, wherein the shared information associated with the page access includes information associated with the page access from a plurality of computing nodes. The method further comprises determining whether to prefetch a number of pages into a global buffer pool based at least on the shared information associated with the page access using a sequential prefetching method.

Type: Application

Filed: July 19, 2022

Publication date: January 25, 2024

Inventors: Sheng Yan Sun, Xiaobo Wang, Shuo Li, Chun Lei Xu
Efficient job writing for database member

Patent number: 11874830

Abstract: In a computer-implemented method for improving performance of a database, a processor receives batch jobs for a relational database. The batch jobs may include a first member with a first buffer pool, and a second member with a second buffer pool. The processor may also identify a first actual object and an isolation level for the batch jobs, generate related queries based on the first actual object and the isolation level, calculate a cost for the first member and the second member to run the batch jobs based on the related queries, and assign the batch jobs to the first member based on a lower calculated cost.

Type: Grant

Filed: March 9, 2022

Date of Patent: January 16, 2024

Assignee: International Business Machines Corporation

Inventors: Sheng Yan Sun, Hong Mei Zhang, Meng Wan, Peng Hui Jiang
Buffer pool management

Patent number: 11874765

Abstract: A processor may allocate a first buffer segment from a buffer pool. The first buffer segment may be configured with a first contiguous range of memory for a first data partition of a data table. The first data partition comprising a first plurality of data blocks. A processor may store the first plurality of data blocks in order into the first buffer segment. A processor may retrieve the target data block from the first buffer segment in response to a data access request for a target data block of the first plurality of data blocks.

Type: Grant

Filed: May 28, 2021

Date of Patent: January 16, 2024

Assignee: International Business Machines Corporation

Inventors: Shuo Li, Xiaobo Wang, Sheng Yan Sun, Hong Mei Zhang
Dynamically inheriting accumulated attribution

Patent number: 11853697

Abstract: An approach is provided in which a method, system, and program product build a time series prediction model based on one or more relationships between a first set of keywords in a set of first news articles and a second set of keywords in a set of second news articles. The time series prediction model includes a time-based interest level adjustment corresponding to a publication time between the set of first news articles and the set second of news articles. The method, system, and program product use the time series prediction model to compute an inherited initial interest level of a third news article that includes a set of new keywords based on the set of new keywords and the time-based interest level adjustment. The method, system, and program product assign the inherited initial interest level to the third news article.

Type: Grant

Filed: April 23, 2021

Date of Patent: December 26, 2023

Assignee: International Business Machines Corporation

Inventors: Shuo Li, June-Ray Lin, Sheng Yan Sun, Xiaobo Wang
DATA MANAGEMENT

Publication number: 20230409602

Abstract: Embodiments of the present disclosure relate to a method, system, and computer program product for data management. According to the method, one or more processors divide data into a plurality of partitions. The one or more processors store the plurality of partitions in a plurality of nodes of a mixed distributed database system, wherein a first node of the mixed distributed database system comprises a plurality of databases, and wherein at least a part of the plurality of partitions are shared by the plurality of databases of the first node and being not shared by other of the plurality of nodes.

Type: Application

Filed: June 21, 2022

Publication date: December 21, 2023

Inventors: Hong Mei Zhang, Sheng Yan Sun, Meng Wan, Peng Hui Jiang

1 2 3 4 5 … next