Patents by Inventor Jian Ling Shi

Jian Ling Shi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250202970
    Abstract: A computer-implemented method may include generating a rule operator comprising a rule table; generating a rule operator group comprising rule operators containing similar rule tables; communicating the rule operator group to a shared compute node; retrieving table data associated with the rule table of a first rule operator within the rule operator group; reusing the table data associated with the first rule operator in a first runtime of additional rule operators corresponding to the rule operator group; determining at least one checker metric associated with a second runtime of a data rule; determining a workload state for the shared compute node based on the at least one checker metric; determining time taken and resource usage for a data rule during the second runtime; and balancing a workload of each compute node based on the workload state, time taken, and resource usage for the data rule during the second runtime.
    Type: Application
    Filed: December 14, 2023
    Publication date: June 19, 2025
    Inventors: Chun Hua Sun, Xu Bin Cai, Yi Yang Ren, Wei Wang, Pin Lv, Jian Ling Shi, Chun Leng
  • Publication number: 20240386032
    Abstract: New data class generation is provided. A dimension score is generated for each respective dimension of a plurality of predefined dimensions as relating to column attributes of a data asset while performing a static reference data analysis of the data asset. The dimension score of each respective dimension is added together to obtain a total dimension score for the data asset. It is determined whether the total dimension score of the data asset is greater than a predefined minimum dimension score threshold level. The data asset is identified as new static reference data in response to determining that the total dimension score of the data asset is greater than the predefined minimum dimension score threshold level. A new data class is generated based on the new static reference data.
    Type: Application
    Filed: May 15, 2023
    Publication date: November 21, 2024
    Inventors: Chun Hua Sun, Xu Bin Cai, Chun Leng, Wei Wang, Yi Yang Ren, Jian Ling Shi, Pin Lv, Xin Yu Wang, Yi Wang, Tao Zhuang
  • Publication number: 20240320234
    Abstract: An approach is disclosed that receives a new ETL job. The job includes a number of intermediate database files descriptors corresponding to a plurality of intermediate database files that are used to accomplish the new ETL. A new data lineage graph is created that pertains to the new ETL job. The new data lineage graph is compared to a number of existing data lineage graphs with each of the existing data lineage graphs corresponding to an existing ETL job. The approach substitutes existing database files found in the existing data lineage graphs for one or more intermediate database files found in the new data lineage graph. The new ETL job is then run by utilizing the substituted database files, the result being a new final database file.
    Type: Application
    Filed: March 24, 2023
    Publication date: September 26, 2024
    Inventors: Yi Yang Ren, Chun Hua Sun, Xu Bin Cai, Wei Wang, Jian Ling Shi, Chun Leng, Pin Lv
  • Patent number: 11972368
    Abstract: Methods, systems, computer program products for determining the source of activity during interaction with a user interface are provided. The method comprises selecting one or more input devices from a plurality of available input devices coupled to the user interface and receiving respective measurements for the selected one or more input devices. Based on the received respective measurements, respective feature vectors for the one or more input devices are generated and then inputted to a pre-defined regression model. Then, the source of activity is determined based on a result received from the regression model.
    Type: Grant
    Filed: September 20, 2019
    Date of Patent: April 30, 2024
    Assignee: International Business Machines Corporation
    Inventors: Liang LL Lu, Sun Chun Hua, Jian Ling Shi, Yi Yang Ren
  • Publication number: 20230385252
    Abstract: An approach is provided that retrieves fingerprint configuration sets corresponding to a received data source and uses the configuration sets to generate fingerprints that correspond to the data source. These fingerprints are compared to a number of fingerprints that are stored in a repository. If a match is found, then the data quality configuration set is retrieved from the repository and used to perform a data quality analysis. On the other hand, if a match is not found, then one of the configuration sets is selected to perform the data quality analysis on the received data source and the repository is updated so that the selected fingerprint configuration set corresponds to the received data source.
    Type: Application
    Filed: May 25, 2022
    Publication date: November 30, 2023
    Inventors: Xu Bin Cai, Wei Wang, Chun Hua Sun, Chun Leng, Pin Lv, Yi Yang Ren, Jian Ling Shi, YI WANG, Tao Zhuang
  • Patent number: 11573983
    Abstract: Provided is a method, computer program product, and system for classifying a set of data items based on format organizations. A processor may determine at least one format organization of a set of data items. The format organization of a data item indicates a symbol type of at least one continuous symbol in the data item and a number of the at least one continuous symbol. The processor may determine at least one candidate data class for the set of data items from a plurality of predetermined data classes based on the at least one format organization. The processor may classify the set of data items into at least one target data class selected from the at least one candidate data class. In this way, the set of data items can be efficiently classified.
    Type: Grant
    Filed: July 2, 2020
    Date of Patent: February 7, 2023
    Assignee: International Business Machines Corporation
    Inventors: Liang Lu, Yue Wang, Sun Chun Hua, Jian Ling Shi, Yi Yang Ren, Chun Leng
  • Patent number: 11514013
    Abstract: A computer-implemented method includes: reading a vector of a first table in a database, the vector including counts of a plurality of keywords in the first table, the plurality of keywords including a first keyword and a second keyword; determining a first custom attribute describing the first table, the first custom attribute having a vector including counts of at least a first portion of the plurality of keywords in the first table; determining a multiplier of the first custom attribute, the multiplier being a number of other tables that reference the first custom attribute; and revising the vector of the first table based on the first custom attribute.
    Type: Grant
    Filed: January 8, 2020
    Date of Patent: November 29, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Liang Lu, Sun Chun Hua, Jian Ling Shi, Yi Yang Ren, Chun Leng
  • Publication number: 20220004566
    Abstract: Provided is a method, computer program product, and system for classifying a set of data items based on format organizations. A processor may determine at least one format organization of a set of data items. The format organization of a data item indicates a symbol type of at least one continuous symbol in the data item and a number of the at least one continuous symbol. The processor may determine at least one candidate data class for the set of data items from a plurality of predetermined data classes based on the at least one format organization. The processor may classify the set of data items into at least one target data class selected from the at least one candidate data class. In this way, the set of data items can be efficiently classified.
    Type: Application
    Filed: July 2, 2020
    Publication date: January 6, 2022
    Inventors: Liang Lu, Yue Wang, Sun Chun Hua, Jian Ling Shi, Yi Yang Ren, Chun Leng
  • Publication number: 20210209083
    Abstract: A computer-implemented method includes: reading a vector of a first table in a database, the vector including counts of a plurality of keywords in the first table, the plurality of keywords including a first keyword and a second keyword; determining a first custom attribute describing the first table, the first custom attribute having a vector including counts of at least a first portion of the plurality of keywords in the first table; determining a multiplier of the first custom attribute, the multiplier being a number of other tables that reference the first custom attribute; and revising the vector of the first table based on the first custom attribute.
    Type: Application
    Filed: January 8, 2020
    Publication date: July 8, 2021
    Inventors: Liang LU, Sun Chun HUA, Jian Ling SHI, Yi Yang REN, Chun LENG
  • Publication number: 20210089949
    Abstract: Methods, systems, computer program products for determining the source of activity during interaction with a user interface are provided. The method comprises selecting one or more input devices from a plurality of available input devices coupled to the user interface and receiving respective measurements for the selected one or more input devices. Based on the received respective measurements, respective feature vectors for the one or more input devices are generated and then inputted to a pre-defined regression model.
    Type: Application
    Filed: September 20, 2019
    Publication date: March 25, 2021
    Inventors: Liang LL Lu, Sun Chun Hua, Jian Ling Shi, Yi Yang Ren