Patents by Inventor Jing James Xu

Jing James Xu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250117443
    Abstract: A computer-implemented method for performing data difference evaluation is provided. Aspects include obtaining a first data set and a second data set, creating a first plurality of feature vectors by inputting the first data set into each of a plurality of models, and creating a second plurality of feature vectors by inputting the second data set into each of the plurality of models. Aspects also include identifying a mapping between elements of the first plurality of vectors and elements the second plurality of feature vectors created by a same model of the plurality of models, calculating, for each of the plurality of models based at least in part on the mapping, a model distance between the first data set and the second data set, and calculating, based at least in part on the model distances, an ensemble distance between first data set and the second data set.
    Type: Application
    Filed: October 9, 2023
    Publication date: April 10, 2025
    Inventors: Lei Tian, Han Zhang, Jing James Xu, Xue Ying Zhang, Si Er Han
  • Patent number: 12249012
    Abstract: A method, computer system, and a computer program product are provided for post-modeling feature evaluation. In one embodiment, at least at least one post model visual output and associated data is obtained that at least includes an individual conditional expectation (ICE) plot and a partial dependence (PDP) plot. Using the associated data and the plots, a Feature Importance (PI) plot is provided. A plurality of features is then determined for each PI, PDP and ICE plots to calculate at least one Interesting Value for each plot. An overall score is also calculated for each plurality of features based on the associated Interesting Values for each PDP, ICE and PI plots. At least one top feature is selected based on said scores. A final plot is then generated at least reflecting the top feature. The final plot combines the PI, PDP and ICE plots together.
    Type: Grant
    Filed: November 17, 2022
    Date of Patent: March 11, 2025
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Xiao Ming Ma, Wen Pei Yu, Jing James Xu, Xue Ying Zhang, Si Er Han, Jing Xu, Jun Wang
  • Publication number: 20240427684
    Abstract: A computer-implemented method, a system and a computer program product for abnormal point simulation are disclosed. A processor analyzes a plurality of data blocks in first time series data to determine traits of respective data blocks. For the respective data blocks, a processor simulates one or more abnormal points based on the traits of the respective data blocks.
    Type: Application
    Filed: June 20, 2023
    Publication date: December 26, 2024
    Inventors: Si Er Han, Xiao Ming Ma, Jun Wang, Wen Pei Yu, Xue Ying Zhang, Jing James Xu, Jing Xu
  • Publication number: 20240411783
    Abstract: A computer-implemented method for treating post-modeling data includes computing, sequentially for each category of a feature, a category importance (CI) value. The CI value is based on a model accuracy change when records of a category being examined are reassigned to a remaining set of categories of the feature according to a cumulative distribution of records among the remaining set of categories of the feature, wherein the remaining set of categories include all categories of the feature, except for the category being examined. A post-modeling category is performed to merge of each category having the CI value less than a CI value threshold.
    Type: Application
    Filed: June 12, 2023
    Publication date: December 12, 2024
    Inventors: Xue Ying Zhang, Si Er Han, Jing Xu, Xiao Ming Ma, Wen Pei Yu, Jing James Xu, Jun Wang, Ji Hui Yang
  • Patent number: 12153953
    Abstract: Mechanisms are provided for intelligently identifying an execution environment to execute a computing job. An execution time of the computing job in each execution environment of a plurality of execution environments is predicted by applying a set of existing machine learning models matching execution context information and key parameters of the computing job and execution environment information of the execution environment. The predicted execution time of the machine learning models is aggregated. The aggregated predicted execution times of the computing job are summarized for the plurality of execution environments. Responsive to a selection of an execution environment from the plurality of execution environments based on the summary of the aggregated predicted execution times of the computing job, the computing job is executed in the selected execution environment. Related data during the execution of the computing job in the selected execution environment is collected.
    Type: Grant
    Filed: April 8, 2021
    Date of Patent: November 26, 2024
    Assignee: International Business Machines Corporation
    Inventors: A Peng Zhang, Lei Gao, Jin Wang, Jing James Xu, Jun Wang, Dong Hai Yu
  • Patent number: 12056622
    Abstract: A method for identifying influential effects that contribute most to a status change of a target index for goal seeking analysis. The method includes generating a candidate list of significant changed predictors between the normal and abnormal status time periods in collected data, and building a plurality of regression models from the collected data. The method determines a first value (trend value or Pearson correlation value) for each of the significant changed predictors based on whether at least one of the significant changed predictors have a significant change trend using the regression models. The method obtains a second predictor importance value for each of the significant changed predictors from a single model built on all the collected data. The method generates a final predictor value for each of the significant changed predictors by combining the first value with the second predictor importance value for each of the significant changed predictors.
    Type: Grant
    Filed: February 3, 2021
    Date of Patent: August 6, 2024
    Assignee: International Business Machines Corporation
    Inventors: Jing James Xu, Lei Gao, A Peng Zhang, Rui Wang, Si Er Han, Xiao Ming Ma
  • Publication number: 20240256637
    Abstract: A computer implemented method manages an ensemble model system to classify records. A number of processor units cluster records into groups of records based on classification predictions generated by base models in the ensemble model system for the records. The number of processor units determines sets of weights for the base models that increase a probability that the base models in the ensemble model system correctly predict the groups of records. Each set of weights in the sets of weights is associated with a group of records in the groups of records.
    Type: Application
    Filed: January 27, 2023
    Publication date: August 1, 2024
    Inventors: Si Er Han, Xue Ying Zhang, Jing Xu, Jing James Xu, Xiao Ming Ma, Wen Pei Yu, Jun Wang, Ji Hui Yang
  • Patent number: 12014026
    Abstract: Using a set of menu to key process mappings, historical menu usage data for an application is aggregated into aggregated key process usage data. A set of key process association rules, each comprising a consequent key process given a particular antecedent key process, is generated. From the set of key process association rules and a set of ranked menus by frequency of usage within each key process, a set of model menu recommendations is generated. According to an application usage history, a menu frequency ratio, and a confidence value of a modelled next menu, the set of menu recommendations is scored. A scored menu recommendation having a rank below a threshold rank is pruned from a set of menu items of the application ranked according to their scores. The pruned set of scored menu recommendations is presented for selection instead of the set of menu items.
    Type: Grant
    Filed: April 21, 2023
    Date of Patent: June 18, 2024
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Long Fan, Yang Yang, Ye Fan, Juan Wu, Qi Mao, Jing James Xu
  • Publication number: 20240193830
    Abstract: In an approach for post-modeling data visualization and analysis, a processor presents a first visualization of a training dataset in a first plot. Responsive to receiving a selection of a data group of the training dataset to analyze, a processor identifies three or fewer key model features of the data group of the training dataset. A processor ascertains a representative record of each key model feature of the three or fewer key model features using a Local Interpretable Model-Agnostic Explanation technique. A processor presents a second visualization of the three or fewer key model features and the representative record of each key model feature in a second plot.
    Type: Application
    Filed: December 13, 2022
    Publication date: June 13, 2024
    Inventors: Wen Pei Yu, Xiao Ming Ma, Xue Ying Zhang, Si Er Han, Jing James Xu, Jing Xu, Jun Wang
  • Publication number: 20240169614
    Abstract: A method, computer system, and a computer program product are provided for post-modeling feature evaluation. In one embodiment, at least at least one post model visual output and associated data is obtained that at least includes an individual conditional expectation (ICE) plot and a partial dependence (PDP) plot. Using the associated data and the plots, a Feature Importance (PI) plot is provided. A plurality of features is then determined for each PI, PDP and ICE plots to calculate at least one Interesting Value for each plot. An overall score is also calculated for each plurality of features based on the associated Interesting Values for each PDP, ICE and PI plots. At least one top feature is selected based on said scores. A final plot is then generated at least reflecting the top feature. The final plot combines the PI, PDP and ICE plots together.
    Type: Application
    Filed: November 17, 2022
    Publication date: May 23, 2024
    Inventors: Xiao Ming Ma, Wen Pei Yu, Jing James Xu, Xue Ying Zhang, Si Er Han, Jing Xu, Jun Wang
  • Patent number: 11971796
    Abstract: An approach is provided in which the approach builds a combination model that includes a normal status model and an abnormal status model. The normal status model is built from a set of time-sequenced normal status records and the abnormal status model is built from a set of time-sequenced abnormal status records. The approach computes a set of time-sequenced coefficient combination values of the normal status model and the abnormal status model based on applying a set of fitting coefficient characteristics to the normal status model and the abnormal status model. The approach performs goal seek analysis on a system using the combination model and the set of time-sequenced coefficient combination values.
    Type: Grant
    Filed: May 18, 2021
    Date of Patent: April 30, 2024
    Assignee: International Business Machines Corporation
    Inventors: Xiao Ming Ma, Si Er Han, Lei Gao, A Peng Zhang, Chun Lei Xu, Rui Wang, Jing James Xu
  • Patent number: 11966340
    Abstract: To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candidate machine learning pipelines. Predictions of each of the candidate machine learning pipelines using at least the one or more features may be evaluated. A ranked list of machine learning pipelines may be automatically generated from the candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines.
    Type: Grant
    Filed: March 15, 2022
    Date of Patent: April 23, 2024
    Assignee: International Business Machines Corporation
    Inventors: Long Vu, Bei Chen, Xuan-Hong Dang, Peter Daniel Kirchner, Syed Yousaf Shah, Dhavalkumar C. Patel, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Gregory Bramble, Horst Cornelius Samulowitz, Saket K. Sathe, Wesley M. Gifford, Petros Zerfos
  • Patent number: 11947449
    Abstract: Embodiments of the present disclosure relate to a method, system and computer program product for semantic search based on a graph database. In some embodiments, a method is disclosed. According to the method, the user jobs of a user are obtained from a first software product. Based on the user jobs, target test cases are selected from a plurality of test cases associated with the first software product and a second software product. The target test cases are applied to the first software product and the second software product, and in accordance with a determination that a result of applying the target test cases satisfies a predetermined criterion, an instruction is provided to indicate migrating from the first software product to the second software product. In other embodiments, a system and a computer program product are disclosed.
    Type: Grant
    Filed: July 7, 2022
    Date of Patent: April 2, 2024
    Assignee: International Business Machines Corporation
    Inventors: Lei Gao, Jin Wang, A Peng Zhang, Kai Li, Jun Wang, Jing James Xu, Rui Wang, Xin Feng Zhu
  • Patent number: 11928156
    Abstract: Obtain, at a computing device, a segment of computer code. With a classification module of a machine learning system executing on the computing device, determine a required annotation category for the segment of computer code. With an annotation generation module of the machine learning system executing on the computing device, generate a natural language annotation of the segment of computer code based on the segment of computer code and the required annotation category. Provide the natural language annotation to a user interface for display adjacent the segment of computer code.
    Type: Grant
    Filed: November 3, 2020
    Date of Patent: March 12, 2024
    Assignee: International Business Machines Corporation
    Inventors: Dakuo Wang, Lingfei Wu, Xuye Liu, Yi Wang, Chuang Gan, Jing Xu, Xue Ying Zhang, Jun Wang, Jing James Xu
  • Patent number: 11907099
    Abstract: Embodiments of the present invention disclose a method, computer program product, and system for estimating the results of a performance test on an updated software application. A method, the method comprising receiving an updated software application, wherein the size of the updated software application is a first size and generating a plurality of small probe, wherein the size of each of the small probe data is a second size, wherein the second size is less than the first size. Conducting a first performance test on the plurality of small probe data and calculating an estimated elapsed time for a performance test on the updated software application. Conducting the performance test on the updated software application and determining if the updated software is given a PASS or FAIL for the performance test, based in part on the elapsed time of the performance test on the updated software application.
    Type: Grant
    Filed: September 22, 2020
    Date of Patent: February 20, 2024
    Assignee: International Business Machines Corporation
    Inventors: Yao Dong Liu, Jing James Xu, Jiang Bo Kang, Dong Hai Yu, Jun Wang
  • Publication number: 20240054211
    Abstract: Detecting anomalous data by applying a plurality of models to a data set to yield detection results including anomalous data, applying evaluation methods to the detection results for each of the plurality of models, determining a combined score for the detection results according to the evaluation methods, determining a combined score threshold, and defining a set of detected anomalies according to the combined score and the combined score threshold.
    Type: Application
    Filed: August 10, 2022
    Publication date: February 15, 2024
    Inventors: Jing Xu, Xue Ying Zhang, Si Er Han, Jing James Xu, Xiao Ming Ma, Wen Pei Yu
  • Patent number: 11893499
    Abstract: Automated development and training of deep forest models for analyzing data by growing a random forest of decision trees using data, determining Out-of-bag (OOB) predictions for the forest, appending the OOB predictions to the data set, and growing an additional forest using the data set including the appended OOB predictions, and combining the output of the additional forest, then utilizing the model to classify data outside the training data set.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: February 6, 2024
    Assignee: International Business Machines Corporation
    Inventors: Jing Xu, Rui Wang, Xiao Ming Ma, Ji Hui Yang, Xue Ying Zhang, Jing James Xu, Si Er Han
  • Publication number: 20240012746
    Abstract: Embodiments of the present disclosure relate to a method, system and computer program product for semantic search based on a graph database. In some embodiments, a method is disclosed. According to the method, the user jobs of a user are obtained from a first software product. Based on the user jobs, target test cases are selected from a plurality of test cases associated with the first software product and a second software product. The target test cases are applied to the first software product and the second software product, and in accordance with a determination that a result of applying the target test cases satisfies a predetermined criterion, an instruction is provided to indicate migrating from the first software product to the second software product. In other embodiments, a system and a computer program product are disclosed.
    Type: Application
    Filed: July 7, 2022
    Publication date: January 11, 2024
    Inventors: Lei Gao, Jin Wang, A PENG ZHANG, Kai Li, Jun Wang, Jing James Xu, Rui Wang, Xin Feng Zhu
  • Publication number: 20230394326
    Abstract: Embodiments of the present disclosure relate to a method, system, and computer program product for predictive models. According to the method, a processor may provide a first list including at least one input variable of a predictive model and a second list including a plurality of variables of the predictive model. For each of input variables in the second list, the processor may determine contribution of the input variable to prediction of the predictive model with respect to the at least one input variable in the first list. The processor may update the first list by moving an input variable in the second list into the first list based on the determined contribution of the plurality of input variables. The processor may render one or more of input variables in the updated first list based on an order of the input variables in the updated first list.
    Type: Application
    Filed: June 1, 2022
    Publication date: December 7, 2023
    Inventors: Si Er Han, Xue Ying Zhang, Xiao Ming Ma, Wen Pei Yu, Jing Xu, Jing James Xu, Rui Wang
  • Publication number: 20230367689
    Abstract: Disclosed are a computer-implemented method, a system and a computer program product for model exploration. Model feature importance of each model of a plurality of models can be obtained, the plurality of models can be grouped into a plurality of model clusters based on the model feature importance of each model, and the model feature importance can be presented by box-plot or confidence interval.
    Type: Application
    Filed: May 15, 2022
    Publication date: November 16, 2023
    Inventors: Jing Xu, Xue Ying Zhang, Si Er Han, Jing James Xu, Xiao Ming Ma, Jun Wang, Wen Pei Yu