Patents by Inventor Jing James Xu
Jing James Xu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250117443Abstract: A computer-implemented method for performing data difference evaluation is provided. Aspects include obtaining a first data set and a second data set, creating a first plurality of feature vectors by inputting the first data set into each of a plurality of models, and creating a second plurality of feature vectors by inputting the second data set into each of the plurality of models. Aspects also include identifying a mapping between elements of the first plurality of vectors and elements the second plurality of feature vectors created by a same model of the plurality of models, calculating, for each of the plurality of models based at least in part on the mapping, a model distance between the first data set and the second data set, and calculating, based at least in part on the model distances, an ensemble distance between first data set and the second data set.Type: ApplicationFiled: October 9, 2023Publication date: April 10, 2025Inventors: Lei Tian, Han Zhang, Jing James Xu, Xue Ying Zhang, Si Er Han
-
Patent number: 12249012Abstract: A method, computer system, and a computer program product are provided for post-modeling feature evaluation. In one embodiment, at least at least one post model visual output and associated data is obtained that at least includes an individual conditional expectation (ICE) plot and a partial dependence (PDP) plot. Using the associated data and the plots, a Feature Importance (PI) plot is provided. A plurality of features is then determined for each PI, PDP and ICE plots to calculate at least one Interesting Value for each plot. An overall score is also calculated for each plurality of features based on the associated Interesting Values for each PDP, ICE and PI plots. At least one top feature is selected based on said scores. A final plot is then generated at least reflecting the top feature. The final plot combines the PI, PDP and ICE plots together.Type: GrantFiled: November 17, 2022Date of Patent: March 11, 2025Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Xiao Ming Ma, Wen Pei Yu, Jing James Xu, Xue Ying Zhang, Si Er Han, Jing Xu, Jun Wang
-
Publication number: 20240427684Abstract: A computer-implemented method, a system and a computer program product for abnormal point simulation are disclosed. A processor analyzes a plurality of data blocks in first time series data to determine traits of respective data blocks. For the respective data blocks, a processor simulates one or more abnormal points based on the traits of the respective data blocks.Type: ApplicationFiled: June 20, 2023Publication date: December 26, 2024Inventors: Si Er Han, Xiao Ming Ma, Jun Wang, Wen Pei Yu, Xue Ying Zhang, Jing James Xu, Jing Xu
-
Publication number: 20240411783Abstract: A computer-implemented method for treating post-modeling data includes computing, sequentially for each category of a feature, a category importance (CI) value. The CI value is based on a model accuracy change when records of a category being examined are reassigned to a remaining set of categories of the feature according to a cumulative distribution of records among the remaining set of categories of the feature, wherein the remaining set of categories include all categories of the feature, except for the category being examined. A post-modeling category is performed to merge of each category having the CI value less than a CI value threshold.Type: ApplicationFiled: June 12, 2023Publication date: December 12, 2024Inventors: Xue Ying Zhang, Si Er Han, Jing Xu, Xiao Ming Ma, Wen Pei Yu, Jing James Xu, Jun Wang, Ji Hui Yang
-
Patent number: 12153953Abstract: Mechanisms are provided for intelligently identifying an execution environment to execute a computing job. An execution time of the computing job in each execution environment of a plurality of execution environments is predicted by applying a set of existing machine learning models matching execution context information and key parameters of the computing job and execution environment information of the execution environment. The predicted execution time of the machine learning models is aggregated. The aggregated predicted execution times of the computing job are summarized for the plurality of execution environments. Responsive to a selection of an execution environment from the plurality of execution environments based on the summary of the aggregated predicted execution times of the computing job, the computing job is executed in the selected execution environment. Related data during the execution of the computing job in the selected execution environment is collected.Type: GrantFiled: April 8, 2021Date of Patent: November 26, 2024Assignee: International Business Machines CorporationInventors: A Peng Zhang, Lei Gao, Jin Wang, Jing James Xu, Jun Wang, Dong Hai Yu
-
Patent number: 12056622Abstract: A method for identifying influential effects that contribute most to a status change of a target index for goal seeking analysis. The method includes generating a candidate list of significant changed predictors between the normal and abnormal status time periods in collected data, and building a plurality of regression models from the collected data. The method determines a first value (trend value or Pearson correlation value) for each of the significant changed predictors based on whether at least one of the significant changed predictors have a significant change trend using the regression models. The method obtains a second predictor importance value for each of the significant changed predictors from a single model built on all the collected data. The method generates a final predictor value for each of the significant changed predictors by combining the first value with the second predictor importance value for each of the significant changed predictors.Type: GrantFiled: February 3, 2021Date of Patent: August 6, 2024Assignee: International Business Machines CorporationInventors: Jing James Xu, Lei Gao, A Peng Zhang, Rui Wang, Si Er Han, Xiao Ming Ma
-
Publication number: 20240256637Abstract: A computer implemented method manages an ensemble model system to classify records. A number of processor units cluster records into groups of records based on classification predictions generated by base models in the ensemble model system for the records. The number of processor units determines sets of weights for the base models that increase a probability that the base models in the ensemble model system correctly predict the groups of records. Each set of weights in the sets of weights is associated with a group of records in the groups of records.Type: ApplicationFiled: January 27, 2023Publication date: August 1, 2024Inventors: Si Er Han, Xue Ying Zhang, Jing Xu, Jing James Xu, Xiao Ming Ma, Wen Pei Yu, Jun Wang, Ji Hui Yang
-
Patent number: 12014026Abstract: Using a set of menu to key process mappings, historical menu usage data for an application is aggregated into aggregated key process usage data. A set of key process association rules, each comprising a consequent key process given a particular antecedent key process, is generated. From the set of key process association rules and a set of ranked menus by frequency of usage within each key process, a set of model menu recommendations is generated. According to an application usage history, a menu frequency ratio, and a confidence value of a modelled next menu, the set of menu recommendations is scored. A scored menu recommendation having a rank below a threshold rank is pruned from a set of menu items of the application ranked according to their scores. The pruned set of scored menu recommendations is presented for selection instead of the set of menu items.Type: GrantFiled: April 21, 2023Date of Patent: June 18, 2024Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Long Fan, Yang Yang, Ye Fan, Juan Wu, Qi Mao, Jing James Xu
-
Publication number: 20240193830Abstract: In an approach for post-modeling data visualization and analysis, a processor presents a first visualization of a training dataset in a first plot. Responsive to receiving a selection of a data group of the training dataset to analyze, a processor identifies three or fewer key model features of the data group of the training dataset. A processor ascertains a representative record of each key model feature of the three or fewer key model features using a Local Interpretable Model-Agnostic Explanation technique. A processor presents a second visualization of the three or fewer key model features and the representative record of each key model feature in a second plot.Type: ApplicationFiled: December 13, 2022Publication date: June 13, 2024Inventors: Wen Pei Yu, Xiao Ming Ma, Xue Ying Zhang, Si Er Han, Jing James Xu, Jing Xu, Jun Wang
-
Publication number: 20240169614Abstract: A method, computer system, and a computer program product are provided for post-modeling feature evaluation. In one embodiment, at least at least one post model visual output and associated data is obtained that at least includes an individual conditional expectation (ICE) plot and a partial dependence (PDP) plot. Using the associated data and the plots, a Feature Importance (PI) plot is provided. A plurality of features is then determined for each PI, PDP and ICE plots to calculate at least one Interesting Value for each plot. An overall score is also calculated for each plurality of features based on the associated Interesting Values for each PDP, ICE and PI plots. At least one top feature is selected based on said scores. A final plot is then generated at least reflecting the top feature. The final plot combines the PI, PDP and ICE plots together.Type: ApplicationFiled: November 17, 2022Publication date: May 23, 2024Inventors: Xiao Ming Ma, Wen Pei Yu, Jing James Xu, Xue Ying Zhang, Si Er Han, Jing Xu, Jun Wang
-
Patent number: 11971796Abstract: An approach is provided in which the approach builds a combination model that includes a normal status model and an abnormal status model. The normal status model is built from a set of time-sequenced normal status records and the abnormal status model is built from a set of time-sequenced abnormal status records. The approach computes a set of time-sequenced coefficient combination values of the normal status model and the abnormal status model based on applying a set of fitting coefficient characteristics to the normal status model and the abnormal status model. The approach performs goal seek analysis on a system using the combination model and the set of time-sequenced coefficient combination values.Type: GrantFiled: May 18, 2021Date of Patent: April 30, 2024Assignee: International Business Machines CorporationInventors: Xiao Ming Ma, Si Er Han, Lei Gao, A Peng Zhang, Chun Lei Xu, Rui Wang, Jing James Xu
-
Patent number: 11966340Abstract: To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candidate machine learning pipelines. Predictions of each of the candidate machine learning pipelines using at least the one or more features may be evaluated. A ranked list of machine learning pipelines may be automatically generated from the candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines.Type: GrantFiled: March 15, 2022Date of Patent: April 23, 2024Assignee: International Business Machines CorporationInventors: Long Vu, Bei Chen, Xuan-Hong Dang, Peter Daniel Kirchner, Syed Yousaf Shah, Dhavalkumar C. Patel, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Gregory Bramble, Horst Cornelius Samulowitz, Saket K. Sathe, Wesley M. Gifford, Petros Zerfos
-
Patent number: 11947449Abstract: Embodiments of the present disclosure relate to a method, system and computer program product for semantic search based on a graph database. In some embodiments, a method is disclosed. According to the method, the user jobs of a user are obtained from a first software product. Based on the user jobs, target test cases are selected from a plurality of test cases associated with the first software product and a second software product. The target test cases are applied to the first software product and the second software product, and in accordance with a determination that a result of applying the target test cases satisfies a predetermined criterion, an instruction is provided to indicate migrating from the first software product to the second software product. In other embodiments, a system and a computer program product are disclosed.Type: GrantFiled: July 7, 2022Date of Patent: April 2, 2024Assignee: International Business Machines CorporationInventors: Lei Gao, Jin Wang, A Peng Zhang, Kai Li, Jun Wang, Jing James Xu, Rui Wang, Xin Feng Zhu
-
Patent number: 11928156Abstract: Obtain, at a computing device, a segment of computer code. With a classification module of a machine learning system executing on the computing device, determine a required annotation category for the segment of computer code. With an annotation generation module of the machine learning system executing on the computing device, generate a natural language annotation of the segment of computer code based on the segment of computer code and the required annotation category. Provide the natural language annotation to a user interface for display adjacent the segment of computer code.Type: GrantFiled: November 3, 2020Date of Patent: March 12, 2024Assignee: International Business Machines CorporationInventors: Dakuo Wang, Lingfei Wu, Xuye Liu, Yi Wang, Chuang Gan, Jing Xu, Xue Ying Zhang, Jun Wang, Jing James Xu
-
Patent number: 11907099Abstract: Embodiments of the present invention disclose a method, computer program product, and system for estimating the results of a performance test on an updated software application. A method, the method comprising receiving an updated software application, wherein the size of the updated software application is a first size and generating a plurality of small probe, wherein the size of each of the small probe data is a second size, wherein the second size is less than the first size. Conducting a first performance test on the plurality of small probe data and calculating an estimated elapsed time for a performance test on the updated software application. Conducting the performance test on the updated software application and determining if the updated software is given a PASS or FAIL for the performance test, based in part on the elapsed time of the performance test on the updated software application.Type: GrantFiled: September 22, 2020Date of Patent: February 20, 2024Assignee: International Business Machines CorporationInventors: Yao Dong Liu, Jing James Xu, Jiang Bo Kang, Dong Hai Yu, Jun Wang
-
Publication number: 20240054211Abstract: Detecting anomalous data by applying a plurality of models to a data set to yield detection results including anomalous data, applying evaluation methods to the detection results for each of the plurality of models, determining a combined score for the detection results according to the evaluation methods, determining a combined score threshold, and defining a set of detected anomalies according to the combined score and the combined score threshold.Type: ApplicationFiled: August 10, 2022Publication date: February 15, 2024Inventors: Jing Xu, Xue Ying Zhang, Si Er Han, Jing James Xu, Xiao Ming Ma, Wen Pei Yu
-
Patent number: 11893499Abstract: Automated development and training of deep forest models for analyzing data by growing a random forest of decision trees using data, determining Out-of-bag (OOB) predictions for the forest, appending the OOB predictions to the data set, and growing an additional forest using the data set including the appended OOB predictions, and combining the output of the additional forest, then utilizing the model to classify data outside the training data set.Type: GrantFiled: March 12, 2019Date of Patent: February 6, 2024Assignee: International Business Machines CorporationInventors: Jing Xu, Rui Wang, Xiao Ming Ma, Ji Hui Yang, Xue Ying Zhang, Jing James Xu, Si Er Han
-
Publication number: 20240012746Abstract: Embodiments of the present disclosure relate to a method, system and computer program product for semantic search based on a graph database. In some embodiments, a method is disclosed. According to the method, the user jobs of a user are obtained from a first software product. Based on the user jobs, target test cases are selected from a plurality of test cases associated with the first software product and a second software product. The target test cases are applied to the first software product and the second software product, and in accordance with a determination that a result of applying the target test cases satisfies a predetermined criterion, an instruction is provided to indicate migrating from the first software product to the second software product. In other embodiments, a system and a computer program product are disclosed.Type: ApplicationFiled: July 7, 2022Publication date: January 11, 2024Inventors: Lei Gao, Jin Wang, A PENG ZHANG, Kai Li, Jun Wang, Jing James Xu, Rui Wang, Xin Feng Zhu
-
Publication number: 20230394326Abstract: Embodiments of the present disclosure relate to a method, system, and computer program product for predictive models. According to the method, a processor may provide a first list including at least one input variable of a predictive model and a second list including a plurality of variables of the predictive model. For each of input variables in the second list, the processor may determine contribution of the input variable to prediction of the predictive model with respect to the at least one input variable in the first list. The processor may update the first list by moving an input variable in the second list into the first list based on the determined contribution of the plurality of input variables. The processor may render one or more of input variables in the updated first list based on an order of the input variables in the updated first list.Type: ApplicationFiled: June 1, 2022Publication date: December 7, 2023Inventors: Si Er Han, Xue Ying Zhang, Xiao Ming Ma, Wen Pei Yu, Jing Xu, Jing James Xu, Rui Wang
-
Publication number: 20230367689Abstract: Disclosed are a computer-implemented method, a system and a computer program product for model exploration. Model feature importance of each model of a plurality of models can be obtained, the plurality of models can be grouped into a plurality of model clusters based on the model feature importance of each model, and the model feature importance can be presented by box-plot or confidence interval.Type: ApplicationFiled: May 15, 2022Publication date: November 16, 2023Inventors: Jing Xu, Xue Ying Zhang, Si Er Han, Jing James Xu, Xiao Ming Ma, Jun Wang, Wen Pei Yu