Patents by Inventor Dhavalkumar C Patel

Dhavalkumar C Patel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Visualization of a model selection process in an automated model selection system

Patent number: 11688111

Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.

Type: Grant

Filed: July 29, 2020

Date of Patent: June 27, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
Automated machine learning pipeline generation

Patent number: 11620582

Abstract: Techniques regarding one or more automated machine learning processes that analyze time series data are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a time series analysis component that selects a machine learning pipeline for meta transfer learning on time series data by sequentially allocating subsets of training data from the time series data amongst a plurality of machine learning pipeline candidates.

Type: Grant

Filed: July 29, 2020

Date of Patent: April 4, 2023

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Bei Chen, Long Vu, Syed Yousaf Shah, Xuan-Hong Dang, Peter Daniel Kirchner, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Dhavalkumar C. Patel, Gregory Bramble, Horst Cornelius Samulowitz, Saket Sathe, Chuang Gan
ASSOCIATING DISTURBANCE EVENTS TO ACCIDENTS OR TICKETS

Publication number: 20230048378

Abstract: Methods and systems to provide a form of probabilistic labeling to associate an outage with a disturbance, which could itself be either known based on the available data or unknown. In the latter case, labeling is especially challenging, as it necessitates the discovery of the disturbance. One approach incorporates a statistical change-point analysis to time-series events that correspond to service tickets in the relevant geographic sub-regions. The method is calibrated to separate the regular periods from the environmental disturbance periods, under the assumption that disturbances significantly increase the rate of loss-causing events. To obtain the probability that a given loss-causing event is related to an environmental disturbance, the method leverages the difference between the rate of events expected in the absence of any disturbances (baseline) and the rate of actually observed events. In the analysis, the local disturbances are identified and estimators of their duration and magnitude are provided.

Type: Application

Filed: August 12, 2021

Publication date: February 16, 2023

Inventors: Emmanuel Yashchin, Nianjun Zhou, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Arun Kwangil Iyengar, Shrey Shrivastava
AUTOMATED TIME SERIES FORECASTING PIPELINE GENERATION

Publication number: 20220327058

Abstract: To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candidate machine learning pipelines. Predictions of each of the candidate machine learning pipelines using at least the one or more features may be evaluated. A ranked list of machine learning pipelines may be automatically generated from the candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines.

Type: Application

Filed: March 15, 2022

Publication date: October 13, 2022

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Long VU, Bei CHEN, Xuan-Hong DANG, Peter Daniel KIRCHNER, Syed Yousaf SHAH, Dhavalkumar C. PATEL, Si Er HAN, Ji Hui YANG, Jun WANG, Jing James XU, Dakuo WANG, Gregory BRAMBLE, Horst Cornelius SAMULOWITZ, Saket K. SATHE, Wesley M. GIFFORD, Petros ZERFOS
INTERACTIVE MACHINE LEARNING OPTIMIZATION

Publication number: 20220309391

Abstract: Methods, computer program products, and systems are presented. The method, computer program products, and systems can include, for instance: examining an enterprise dataset, the enterprise dataset defined by enterprise collected data; selecting one or more synthetic dataset in dependence on the examining, the one or more synthetic dataset including data other than data collected by the enterprise; training a set of predictive models using data of the one or more synthetic dataset to provide a set of trained predictive models; testing the set of trained predictive models with use of holdout data of the one or more synthetic dataset; and presenting prompting data on a displayed user interface of a developer user in dependence on result data resulting from the testing, the prompting data prompting the developer user to direct action with respect to one or more model of the set of predictive models.

Type: Application

Filed: March 29, 2021

Publication date: September 29, 2022

Inventors: Dhavalkumar C. PATEL, Si Er HAN, Jiang Bo KANG
AUTOMATED TIME SERIES FORECASTING PIPELINE RANKING

Publication number: 20220261598

Abstract: To rank time series forecasting in machine learning pipelines, time series data may be incrementally allocated from a time series data set for testing by candidate machine learning pipelines based on seasonality or a degree of temporal dependence of the time series data. Intermediate evaluation scores may be provided by each of the candidate machine learning pipelines following each time series data allocation. One or more machine learning pipelines may be automatically selected from a ranked list of the one or more candidate machine learning pipelines based on a projected learning curve generated from the intermediate evaluation scores.

Type: Application

Filed: October 26, 2021

Publication date: August 18, 2022

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Bei CHEN, Long VU, Dhavalkumar C. PATEL, Syed Yousaf SHAH, Gregory BRAMBLE, Peter Daniel KIRCHNER, Horst Cornelius SAMULOWITZ, Xuan-Hong DANG, Petros ZERFOS
FEDERATED LEARNING FOR MULTI-LABEL CLASSIFICATION MODEL FOR OIL PUMP MANAGEMENT

Publication number: 20220188775

Abstract: A computer implemented federated learning method of predicting failure of assets includes generating a local model at a local site for each of the cohorts and training the local model on local data for each of the cohorts for each failure type. The local model is shared with a central database. A global model is created based on an aggregation of a plurality of the local models from a plurality of the local sites. At each of the plurality of local sites, one of the global model and the local model is chosen for each of the cohorts. The chosen model operates on local data to predict failure of the assets. The utilized features include partitioning features of the assets into static features, semi-static features, and dynamic features, and forming cohorts of the assets based on the static features and the semi-static features.

Type: Application

Filed: December 15, 2020

Publication date: June 16, 2022

Inventors: Nianjun Zhou, Dhavalkumar C. Patel, Anuradha Bhamidipaty
DYNAMIC AND CONTINUOUS COMPOSITION OF FEATURES EXTRACTION AND LEARNING OPERATION TOOL FOR EPISODIC INDUSTRIAL PROCESS

Publication number: 20220172002

Abstract: A computer implemented method of preparing process data for use in an artificial intelligence (AI) model includes collecting and storing raw data as episodic data for each episode of a process. An episode data generator assigns an episode identifier each set of episodic data. The raw data per episode is transformed into a standardized episodic data format that is usable by the AI model. Metrics are assigned to the episodic data and the episodic data is aggregated in an episode store. The data in the episode store is used by a feature extraction and learning module to extract and rank features.

Type: Application

Filed: December 1, 2020

Publication date: June 2, 2022

Inventors: Shrey Shrivastava, Dhavalkumar C. Patel, Jayant R. Kalagnanam, Chandrasekhara K. Reddy
SCALABLE DISCOVERY OF LEADERS FROM DYNAMIC COMBINATORIAL SEARCH SPACE USING INCREMENTAL PIPELINE GROWTH APPROACH

Publication number: 20220138616

Abstract: A computer implemented method includes generating a pipeline graph having a plurality of layers, each of the plurality of layers having one or more machine learning components for performing a predictive modeling task. A plurality of pipelines are operated through the pipeline graph on a training dataset to determine a respective plurality of results. Each of the plurality of pipelines are distinct paths through selected ones of the one or more machine learning components at each of the plurality of layers. The plurality of results are compared to known results based on a user-defined metric to output one or more leader pipelines.

Type: Application

Filed: October 30, 2020

Publication date: May 5, 2022

Inventors: Dhavalkumar C. Patel, Shrey Shrivastava, Jayant R. Kalagnanam, Stuart Siegel, Wesley M. Gifford, Chandrasekhara K. Reddy
Efficient real-time data quality analysis

Patent number: 11263103

Abstract: Embodiments of the invention are directed a computer-implemented method for efficiently assessing data quality metrics. A non-limiting example of the computer-implemented method includes receiving, using a processor, a plurality of updates to data points in a data stream. The processor is further used to provide a plurality of data quality metrics (DQMs), and to maintain information on how much the plurality of DQMs are changing over time. The processor also maintains information on computational overhead for the plurality of DQMs, and also updates data quality information based on the maintained information.

Type: Grant

Filed: July 31, 2020

Date of Patent: March 1, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
AUTOMATED MACHINE LEARNING PIPELINE GENERATION

Publication number: 20220036246

Abstract: Techniques regarding one or more automated machine learning processes that analyze time series data are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a time series analysis component that selects a machine learning pipeline for meta transfer learning on time series data by sequentially allocating subsets of training data from the time series data amongst a plurality of machine learning pipeline candidates.

Type: Application

Filed: July 29, 2020

Publication date: February 3, 2022

Inventors: Bei Chen, Long VU, Syed Yousaf Shah, Xuan-Hong Dang, Peter Daniel Kirchner, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Dhavalkumar C. Patel, Gregory Bramble, Horst Cornelius Samulowitz, Saket Sathe, Chuang Gan
TECHNOLOGY FOR OPTIMIZING ARTIFICIAL INTELLIGENCE PIPELINES

Publication number: 20220036232

Abstract: Machine logic to change steps included in and/or parameters/parameter value used in artificial intelligence (“AI”) pipelines. For example, the machine logic may control what types of data (for example, sensor data) are received by the AI pipeline and/or have the data is culled in the pipeline prior to application of a machine learning and/or artificial intelligence algorithm.

Type: Application

Filed: July 29, 2020

Publication date: February 3, 2022

Inventors: Dhavalkumar C. Patel, Shrey Shrivastava, Wesley M. Gifford
EFFICIENT REAL-TIME DATA QUALITY ANALYSIS

Publication number: 20220035721

Abstract: Embodiments of the invention are directed a computer-implemented method for efficiently assessing data quality metrics. A non-limiting example of the computer-implemented method includes receiving, using a processor, a plurality of updates to data points in a data stream. The processor is further used to provide a plurality of data quality metrics (DQMs), and to maintain information on how much the plurality of DQMs are changing over time. The processor also maintains information on computational overhead for the plurality of DQMs, and also updates data quality information based on the maintained information.

Type: Application

Filed: July 31, 2020

Publication date: February 3, 2022

Inventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
VISUALIZATION OF A MODEL SELECTION PROCESS IN AN AUTOMATED MODEL SELECTION SYSTEM

Publication number: 20220036610

Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.

Type: Application

Filed: July 29, 2020

Publication date: February 3, 2022

Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
DETERMINING AND SELECTING PREDICTION MODELS OVER MULTIPLE POINTS IN TIME

Publication number: 20220012641

Abstract: Techniques for generating model ensembles are provided. A plurality of models trained to generate predictions at each of a plurality of intervals is received. A respective prediction accuracy of each respective model of the plurality of models is determined for a first interval of the plurality of intervals by processing labeled evaluation data using the respective model. Additionally, a model ensemble specifying one or more of the plurality of models for each of the plurality of intervals is generated, comprising selecting, for the first interval, a first model of the plurality of models based on (i) the respective prediction accuracies and (ii) at least one non-error metric.

Type: Application

Filed: July 9, 2020

Publication date: January 13, 2022

Inventors: Arun Kwangil IYENGAR, Jeffrey Owen KEPHART, Dhavalkumar C. PATEL, Dung Tien PHAN, Chandrasekhara K. REDDY
MODEL FIDELITY MONITORING AND REGENERATION FOR MANUFACTURING PROCESS DECISION SUPPORT

Publication number: 20220011760

Abstract: Techniques for model fidelity monitoring and regeneration for manufacturing process decision support are described herein. Aspects of the invention include determining that an output of a regression model corresponding to a current time period of decision support for a manufacturing process is not within a predefined range of a historical process dataset, wherein the regression model was constructed based on the historical process dataset, and performing an accuracy and fidelity analysis on the regression model based on process data from the manufacturing process corresponding to a previous time period. Based on a result of the accuracy and fidelity analysis being below a threshold, a mismatch of the regression model as compared to the manufacturing process is determined. Based on determining the mismatch, a temporary regression model corresponding to the manufacturing process is generated, and decision support for the manufacturing process is performed based on the temporary regression model.

Type: Application

Filed: July 8, 2020

Publication date: January 13, 2022

Inventors: Nianjun Zhou, Dharmashankar Subramanian, Dhavalkumar C. Patel, Anuradha Bhamidipaty
DETERMINING AND SELECTING PREDICTION MODELS OVER MULTIPLE POINTS IN TIME USING TEST DATA

Publication number: 20220012640

Abstract: Techniques for model evaluation and selection are provided. A plurality of models trained to generate predictions at each of a plurality of intervals is received, and a plurality of model ensembles, each specifying one or more of the plurality of models for each of the plurality of intervals, is generated. A test data set is received, where the test data set includes values for at least a first interval of the plurality of intervals and does not include values for at least a second interval of the plurality of intervals. A first model ensemble, of the plurality of model ensembles, is selected based on processing the test data set using each of the plurality of model ensembles.

Type: Application

Filed: July 9, 2020

Publication date: January 13, 2022

Inventors: Arun Kwangil IYENGAR, Jeffrey Owen KEPHART, Dhavalkumar C. PATEL, Dung Tien PHAN, Chandrasekhara K. REDDY
Real-time data quality analysis

Patent number: 11204851

Abstract: Embodiments of the invention are directed a computer-implemented method for assessing data quality. A non-limiting example of the computer-implemented method includes using a processor to receive a plurality of updates to data points in a data stream. The processor is further used to compute instances of a data quality metric (DQM) from the data points in the data stream. The instances of the DQM are configured to differentiate the data points in the data stream by time and assign a higher weight to the instances of the DQM computed from more recent data points in the data stream. The instances of the DQM are continuously updated as more of the data points are received by the processor while limiting cycles of the processor consumed by updating the instances of the DQM to a threshold.

Type: Grant

Filed: July 31, 2020

Date of Patent: December 21, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
EFFICIENT TECHNIQUES FOR DETERMINING THE BEST DATA IMPUTATION ALGORITHMS

Publication number: 20210357781

Abstract: A processing system, a computer program product, and a method for efficiently determining a best imputation algorithm from a plurality of imputation algorithms A method includes: providing a plurality of imputation algorithms; providing a time parameter tmax to limit an amount of time spent determining a best imputation algorithm; maintaining past information i on accuracy and execution time for at least one of the imputation algorithms; using said information i to compute a utility score for each of the at least one the imputation algorithms; and testing imputation algorithms and associated parameters in an order based on said utility scores.

Type: Application

Filed: May 15, 2020

Publication date: November 18, 2021

Inventors: Arun Kwangil IYENGAR, Dhavalkumar C. PATEL
DETERMINING THE BEST DATA IMPUTATION ALGORITHMS

Publication number: 20210357794

Abstract: A processing system, a computer program product, and a method for determining a best imputation algorithm from a plurality of imputation algorithms A method includes: providing a plurality of imputation algorithms; defining a data analytics task in which at least one step of the data analytics task includes determining at least one missing data value by imputation; executing the data analytics task multiple times wherein each execution of the data analytics task uses a data imputation algorithm of the plurality of data imputation algorithms to determine at least one missing data value; determining an error for each execution of the data analytics task; and selecting an imputation algorithm which results in a least error for the data analytics task.

Type: Application

Filed: May 15, 2020

Publication date: November 18, 2021

Inventors: Arun Kwangil IYENGAR, Dhavalkumar C. PATEL

prev 1 2 3 next