Patents by Inventor Dhavalkumar C Patel
Dhavalkumar C Patel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11688111Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.Type: GrantFiled: July 29, 2020Date of Patent: June 27, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
-
Patent number: 11620582Abstract: Techniques regarding one or more automated machine learning processes that analyze time series data are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a time series analysis component that selects a machine learning pipeline for meta transfer learning on time series data by sequentially allocating subsets of training data from the time series data amongst a plurality of machine learning pipeline candidates.Type: GrantFiled: July 29, 2020Date of Patent: April 4, 2023Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Bei Chen, Long Vu, Syed Yousaf Shah, Xuan-Hong Dang, Peter Daniel Kirchner, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Dhavalkumar C. Patel, Gregory Bramble, Horst Cornelius Samulowitz, Saket Sathe, Chuang Gan
-
Publication number: 20230048378Abstract: Methods and systems to provide a form of probabilistic labeling to associate an outage with a disturbance, which could itself be either known based on the available data or unknown. In the latter case, labeling is especially challenging, as it necessitates the discovery of the disturbance. One approach incorporates a statistical change-point analysis to time-series events that correspond to service tickets in the relevant geographic sub-regions. The method is calibrated to separate the regular periods from the environmental disturbance periods, under the assumption that disturbances significantly increase the rate of loss-causing events. To obtain the probability that a given loss-causing event is related to an environmental disturbance, the method leverages the difference between the rate of events expected in the absence of any disturbances (baseline) and the rate of actually observed events. In the analysis, the local disturbances are identified and estimators of their duration and magnitude are provided.Type: ApplicationFiled: August 12, 2021Publication date: February 16, 2023Inventors: Emmanuel Yashchin, Nianjun Zhou, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Arun Kwangil Iyengar, Shrey Shrivastava
-
Publication number: 20220327058Abstract: To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candidate machine learning pipelines. Predictions of each of the candidate machine learning pipelines using at least the one or more features may be evaluated. A ranked list of machine learning pipelines may be automatically generated from the candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines.Type: ApplicationFiled: March 15, 2022Publication date: October 13, 2022Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Long VU, Bei CHEN, Xuan-Hong DANG, Peter Daniel KIRCHNER, Syed Yousaf SHAH, Dhavalkumar C. PATEL, Si Er HAN, Ji Hui YANG, Jun WANG, Jing James XU, Dakuo WANG, Gregory BRAMBLE, Horst Cornelius SAMULOWITZ, Saket K. SATHE, Wesley M. GIFFORD, Petros ZERFOS
-
Publication number: 20220309391Abstract: Methods, computer program products, and systems are presented. The method, computer program products, and systems can include, for instance: examining an enterprise dataset, the enterprise dataset defined by enterprise collected data; selecting one or more synthetic dataset in dependence on the examining, the one or more synthetic dataset including data other than data collected by the enterprise; training a set of predictive models using data of the one or more synthetic dataset to provide a set of trained predictive models; testing the set of trained predictive models with use of holdout data of the one or more synthetic dataset; and presenting prompting data on a displayed user interface of a developer user in dependence on result data resulting from the testing, the prompting data prompting the developer user to direct action with respect to one or more model of the set of predictive models.Type: ApplicationFiled: March 29, 2021Publication date: September 29, 2022Inventors: Dhavalkumar C. PATEL, Si Er HAN, Jiang Bo KANG
-
Publication number: 20220261598Abstract: To rank time series forecasting in machine learning pipelines, time series data may be incrementally allocated from a time series data set for testing by candidate machine learning pipelines based on seasonality or a degree of temporal dependence of the time series data. Intermediate evaluation scores may be provided by each of the candidate machine learning pipelines following each time series data allocation. One or more machine learning pipelines may be automatically selected from a ranked list of the one or more candidate machine learning pipelines based on a projected learning curve generated from the intermediate evaluation scores.Type: ApplicationFiled: October 26, 2021Publication date: August 18, 2022Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Bei CHEN, Long VU, Dhavalkumar C. PATEL, Syed Yousaf SHAH, Gregory BRAMBLE, Peter Daniel KIRCHNER, Horst Cornelius SAMULOWITZ, Xuan-Hong DANG, Petros ZERFOS
-
Publication number: 20220188775Abstract: A computer implemented federated learning method of predicting failure of assets includes generating a local model at a local site for each of the cohorts and training the local model on local data for each of the cohorts for each failure type. The local model is shared with a central database. A global model is created based on an aggregation of a plurality of the local models from a plurality of the local sites. At each of the plurality of local sites, one of the global model and the local model is chosen for each of the cohorts. The chosen model operates on local data to predict failure of the assets. The utilized features include partitioning features of the assets into static features, semi-static features, and dynamic features, and forming cohorts of the assets based on the static features and the semi-static features.Type: ApplicationFiled: December 15, 2020Publication date: June 16, 2022Inventors: Nianjun Zhou, Dhavalkumar C. Patel, Anuradha Bhamidipaty
-
Publication number: 20220172002Abstract: A computer implemented method of preparing process data for use in an artificial intelligence (AI) model includes collecting and storing raw data as episodic data for each episode of a process. An episode data generator assigns an episode identifier each set of episodic data. The raw data per episode is transformed into a standardized episodic data format that is usable by the AI model. Metrics are assigned to the episodic data and the episodic data is aggregated in an episode store. The data in the episode store is used by a feature extraction and learning module to extract and rank features.Type: ApplicationFiled: December 1, 2020Publication date: June 2, 2022Inventors: Shrey Shrivastava, Dhavalkumar C. Patel, Jayant R. Kalagnanam, Chandrasekhara K. Reddy
-
Publication number: 20220138616Abstract: A computer implemented method includes generating a pipeline graph having a plurality of layers, each of the plurality of layers having one or more machine learning components for performing a predictive modeling task. A plurality of pipelines are operated through the pipeline graph on a training dataset to determine a respective plurality of results. Each of the plurality of pipelines are distinct paths through selected ones of the one or more machine learning components at each of the plurality of layers. The plurality of results are compared to known results based on a user-defined metric to output one or more leader pipelines.Type: ApplicationFiled: October 30, 2020Publication date: May 5, 2022Inventors: Dhavalkumar C. Patel, Shrey Shrivastava, Jayant R. Kalagnanam, Stuart Siegel, Wesley M. Gifford, Chandrasekhara K. Reddy
-
Patent number: 11263103Abstract: Embodiments of the invention are directed a computer-implemented method for efficiently assessing data quality metrics. A non-limiting example of the computer-implemented method includes receiving, using a processor, a plurality of updates to data points in a data stream. The processor is further used to provide a plurality of data quality metrics (DQMs), and to maintain information on how much the plurality of DQMs are changing over time. The processor also maintains information on computational overhead for the plurality of DQMs, and also updates data quality information based on the maintained information.Type: GrantFiled: July 31, 2020Date of Patent: March 1, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
-
Publication number: 20220036246Abstract: Techniques regarding one or more automated machine learning processes that analyze time series data are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a time series analysis component that selects a machine learning pipeline for meta transfer learning on time series data by sequentially allocating subsets of training data from the time series data amongst a plurality of machine learning pipeline candidates.Type: ApplicationFiled: July 29, 2020Publication date: February 3, 2022Inventors: Bei Chen, Long VU, Syed Yousaf Shah, Xuan-Hong Dang, Peter Daniel Kirchner, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Dhavalkumar C. Patel, Gregory Bramble, Horst Cornelius Samulowitz, Saket Sathe, Chuang Gan
-
Publication number: 20220036610Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.Type: ApplicationFiled: July 29, 2020Publication date: February 3, 2022Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
-
Publication number: 20220035721Abstract: Embodiments of the invention are directed a computer-implemented method for efficiently assessing data quality metrics. A non-limiting example of the computer-implemented method includes receiving, using a processor, a plurality of updates to data points in a data stream. The processor is further used to provide a plurality of data quality metrics (DQMs), and to maintain information on how much the plurality of DQMs are changing over time. The processor also maintains information on computational overhead for the plurality of DQMs, and also updates data quality information based on the maintained information.Type: ApplicationFiled: July 31, 2020Publication date: February 3, 2022Inventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
-
Publication number: 20220036232Abstract: Machine logic to change steps included in and/or parameters/parameter value used in artificial intelligence (“AI”) pipelines. For example, the machine logic may control what types of data (for example, sensor data) are received by the AI pipeline and/or have the data is culled in the pipeline prior to application of a machine learning and/or artificial intelligence algorithm.Type: ApplicationFiled: July 29, 2020Publication date: February 3, 2022Inventors: Dhavalkumar C. Patel, Shrey Shrivastava, Wesley M. Gifford
-
Publication number: 20220012640Abstract: Techniques for model evaluation and selection are provided. A plurality of models trained to generate predictions at each of a plurality of intervals is received, and a plurality of model ensembles, each specifying one or more of the plurality of models for each of the plurality of intervals, is generated. A test data set is received, where the test data set includes values for at least a first interval of the plurality of intervals and does not include values for at least a second interval of the plurality of intervals. A first model ensemble, of the plurality of model ensembles, is selected based on processing the test data set using each of the plurality of model ensembles.Type: ApplicationFiled: July 9, 2020Publication date: January 13, 2022Inventors: Arun Kwangil IYENGAR, Jeffrey Owen KEPHART, Dhavalkumar C. PATEL, Dung Tien PHAN, Chandrasekhara K. REDDY
-
Publication number: 20220011760Abstract: Techniques for model fidelity monitoring and regeneration for manufacturing process decision support are described herein. Aspects of the invention include determining that an output of a regression model corresponding to a current time period of decision support for a manufacturing process is not within a predefined range of a historical process dataset, wherein the regression model was constructed based on the historical process dataset, and performing an accuracy and fidelity analysis on the regression model based on process data from the manufacturing process corresponding to a previous time period. Based on a result of the accuracy and fidelity analysis being below a threshold, a mismatch of the regression model as compared to the manufacturing process is determined. Based on determining the mismatch, a temporary regression model corresponding to the manufacturing process is generated, and decision support for the manufacturing process is performed based on the temporary regression model.Type: ApplicationFiled: July 8, 2020Publication date: January 13, 2022Inventors: Nianjun Zhou, Dharmashankar Subramanian, Dhavalkumar C. Patel, Anuradha Bhamidipaty
-
Publication number: 20220012641Abstract: Techniques for generating model ensembles are provided. A plurality of models trained to generate predictions at each of a plurality of intervals is received. A respective prediction accuracy of each respective model of the plurality of models is determined for a first interval of the plurality of intervals by processing labeled evaluation data using the respective model. Additionally, a model ensemble specifying one or more of the plurality of models for each of the plurality of intervals is generated, comprising selecting, for the first interval, a first model of the plurality of models based on (i) the respective prediction accuracies and (ii) at least one non-error metric.Type: ApplicationFiled: July 9, 2020Publication date: January 13, 2022Inventors: Arun Kwangil IYENGAR, Jeffrey Owen KEPHART, Dhavalkumar C. PATEL, Dung Tien PHAN, Chandrasekhara K. REDDY
-
Patent number: 11204851Abstract: Embodiments of the invention are directed a computer-implemented method for assessing data quality. A non-limiting example of the computer-implemented method includes using a processor to receive a plurality of updates to data points in a data stream. The processor is further used to compute instances of a data quality metric (DQM) from the data points in the data stream. The instances of the DQM are configured to differentiate the data points in the data stream by time and assign a higher weight to the instances of the DQM computed from more recent data points in the data stream. The instances of the DQM are continuously updated as more of the data points are received by the processor while limiting cycles of the processor consumed by updating the instances of the DQM to a threshold.Type: GrantFiled: July 31, 2020Date of Patent: December 21, 2021Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
-
Publication number: 20210357781Abstract: A processing system, a computer program product, and a method for efficiently determining a best imputation algorithm from a plurality of imputation algorithms A method includes: providing a plurality of imputation algorithms; providing a time parameter tmax to limit an amount of time spent determining a best imputation algorithm; maintaining past information i on accuracy and execution time for at least one of the imputation algorithms; using said information i to compute a utility score for each of the at least one the imputation algorithms; and testing imputation algorithms and associated parameters in an order based on said utility scores.Type: ApplicationFiled: May 15, 2020Publication date: November 18, 2021Inventors: Arun Kwangil IYENGAR, Dhavalkumar C. PATEL
-
Publication number: 20210357794Abstract: A processing system, a computer program product, and a method for determining a best imputation algorithm from a plurality of imputation algorithms A method includes: providing a plurality of imputation algorithms; defining a data analytics task in which at least one step of the data analytics task includes determining at least one missing data value by imputation; executing the data analytics task multiple times wherein each execution of the data analytics task uses a data imputation algorithm of the plurality of data imputation algorithms to determine at least one missing data value; determining an error for each execution of the data analytics task; and selecting an imputation algorithm which results in a least error for the data analytics task.Type: ApplicationFiled: May 15, 2020Publication date: November 18, 2021Inventors: Arun Kwangil IYENGAR, Dhavalkumar C. PATEL