Patents by Inventor Dhavalkumar C Patel

Dhavalkumar C Patel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11966340
    Abstract: To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candidate machine learning pipelines. Predictions of each of the candidate machine learning pipelines using at least the one or more features may be evaluated. A ranked list of machine learning pipelines may be automatically generated from the candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines.
    Type: Grant
    Filed: March 15, 2022
    Date of Patent: April 23, 2024
    Assignee: International Business Machines Corporation
    Inventors: Long Vu, Bei Chen, Xuan-Hong Dang, Peter Daniel Kirchner, Syed Yousaf Shah, Dhavalkumar C. Patel, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Gregory Bramble, Horst Cornelius Samulowitz, Saket K. Sathe, Wesley M. Gifford, Petros Zerfos
  • Publication number: 20240095001
    Abstract: Using exported data of a machine learning model and a model training environment specification, a resource usage specification and a code module usage specification of the model are identified. A code module installation specification is determined from a code module requirements specification and a target execution environment specification. The code modules specified by the code module installation specification are caused to be installed in the target execution environment. Using data of the updated target execution environment, the updated target execution environment is validated for execution of the model. Execution of the model in the updated target execution environment is simulated. The model is deployed in the updated target execution environment responsive to the simulating being successful.
    Type: Application
    Filed: September 19, 2022
    Publication date: March 21, 2024
    Applicant: International Business Machines Corporation
    Inventor: Dhavalkumar C. Patel
  • Publication number: 20230297876
    Abstract: Selecting a time-series forecasting pipeline by receiving target variable time-series data and exogenous variable time-series data, generating a regular forecasting pipeline comprising a model according to the target variable time-series data, generating an exogenous forecasting pipeline comprising a model according to the target variable time-series data and the exogenous variable time-series data, evaluating the regular forecasting pipeline and the exogenous forecasting pipeline, selecting a pipeline according to the evaluation, and providing the selected pipeline.
    Type: Application
    Filed: March 17, 2022
    Publication date: September 21, 2023
    Inventors: Xuan-Hong Dang, SYED YOUSAF SHAH, Dhavalkumar C. Patel, Wesley M. Gifford, Petros ZERFOS
  • Publication number: 20230281364
    Abstract: A system and method for learning a predictive function that can automatically learn different operating modes for a multi-modal system and predict the number of operating states for a multi-modal system and additionally the detailed structure for each state. Once learned, the predictive function (model) can be used to determine a mode of a new sample (an asset). Based on the determined components that maximize a log likelihood function, a mode of the new sample is detected into the model via dependency graphs. One aspect includes enforcing a lower bound for the number of sample points to form an operational mode for an asset. While a mode relates to sample points which maximizes like log-likelihood, an ability is provided to remove artifact modes due to noisy data by considering a sufficient sample data condition and maximizing log-likelihood. Domain knowledge can be incorporated into the model via dependency graphs.
    Type: Application
    Filed: March 4, 2022
    Publication date: September 7, 2023
    Inventors: Dzung Tien Phan, Robert Jeffrey Baseman, Dhavalkumar C. Patel, Fateh A. Tipu
  • Patent number: 11688111
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.
    Type: Grant
    Filed: July 29, 2020
    Date of Patent: June 27, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
  • Patent number: 11620582
    Abstract: Techniques regarding one or more automated machine learning processes that analyze time series data are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a time series analysis component that selects a machine learning pipeline for meta transfer learning on time series data by sequentially allocating subsets of training data from the time series data amongst a plurality of machine learning pipeline candidates.
    Type: Grant
    Filed: July 29, 2020
    Date of Patent: April 4, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Bei Chen, Long Vu, Syed Yousaf Shah, Xuan-Hong Dang, Peter Daniel Kirchner, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Dhavalkumar C. Patel, Gregory Bramble, Horst Cornelius Samulowitz, Saket Sathe, Chuang Gan
  • Publication number: 20230048378
    Abstract: Methods and systems to provide a form of probabilistic labeling to associate an outage with a disturbance, which could itself be either known based on the available data or unknown. In the latter case, labeling is especially challenging, as it necessitates the discovery of the disturbance. One approach incorporates a statistical change-point analysis to time-series events that correspond to service tickets in the relevant geographic sub-regions. The method is calibrated to separate the regular periods from the environmental disturbance periods, under the assumption that disturbances significantly increase the rate of loss-causing events. To obtain the probability that a given loss-causing event is related to an environmental disturbance, the method leverages the difference between the rate of events expected in the absence of any disturbances (baseline) and the rate of actually observed events. In the analysis, the local disturbances are identified and estimators of their duration and magnitude are provided.
    Type: Application
    Filed: August 12, 2021
    Publication date: February 16, 2023
    Inventors: Emmanuel Yashchin, Nianjun Zhou, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Arun Kwangil Iyengar, Shrey Shrivastava
  • Publication number: 20220327058
    Abstract: To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candidate machine learning pipelines. Predictions of each of the candidate machine learning pipelines using at least the one or more features may be evaluated. A ranked list of machine learning pipelines may be automatically generated from the candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines.
    Type: Application
    Filed: March 15, 2022
    Publication date: October 13, 2022
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Long VU, Bei CHEN, Xuan-Hong DANG, Peter Daniel KIRCHNER, Syed Yousaf SHAH, Dhavalkumar C. PATEL, Si Er HAN, Ji Hui YANG, Jun WANG, Jing James XU, Dakuo WANG, Gregory BRAMBLE, Horst Cornelius SAMULOWITZ, Saket K. SATHE, Wesley M. GIFFORD, Petros ZERFOS
  • Publication number: 20220309391
    Abstract: Methods, computer program products, and systems are presented. The method, computer program products, and systems can include, for instance: examining an enterprise dataset, the enterprise dataset defined by enterprise collected data; selecting one or more synthetic dataset in dependence on the examining, the one or more synthetic dataset including data other than data collected by the enterprise; training a set of predictive models using data of the one or more synthetic dataset to provide a set of trained predictive models; testing the set of trained predictive models with use of holdout data of the one or more synthetic dataset; and presenting prompting data on a displayed user interface of a developer user in dependence on result data resulting from the testing, the prompting data prompting the developer user to direct action with respect to one or more model of the set of predictive models.
    Type: Application
    Filed: March 29, 2021
    Publication date: September 29, 2022
    Inventors: Dhavalkumar C. PATEL, Si Er HAN, Jiang Bo KANG
  • Publication number: 20220261598
    Abstract: To rank time series forecasting in machine learning pipelines, time series data may be incrementally allocated from a time series data set for testing by candidate machine learning pipelines based on seasonality or a degree of temporal dependence of the time series data. Intermediate evaluation scores may be provided by each of the candidate machine learning pipelines following each time series data allocation. One or more machine learning pipelines may be automatically selected from a ranked list of the one or more candidate machine learning pipelines based on a projected learning curve generated from the intermediate evaluation scores.
    Type: Application
    Filed: October 26, 2021
    Publication date: August 18, 2022
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Bei CHEN, Long VU, Dhavalkumar C. PATEL, Syed Yousaf SHAH, Gregory BRAMBLE, Peter Daniel KIRCHNER, Horst Cornelius SAMULOWITZ, Xuan-Hong DANG, Petros ZERFOS
  • Publication number: 20220188775
    Abstract: A computer implemented federated learning method of predicting failure of assets includes generating a local model at a local site for each of the cohorts and training the local model on local data for each of the cohorts for each failure type. The local model is shared with a central database. A global model is created based on an aggregation of a plurality of the local models from a plurality of the local sites. At each of the plurality of local sites, one of the global model and the local model is chosen for each of the cohorts. The chosen model operates on local data to predict failure of the assets. The utilized features include partitioning features of the assets into static features, semi-static features, and dynamic features, and forming cohorts of the assets based on the static features and the semi-static features.
    Type: Application
    Filed: December 15, 2020
    Publication date: June 16, 2022
    Inventors: Nianjun Zhou, Dhavalkumar C. Patel, Anuradha Bhamidipaty
  • Publication number: 20220172002
    Abstract: A computer implemented method of preparing process data for use in an artificial intelligence (AI) model includes collecting and storing raw data as episodic data for each episode of a process. An episode data generator assigns an episode identifier each set of episodic data. The raw data per episode is transformed into a standardized episodic data format that is usable by the AI model. Metrics are assigned to the episodic data and the episodic data is aggregated in an episode store. The data in the episode store is used by a feature extraction and learning module to extract and rank features.
    Type: Application
    Filed: December 1, 2020
    Publication date: June 2, 2022
    Inventors: Shrey Shrivastava, Dhavalkumar C. Patel, Jayant R. Kalagnanam, Chandrasekhara K. Reddy
  • Publication number: 20220138616
    Abstract: A computer implemented method includes generating a pipeline graph having a plurality of layers, each of the plurality of layers having one or more machine learning components for performing a predictive modeling task. A plurality of pipelines are operated through the pipeline graph on a training dataset to determine a respective plurality of results. Each of the plurality of pipelines are distinct paths through selected ones of the one or more machine learning components at each of the plurality of layers. The plurality of results are compared to known results based on a user-defined metric to output one or more leader pipelines.
    Type: Application
    Filed: October 30, 2020
    Publication date: May 5, 2022
    Inventors: Dhavalkumar C. Patel, Shrey Shrivastava, Jayant R. Kalagnanam, Stuart Siegel, Wesley M. Gifford, Chandrasekhara K. Reddy
  • Patent number: 11263103
    Abstract: Embodiments of the invention are directed a computer-implemented method for efficiently assessing data quality metrics. A non-limiting example of the computer-implemented method includes receiving, using a processor, a plurality of updates to data points in a data stream. The processor is further used to provide a plurality of data quality metrics (DQMs), and to maintain information on how much the plurality of DQMs are changing over time. The processor also maintains information on computational overhead for the plurality of DQMs, and also updates data quality information based on the maintained information.
    Type: Grant
    Filed: July 31, 2020
    Date of Patent: March 1, 2022
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
  • Publication number: 20220036232
    Abstract: Machine logic to change steps included in and/or parameters/parameter value used in artificial intelligence (“AI”) pipelines. For example, the machine logic may control what types of data (for example, sensor data) are received by the AI pipeline and/or have the data is culled in the pipeline prior to application of a machine learning and/or artificial intelligence algorithm.
    Type: Application
    Filed: July 29, 2020
    Publication date: February 3, 2022
    Inventors: Dhavalkumar C. Patel, Shrey Shrivastava, Wesley M. Gifford
  • Publication number: 20220036610
    Abstract: Systems, computer-implemented methods, and computer program products to facilitate visualization of a model selection process are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an interaction backend handler component that obtains one or more assessment metrics of a model pipeline candidate. The computer executable components can further comprise a visualization render component that renders a progress visualization of the model pipeline candidate based on the one or more assessment metrics.
    Type: Application
    Filed: July 29, 2020
    Publication date: February 3, 2022
    Inventors: Dakuo Wang, Bei Chen, Ji Hui Yang, Abel Valente, Arunima Chaudhary, Chuang Gan, John Dillon Eversman, Voranouth Supadulya, Daniel Karl I. Weidele, Jun Wang, Jing James Xu, Dhavalkumar C. Patel, Long Vu, Syed Yousaf Shah, Si Er Han
  • Publication number: 20220035721
    Abstract: Embodiments of the invention are directed a computer-implemented method for efficiently assessing data quality metrics. A non-limiting example of the computer-implemented method includes receiving, using a processor, a plurality of updates to data points in a data stream. The processor is further used to provide a plurality of data quality metrics (DQMs), and to maintain information on how much the plurality of DQMs are changing over time. The processor also maintains information on computational overhead for the plurality of DQMs, and also updates data quality information based on the maintained information.
    Type: Application
    Filed: July 31, 2020
    Publication date: February 3, 2022
    Inventors: Arun Kwangil Iyengar, Anuradha Bhamidipaty, Dhavalkumar C. Patel, Shrey Shrivastava, Nianjun Zhou
  • Publication number: 20220036246
    Abstract: Techniques regarding one or more automated machine learning processes that analyze time series data are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a time series analysis component that selects a machine learning pipeline for meta transfer learning on time series data by sequentially allocating subsets of training data from the time series data amongst a plurality of machine learning pipeline candidates.
    Type: Application
    Filed: July 29, 2020
    Publication date: February 3, 2022
    Inventors: Bei Chen, Long VU, Syed Yousaf Shah, Xuan-Hong Dang, Peter Daniel Kirchner, Si Er Han, Ji Hui Yang, Jun Wang, Jing James Xu, Dakuo Wang, Dhavalkumar C. Patel, Gregory Bramble, Horst Cornelius Samulowitz, Saket Sathe, Chuang Gan
  • Publication number: 20220012640
    Abstract: Techniques for model evaluation and selection are provided. A plurality of models trained to generate predictions at each of a plurality of intervals is received, and a plurality of model ensembles, each specifying one or more of the plurality of models for each of the plurality of intervals, is generated. A test data set is received, where the test data set includes values for at least a first interval of the plurality of intervals and does not include values for at least a second interval of the plurality of intervals. A first model ensemble, of the plurality of model ensembles, is selected based on processing the test data set using each of the plurality of model ensembles.
    Type: Application
    Filed: July 9, 2020
    Publication date: January 13, 2022
    Inventors: Arun Kwangil IYENGAR, Jeffrey Owen KEPHART, Dhavalkumar C. PATEL, Dung Tien PHAN, Chandrasekhara K. REDDY
  • Publication number: 20220012641
    Abstract: Techniques for generating model ensembles are provided. A plurality of models trained to generate predictions at each of a plurality of intervals is received. A respective prediction accuracy of each respective model of the plurality of models is determined for a first interval of the plurality of intervals by processing labeled evaluation data using the respective model. Additionally, a model ensemble specifying one or more of the plurality of models for each of the plurality of intervals is generated, comprising selecting, for the first interval, a first model of the plurality of models based on (i) the respective prediction accuracies and (ii) at least one non-error metric.
    Type: Application
    Filed: July 9, 2020
    Publication date: January 13, 2022
    Inventors: Arun Kwangil IYENGAR, Jeffrey Owen KEPHART, Dhavalkumar C. PATEL, Dung Tien PHAN, Chandrasekhara K. REDDY