Patents by Inventor Nipun Agarwal

Nipun Agarwal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11615438
    Abstract: A method, apparatus, and computer program product are disclosed for self-service design, scheduling, and delivery of user-defined reports regarding promotions. The method includes receiving, from a user device, a report type and report delivery information. Based on the report type, relevant data regarding the one or more promotions is collected, using which a report is generated. The method then outputs the generated report based on the report delivery information. Optionally, analytical insights, such as trends within the data, sample size, suitability of control data, and indications of statistical significance, are generated and included in the report. A corresponding apparatus and computer program product are also provided.
    Type: Grant
    Filed: September 1, 2020
    Date of Patent: March 28, 2023
    Assignee: GROUPON, INC.
    Inventors: Nipun Agarwal, Sudeep Srivastava, Isaac Kim
  • Patent number: 11615265
    Abstract: The present invention relates to dimensionality reduction for machine learning (ML) models. Herein are techniques that individually rank features and combine features based on their rank to achieve an optimal combination of features that may accelerate training and/or inferencing, prevent overfitting, and/or provide insights into somewhat mysterious datasets. In an embodiment, a computer ranks features of datasets of a training corpus. For each dataset and for each landmark percentage, a target ML model is configured to receive only a highest ranking landmark percentage of features, and a landmark accuracy achieved by training the ML model with the dataset is measured. Based on the landmark accuracies and meta-features values of the dataset, a respective training tuple is generated for each dataset. Based on all of the training tuples, a regressor is trained to predict an optimal amount of features for training the target ML model.
    Type: Grant
    Filed: August 21, 2019
    Date of Patent: March 28, 2023
    Assignee: Oracle International Corporation
    Inventors: Tomas Karnagel, Sam Idicula, Hesam Fathi Moghadam, Nipun Agarwal
  • Publication number: 20230091708
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for recommending contextually relevant promotions to consumers in order to facilitate their discovery of promotions that they are likely to purchase from a promotion and marketing service.
    Type: Application
    Filed: September 27, 2022
    Publication date: March 23, 2023
    Inventors: Feili HOU, Vyomkesh TRIPATHI, Nipun AGARWAL, Rajesh Girish PAREKH
  • Publication number: 20230084410
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for recommending contextually relevant promotions to consumers in order to facilitate their discovery of promotions that they are likely to purchase from a promotion and marketing service.
    Type: Application
    Filed: September 26, 2022
    Publication date: March 16, 2023
    Inventors: Feili HOU, Vyomkesh TRIPATHI, Nipun AGARWAL, Rajesh Girish PAREKH
  • Patent number: 11579951
    Abstract: Techniques are described herein for predicting disk drive failure using a machine learning model. The framework involves receiving disk drive sensor attributes as training data, preprocessing the training data to select a set of enhanced feature sequences, and using the enhanced feature sequences to train a machine learning model to predict disk drive failures from disk drive sensor monitoring data. Prior to the training phase, the RNN LSTM model is tuned using a set of predefined hyper-parameters. The preprocessing, which is performed during the training and evaluation phase as well as later during the prediction phase, involves using predefined values for a set of parameters to generate the set of enhanced sequences from raw sensor reading. The enhanced feature sequences are generated to maintain a desired healthy/failed disk ratio, and only use samples leading up to a last-valid-time sample in order to honor a pre-specified heads-up-period alert requirement.
    Type: Grant
    Filed: September 27, 2018
    Date of Patent: February 14, 2023
    Assignee: Oracle International Corporation
    Inventors: Onur Kocberber, Felix Schmidt, Arun Raghavan, Nipun Agarwal, Sam Idicula, Guang-Tong Zhou, Nitin Kunal
  • Patent number: 11567937
    Abstract: Embodiments implement a prediction-driven, rather than a trial-driven, approach to automate database configuration parameter tuning for a database workload. This approach uses machine learning (ML) models to test performance metrics resulting from application of particular database parameters to a database workload, and does not require live trials on the DBMS managing the workload. Specifically, automatic configuration (AC) ML models are trained, using a training corpus that includes information from workloads being run by DBMSs, to predict performance metrics based on workload features and configuration parameter values. The trained AC-ML models predict performance metrics resulting from applying particular configuration parameter values to a given database workload being automatically tuned. Based on correlating changes to configuration parameter values with changes in predicted performance metrics, an optimization algorithm is used to converge to an optimal set of configuration parameters.
    Type: Grant
    Filed: May 12, 2021
    Date of Patent: January 31, 2023
    Assignee: Oracle International Corporation
    Inventors: Sam Idicula, Tomas Karnagel, Jian Wen, Seema Sundara, Nipun Agarwal, Mayur Bency
  • Publication number: 20230022884
    Abstract: Techniques are described herein for prediction of an buffer pool size (BPS). Before performing BPS prediction, gathered data are used to determine whether a target workload is in a steady state. Historical utilization data gathered while the workload is in a steady state are used to predict object-specific BPS components for database objects, accessed by the target workload, that are identified for BPS analysis based on shares of the total disk I/O requests, for the workload, that are attributed to the respective objects. Preference of analysis is given to objects that are associated with larger shares of disk I/O activity. An object-specific BPS component is determined based on a coverage function that returns a percentage of the database object size (on disk) that should be available in the buffer pool for that database object. The percentage is determined using either a heuristic-based or a machine learning-based approach.
    Type: Application
    Filed: July 20, 2021
    Publication date: January 26, 2023
    Inventors: Peyman Faizian, Mayur Bency, Onur Kocberber, Seema Sundara, Nipun Agarwal
  • Patent number: 11562178
    Abstract: According to an embodiment, a method includes generating a first dataset sample from a dataset, calculating a first validation score for the first dataset sample and a machine learning model, and determining whether a difference in validation score between the first validation score and a second validation score satisfies a first criteria. If the difference in validation score does not satisfy the first criteria, the method includes generating a second dataset sample from the dataset. If the difference in validation score does satisfy the first criteria, the method includes updating a convergence value and determining whether the updated convergence value satisfies a second criteria. If the updated convergence value satisfies the second criteria, the method includes returning the first dataset sample. If the updated convergence value does not satisfy the second criteria, the method includes generating the second dataset sample from the dataset.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: January 24, 2023
    Assignee: Oracle International Corporation
    Inventors: Jingxiao Cai, Sandeep Agrawal, Sam Idicula, Venkatanathan Varadarajan, Anatoly Yakovlev, Nipun Agarwal
  • Patent number: 11544494
    Abstract: Techniques are provided for selection of machine learning algorithms based on performance predictions by trained algorithm-specific regressors. In an embodiment, a computer derives meta-feature values from an inference dataset by, for each meta-feature, deriving a respective meta-feature value from the inference dataset. For each trainable algorithm and each regression meta-model that is respectively associated with the algorithm, a respective score is calculated by invoking the meta-model based on at least one of: a respective subset of meta-feature values, and/or hyperparameter values of a respective subset of hyperparameters of the algorithm. The algorithm(s) are selected based on the respective scores. Based on the inference dataset, the selected algorithm(s) may be invoked to obtain a result. In an embodiment, the trained regressors are distinctly configured artificial neural networks. In an embodiment, the trained regressors are contained within algorithm-specific ensembles.
    Type: Grant
    Filed: January 30, 2018
    Date of Patent: January 3, 2023
    Assignee: Oracle International Corporation
    Inventors: Sandeep Agrawal, Sam Idicula, Venkatanathan Varadarajan, Nipun Agarwal
  • Patent number: 11544630
    Abstract: The present invention relates to dimensionality reduction for machine learning (ML) models. Herein are techniques that individually rank features and combine features based on their rank to achieve an optimal combination of features that may accelerate training and/or inferencing, prevent overfitting, and/or provide insights into somewhat mysterious datasets. In an embodiment, a computer calculates, for each feature of a training dataset, a relevance score based on: a relevance scoring function, and statistics of values, of the feature, that occur in the training dataset. A rank based on relevance scores of the features is calculated for each feature. A sequence of distinct subsets of the features, based on the ranks of the features, is generated. For each distinct subset of the sequence of distinct feature subsets, a fitness score is generated based on training a machine learning (ML) model that is configured for the distinct subset.
    Type: Grant
    Filed: May 20, 2019
    Date of Patent: January 3, 2023
    Assignee: Oracle International Corporation
    Inventors: Tomas Karnagel, Sam Idicula, Nipun Agarwal
  • Patent number: 11531915
    Abstract: Herein are techniques to generate candidate rulesets for machine learning (ML) explainability (MLX) for black-box ML models. In an embodiment, an ML model generates classifications that each associates a distinct example with a label. A decision tree that, based on the classifications, contains tree nodes is received or generated. Each node contains label(s), a condition that identifies a feature of examples, and a split value for the feature. When a node has child nodes, the feature and the split value that are identified by the condition of the node are set to maximize information gain of the child nodes. Candidate rules are generated by traversing the tree. Each rule is built from a combination of nodes in a tree traversal path. Each rule contains a condition of at least one node and is assigned to a rule level. Candidate rules are subsequently optimized into an optimal ruleset for actual use.
    Type: Grant
    Filed: March 20, 2019
    Date of Patent: December 20, 2022
    Assignee: Oracle International Corporation
    Inventors: Tayler Hetherington, Zahra Zohrevand, Onur Kocberber, Karoon Rashedi Nia, Sam Idicula, Nipun Agarwal
  • Publication number: 20220398629
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for recommending contextually relevant promotions to consumers in order to facilitate their discovery of promotions that they are likely to purchase from a promotion and marketing service.
    Type: Application
    Filed: May 2, 2022
    Publication date: December 15, 2022
    Inventors: Feili HOU, Vyomkesh TRIPATHI, Nipun AGARWAL, Rajesh Girish PAREKH
  • Publication number: 20220391935
    Abstract: A method, apparatus and computer program product are provided for generation of analytic data. An example embodiment includes a method for identifying trending items based on electronically generated velocity metrics derived from transaction data.
    Type: Application
    Filed: June 16, 2022
    Publication date: December 8, 2022
    Inventors: Nipun Agarwal, Rajesh Parekh
  • Patent number: 11520834
    Abstract: Techniques are described for generating an approximate frequency histogram using a series of Bloom filters (BF). For example, to estimate the f1 and f2 cardinalities in a dataset, an ordered chain of three BFs is established (“BF1”, “BF2”, and “BF3”). An insertion operation is performed for each datum in the dataset, whereby the BFs are tested in order (starting at BF1) for the datum. If the datum is represented in a currently-tested BF, the subsequent BF in the chain is tested for the datum. If the datum is not represented in the currently-tested BF, the datum is added to the BF, a counter for the BF is incremented, and the insertion operation for the current datum ends. To estimate the cardinality of f1-values in the dataset, the BF2-counter is subtracted from the BF1-counter. Similarly, to estimate the cardinality of f2-values in the dataset, the BF3-counter is subtracted from the BF2-counter.
    Type: Grant
    Filed: July 28, 2021
    Date of Patent: December 6, 2022
    Assignee: Oracle International Corporation
    Inventors: Tomas Karnagel, Suratna Budalakoti, Onur Kocberber, Nipun Agarwal, Alan Wood
  • Patent number: 11514697
    Abstract: Herein is a probabilistic indexing technique for searching semi-structured text documents in columnar storage formats such as Parquet, using columnar input/output (I/O) avoidance, and needing minimal storage overhead. In an embodiment, a computer associates columns with text strings that occur in semi-structured documents. Text words that occur in the text strings are detected. Respectively for each text word, a bitmap, of a plurality of bitmaps, that contains a respective bit for each column is generated. Based on at least one of the bitmaps, some of the columns or some of the semi-structured documents are accessed.
    Type: Grant
    Filed: July 15, 2020
    Date of Patent: November 29, 2022
    Assignee: Oracle International Corporation
    Inventors: Jian Wen, Hamed Ahmadi, Sanjay Jinturkar, Nipun Agarwal, Lijian Wan, Shrikumar Hariharasubrahmanian
  • Publication number: 20220377582
    Abstract: Systems, methods, and non-transitory computer-readable storage media are provided for predicting the likelihood or probability of a subscriber of a service to cancel or not renew a subscription. A method, according to one implementation, includes a step of receiving data pertaining to aspects of a service that is provided by a service provider to a subscriber in accordance with a subscription. The data may include one or more impact factors each having a positive, neutral, or negative influence on the likelihood of subscriber churn. The method also includes a step of using the one or more impact factors to predict the likelihood that the subscriber will cancel the subscription.
    Type: Application
    Filed: August 2, 2022
    Publication date: November 24, 2022
    Inventors: Yusuke Sakamoto, Muhammad Ali Valliani, Nipun Agarwal, Sachin Vasudeva
  • Publication number: 20220366297
    Abstract: In an embodiment, a computer hosts a machine learning (ML) model that infers a particular inference for a particular tuple that is based on many features. For each feature, and for each of many original tuples, the computer: a) randomly selects many perturbed values from original values of the feature in the original tuples, b) generates perturbed tuples that are based on the original tuple and a respective perturbed value, c) causes the ML model to infer a respective perturbed inference for each perturbed tuple, and d) measures a respective difference between each perturbed inference of the perturbed tuples and the particular inference. For each feature, a respective importance of the feature is calculated based on the differences measured for the feature. Feature importances may be used to rank features by influence and/or generate a local ML explainability (MLX) explanation.
    Type: Application
    Filed: May 13, 2021
    Publication date: November 17, 2022
    Inventors: Yasha Pushak, Zahra Zohrevand, Tayler Hetherington, Karoon Rashedi Nia, Sanjay Jinturkar, Nipun Agarwal
  • Patent number: 11494806
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for recommending contextually relevant promotions to consumers in order to facilitate their discovery of promotions that they are likely to purchase from a promotion and marketing service.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: November 8, 2022
    Assignee: Groupon, Inc.
    Inventors: Feili Hou, Vyomkesh Tripathi, Nipun Agarwal, Rajesh Girish Parekh
  • Patent number: 11488205
    Abstract: In general, embodiments of the present invention provide systems, methods and computer readable media for recommending contextually relevant promotions to consumers in order to facilitate their discovery of promotions that they are likely to purchase from a promotion and marketing service.
    Type: Grant
    Filed: April 22, 2015
    Date of Patent: November 1, 2022
    Assignee: Groupon, Inc.
    Inventors: Feili Hou, Vyomkesh Tripathi, Nipun Agarwal, Rajesh Girish Parekh
  • Publication number: 20220335255
    Abstract: In an embodiment, a computer assigns a respective probability distribution to each of many features that include a first feature and a second feature that are assigned different probability distributions. For each original tuple that are based on the features, a machine learning (ML) model infers a respective original inference. For each feature, and for each original tuple, the computer: a) generates perturbed values based on the probability distribution of the feature, b) generates perturbed tuples that are based on the original tuple and a respective perturbed value, c) causes the ML model to infer a respective perturbed inference for each perturbed tuple, and d) measures a respective difference between each perturbed inference and the original inference. A respective importance of each feature is calculated based on the differences measured for the feature. Feature importances may be used to rank features by influence and/or generate a global or local ML explainability (MLX) explanation.
    Type: Application
    Filed: April 16, 2021
    Publication date: October 20, 2022
    Inventors: ZAHRA ZOHREVAND, YASHA PUSHAK, TAYLER HETHERINGTON, KAROON RASHEDI NIA, SANJAY JINTURKAR, NIPUN AGARWAL