Patents by Inventor Nipun Agarwal

Nipun Agarwal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Methods for substituting a semi-join operator with alternative execution strategies and selection methods to choose the most efficient solution under different plans

Patent number: 11468064

Abstract: The manner in which tables are joined can affect the outcome of the query and database performance. Example types of join operations include semi-join and inner-join. The techniques described herein are approaches that may be used to substitute a semi-join operator with an inner-join operator and may be used to transform and optimize representations of queries.

Type: Grant

Filed: April 25, 2019

Date of Patent: October 11, 2022

Assignee: Oracle International Corporation

Inventors: Pit Fender, Benjamin Schlegel, Matthias Brantner, Cagri Balkesen, Nipun Agarwal
SPARSE ENSEMBLING OF UNSUPERVISED MODELS

Publication number: 20220318684

Abstract: Techniques are provided for sparse ensembling of unsupervised machine learning models. In an embodiment, the proposed architecture is composed of multiple unsupervised machine learning models that each produce a score as output and a gating network that analyzes the inputs and outputs of the unsupervised machine learning models to select an optimal ensemble of unsupervised machine learning models. The gating network is trained to choose a minimal number of the multiple unsupervised machine learning models whose scores are combined to create a final score that matches or closely resembles a final score that is computed using all the scores of the multiple unsupervised machine learning models.

Type: Application

Filed: April 2, 2021

Publication date: October 6, 2022

Inventors: SAEID ALLAHDADIAN, AMIN SUZANI, MILOS VASIC, MATTEO CASSERINI, ANDREW BROWNSWORD, FELIX SCHMIDT, NIPUN AGARWAL
EFFICIENT AND ACCURATE REGIONAL EXPLANATION TECHNIQUE FOR NLP MODELS

Publication number: 20220309360

Abstract: Herein are techniques for topic modeling and content perturbation that provide machine learning (ML) explainability (MLX) for natural language processing (NLP). A computer hosts an ML model that infers an original inference for each of many text documents that contain many distinct terms. To each text document (TD) is assigned, based on terms in the TD, a topic that contains a subset of the distinct terms. In a perturbed copy of each TD, a perturbed subset of the distinct terms is replaced. For the perturbed copy of each TD, the ML model infers a perturbed inference. For TDs of a topic, the computer detects that a difference between original inferences of the TDs of the topic and perturbed inferences of the TDs of the topic exceeds a threshold. Based on terms in the TDs of the topic, the topic is replaced with multiple, finer-grained new topics. After sufficient topic modeling, a regional explanation of the ML model is generated.

Type: Application

Filed: March 25, 2021

Publication date: September 29, 2022

Inventors: Zahra Zohrevand, Tayler Hetherington, Karoon Rashedi Nia, Yasha Pushak, Sanjay Jinturkar, Nipun Agarwal
High availability and automated recovery in scale-out distributed database system

Patent number: 11455219

Abstract: Herein are acceleration techniques for resuming offloaded execution by replacing a failed computer with a hot spare computer. In an embodiment, a distributed system configures a DBMS, a set of participating computers, and a set of spare computers. The DBMS receives a query of a database. From the query, an offload query plan is generated for distributed execution. The DBMS sends the offload query plan and a respective portion of the database to each participating computer. The distributed system detects that a participating computer failed after the offload query plan was sent. Responsively, the DBMS sends the same offload query plan and same respective portion of the database of the failed computer to a replacement computer from the spare computers. Despite the computer failure, the DBMS receives results of successful distributed execution of the offload query plan that include a result from the replacement computer.

Type: Grant

Filed: October 22, 2020

Date of Patent: September 27, 2022

Assignee: Oracle International Corporation

Inventors: Krishna Kantikiran Pasupuleti, Boris Klots, Vijayakrishnan Nagarajan, Anantha Kiran Kandukuri, Nipun Agarwal
Anomaly detection in SS7 control network using reconstructive neural networks

Patent number: 11451670

Abstract: Herein are machine learning (ML) techniques for unsupervised training with a corpus of signaling system 7 (SS7) messages having a diversity of called and calling parties, operation codes (opcodes) and transaction types, numbering plans and nature of address indicators, and mobile country codes and network codes. In an embodiment, a computer stores SS7 messages that are not labeled as anomalous or non-anomalous. Each SS7 message contains an opcode and other fields. For each SS7 message, the opcode of the SS7 message is stored into a respective feature vector (FV) of many FVs that are based on respective unlabeled SS7 messages. The FVs contain many distinct opcodes. Based on the FVs that contain many distinct opcodes and that are based on respective unlabeled SS7 messages, an ML model such as a reconstructive model such as an autoencoder is unsupervised trained to detect an anomalous SS7 message.

Type: Grant

Filed: December 16, 2020

Date of Patent: September 20, 2022

Assignee: Oracle International Corporation

Inventors: Hamed Ahmadi, Ali Moharrer, Venkatanathan Varadarajan, Vaseem Akram, Nishesh Rai, Reema Hingorani, Sanjay Jinturkar, Nipun Agarwal
Kernel subsampling for an accelerated tree similarity computation

Patent number: 11449517

Abstract: Approaches herein relate to machine learning for detection of anomalous logic syntax. Herein is acceleration for comparison of parse trees such as suspicious database queries. In an embodiment, a computer identifies subtrees in each of many trees. A respective subset of participating subtrees is selected in each tree. A respective root node of each participating subtree should directly have a child node that is a leaf and/or should have a degree that exceeds a branching threshold such as one. For each pairing of a respective first tree with a respective second tree, based on a count of subtree matches between the participating subset of subtrees in the first tree and the participating subset of subtrees in the second tree, a respective tree similarity score is calculated. A machine learning model inferences based on the tree similarity scores of the many trees. In an embodiment, each tree similarity score is a convolution kernel.

Type: Grant

Filed: December 22, 2020

Date of Patent: September 20, 2022

Assignee: Oracle International Corporation

Inventors: Arno Schneuwly, Nikola Milojkovic, Felix Schmidt, Nipun Agarwal
MACHINE LEARNING-BASED DNS REQUEST STRING REPRESENTATION WITH HASH REPLACEMENT

Publication number: 20220294757

Abstract: Techniques are described herein for using machine learning to learn vector representations of DNS requests such that the resulting embeddings represent the semantics of the DNS requests as a whole. Techniques described herein perform pre-processing of tokenized DNS request strings in which hashes, which are long and relatively random strings of characters, are detected in DNS request strings and each detected hash token is replaced with a placeholder token. A vectorizing ML model is trained using the pre-processed training dataset in which hash tokens have been replaced. Embeddings for the DNS tokens are derived from an intermediate layer of the vectorizing ML model. The encoding application creates final vector representations for each DNS request string by generating a weighted summation of the embeddings of all of the tokens in the DNS request string. Because of hash replacement, the resulting DNS request embeddings reflect semantics of the hashes as a group.

Type: Application

Filed: March 10, 2021

Publication date: September 15, 2022

Inventors: Renata Khasanova, Felix Schmidt, Stuart Wray, Craig Schelp, Nipun Agarwal, Matteo Casserini
MULTI-STAGE FEATURE EXTRACTION FOR EFFECTIVE ML-BASED ANOMALY DETECTION ON STRUCTURED LOG DATA

Publication number: 20220292304

Abstract: Herein are feature extraction mechanisms that receive parsed log messages as inputs and transform them into numerical feature vectors for machine learning models (MLMs). In an embodiment, a computer extracts fields from a log message. Each field specifies a name, a text value, and a type. For each field, a field transformer for the field is dynamically selected based the field's name and/or the field's type. The field transformer converts the field's text value into a value of the field's type. A feature encoder for the value of the field's type is dynamically selected based on the field's type and/or a range of the field's values that occur in a training corpus of an MLM. From the feature encoder, an encoding of the value of the field's typed is stored into a feature vector. Based on the MLM and the feature vector, the log message is detected as anomalous or not.

Type: Application

Filed: March 12, 2021

Publication date: September 15, 2022

Inventors: AMIN SUZANI, SAEID ALLAHDADIAN, MILOS VASIC, MATTEO CASSERINI, HAMED AHMADI, FELIX SCHMIDT, ANDREW BROWNSWORD, NIPUN AGARWAL
Detection of parallel operating Wi-Fi networks

Publication number: 20220279429

Abstract: Systems and methods of detecting a parallel Wi-Fi network or a parallel Wi-Fi access point include operating a new Wi-Fi network at a location; analyzing Wi-Fi at the location; determining whether there is a parallel Wi-Fi network or a parallel Wi-Fi access point operating at the location with the new Wi-Fi network based on the analyzing; and, responsive to determining there is the parallel Wi-Fi network or the parallel Wi-Fi access point at the location, performing one or more of i) causing resolution of the parallel Wi-Fi network or the parallel Wi-Fi access point and ii) providing a notification to a user associated with the location.

Type: Application

Filed: May 16, 2022

Publication date: September 1, 2022

Inventors: Nipun Agarwal, William J. McFarland, Yoseph Malkin, Na Hyun Ha, Adam R. Hotchkiss, Sandeep Eyyuni
Predicting machine learning or deep learning model training time

Patent number: 11429895

Abstract: Herein are techniques for exploring hyperparameters of a machine learning model (MLM) and to train a regressor to predict a time needed to train the MLM based on a hyperparameter configuration and a dataset. In an embodiment that is deployed in production inferencing mode, for each landmark configuration, each containing values for hyperparameters of a MLM, a computer configures the MLM based on the landmark configuration and measures time spent training the MLM on a dataset. An already trained regressor predicts time needed to train the MLM based on a proposed configuration of the MLM, dataset meta-feature values, and training durations and hyperparameter values of landmark configurations of the MLM. When instead in training mode, a regressor in training ingests a training corpus of MLM performance history to learn, by reinforcement, to predict a training time for the MLM for new datasets and/or new hyperparameter configurations.

Type: Grant

Filed: April 15, 2019

Date of Patent: August 30, 2022

Assignee: Oracle International Corporation

Inventors: Anatoly Yakovlev, Venkatanathan Varadarajan, Sandeep Agrawal, Hesam Fathi Moghadam, Sam Idicula, Nipun Agarwal
Hybrid declarative query compiler and optimizer framework

Patent number: 11423022

Abstract: Techniques are described herein for building a framework for declarative query compilation using both rule-based and cost-based approaches for database management. The framework involves constructing and using: a set of rule-based properties tables that contain optimization parameters for both logical and physical optimization, a recursive algorithm to form candidate physical query plans that is based on the rule based tables, and a cost model for estimating the cost of a generated physical query plan that is used with the rule based properties tables to prune inferior query plans.

Type: Grant

Filed: June 25, 2018

Date of Patent: August 23, 2022

Assignee: Oracle International Corporation

Inventors: Jian Wen, Sam Idicula, Nitin Kunal, Farhan Tauheed, Seema Sundara, Nipun Agarwal, Indu Bhagat
Out of band server utilization estimation and server workload characterization for datacenter resource optimization and forecasting

Patent number: 11423327

Abstract: Techniques are described herein for estimating CPU, memory, and I/O utilization for a workload via out-of-band sensor readings using a machine learning model. The framework involves receiving sensor data associated with executing benchmark applications, obtaining ground truth utilization values for the benchmarks, preprocessing the training data to select a set of enhanced sequences, and using the enhanced sequences to train a random forest model to estimate CPU, memory, and I/O utilization given sensor monitoring data. Prior to the training phase, a machine learning model is trained using a set of predefined hyper-parameters. The trained models are used to generate estimations for CPU, memory, and I/O utilizations values. The utilization values are used with workload context information to assess the deployment and generate one or more recommendations for machine types that will best serve the workload in terms of system utilization.

Type: Grant

Filed: October 10, 2018

Date of Patent: August 23, 2022

Assignee: Oracle International Corporation

Inventors: Onur Kocberber, Felix Schmidt, Craig Schelp, Andrew Brownsword, Nipun Agarwal
FAST, APPROXIMATE CONDITIONAL DISTRIBUTION SAMPLING

Publication number: 20220261400

Abstract: Techniques are described for fast approximate conditional sampling by randomly sampling a dataset and then performing a nearest neighbor search on the pre-sampled dataset to reduce the data over which the nearest neighbor search must be performed and, according to an embodiment, to effectively reduce the number of nearest neighbors that are to be found within the random sample. Furthermore, KD-Tree-based stratified sampling is used to generate a representative sample of a dataset. KD-Tree-based stratified sampling may be used to identify the random sample for fast approximate conditional sampling, which reduces variance in the resulting data sample. As such, using KD-Tree-based stratified sampling to generate the random sample for fast approximate conditional sampling ensures that any nearest neighbor selected, for a target data instance, from the random sample is likely to be among the nearest neighbors of the target data instance within the unsampled dataset.

Type: Application

Filed: February 18, 2021

Publication date: August 18, 2022

Inventors: Yasha Pushak, Tayler Hetherington, Karoon Rashedi Nia, Zahra Zohrevand, Sanjay Jinturkar, Nipun Agarwal
EXTRACTION FROM TREES AT SCALE

Publication number: 20220261228

Abstract: Herein are machine learning (ML) feature processing and analytic techniques to detect anomalies in parse trees of logic statements, database queries, logic scripts, compilation units of general-purpose programing language, extensible markup language (XML), JavaScript object notation (JSON), and document object models (DOM). In an embodiment, a computer identifies an operational trace that contains multiple parse trees. Values of explicit features are generated from a single respective parse tree of the multiple parse trees of the operational trace. Values of implicit features are generated from more than one respective parse tree of the multiple parse trees of the operational trace. The explicit and implicit features are stored into a same feature vector. With the feature vector as input, an ML model detects whether or not the operational trace is anomalous, based on the explicit features of each parse tree of the operational trace and the implicit features of multiple parse trees of the operational trace.

Type: Application

Filed: February 12, 2021

Publication date: August 18, 2022

Inventors: Arno Schneuwly, Nikola Milojkovic, Felix Schmidt, Nipun Agarwal
Method and apparatus for identifying trending promotions

Patent number: 11403656

Abstract: A method, apparatus and computer program product are provided for generation of analytic data. An example embodiment includes a method for identifying trending items based on electronically generated velocity metrics derived from transaction data.

Type: Grant

Filed: August 28, 2015

Date of Patent: August 2, 2022

Assignee: GROUPON, INC.

Inventors: Nipun Agarwal, Rajesh Parekh
METHOD AND APPARATUS FOR PROVIDING PROMOTION RECOMMENDATIONS

Publication number: 20220237654

Abstract: The present disclosure relates to methods, systems, and apparatuses for providing electronic communications to client devices based on clustering and filtering candidates for inclusion in the electronic communications based on programmatically generated correlation metrics and thresholds associated with the clustering.

Type: Application

Filed: April 11, 2022

Publication date: July 28, 2022

Inventors: Nipun Agarwal, Rajesh Girish Parekh, Ying Chen
GLOBAL, MODEL-AGNOSTIC MACHINE LEARNING EXPLANATION TECHNIQUE FOR TEXTUAL DATA

Publication number: 20220229983

Abstract: A model-agnostic global explainer for textual data processing (NLP) machine learning (ML) models, “NLP-MLX”, is described herein. NLP-MLX explains global behavior of arbitrary NLP ML models by identifying globally-important tokens within a textual dataset containing text data. NLP-MLX accommodates any arbitrary combination of training dataset pre-processing operations used by the NLP ML model. NLP-MLX includes four main stages. A Text Analysis stage converts text in documents of a target dataset into tokens. A Token Extraction stage uses pre-processing techniques to efficiently pre-filter the complete list of tokens into a smaller set of candidate important tokens. A Perturbation Generation stage perturbs tokens within documents of the dataset to help evaluate the effect of different tokens, and combinations of tokens, on the model's predictions.

Type: Application

Filed: January 11, 2021

Publication date: July 21, 2022

Inventors: Zahra Zohrevand, Tayler Hetherington, Karoon Rashedi Nia, Yasha Pushak, Sanjay Jinturkar, Nipun Agarwal
METHOD AND APPARATUS FOR PROVIDING PROMOTION RECOMMENDATIONS

Publication number: 20220215430

Abstract: The present disclosure relates to methods, systems, and apparatuses for providing promotion recommendations using a promotion and marketing service. Some aspects may provide a method for providing a promotion recommendation framework. The method includes receiving, via a network interface, a promotion recommendation inquiry from a component of a promotion and marketing service, the promotion recommendation inquiry including electronic identification data identifying at least one of a consumer or a consumer characteristic. The method also includes identifying, via processing circuitry, promotion transaction information associated with the electronic identification data. The promotion transaction information includes electronic data identifying at least one transaction performed using the promotion and marketing service.

Type: Application

Filed: November 11, 2021

Publication date: July 7, 2022

Inventors: Nipun Agarwal, Rajesh Girish Parekh, Ying Chen
Intelligent monitoring systems and methods for Wi-Fi Metric-Based ISP Outage Detection for Cloud Based Wi-Fi Networks

Publication number: 20220217550

Abstract: System and methods include obtaining data, over the Internet, associated with a plurality of Wi-Fi networks each Wi-Fi network having one or more access points and each Wi-Fi network being associated with a customer of one or more service providers; aggregating and filtering the data; analyzing the aggregated and filtered data for the network condition of each of the plurality of customers of one or more service providers; determining an internet service provider (ISP) outage based on a plurality of factors; and performing one of a plurality of resolution or notification actions.

Type: Application

Filed: March 22, 2022

Publication date: July 7, 2022

Inventors: Yusuke Sakamoto, Yoseph Malkin, Sachin Vasudeva, Nipun Agarwal
Efficient adjustment of spin-locking parameter values

Patent number: 11379456

Abstract: Systems and methods for adjusting parameters for a spin-lock implementation of concurrency control are described herein. In an embodiment, a system continuously retrieves, from a resource management system, one or more state values defining a state of the resource management system. Based on the one or more state values, the system determines that the resource management system has reached a steady state and, in response adjusts a plurality of parameters for spin-locking performed by said resource management system to identify optimal values for the plurality of parameters. After adjusting the plurality of parameters, the system detects, based on one or more current state values, a workload change in the resource management system and, in response, readjusts the plurality of parameters for spin-locking performed by said resource management system to identify new optimal values for the parameters.

Type: Grant

Filed: October 1, 2020

Date of Patent: July 5, 2022

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: Onur Kocberber, Mayur Bency, Marc Jolles, Seema Sundara, Nipun Agarwal

prev 1 2 3 4 5 6 7 8 9 … next