Patents by Inventor Sandeep R. Agrawal

Sandeep R. Agrawal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AUTOMATED SELECTION OF EMBEDDING AND GENERATIVE MODELS WITH VECTOR STORE

Publication number: 20250094777

Abstract: The present disclosure relates to LLM orchestration with vector store generation. An embeddings model may be selected to generate an embedding for a digital artifact. Metadata for the digital artifact may also be generated and stored in a vector store in association with the embedding. A user query may be received and categorized. One of a plurality of machine learning models may be selected based on the categorization of the user query. A prompt may be generated based at least in part on the user query, and the selected machine learning model may generate a response to the user query based at least in part on the prompt.

Type: Application

Filed: August 30, 2024

Publication date: March 20, 2025

Inventors: Anatoly Yakovlev, Sandeep R. Agrawal, Karoon Rashedi Nia, Ridha Chahed, Sanjay Jinturkar, Nipun Agarwal
PRIVACY-PROTECTIVE KNOWLEDGE SHARING USING A HIERARCHICAL VECTOR STORE

Publication number: 20250094787

Abstract: Disclosed herein are various approaches for sharing knowledge within and between organizations while protecting sensitive data. A machine learning model may be trained using training prompts querying a vector store to prevent unauthorized user disclosure of data derived from the vector store. A prompt may be received and a response to the prompt may be generated using the machine learning model based at least in part on the vector store.

Type: Application

Filed: August 19, 2024

Publication date: March 20, 2025

Inventors: Karoon Rashedi Nia, Anatoly Yakovlev, Sandeep R. Agrawal, Ridha Chahed, Sanjay Jinturkar, Nipun Agarwal
TUNING-FREE UNSUPERVISED ANOMALY DETECTION BASED ON DISTANCE TO NEAREST NORMAL POINT

Publication number: 20240281455

Abstract: Disclosed is an improved approach to implement anomaly detection, where an ensemble detection mechanism is provided. An improvement is provided for the KNN algorithm where scaling is applied to permit efficient detection of multiple categories of anomalies. Further extensions are used to optimize local anomaly detection.

Type: Application

Filed: February 16, 2024

Publication date: August 22, 2024

Applicant: Oracle International Corporation

Inventors: Youssef Mohamed Saied, Mohamed Ridha Chahed, Anatoly Yakovlev, Sandeep R. Agrawal, Sanjay Jinturkar, Nipun Agarwal
ADAPTIVE SAMPLING TO COMPUTE GLOBAL FEATURE EXPLANATIONS WITH SHAPLEY VALUES

Publication number: 20240086763

Abstract: Techniques for computing global feature explanations using adaptive sampling are provided. In one technique, first and second samples from an dataset are identified. A first set of feature importance values (FIVs) is generated based on the first sample and a machine-learned model. A second set of FIVs is generated based on the second sample and the model. If a result of a comparison between the first and second FIV sets does not satisfy criteria, then: (i) an aggregated set is generated based on the last two FIV sets; (ii) a new sample that is double the size of a previous sample is identified from the dataset; (iii) a current FIV set is generated based on the new sample and the model; (iv) determine whether a result of a comparison between the current and aggregated FIV sets satisfies criteria; repeating (i)-(iv) until the result of the last comparison satisfies the criteria.

Type: Application

Filed: September 14, 2022

Publication date: March 14, 2024

Inventors: Jeremy Plassmann, Anatoly Yakovlev, Sandeep R. Agrawal, Ali Moharrer, Sanjay Jinturkar, Nipun Agarwal
ONE-PASS APPROACH TO AUTOMATED TIMESERIES FORECASTING

Publication number: 20230153394

Abstract: Herein are timeseries preprocessing, model selection, and hyperparameter tuning techniques for forecasting development based on temporal statistics of a timeseries and a single feed-forward pass through a machine learning (ML) pipeline. In an embodiment, a computer hosts and operates the ML pipeline that automatically measures temporal statistic(s) of a timeseries. ML algorithm selection, cross validation, and hyperparameters tuning is based on the temporal statistics of the timeseries. The result from the ML pipeline is a rigorously trained and production ready ML model that is validated to have increased accuracy for multiple prediction horizons. Based on the temporal statistics, efficiency is achieved by asymmetry of investment of computer resources in the tuning and training of the most promising ML algorithm(s). Compared to other approaches, this ML pipeline produces a more accurate ML model for a given amount of computer resources and consumes fewer computer resources to achieve a given accuracy.

Type: Application

Filed: November 17, 2021

Publication date: May 18, 2023

Inventors: Ritesh Ahuja, Anatoly Yakovlev, Venkatanathan Varadarajan, Sandeep R. Agrawal, Hesam Fathi Moghadam, Sanjay Jinturkar, Nipun Agarwal
MEMORY USAGE PREDICTION FOR MACHINE LEARNING AND DEEP LEARNING MODELS

Publication number: 20220043681

Abstract: Herein, a computer receives a new training dataset for a target ML model. Proven or unproven respective values of hyperparameters of the target ML model are selected. An already-trained ML metamodel predicts an amount of memory that the target ML model will need, when configured with the respective values of the hyperparameters, to train with the new training dataset. In an embodiment, supervised training of the ML metamodel is as follows. The ML metamodel receives feature vectors that each contains distinct details of a respective past training of the target ML model of many and varied trainings of the target ML model. Those distinct details of each past training includes: respective values of the hyperparameters, and respective values of metafeatures of a respective training dataset of many training datasets. Each feature vector is labeled with a respective amount of memory that the target ML model needed during the respective past training.

Type: Application

Filed: August 4, 2020

Publication date: February 10, 2022

Inventors: Ali Moharrer, Sandeep R. Agrawal, Venkatanathan Varadarajan, Sanjay Jinturkar, Nipun Agarwal
FAST, PREDICTIVE, AND ITERATION-FREE AUTOMATED MACHINE LEARNING PIPELINE

Publication number: 20210390466

Abstract: A proxy-based automatic non-iterative machine learning (PANI-ML) pipeline is described, which predicts machine learning model configuration performance and outputs an automatically-configured machine learning model for a target training dataset. Techniques described herein use one or more proxy models—which implement a variety of machine learning algorithms and are pre-configured with tuned hyperparameters—to estimate relative performance of machine learning model configuration parameters at various stages of the PANI-ML pipeline. The PANI-ML pipeline implements a radically new approach of rapidly narrowing the search space for machine learning model configuration parameters by performing algorithm selection followed by algorithm-specific adaptive data reduction (i.e., row- and/or feature-wise dataset sampling), and then hyperparameter tuning.

Type: Application

Filed: October 30, 2020

Publication date: December 16, 2021

Inventors: Venkatanathan Varadarajan, Sandeep R. Agrawal, Hesam Fathi Moghadam, Anatoly Yakovlev, Ali Moharrer, Jingxiao Cai, Sanjay Jinturkar, Nipun Agarwal, Sam Idicula, Nikan Chavoshi
Matrix multiplication at memory bandwidth

Patent number: 10521225

Abstract: Techniques related to matrix multiplication at memory bandwidth are disclosed. Computing device(s) perform multiplication of a first matrix with a second matrix to generate a third matrix. A first register stores contiguous element values of the first matrix. Furthermore, a second register stores a first set of contiguous element values of the second matrix, and a third register stores a second set of contiguous element values of the second matrix. The first set and the second set correspond to a first row and a second row, respectively, of the second matrix. The first row and the second row are contiguous rows. A single instruction is executed to cause at least a partial computation of contiguous element values of the third matrix. The single instruction causes multiplication of element values stored in the first register with element values stored in the second and third registers and grouped accumulation of the products.

Type: Grant

Filed: June 29, 2017

Date of Patent: December 31, 2019

Assignee: Oracle International Corporation

Inventors: Arun Raghavan, Sandeep R. Agrawal, Sam Idicula, Nipun Agarwal
Memory management for sparse matrix multiplication

Patent number: 10452744

Abstract: Techniques related to memory management for sparse matrix multiplication are disclosed. Computing device(s) may perform a method for multiplying a row of a first sparse matrix with a second sparse matrix to generate a product matrix row. A compressed representation of the second sparse matrix is stored in main memory. The compressed representation comprises a values array that stores non-zero value(s). Tile(s) corresponding to row(s) of second sparse matrix are loaded into scratchpad memory. The tile(s) comprise set(s) of non-zero value(s) of the values array. A particular partition of an uncompressed representation of the product matrix row is generated in the scratchpad memory. The particular partition corresponds to a partition of the second sparse matrix comprising non-zero value(s) included in the tile(s). When a particular tile is determined to comprise non-zero value(s) that are required to generate the particular partition, the particular tile is loaded into the scratchpad memory.

Type: Grant

Filed: March 27, 2017

Date of Patent: October 22, 2019

Assignee: Oracle International Corporation

Inventors: Sandeep R. Agrawal, Sam Idicula, Nipun Agarwal
MATRIX MULTIPLICATION AT MEMORY BANDWIDTH

Publication number: 20190004794

Abstract: Techniques related to matrix multiplication at memory bandwidth are disclosed. Computing device(s) perform multiplication of a first matrix with a second matrix to generate a third matrix. A first register stores contiguous element values of the first matrix. Furthermore, a second register stores a first set of contiguous element values of the second matrix, and a third register stores a second set of contiguous element values of the second matrix. The first set and the second set correspond to a first row and a second row, respectively, of the second matrix. The first row and the second row are contiguous rows. A single instruction is executed to cause at least a partial computation of contiguous element values of the third matrix. The single instruction causes multiplication of element values stored in the first register with element values stored in the second and third registers and grouped accumulation of the products.

Type: Application

Filed: June 29, 2017

Publication date: January 3, 2019

Inventors: Arun Raghavan, Sandeep R. Agrawal, Sam Idicula, Nipun Agarwal
MEMORY MANAGEMENT FOR SPARSE MATRIX MULTIPLICATION

Publication number: 20180275909

Abstract: Techniques related to memory management for sparse matrix multiplication are disclosed. Computing device(s) may perform a method for multiplying a row of a first sparse matrix with a second sparse matrix to generate a product matrix row. A compressed representation of the second sparse matrix is stored in main memory. The compressed representation comprises a values array that stores non-zero value(s). Tile(s) corresponding to row(s) of second sparse matrix are loaded into scratchpad memory. The tile(s) comprise set(s) of non-zero value(s) of the values array. A particular partition of an uncompressed representation of the product matrix row is generated in the scratchpad memory. The particular partition corresponds to a partition of the second sparse matrix comprising non-zero value(s) included in the tile(s). When a particular tile is determined to comprise non-zero value(s) that are required to generate the particular partition, the particular tile is loaded into the scratchpad memory.

Type: Application

Filed: March 27, 2017

Publication date: September 27, 2018

Inventors: Sandeep R. Agrawal, Sam Idicula, Nipun Agarwal