Patents by Inventor Madhav Datt

Madhav Datt has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Automatic generation of training and testing data for machine-learning models

Patent number: 12242469

Abstract: Provided are computing systems, methods, and platforms for generating training and testing data for machine-learning models. The operations can include receiving signal extraction information that has instructions to query a data store. Additionally, the operations can include accessing, using Structured Query Language (SQL) code generated based on the signal extraction information, raw data from the data store. Moreover, the operations can include processing the raw data using signal configuration information to generate a plurality of signals. The signal configuration information can have instructions on how to generate the plurality of signals from the raw data. Furthermore, the operations can include joining, using SQL code, the plurality of signals with a first label source to generate training data and testing data. Subsequently, the operations can include processing the training data and the testing data to generate the input data. The input data being an ingestible for a machine-learning pipeline.

Type: Grant

Filed: December 16, 2022

Date of Patent: March 4, 2025

Assignee: GOOGLE LLC

Inventors: Madhav Datt, Sukriti Ramesh
Automated, Constraints-Dependent Machine Learning Model Thresholding Mechanisms

Publication number: 20240232589

Abstract: Provided are computing systems, methods, and platforms for a discrete-valued output classification. The operations can include obtaining a candidate threshold value for a first slice in a plurality of data slices. Additionally, the operations can include calculating, using a candidate machine-learned model and the candidate threshold value, a first performance value associated with a first risk tolerance value. Moreover, the operations can include determining, based on the first performance value, that a safeguard criterion for the first slice has not been satisfied. In response to the determination that the safeguard criterion for the first slice has not been satisfied, the operations can include performing a tradeoff logic operation to determine the final threshold value. Subsequently, the operations can include determining, using the candidate machine-learned model, whether input data is authentic based on the final threshold value.

Type: Application

Filed: December 12, 2022

Publication date: July 11, 2024

Inventors: Madhav Datt, Surabhi Choudhary, Nikhil Shirish Ketkar
Automatic Generation of Training and Testing Data for Machine-Learning Models

Publication number: 20240232177

Abstract: Provided are computing systems, methods, and platforms for generating training and testing data for machine-learning models. The operations can include receiving signal extraction information that has instructions to query a data store. Additionally, the operations can include accessing, using Structured Query Language (SQL) code generated based on the signal extraction information, raw data from the data store. Moreover, the operations can include processing the raw data using signal configuration information to generate a plurality of signals. The signal configuration information can have instructions on how to generate the plurality of signals from the raw data. Furthermore, the operations can include joining, using SQL code, the plurality of signals with a first label source to generate training data and testing data. Subsequently, the operations can include processing the training data and the testing data to generate the input data. The input data being an ingestible for a machine-learning pipeline.

Type: Application

Filed: December 16, 2022

Publication date: July 11, 2024

Inventors: Madhav Datt, Sukriti Ramesh
Automated, Constraints-Dependent Machine Learning Model Thresholding Mechanisms

Publication number: 20240135152

Abstract: Provided are computing systems, methods, and platforms for a discrete-valued output classification. The operations can include obtaining a candidate threshold value for a first slice in a plurality of data slices. Additionally, the operations can include calculating, using a candidate machine-learned model and the candidate threshold value, a first performance value associated with a first risk tolerance value. Moreover, the operations can include determining, based on the first performance value, that a safeguard criterion for the first slice has not been satisfied. In response to the determination that the safeguard criterion for the first slice has not been satisfied, the operations can include performing a tradeoff logic operation to determine the final threshold value. Subsequently, the operations can include determining, using the candidate machine-learned model, whether input data is authentic based on the final threshold value.

Type: Application

Filed: December 12, 2022

Publication date: April 25, 2024

Inventors: Madhav Datt, Surabhi Choudhary, Nikhil Shirish Ketkar
Automatic Generation of Training and Testing Data for Machine-Learning Models

Publication number: 20240134846

Abstract: Provided are computing systems, methods, and platforms for generating training and testing data for machine-learning models. The operations can include receiving signal extraction information that has instructions to query a data store. Additionally, the operations can include accessing, using Structured Query Language (SQL) code generated based on the signal extraction information, raw data from the data store. Moreover, the operations can include processing the raw data using signal configuration information to generate a plurality of signals. The signal configuration information can have instructions on how to generate the plurality of signals from the raw data. Furthermore, the operations can include joining, using SQL code, the plurality of signals with a first label source to generate training data and testing data. Subsequently, the operations can include processing the training data and the testing data to generate the input data. The input data being an ingestible for a machine-learning pipeline.

Type: Application

Filed: December 16, 2022

Publication date: April 25, 2024

Inventors: Madhav Datt, Sukriti Ramesh
Model-Agnostic System for Automatic Investigation of the Impact of New Features on Performance of Machine Learning Models

Publication number: 20240104429

Abstract: Provided are computing systems, methods, and platforms that automatically investigate and analyze the impact of new features or signals on the performance of a machine learning model by producing a ranked list of the most impactful features from input of a set of candidate features. In particular, one example computing system can import a training dataset associated with a user. The computing system can train a machine learning model for the training dataset and generate baseline metrics for the machine learning model. Correlations between features or signals in the training dataset can be identified and the features or signals can be grouped into clusters based on the correlations. The computing system can determine the importance of each cluster and each feature or signal. A ranked list of signals and their importances can be exported in decreasing order of machine learning model performance lift based on cluster importance and signal importance.

Type: Application

Filed: December 13, 2022

Publication date: March 28, 2024

Inventors: Madhav Datt, Nikhil Shirish Ketkar, Arooshi Verma
Automated thresholding of binary classification ML models

Publication number: 20230267314

Abstract: Methods and systems are provided for generating, for respective mutually exclusive classes of model inputs, separate output thresholds that can be applied to the continuous-valued output of a neural network or other machine learning model in order to classify inputs in a class-sensitive manner. Such classes could be related to operational or other constraints with respect to the classifier outputs that vary across the classes of inputs. Thus, the machine learning model can be improved by using training data from all of the available classes while allowing the end performance of the model plus threshold classifier to be separately set for each input class. These automated methods for class-specific threshold setting also provide improvements with respect to accuracy, time, and cost. Also provided are methods and systems for per-class calibration of model outputs.

Type: Application

Filed: September 14, 2022

Publication date: August 24, 2023

Inventors: Madhav Datt, Prakhar Gupta

Automatic generation of training and testing data for machine-learning models

Automated, Constraints-Dependent Machine Learning Model Thresholding Mechanisms

Automatic Generation of Training and Testing Data for Machine-Learning Models

Automated, Constraints-Dependent Machine Learning Model Thresholding Mechanisms

Automatic Generation of Training and Testing Data for Machine-Learning Models

Model-Agnostic System for Automatic Investigation of the Impact of New Features on Performance of Machine Learning Models

Automated thresholding of binary classification ML models