Patents by Inventor Harsha Prasad Nori

Harsha Prasad Nori has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Remote validation of machine-learning models for data imbalance

Patent number: 11537941

Abstract: A method and system for providing data imbalance detection and validation for a trained a ML model includes receiving a request to perform data imbalance detection on the trained ML model, identifying a feature of a dataset associated with the trained model for which data imbalance detection is to be performed, receiving access to the dataset, receiving access to the trained ML model, examining at least one of the dataset or outcome data generated by the trained ML model to determine a distribution of the feature or a distribution of the outcome data, and determining if the trained ML model exhibits data imbalance based at least in part on the distribution of the feature or the distribution of the outcome data.

Type: Grant

Filed: May 28, 2019

Date of Patent: December 27, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Christopher Lee Weider, Ruth Kikin-Gil, Harsha Prasad Nori
Method and system of performing data imbalance detection and correction in training a machine-learning model

Patent number: 11526701

Abstract: A method and system for performing semi or fully automatic data imbalance detection and correction in training a machine-learning (ML) model includes receiving a request to train the ML model, receiving access to a dataset for use in training the ML model, identifying a feature of the dataset for which data imbalance detection is to be performed, examining the dataset to determine a distribution of the feature across the dataset, determining if the distribution of the feature across the dataset indicates data imbalance, upon determining that the distribution of the feature across the dataset indicates data imbalance, identifying a desired distribution for the identified feature, selecting a subset of the dataset that corresponds with the selected feature and the desired distribution, and using the subset to train the ML model.

Type: Grant

Filed: May 28, 2019

Date of Patent: December 13, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Christopher Lee Weider, Ruth Kikin-Gil, Harsha Prasad Nori
Method and system of detecting data imbalance in a dataset used in machine-learning

Patent number: 11521115

Abstract: A method and system for detecting imbalanced distribution of data that may signal bias in a dataset associated with training a machine-learning (ML) model includes receiving a request to perform data imbalance detection on the dataset associated with training the machine-learning (ML) model, identifying a feature of the dataset for which data imbalance detection is to be performed and examining the dataset to determine a distribution of the feature across the dataset. The result of the determination may then be presented in a user interface element to help identify data imbalance in the dataset.

Type: Grant

Filed: May 28, 2019

Date of Patent: December 6, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Christopher Lee Weider, Ruth Kikin-Gil, Harsha Prasad Nori
Remote testing analysis for software optimization based on client-side local differential privacy-based data

Patent number: 10902149

Abstract: Methods, systems, apparatuses, and computer-readable storage medium are described herein for remotely analyzing testing results based on LDP-based data obtained from client devices in order to determine an effect of a software application with respect to its features and/or the population in which the application is tested. The analysis is based on a series of statistical computations for conducting hypothesis tests to compare population means, while ensuring LDP for each user. For example, an LDP scheme is used on the client-side that privatizes a measured value corresponding to a usage of a resource of the client. A data collector receives the privatized data from two sets of populations. Each population's clients have a software application that may differ in terms of features or user group. The privatized data received from each population is analyzed to determine an effect of the difference between the software applications of the different populations.

Type: Grant

Filed: March 22, 2018

Date of Patent: January 26, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Bolin Ding, Harsha Prasad Nori, Paul Luo Li, Joshua Stanley Allen
Method and System of Performing Data Imbalance Detection and Correction in Training a Machine-Learning Model

Publication number: 20200380310

Abstract: A method and system for performing semi or fully automatic data imbalance detection and correction in training a machine-learning (ML) model includes receiving a request to train the ML model, receiving access to a dataset for use in training the ML model, identifying a feature of the dataset for which data imbalance detection is to be performed, examining the dataset to determine a distribution of the feature across the dataset, determining if the distribution of the feature across the dataset indicates data imbalance, upon determining that the distribution of the feature across the dataset indicates data imbalance, identifying a desired distribution for the identified feature, selecting a subset of the dataset that corresponds with the selected feature and the desired distribution, and using the subset to train the ML model.

Type: Application

Filed: May 28, 2019

Publication date: December 3, 2020

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
Method and System of Correcting Data Imbalance in a Dataset Used in Machine-Learning

Publication number: 20200380309

Abstract: A method and system for correcting imbalanced distribution of data that may signal bias in a dataset associated with training a machine-learning (ML) model includes receiving a request to perform a data imbalance correction on a dataset associated with training a machine-learning (ML) model, identifying a feature of the dataset for which data imbalance correction is to be performed, identifying a desired distribution for the identified feature, selecting a subset of the dataset that corresponds with the selected feature and the desired distribution, and using the subset to train a ML model.

Type: Application

Filed: May 28, 2019

Publication date: December 3, 2020

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
Method and System of Detecting Data Imbalance in a Dataset Used in Machine-Learning

Publication number: 20200380399

Abstract: A method and system for detecting imbalanced distribution of data that may signal bias in a dataset associated with training a machine-learning (ML) model includes receiving a request to perform data imbalance detection on the dataset associated with training the machine-learning (ML) model, identifying a feature of the dataset for which data imbalance detection is to be performed and examining the dataset to determine a distribution of the feature across the dataset. The result of the determination may then be presented in a user interface element to help identify data imbalance in the dataset.

Type: Application

Filed: May 28, 2019

Publication date: December 3, 2020

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
Remote Validation of Machine-Learning Models for Data Imbalance

Publication number: 20200380398

Abstract: A method and system for providing data imbalance detection and validation for a trained a ML model includes receiving a request to perform data imbalance detection on the trained ML model, identifying a feature of a dataset associated with the trained model for which data imbalance detection is to be performed, receiving access to the dataset, receiving access to the trained ML model, examining at least one of the dataset or outcome data generated by the trained ML model to determine a distribution of the feature or a distribution of the outcome data, and determining if the trained ML model exhibits data imbalance based at least in part on the distribution of the feature or the distribution of the outcome data.

Type: Application

Filed: May 28, 2019

Publication date: December 3, 2020

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
REMOTE TESTING ANALYSIS FOR SOFTWARE OPTIMIZATION BASED ON CLIENT-SIDE LOCAL DIFFERENTIAL PRIVACY-BASED DATA

Publication number: 20190236306

Abstract: Methods, systems, apparatuses, and computer-readable storage medium are described herein for remotely analyzing testing results based on LDP-based data obtained from client devices in order to determine an effect of a software application with respect to its features and/or the population in which the application is tested. The analysis is based on a series of statistical computations for conducting hypothesis tests to compare population means, while ensuring LDP for each user. For example, an LDP scheme is used on the client-side that privatizes a measured value corresponding to a usage of a resource of the client. A data collector receives the privatized data from two sets of populations. Each population's clients have a software application that may differ in terms of features or user group. The privatized data received from each population is analyzed to determine an effect of the difference between the software applications of the different populations.

Type: Application

Filed: March 22, 2018

Publication date: August 1, 2019

Inventors: Bolin Ding, Harsha Prasad Nori, Paul Luo Li, Joshua Stanley Allen