Patents by Inventor Harsha Prasad Nori

Harsha Prasad Nori has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11537941
    Abstract: A method and system for providing data imbalance detection and validation for a trained a ML model includes receiving a request to perform data imbalance detection on the trained ML model, identifying a feature of a dataset associated with the trained model for which data imbalance detection is to be performed, receiving access to the dataset, receiving access to the trained ML model, examining at least one of the dataset or outcome data generated by the trained ML model to determine a distribution of the feature or a distribution of the outcome data, and determining if the trained ML model exhibits data imbalance based at least in part on the distribution of the feature or the distribution of the outcome data.
    Type: Grant
    Filed: May 28, 2019
    Date of Patent: December 27, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Christopher Lee Weider, Ruth Kikin-Gil, Harsha Prasad Nori
  • Patent number: 11526701
    Abstract: A method and system for performing semi or fully automatic data imbalance detection and correction in training a machine-learning (ML) model includes receiving a request to train the ML model, receiving access to a dataset for use in training the ML model, identifying a feature of the dataset for which data imbalance detection is to be performed, examining the dataset to determine a distribution of the feature across the dataset, determining if the distribution of the feature across the dataset indicates data imbalance, upon determining that the distribution of the feature across the dataset indicates data imbalance, identifying a desired distribution for the identified feature, selecting a subset of the dataset that corresponds with the selected feature and the desired distribution, and using the subset to train the ML model.
    Type: Grant
    Filed: May 28, 2019
    Date of Patent: December 13, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Christopher Lee Weider, Ruth Kikin-Gil, Harsha Prasad Nori
  • Patent number: 11521115
    Abstract: A method and system for detecting imbalanced distribution of data that may signal bias in a dataset associated with training a machine-learning (ML) model includes receiving a request to perform data imbalance detection on the dataset associated with training the machine-learning (ML) model, identifying a feature of the dataset for which data imbalance detection is to be performed and examining the dataset to determine a distribution of the feature across the dataset. The result of the determination may then be presented in a user interface element to help identify data imbalance in the dataset.
    Type: Grant
    Filed: May 28, 2019
    Date of Patent: December 6, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Christopher Lee Weider, Ruth Kikin-Gil, Harsha Prasad Nori
  • Patent number: 10902149
    Abstract: Methods, systems, apparatuses, and computer-readable storage medium are described herein for remotely analyzing testing results based on LDP-based data obtained from client devices in order to determine an effect of a software application with respect to its features and/or the population in which the application is tested. The analysis is based on a series of statistical computations for conducting hypothesis tests to compare population means, while ensuring LDP for each user. For example, an LDP scheme is used on the client-side that privatizes a measured value corresponding to a usage of a resource of the client. A data collector receives the privatized data from two sets of populations. Each population's clients have a software application that may differ in terms of features or user group. The privatized data received from each population is analyzed to determine an effect of the difference between the software applications of the different populations.
    Type: Grant
    Filed: March 22, 2018
    Date of Patent: January 26, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bolin Ding, Harsha Prasad Nori, Paul Luo Li, Joshua Stanley Allen
  • Publication number: 20200380310
    Abstract: A method and system for performing semi or fully automatic data imbalance detection and correction in training a machine-learning (ML) model includes receiving a request to train the ML model, receiving access to a dataset for use in training the ML model, identifying a feature of the dataset for which data imbalance detection is to be performed, examining the dataset to determine a distribution of the feature across the dataset, determining if the distribution of the feature across the dataset indicates data imbalance, upon determining that the distribution of the feature across the dataset indicates data imbalance, identifying a desired distribution for the identified feature, selecting a subset of the dataset that corresponds with the selected feature and the desired distribution, and using the subset to train the ML model.
    Type: Application
    Filed: May 28, 2019
    Publication date: December 3, 2020
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
  • Publication number: 20200380309
    Abstract: A method and system for correcting imbalanced distribution of data that may signal bias in a dataset associated with training a machine-learning (ML) model includes receiving a request to perform a data imbalance correction on a dataset associated with training a machine-learning (ML) model, identifying a feature of the dataset for which data imbalance correction is to be performed, identifying a desired distribution for the identified feature, selecting a subset of the dataset that corresponds with the selected feature and the desired distribution, and using the subset to train a ML model.
    Type: Application
    Filed: May 28, 2019
    Publication date: December 3, 2020
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
  • Publication number: 20200380399
    Abstract: A method and system for detecting imbalanced distribution of data that may signal bias in a dataset associated with training a machine-learning (ML) model includes receiving a request to perform data imbalance detection on the dataset associated with training the machine-learning (ML) model, identifying a feature of the dataset for which data imbalance detection is to be performed and examining the dataset to determine a distribution of the feature across the dataset. The result of the determination may then be presented in a user interface element to help identify data imbalance in the dataset.
    Type: Application
    Filed: May 28, 2019
    Publication date: December 3, 2020
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
  • Publication number: 20200380398
    Abstract: A method and system for providing data imbalance detection and validation for a trained a ML model includes receiving a request to perform data imbalance detection on the trained ML model, identifying a feature of a dataset associated with the trained model for which data imbalance detection is to be performed, receiving access to the dataset, receiving access to the trained ML model, examining at least one of the dataset or outcome data generated by the trained ML model to determine a distribution of the feature or a distribution of the outcome data, and determining if the trained ML model exhibits data imbalance based at least in part on the distribution of the feature or the distribution of the outcome data.
    Type: Application
    Filed: May 28, 2019
    Publication date: December 3, 2020
    Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Christopher Lee WEIDER, Ruth KIKIN-GIL, Harsha Prasad NORI
  • Publication number: 20190236306
    Abstract: Methods, systems, apparatuses, and computer-readable storage medium are described herein for remotely analyzing testing results based on LDP-based data obtained from client devices in order to determine an effect of a software application with respect to its features and/or the population in which the application is tested. The analysis is based on a series of statistical computations for conducting hypothesis tests to compare population means, while ensuring LDP for each user. For example, an LDP scheme is used on the client-side that privatizes a measured value corresponding to a usage of a resource of the client. A data collector receives the privatized data from two sets of populations. Each population's clients have a software application that may differ in terms of features or user group. The privatized data received from each population is analyzed to determine an effect of the difference between the software applications of the different populations.
    Type: Application
    Filed: March 22, 2018
    Publication date: August 1, 2019
    Inventors: Bolin Ding, Harsha Prasad Nori, Paul Luo Li, Joshua Stanley Allen