Patents by Inventor Akshay Krishnamurthy

Akshay Krishnamurthy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240005356
    Abstract: Off-policy evaluation of a new “target” policy is performed using historical data gathered based on a previous “logging” policy to estimate the performance of the target policy. An estimator may be used, wherein either a quality-based estimator or a quality-agnostic estimator is used to weight the difference between an observed reward in the historical data and an estimated reward generated by the target policy. A quality-agnostic estimator may be used to evaluate an importance weight according to a threshold. In such examples, when the importance weight exceeds the threshold, the quality-agnostic estimator clips the importance weight at the threshold, thereby providing an fixed upper bound irrespective of the quality of the reward predictor. In other examples, a quality-based estimator is used, in which an upper bound incorporates the quality of the reward predictor in order to modify an importance weight used by the estimator.
    Type: Application
    Filed: September 14, 2023
    Publication date: January 4, 2024
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Miroslav DUDIK, Akshay KRISHNAMURTHY, Maria DIMAKOPOULOU, Yi SU
  • Patent number: 11798029
    Abstract: Off-policy evaluation of a new “target” policy is performed using historical data gathered based on a previous “logging” policy to estimate the performance of the target policy. An estimator may be used, wherein either a quality-based estimator or a quality-agnostic estimator is used to weight the difference between an observed reward in the historical data and an estimated reward generated by the target policy. A quality-agnostic estimator may be used to evaluate an importance weight according to a threshold. In such examples, when the importance weight exceeds the threshold, the quality-agnostic estimator clips the importance weight at the threshold, thereby providing an fixed upper bound irrespective of the quality of the reward predictor. In other examples, a quality-based estimator is used, in which an upper bound incorporates the quality of the reward predictor in order to modify an importance weight used by the estimator.
    Type: Grant
    Filed: October 18, 2019
    Date of Patent: October 24, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Miroslav Dudik, Akshay Krishnamurthy, Maria Dimakopoulou, Yi Su
  • Publication number: 20200394473
    Abstract: Off-policy evaluation of a new “target” policy is performed using historical data gathered based on a previous “logging” policy to estimate the performance of the target policy. An estimator may be used, wherein either a quality-based estimator or a quality-agnostic estimator is used to weight the difference between an observed reward in the historical data and an estimated reward generated by the target policy. A quality-agnostic estimator may be used to evaluate an importance weight according to a threshold. In such examples, when the importance weight exceeds the threshold, the quality-agnostic estimator clips the importance weight at the threshold, thereby providing an fixed upper bound irrespective of the quality of the reward predictor. In other examples, a quality-based estimator is used, in which an upper bound incorporates the quality of the reward predictor in order to modify an importance weight used by the estimator.
    Type: Application
    Filed: October 18, 2019
    Publication date: December 17, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Miroslav DUDIK, Akshay KRISHNAMURTHY, Maria DIMAKOPOULOU, Yi SU
  • Publication number: 20170308535
    Abstract: A computing device can determine a decomposition of data of actions of a first session based at least in part on a first computational model associating the actions of the first session with corresponding state values of the first session. The computing device can determine a second computational model based at least in part on the decomposition and an operation template. The computing device can receive a query via the communications interface, the query associated with the second session. The computing device can determine a state value of the second session based at least in part on the query. The computing device can operate the second computational model to determine at least one response associated with the query based at least in part on the state value of the second session. The computing device can provide an indication of the at least one response via the communications interface.
    Type: Application
    Filed: April 22, 2016
    Publication date: October 26, 2017
    Inventors: Alekh Agarwal, Miroslav Dudik, Akshay Krishnamurthy, John Langford, Adith Swaminathan