Patents by Inventor Qingyun Wu

Qingyun Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230259829
    Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing offline models to warm start online bandit learner models. For example, the disclosed system can determine relevant offline models for an environment based on reward estimate differences between the offline models and the online model. The disclosed system can then utilize the relevant offline models (if any) to select an arm for the environment. The disclosed system can update the online model based on observed rewards for the selected arm. Additionally, the disclosed system can also use entropy reduction of arms to determine the utility of the arms in differentiating relevant and irrelevant offline models. For example, the disclosed system can select an arm based on a combination of the entropy reduction of the arm and the reward estimate for the arm and use the observed reward to update an observation history.
    Type: Application
    Filed: April 25, 2023
    Publication date: August 17, 2023
    Inventors: Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Qingyun Wu
  • Patent number: 11669768
    Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing offline models to warm start online bandit learner models. For example, the disclosed system can determine relevant offline models for an environment based on reward estimate differences between the offline models and the online model. The disclosed system can then utilize the relevant offline models (if any) to select an arm for the environment. The disclosed system can update the online model based on observed rewards for the selected arm. Additionally, the disclosed system can also use entropy reduction of arms to determine the utility of the arms in differentiating relevant and irrelevant offline models. For example, the disclosed system can select an arm based on a combination of the entropy reduction of the arm and the reward estimate for the arm and use the observed reward to update an observation history.
    Type: Grant
    Filed: September 26, 2019
    Date of Patent: June 6, 2023
    Assignee: Adobe Inc.
    Inventors: Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Qingyun Wu
  • Publication number: 20220374781
    Abstract: Systems and methods for tuning hyperparameters for a machine learning model using a challenger champion model are described. A set of challenger configurations are generated based on a hyperparameter for tuning and a subset of the set of challenger configurations are scheduled for evaluation based on a loss function. A loss value derived from the loss function for the challenger configurations is compared to a loss value derived from the loss function for a champion configuration, and the champion configuration is replaced with the challenger configuration based on the comparison of the loss value derived from the loss function for the challenger configuration and the loss value derived from the loss function for the champion configuration. When the champion is replaced, a new set of challenger configurations is generated based on the new champion configuration.
    Type: Application
    Filed: May 4, 2021
    Publication date: November 24, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Chi WANG, John Carol LANGFORD, Qingyun WU, Paul S. MINEIRO, Marco ROSSI
  • Publication number: 20210097350
    Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing offline models to warm start online bandit learner models. For example, the disclosed system can determine relevant offline models for an environment based on reward estimate differences between the offline models and the online model. The disclosed system can then utilize the relevant offline models (if any) to select an arm for the environment. The disclosed system can update the online model based on observed rewards for the selected arm. Additionally, the disclosed system can also use entropy reduction of arms to determine the utility of the arms in differentiating relevant and irrelevant offline models. For example, the disclosed system can select an arm based on a combination of the entropy reduction of the arm and the reward estimate for the arm and use the observed reward to update an observation history.
    Type: Application
    Filed: September 26, 2019
    Publication date: April 1, 2021
    Inventors: Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Qingyun Wu
  • Patent number: D803375
    Type: Grant
    Filed: September 21, 2016
    Date of Patent: November 21, 2017
    Assignee: Radha Beauty Products LLC
    Inventors: Qingyun Wu, Junlin Liang
  • Patent number: D973933
    Type: Grant
    Filed: January 5, 2022
    Date of Patent: December 27, 2022
    Inventor: Qingyun Wu
  • Patent number: D981022
    Type: Grant
    Filed: January 5, 2022
    Date of Patent: March 14, 2023
    Inventor: Qingyun Wu