Patents by Inventor Qingyun Wu

Qingyun Wu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

WARM STARTING AN ONLINE BANDIT LEARNER MODEL UTILIZING RELEVANT OFFLINE MODELS

Publication number: 20230259829

Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing offline models to warm start online bandit learner models. For example, the disclosed system can determine relevant offline models for an environment based on reward estimate differences between the offline models and the online model. The disclosed system can then utilize the relevant offline models (if any) to select an arm for the environment. The disclosed system can update the online model based on observed rewards for the selected arm. Additionally, the disclosed system can also use entropy reduction of arms to determine the utility of the arms in differentiating relevant and irrelevant offline models. For example, the disclosed system can select an arm based on a combination of the entropy reduction of the arm and the reward estimate for the arm and use the observed reward to update an observation history.

Type: Application

Filed: April 25, 2023

Publication date: August 17, 2023

Inventors: Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Qingyun Wu
Utilizing relevant offline models to warm start an online bandit learner model

Patent number: 11669768

Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing offline models to warm start online bandit learner models. For example, the disclosed system can determine relevant offline models for an environment based on reward estimate differences between the offline models and the online model. The disclosed system can then utilize the relevant offline models (if any) to select an arm for the environment. The disclosed system can update the online model based on observed rewards for the selected arm. Additionally, the disclosed system can also use entropy reduction of arms to determine the utility of the arms in differentiating relevant and irrelevant offline models. For example, the disclosed system can select an arm based on a combination of the entropy reduction of the arm and the reward estimate for the arm and use the observed reward to update an observation history.

Type: Grant

Filed: September 26, 2019

Date of Patent: June 6, 2023

Assignee: Adobe Inc.

Inventors: Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Qingyun Wu
SYSTEM AND METHOD FOR AUTOMATIC HYPERPARAMETER SELECTION FOR ONLINE LEARNING

Publication number: 20220374781

Abstract: Systems and methods for tuning hyperparameters for a machine learning model using a challenger champion model are described. A set of challenger configurations are generated based on a hyperparameter for tuning and a subset of the set of challenger configurations are scheduled for evaluation based on a loss function. A loss value derived from the loss function for the challenger configurations is compared to a loss value derived from the loss function for a champion configuration, and the champion configuration is replaced with the challenger configuration based on the comparison of the loss value derived from the loss function for the challenger configuration and the loss value derived from the loss function for the champion configuration. When the champion is replaced, a new set of challenger configurations is generated based on the new champion configuration.

Type: Application

Filed: May 4, 2021

Publication date: November 24, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Chi WANG, John Carol LANGFORD, Qingyun WU, Paul S. MINEIRO, Marco ROSSI
UTILIZING RELEVANT OFFLINE MODELS TO WARM START AN ONLINE BANDIT LEARNER MODEL

Publication number: 20210097350

Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing offline models to warm start online bandit learner models. For example, the disclosed system can determine relevant offline models for an environment based on reward estimate differences between the offline models and the online model. The disclosed system can then utilize the relevant offline models (if any) to select an arm for the environment. The disclosed system can update the online model based on observed rewards for the selected arm. Additionally, the disclosed system can also use entropy reduction of arms to determine the utility of the arms in differentiating relevant and irrelevant offline models. For example, the disclosed system can select an arm based on a combination of the entropy reduction of the arm and the reward estimate for the arm and use the observed reward to update an observation history.

Type: Application

Filed: September 26, 2019

Publication date: April 1, 2021

Inventors: Georgios Theocharous, Zheng Wen, Yasin Abbasi Yadkori, Qingyun Wu
Aromatherapy diffuser

Patent number: D803375

Type: Grant

Filed: September 21, 2016

Date of Patent: November 21, 2017

Assignee: Radha Beauty Products LLC

Inventors: Qingyun Wu, Junlin Liang
Flashlight

Patent number: D973933

Type: Grant

Filed: January 5, 2022

Date of Patent: December 27, 2022

Inventor: Qingyun Wu
Flashlight

Patent number: D981022

Type: Grant

Filed: January 5, 2022

Date of Patent: March 14, 2023

Inventor: Qingyun Wu

WARM STARTING AN ONLINE BANDIT LEARNER MODEL UTILIZING RELEVANT OFFLINE MODELS

Utilizing relevant offline models to warm start an online bandit learner model

SYSTEM AND METHOD FOR AUTOMATIC HYPERPARAMETER SELECTION FOR ONLINE LEARNING

UTILIZING RELEVANT OFFLINE MODELS TO WARM START AN ONLINE BANDIT LEARNER MODEL

Aromatherapy diffuser

Flashlight

Flashlight