Patents by Inventor Satinder Singh Baveja

Satinder Singh Baveja has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

META-LEARNED EVOLUTIONARY STRATEGIES OPTIMIZER

Publication number: 20240127071

Abstract: There is provided a computer-implemented method for updating a search distribution of an evolutionary strategies optimizer using an optimizer neural network comprising one or more attention blocks. The method comprises receiving a plurality of candidate solutions, one or more parameters defining the search distribution that the plurality of candidate solutions are sampled from, and fitness score data indicating a fitness of each respective candidate solution of the plurality of candidate solutions. The method further comprises processing, by the one or more attention neural network blocks, the fitness score data using an attention mechanism to generate respective recombination weights corresponding to each respective candidate solution. The method further comprises updating the one or more parameters defining the search distribution based upon the recombination weights applied to the plurality of candidate solutions.

Type: Application

Filed: September 27, 2023

Publication date: April 18, 2024

Inventors: Robert Tjarko Lange, Tom Schaul, Yutian Chen, Tom Ben Zion Zahavy, Valentin Clement Dalibard, Christopher Yenchuan Lu, Satinder Singh Baveja, Johan Sebastian Flennerhag
AGENT CONTROL THROUGH IN-CONTEXT REINFORCEMENT LEARNING

Publication number: 20240104379

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using an action selection neural network that performs in-context reinforcement learning when controlling an agent on a new task.

Type: Application

Filed: September 28, 2023

Publication date: March 28, 2024

Inventors: Michael Laskin, Volodymyr Mnih, Luyu Wang, Satinder Singh Baveja
NEURAL NETWORK REINFORCEMENT LEARNING WITH DIVERSE POLICIES

Publication number: 20240104389

Abstract: In one aspect there is provided a method for training a neural network system by reinforcement learning. The neural network system may be configured to receive an input observation characterizing a state of an environment interacted with by an agent and to select and output an action in accordance with a policy aiming to satisfy an objective. The method may comprise obtaining a policy set comprising one or more policies for satisfying the objective and determining a new policy based on the one or more policies. The determining may include one or more optimization steps that aim to maximize a diversity of the new policy relative to the policy set under the condition that the new policy satisfies a minimum performance criterion based on an expected return that would be obtained by following the new policy.

Type: Application

Filed: February 4, 2022

Publication date: March 28, 2024

Inventors: Tom Ben Zion Zahavy, Brendan Timothy O'Donoghue, Andre da Motta Salles Barreto, Johan Sebastian Flennerhag, Volodymyr Mnih, Satinder Singh Baveja
LEARNING OPTIONS FOR ACTION SELECTION WITH META-GRADIENTS IN MULTI-TASK REINFORCEMENT LEARNING

Publication number: 20230144995

Abstract: A reinforcement learning system, method, and computer program code for controlling an agent to perform a plurality of tasks while interacting with an environment. The system learns options, where an option comprises a sequence of primitive actions performed by the agent under control of an option policy neural network. In implementations the system discovers options which are useful for multiple different tasks by meta-learning rewards for training the option policy neural network whilst the agent is interacting with the environment.

Type: Application

Filed: June 7, 2021

Publication date: May 11, 2023

Inventors: Vivek Veeriah Jeya Veeraiah, Tom Ben Zion Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado Philip van Hasselt, David Silver, Satinder Singh Baveja
REINFORCEMENT LEARNING USING META-LEARNED INTRINSIC REWARDS

Publication number: 20210089910

Abstract: There is described methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a reinforcement learning system. The reinforcement learning system comprises an agent configured to perform actions based upon a policy and an intrinsic reward system configured to generate intrinsic reward values for the agent based upon the actions taken by the agent. The method comprises training the reinforcement learning system based upon a plurality of tasks. The training comprises updating the agent's policy based upon the intrinsic reward values generated by the intrinsic reward system and updating the intrinsic reward system based upon an extrinsic reward value obtained based upon the task being performed by the agent. The training further comprises re-initializing the agent's policy when an expiration criterion associated with the agent is met.

Type: Application

Filed: September 25, 2020

Publication date: March 25, 2021

Inventors: Zeyu Zheng, Junhyuk Oh, Satinder Singh Baveja
Mood monitoring of bipolar disorder using speech analysis

Patent number: 9685174

Abstract: A system that monitors and assesses the moods of subjects with neurological disorders, like bipolar disorder, by analyzing normal conversational speech to identify speech data that is then analyzed through an automated speech data classifier. The classifier may be based on a vector, separator, hyperplane, decision boundary, or other set of rules to classify one or more mood states of a subject. The system classifier is used to assess current mood state, predicted instability, and/or a change in future mood state, in particular for subjects with bipolar disorder.

Type: Grant

Filed: May 1, 2015

Date of Patent: June 20, 2017

Assignee: THE REGENTS OF THE UNIVERSITY OF MICHIGAN

Inventors: Zahi N. Karam, Satinder Singh Baveja, Melvin Mcinnis, Emily Mower Provost
MOOD MONITORING OF BIPOLAR DISORDER USING SPEECH ANALYSIS

Publication number: 20150318002

Abstract: A system that monitors and assesses the moods of subjects with neurological disorders, like bipolar disorder, by analyzing normal conversational speech to identify speech data that is then analyzed through an automated speech data classifier. The classifier may be based on a vector, separator, hyperplane, decision boundary, or other set of rules to classify one or more mood states of a subject. The system classifier is used to assess current mood state, predicted instability, and/or a change in future mood state, in particular for subjects with bipolar disorder.

Type: Application

Filed: May 1, 2015

Publication date: November 5, 2015

Inventors: Zahi N. Karam, Satinder Singh Baveja, Melvin Mcinnis

META-LEARNED EVOLUTIONARY STRATEGIES OPTIMIZER

AGENT CONTROL THROUGH IN-CONTEXT REINFORCEMENT LEARNING

NEURAL NETWORK REINFORCEMENT LEARNING WITH DIVERSE POLICIES

LEARNING OPTIONS FOR ACTION SELECTION WITH META-GRADIENTS IN MULTI-TASK REINFORCEMENT LEARNING

REINFORCEMENT LEARNING USING META-LEARNED INTRINSIC REWARDS

Mood monitoring of bipolar disorder using speech analysis

MOOD MONITORING OF BIPOLAR DISORDER USING SPEECH ANALYSIS