Patents by Inventor Joshua Merel

Joshua Merel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

HIERARCHICAL LATENT MIXTURE POLICIES FOR AGENT CONTROL

Publication number: 20240403652

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents. In particular, an agent can be controlled using a hierarchical controller that includes a high-level controller neural network, a mid-level controller neural network, and a low-level controller neural network.

Type: Application

Filed: October 5, 2022

Publication date: December 5, 2024

Inventors: Dushyant ` Rao, Fereshteh Sadeghi, Leonard Hasenclever, Markus Wulfmeier, Martina Zambelli, Giulia Vezzani, Dhruva Tirumala Bukkapatnam, Yusuf Aytar, Joshua Merel, Nicolas Manfred Otto Heess, Raia Thais Hadsell
REINFORCEMENT AND IMITATION LEARNING FOR A TASK

Publication number: 20230330848

Abstract: A neural network control system for controlling an agent to perform a task in a real-world environment, operates based on both image data and proprioceptive data describing the configuration of the agent. The training of the control system includes both imitation learning, using datasets generated from previous performances of the task, and reinforcement learning, based on rewards calculated from control data output by the control system.

Type: Application

Filed: April 25, 2023

Publication date: October 19, 2023

Inventors: Saran Tunyasuvunakool, Yuke Zhu, Joshua Merel, János Kramár, Ziyu Wang, Nicolas Manfred Otto Heess
Learning motor primitives and training a machine learning system using a linear-feedback-stabilized policy

Patent number: 11714996

Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.

Type: Grant

Filed: July 25, 2022

Date of Patent: August 1, 2023

Assignee: DeepMind Technologies Limited

Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
LEARNING MOTOR PRIMITIVES AND TRAINING A MACHINE LEARNING SYSTEM USING A LINEAR-FEEDBACK-STABILIZED POLICY

Publication number: 20220374686

Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.

Type: Application

Filed: July 25, 2022

Publication date: November 24, 2022

Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
Learning motor primitives and training a machine learning system using a linear-feedback-stabilized policy

Patent number: 11403513

Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.

Type: Grant

Filed: September 27, 2019

Date of Patent: August 2, 2022

Assignee: DeepMind Technologies Limited

Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
Adaptive system for deriving control signals from measurements of neuromuscular activity

Patent number: 11000211

Abstract: Methods and apparatus for adapting a control mapping associating sensor signals with control signals for controlling an operation of a device. The method comprises obtaining first state information for an operation of the device, providing the first state information as input to an intention model associated with an operation of the device and obtaining corresponding first intention model output, providing a plurality of neuromuscular signals recorded from a user and/or signals derived from the neuromuscular signals as inputs to a first control mapping and obtaining corresponding first control mapping output, and updating the first control mapping using the inputs provided to the first control mapping and the first intention model output to obtain a second control mapping.

Type: Grant

Filed: July 25, 2017

Date of Patent: May 11, 2021

Assignee: Facebook Technologies, LLC

Inventors: Patrick Kaifosh, Timothy Machado, Thomas Reardon, Erik Schomburg, Joshua Merel, Steven Demers
GRAPH NEURAL NETWORKS REPRESENTING PHYSICAL SYSTEMS

Publication number: 20210049467

Abstract: A graph neural network system implementing a learnable physics engine for understanding and controlling a physical system. The physical system is considered to be composed of bodies coupled by joints and is represented by static and dynamic graphs. A graph processing neural network processes an input graph e.g. the static and dynamic graphs, to provide an output graph, e.g. a predicted dynamic graph. The graph processing neural network is differentiable and may be used for control and/or reinforcement learning. The trained graph neural network system can be applied to physical systems with similar but new graph structures (zero-shot learning).

Type: Application

Filed: April 12, 2019

Publication date: February 18, 2021

Inventors: Martin Riedmiller, Raia Thais Hadsell, Peter William Battaglia, Joshua Merel, Jost Tobias Springenberg, Alvaro Sanchez, Nicolas Manfred Otto Heess
LEARNING MOTOR PRIMITIVES AND TRAINING A MACHINE LEARNING SYSTEM USING A LINEAR-FEEDBACK-STABILIZED POLICY

Publication number: 20200104685

Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.

Type: Application

Filed: September 27, 2019

Publication date: April 2, 2020

Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
DATA EFFICIENT IMITATION OF DIVERSE BEHAVIORS

Publication number: 20200090042

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. One of the methods includes: obtaining data identifying a set of trajectories, each trajectory comprising a set of observations characterizing a set of states of the environment and corresponding actions performed by another agent in response to the states; obtaining data identifying an encoder that maps the observations onto embeddings for use in determining a set of imitation trajectories; determining, for each trajectory, a corresponding embedding by applying the encoder to the trajectory; determining a set of imitation trajectories by applying a policy defined by the neural network to the embedding for each trajectory; and adjusting parameters of the neural network based on the set of trajectories, the set of imitation trajectories and the embeddings.

Type: Application

Filed: November 19, 2019

Publication date: March 19, 2020

Inventors: Gregory Duncan Wayne, Joshua Merel, Ziyu Wang, Nicolas Manfred Otto Heess, Joao Ferdinando Gomes de Freitas, Scott Ellison Reed
REINFORCEMENT AND IMITATION LEARNING FOR A TASK

Publication number: 20190126472

Abstract: A neural network control system for controlling an agent to perform a task in a real-world environment, operates based on both image data and proprioceptive data describing the configuration of the agent. The training of the control system includes both imitation learning, using datasets generated from previous performances of the task, and reinforcement learning, based on rewards calculated from control data output by the control system.

Type: Application

Filed: October 29, 2018

Publication date: May 2, 2019

Inventors: Saran Tunyasuvunakool, Yuke Zhu, Joshua Merel, Janos Kramar, Ziyu Wang, Nicolas Manfred Otto Heess
ADAPTIVE SYSTEM FOR DERIVING CONTROL SIGNALS FROM MEASUREMENTS OF NEUROMUSCULAR ACTIVITY

Publication number: 20180020951

Abstract: Methods and apparatus for adapting a control mapping associating sensor signals with control signals for controlling an operation of a device. The method comprises obtaining first state information for an operation of the device, providing the first state information as input to an intention model associated with an operation of the device and obtaining corresponding first intention model output, providing a plurality of neuromuscular signals recorded from a user and/or signals derived from the neuromuscular signals as inputs to a first control mapping and obtaining corresponding first control mapping output, and updating the first control mapping using the inputs provided to the first control mapping and the first intention model output to obtain a second control mapping.

Type: Application

Filed: July 25, 2017

Publication date: January 25, 2018

Inventors: Patrick Kaifosh, Timothy Machado, Thomas Reardon, Erik Schomburg, Joshua Merel, Steven Demers