Patents by Inventor Joshua Merel

Joshua Merel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230330848
    Abstract: A neural network control system for controlling an agent to perform a task in a real-world environment, operates based on both image data and proprioceptive data describing the configuration of the agent. The training of the control system includes both imitation learning, using datasets generated from previous performances of the task, and reinforcement learning, based on rewards calculated from control data output by the control system.
    Type: Application
    Filed: April 25, 2023
    Publication date: October 19, 2023
    Inventors: Saran Tunyasuvunakool, Yuke Zhu, Joshua Merel, János Kramár, Ziyu Wang, Nicolas Manfred Otto Heess
  • Patent number: 11714996
    Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.
    Type: Grant
    Filed: July 25, 2022
    Date of Patent: August 1, 2023
    Assignee: DeepMind Technologies Limited
    Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
  • Publication number: 20220374686
    Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.
    Type: Application
    Filed: July 25, 2022
    Publication date: November 24, 2022
    Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
  • Patent number: 11403513
    Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: August 2, 2022
    Assignee: DeepMind Technologies Limited
    Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
  • Patent number: 11000211
    Abstract: Methods and apparatus for adapting a control mapping associating sensor signals with control signals for controlling an operation of a device. The method comprises obtaining first state information for an operation of the device, providing the first state information as input to an intention model associated with an operation of the device and obtaining corresponding first intention model output, providing a plurality of neuromuscular signals recorded from a user and/or signals derived from the neuromuscular signals as inputs to a first control mapping and obtaining corresponding first control mapping output, and updating the first control mapping using the inputs provided to the first control mapping and the first intention model output to obtain a second control mapping.
    Type: Grant
    Filed: July 25, 2017
    Date of Patent: May 11, 2021
    Assignee: Facebook Technologies, LLC
    Inventors: Patrick Kaifosh, Timothy Machado, Thomas Reardon, Erik Schomburg, Joshua Merel, Steven Demers
  • Publication number: 20210049467
    Abstract: A graph neural network system implementing a learnable physics engine for understanding and controlling a physical system. The physical system is considered to be composed of bodies coupled by joints and is represented by static and dynamic graphs. A graph processing neural network processes an input graph e.g. the static and dynamic graphs, to provide an output graph, e.g. a predicted dynamic graph. The graph processing neural network is differentiable and may be used for control and/or reinforcement learning. The trained graph neural network system can be applied to physical systems with similar but new graph structures (zero-shot learning).
    Type: Application
    Filed: April 12, 2019
    Publication date: February 18, 2021
    Inventors: Martin Riedmiller, Raia Thais Hadsell, Peter William Battaglia, Joshua Merel, Jost Tobias Springenberg, Alvaro Sanchez, Nicolas Manfred Otto Heess
  • Publication number: 20200104685
    Abstract: A computer-implemented method of training a student machine learning system comprises receiving data indicating execution of an expert, determining one or more actions performed by the expert during the execution and a corresponding state-action Jacobian, and training the student machine learning system using a linear-feedback-stabilized policy. The linear-feedback-stabilized policy may be based on the state-action Jacobian. Also a neural network system for representing a space of probabilistic motor primitives, implemented by one or more computers. The neural network system comprises an encoder configured to generate latent variables based on a plurality of inputs, each input comprising a plurality of frames, and a decoder configured to generate an action based on one or more of the latent variables and a state.
    Type: Application
    Filed: September 27, 2019
    Publication date: April 2, 2020
    Inventors: Leonard Hasenclever, Vu Pham, Joshua Merel, Alexandre Galashov
  • Publication number: 20200090042
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network used to select actions to be performed by an agent interacting with an environment. One of the methods includes: obtaining data identifying a set of trajectories, each trajectory comprising a set of observations characterizing a set of states of the environment and corresponding actions performed by another agent in response to the states; obtaining data identifying an encoder that maps the observations onto embeddings for use in determining a set of imitation trajectories; determining, for each trajectory, a corresponding embedding by applying the encoder to the trajectory; determining a set of imitation trajectories by applying a policy defined by the neural network to the embedding for each trajectory; and adjusting parameters of the neural network based on the set of trajectories, the set of imitation trajectories and the embeddings.
    Type: Application
    Filed: November 19, 2019
    Publication date: March 19, 2020
    Inventors: Gregory Duncan Wayne, Joshua Merel, Ziyu Wang, Nicolas Manfred Otto Heess, Joao Ferdinando Gomes de Freitas, Scott Ellison Reed
  • Publication number: 20190126472
    Abstract: A neural network control system for controlling an agent to perform a task in a real-world environment, operates based on both image data and proprioceptive data describing the configuration of the agent. The training of the control system includes both imitation learning, using datasets generated from previous performances of the task, and reinforcement learning, based on rewards calculated from control data output by the control system.
    Type: Application
    Filed: October 29, 2018
    Publication date: May 2, 2019
    Inventors: Saran Tunyasuvunakool, Yuke Zhu, Joshua Merel, Janos Kramar, Ziyu Wang, Nicolas Manfred Otto Heess
  • Publication number: 20180020951
    Abstract: Methods and apparatus for adapting a control mapping associating sensor signals with control signals for controlling an operation of a device. The method comprises obtaining first state information for an operation of the device, providing the first state information as input to an intention model associated with an operation of the device and obtaining corresponding first intention model output, providing a plurality of neuromuscular signals recorded from a user and/or signals derived from the neuromuscular signals as inputs to a first control mapping and obtaining corresponding first control mapping output, and updating the first control mapping using the inputs provided to the first control mapping and the first intention model output to obtain a second control mapping.
    Type: Application
    Filed: July 25, 2017
    Publication date: January 25, 2018
    Inventors: Patrick Kaifosh, Timothy Machado, Thomas Reardon, Erik Schomburg, Joshua Merel, Steven Demers