Patents by Inventor Timothy LILLICRAP

Timothy LILLICRAP has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Publication number: 20240131695

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Application

Filed: December 1, 2023

Publication date: April 25, 2024

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
Deep reinforcement learning for robotic manipulation

Patent number: 11897133

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Grant

Filed: August 1, 2022

Date of Patent: February 13, 2024

Assignee: GOOGLE LLC

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
Deep reinforcement learning for robotic manipulation

Patent number: 11845183

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Grant

Filed: August 1, 2022

Date of Patent: December 19, 2023

Assignee: GOOGLE LLC

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Publication number: 20220388159

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Application

Filed: August 1, 2022

Publication date: December 8, 2022

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
Deep reinforcement learning for robotic manipulation

Patent number: 11400587

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Grant

Filed: September 14, 2017

Date of Patent: August 2, 2022

Assignee: GOOGLE LLC

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Publication number: 20190232488

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Application

Filed: September 14, 2017

Publication date: August 1, 2019

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
METHOD OF TRAINING A NEURAL NETWORK

Publication number: 20160162781

Abstract: A method of training a neural network having at least an input layer, an output layer and a hidden layer, and a weight matrix encoding connection weights between two of the layers, the method comprising the steps of (a) providing an input to the input layer, the input having an associated expected output, (b) receiving a generated output at the output layer, (c) generating an error vector from the difference between the generated output and expected output, (d) generating a change matrix, the change matrix being the product of a random weight matrix and the error vector, and (e) modifying the weight matrix in accordance with the change matrix.

Type: Application

Filed: July 25, 2014

Publication date: June 9, 2016

Inventors: Timothy LILLICRAP, Colin Akerman, Douglas TWEED, Daniel COWNDEN

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Deep reinforcement learning for robotic manipulation

Deep reinforcement learning for robotic manipulation

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Deep reinforcement learning for robotic manipulation

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

METHOD OF TRAINING A NEURAL NETWORK