Patents by Inventor Ethan Holly

Ethan Holly has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Publication number: 20240131695

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Application

Filed: December 1, 2023

Publication date: April 25, 2024

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
Deep reinforcement learning for robotic manipulation

Patent number: 11897133

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Grant

Filed: August 1, 2022

Date of Patent: February 13, 2024

Assignee: GOOGLE LLC

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
Deep reinforcement learning for robotic manipulation

Patent number: 11845183

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Grant

Filed: August 1, 2022

Date of Patent: December 19, 2023

Assignee: GOOGLE LLC

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Publication number: 20220388159

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Application

Filed: August 1, 2022

Publication date: December 8, 2022

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
Deep reinforcement learning for robotic manipulation

Patent number: 11400587

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Grant

Filed: September 14, 2017

Date of Patent: August 2, 2022

Assignee: GOOGLE LLC

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap
DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Publication number: 20210237266

Abstract: Using large-scale reinforcement learning to train a policy model that can be utilized by a robot in performing a robotic task in which the robot interacts with one or more environmental objects. In various implementations, off-policy deep reinforcement learning is used to train the policy model, and the off-policy deep reinforcement learning is based on self-supervised data collection. The policy model can be a neural network model. Implementations of the reinforcement learning utilized in training the neural network model utilize a continuous-action variant of Q-learning. Through techniques disclosed herein, implementations can learn policies that generalize effectively to previously unseen objects, previously unseen environments, etc.

Type: Application

Filed: June 14, 2019

Publication date: August 5, 2021

Inventors: Dmitry Kalashnikov, Alexander Irpan, Peter Pastor Sampedro, Julian Ibarz, Alexander Herzog, Eric Jang, Deirdre Quillen, Ethan Holly, Sergey Levine
DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Publication number: 20190232488

Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

Type: Application

Filed: September 14, 2017

Publication date: August 1, 2019

Inventors: Sergey Levine, Ethan Holly, Shixiang Gu, Timothy Lillicrap

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Deep reinforcement learning for robotic manipulation

Deep reinforcement learning for robotic manipulation

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

Deep reinforcement learning for robotic manipulation

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION