Patents by Inventor Ali Hamid Yahya Valdovinos

Ali Hamid Yahya Valdovinos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Control policies for collective robot learning

Patent number: 11188821

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, of training a global policy neural network. One of the methods includes initializing an instance of the robotic task for multiple local workers, generating a trajectory of state-action pairs by selecting actions to be performed by the robotic agent while performing the instance of the robotic task, optimizing a local policy controller on the trajectory, generating an optimized trajectory using the optimized local controller, and storing the optimized trajectory in a replay memory associated with the local worker. The method includes sampling, for multiple global workers, an optimized trajectory from one of one or more replay memories associated with the global worker, and training the replica of the global policy neural network maintained by the global worker on the sampled optimized trajectory to determine delta values for the parameters of the global policy neural network.

Type: Grant

Filed: September 15, 2017

Date of Patent: November 30, 2021

Assignee: X Development LLC

Inventors: Mrinal Kalakrishnan, Ali Hamid Yahya Valdovinos, Adrian Ling Hin Li, Yevgen Chebotar, Sergey Vladimir Levine
Control policies for robotic agents

Patent number: 10960539

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, of training a global policy neural network. One of the methods includes initializing a plurality of instances of the robotic task. For each instance of the robotic task, the method includes generating a trajectory of state-action pairs by selecting actions to be performed by the robotic agent while performing the instance of the robotic task in accordance with current values of the parameters of the global policy neural network, and optimizing a local policy controller that is specific to the instance on the trajectory of state-action pairs for the instance. The method further includes generating training data for the global policy neural network using the local policy controllers, and training the global policy neural network on the training data to adjust the current values of the parameters of the global policy neural network.

Type: Grant

Filed: September 15, 2017

Date of Patent: March 30, 2021

Assignee: X Development LLC

Inventors: Mrinal Kalakrishnan, Ali Hamid Yahya Valdovinos, Adrian Ling Hin Li, Yevgen Chebotar, Sergey Vladimir Levine

Control policies for collective robot learning

Control policies for robotic agents