Patents by Inventor Marlos Cholodovskis Machado

Marlos Cholodovskis Machado has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

CONTRASTIVE BEHAVIORAL SIMILARITY EMBEDDINGS FOR GENERALIZATION IN REINFORCEMENT LEARNING

Publication number: 20230102544

Abstract: Approaches are described for training an action selection neural network system for use in controlling an agent interacting with an environment to perform a task, using a contrastive loss function based on a policy similarity metric. In one aspect, a method includes: obtaining a first observation of a first training environment; obtaining a plurality of second observations of a second training environment; for each second observation, determining a respective policy similarity metric between the second observation and the first observation; processing the first observation and the second observations using the representation neural network to generate a first representation of the first training observation and a respective second representation of each second training observation; and training the representation neural network on a contrastive loss function computed using the policy similarity metrics and the first and second representations.

Type: Application

Filed: September 28, 2021

Publication date: March 30, 2023

Inventors: Rishabh Agarwal, Marlos Cholodovskis Machado, Pablo Samuel Castro Rivadeneira, Marc Gendron-Bellemare
Systems and Methods for Navigating Aerial Vehicles Using Deep Reinforcement Learning

Publication number: 20210123741

Abstract: The technology relates to navigating aerial vehicles using deep reinforcement learning techniques to generate flight policies. A computing system may include a simulator configured to produce simulations of a flight of the aerial vehicle in a region of an atmosphere, a replay buffer configured to store frames of the simulations, and a learning module having a deep reinforcement learning architecture configured to, by a reinforcement learning algorithm, process an input of a set of frames, and output a neural network encoding a learned flight policy. A meta-learning system may include stacks of learning systems, a coordinator configured to provide an instruction to the learning systems that includes a parameter and a start time, and an evaluation server configured to evaluate resulting rewards from learned flight policies generated by the learning systems.

Type: Application

Filed: October 29, 2019

Publication date: April 29, 2021

Applicant: LOON LLC

Inventors: Salvatore J. Candido, Jun Gong, Marc Gendron-Bellemare, Marlos Cholodovskis Machado

CONTRASTIVE BEHAVIORAL SIMILARITY EMBEDDINGS FOR GENERALIZATION IN REINFORCEMENT LEARNING

Systems and Methods for Navigating Aerial Vehicles Using Deep Reinforcement Learning