Patents by Inventor Siddhant Jayakumar

Siddhant Jayakumar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Machine learning systems with memory based parameter adaptation for learning fast and slower

Patent number: 12242947

Abstract: There is described herein a computer-implemented method of processing an input data item. The method comprises processing the input data item using a parametric model to generate output data, wherein the parametric model comprises a first sub-model and a second sub-model. The processing comprises processing, by the first sub-model, the input data to generate a query data item, retrieving, from a memory storing data point-value pairs, at least one data point-value pair based upon the query data item and modifying weights of the second sub-model based upon the retrieved at least one data point-value pair. The output data is then generated based upon the modified second sub-model.

Type: Grant

Filed: October 29, 2018

Date of Patent: March 4, 2025

Assignee: DeepMind Technologies Limited

Inventors: Pablo Sprechmann, Siddhant Jayakumar, Jack William Rae, Alexander Pritzel, Adrià Puigdomènech Badia, Oriol Vinyals, Razvan Pascanu, Charles Blundell
Selecting actions by reverting to previous learned action selection policies

Patent number: 11423300

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a system output using a remembered value of a neural network hidden state. In one aspect, a system comprises an external memory that maintains context experience tuples respectively comprising: (i) a key embedding of context data, and (ii) a value of a hidden state of a neural network at the respective previous time step. The neural network is configured to receive a system input and a remembered value of the hidden state of the neural network and to generate a system output. The system comprises a memory interface subsystem that is configured to determine a key embedding for current context data, determine a remembered value of the hidden state of the neural network based on the key embedding, and provide the remembered value of the hidden state as an input to the neural network.

Type: Grant

Filed: February 8, 2019

Date of Patent: August 23, 2022

Assignee: DeepMind Technologies Limited

Inventors: Samuel Ritter, Xiao Jing Wang, Siddhant Jayakumar, Razvan Pascanu, Charles Blundell, Matthew Botvinick
Reinforcement learning using agent curricula

Patent number: 11113605

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning using agent curricula. One of the methods includes maintaining data specifying plurality of candidate agent policy neural networks; initializing mixing data that assigns a respective weight to each of the candidate agent policy neural networks; training the candidate agent policy neural networks using a reinforcement learning technique to generate combined action selection policies that result in improved performance on a reinforcement learning task; and during the training, repeatedly adjusting the weights in the mixing data to favor higher-performing candidate agent policy neural networks.

Type: Grant

Filed: May 20, 2019

Date of Patent: September 7, 2021

Assignee: DeepMind Technologies Limited

Inventors: Wojciech Czarnecki, Siddhant Jayakumar
MACHINE LEARNING SYSTEMS WITH MEMORY BASED PARAMETER ADAPTATION FOR LEARNING FAST AND SLOWER

Publication number: 20200285940

Abstract: There is described herein a computer-implemented method of processing an input data item. The method comprises processing the input data item using a parametric model to generate output data, wherein the parametric model comprises a first sub-model and a second sub-model. The processing comprises processing, by the first sub-model, the input data to generate a query data item, retrieving, from a memory storing data point-value pairs, at least one data point-value pair based upon the query data item and modifying weights of the second sub-model based upon the retrieved at least one data point-value pair. The output data is then generated based upon the modified second sub-model.

Type: Application

Filed: October 29, 2018

Publication date: September 10, 2020

Inventors: Pablo Sprechmann, Siddhant Jayakumar, Jack William Rae, Alexander Pritzel, Adrià Puigdomènech Badia, Oriol Vinyals, Razvan Pascanu, Charles Blundell
REINFORCEMENT LEARNING USING AGENT CURRICULA

Publication number: 20190354867

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning using agent curricula. One of the methods includes maintaining data specifying plurality of candidate agent policy neural networks; initializing mixing data that assigns a respective weight to each of the candidate agent policy neural networks; training the candidate agent policy neural networks using a reinforcement learning technique to generate combined action selection policies that result in improved performance on a reinforcement learning task; and during the training, repeatedly adjusting the weights in the mixing data to favor higher-performing candidate agent policy neural networks.

Type: Application

Filed: May 20, 2019

Publication date: November 21, 2019

Inventors: Wojciech Czarnecki, Siddhant Jayakumar

Machine learning systems with memory based parameter adaptation for learning fast and slower

Selecting actions by reverting to previous learned action selection policies

Reinforcement learning using agent curricula

MACHINE LEARNING SYSTEMS WITH MEMORY BASED PARAMETER ADAPTATION FOR LEARNING FAST AND SLOWER

REINFORCEMENT LEARNING USING AGENT CURRICULA