Patents by Inventor Simon Osindero

Simon Osindero has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

REINFORCEMENT LEARNING FOR ACTIVE SEQUENCE PROCESSING

Publication number: 20250148774

Abstract: A system that is configured to receive a sequence of task inputs and to perform a machine learning task is described. An RL neural network is configured to: generate, for each task input of the sequence, a respective decision that determines whether to encode the task input or to skip the task input, and provide the respective decision of each task input to the task neural network. The task neural network is configured to: receive the sequence of task inputs, receive, from the RL neural network, for each task input of the sequence, a respective decision, process each of the un-skipped task inputs in the sequence of task inputs to generate a respective accumulated feature for the un-skipped task input, and generate a machine learning task output for the machine learning task based on the last accumulated feature generated for the last un-skipped task input in the sequence.

Type: Application

Filed: November 19, 2024

Publication date: May 8, 2025

Inventors: Viorica Patraucean, Bilal Piot, Joao Carreira, Volodymyr Mnih, Simon Osindero
Energy-based associative memory neural networks

Patent number: 12277487

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing associative memory. In one aspect a system comprises an associative memory neural network to process an input to generate an output that defines an energy corresponding to the input. A reading subsystem retrieves stored information from the associative memory neural network. The reading subsystem performs operations including receiving a given, i.e. query, input and retrieving a data element from the associative memory neural network that is associated with the given input. The retrieving is performed by iteratively adjusting the given input using the associative memory neural network.

Type: Grant

Filed: May 19, 2020

Date of Patent: April 15, 2025

Assignee: DeepMind Technologies Limited

Inventors: Sergey Bartunov, Jack William Rae, Timothy Paul Lillicrap, Simon Osindero
Reinforcement learning for active sequence processing

Patent number: 12175737

Abstract: A system that is configured to receive a sequence of task inputs and to perform a machine learning task is described. The system includes a reinforcement learning (RL) neural network and a task neural network. The RL neural network is configured to: generate, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, and provide the respective decision of each task input to the task neural network.

Type: Grant

Filed: November 13, 2020

Date of Patent: December 24, 2024

Assignee: DeepMind Technologies Limited

Inventors: Viorica Patraucean, Bilal Piot, Joao Carreira, Volodymyr Mnih, Simon Osindero
RETRIEVAL AUGMENTED REINFORCEMENT LEARNING

Publication number: 20240320506

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling a reinforcement learning agent in an environment to perform a task using a retrieval-augmented action selection process.

Type: Application

Filed: October 5, 2022

Publication date: September 26, 2024

Inventors: Anirudh Goyal, Andrea Banino, Abram Luke Friesen, Theophane Guillaume Weber, Adrià Puigdomènech Badia, Nan Ke, Simon Osindero, Timothy Paul Lillicrap, Charles Blundell
Modulating agent behavior to optimize learning progress

Patent number: 12061964

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent. One of the methods includes sampling a behavior modulation in accordance with a current probability distribution; for each of one or more time steps: processing an input comprising an observation characterizing a current state of the environment at the time step using an action selection neural network to generate a respective action score for each action in a set of possible actions that can be performed by the agent; modifying the action scores using the sampled behavior modulation; and selecting the action to be performed by the agent at the time step based on the modified action scores; determining a fitness measure corresponding to the sampled behavior modulation; and updating the current probability distribution over the set of possible behavior modulations using the fitness measure corresponding to the behavior modulation.

Type: Grant

Filed: September 25, 2020

Date of Patent: August 13, 2024

Assignee: DeepMind Technologies Limited

Inventors: Tom Schaul, Diana Luiza Borsa, Fengning Ding, David Szepesvari, Georg Ostrovski, Simon Osindero, William Clinton Dabney
Parallel video processing systems

Patent number: 11967150

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel processing of video frames using neural networks. One of the methods includes receiving a video sequence comprising a respective video frame at each of a plurality of time steps; and processing the video sequence using a video processing neural network to generate a video processing output for the video sequence, wherein the video processing neural network includes a sequence of network components, wherein the network components comprise a plurality of layer blocks each comprising one or more neural network layers, wherein each component is active for a respective subset of the plurality of time steps, and wherein each layer block is configured to, at each time step at which the layer block is active, receive an input generated at a previous time step and to process the input to generate a block output.

Type: Grant

Filed: February 13, 2023

Date of Patent: April 23, 2024

Assignee: DeepMind Technologies Limited

Inventors: Simon Osindero, Joao Carreira, Viorica Patraucean, Andrew Zisserman
Training neural networks using synthetic gradients

Patent number: 11715009

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network including a first subnetwork followed by a second subnetwork on training inputs by optimizing an objective function. In one aspect, a method includes processing a training input using the neural network to generate a training model output, including processing a subnetwork input for the training input using the first subnetwork to generate a subnetwork activation for the training input in accordance with current values of parameters of the first subnetwork, and providing the subnetwork activation as input to the second subnetwork; determining a synthetic gradient of the objective function for the first subnetwork by processing the subnetwork activation using a synthetic gradient model in accordance with current values of parameters of the synthetic gradient model; and updating the current values of the parameters of the first subnetwork using the synthetic gradient.

Type: Grant

Filed: May 19, 2017

Date of Patent: August 1, 2023

Assignee: DeepMind Technologies Limited

Inventors: Oriol Vinyals, Alexander Benjamin Graves, Wojciech Czarnecki, Koray Kavukcuoglu, Simon Osindero, Maxwell Elliot Jaderberg
PARALLEL VIDEO PROCESSING SYSTEMS

Publication number: 20230186625

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel processing of video frames using neural networks. One of the methods includes receiving a video sequence comprising a respective video frame at each of a plurality of time steps; and processing the video sequence using a video processing neural network to generate a video processing output for the video sequence, wherein the video processing neural network includes a sequence of network components, wherein the network components comprise a plurality of layer blocks each comprising one or more neural network layers, wherein each component is active for a respective subset of the plurality of time steps, and wherein each layer block is configured to, at each time step at which the layer block is active, receive an input generated at a previous time step and to process the input to generate a block output.

Type: Application

Filed: February 13, 2023

Publication date: June 15, 2023

Inventors: Simon Osindero, Joao Carreira, Viorica Patraucean, Andrew Zisserman
SYSTEM AND METHOD FOR TRAINING A SPARSE NEURAL NETWORK WHILST MAINTAINING SPARSITY

Publication number: 20230124177

Abstract: A computer-implemented method of training a neural network. The method comprises repeatedly determining a forward-pass set of network parameters by selecting a first sub-set of parameters of the neural network and setting all other parameters of the forward-pass set of network parameters to zero. The method then processes a training data item using the neural network in accordance with the forward-pass set of network parameters to generate a neural network output, determines a value of an objective function from the neural network output and the training data item, selects a second sub-set of network parameters, determines a backward-pass set of network parameters comprising the first and second sub-sets of parameters, and updates parameters corresponding to the backward-pass set of network parameters using a gradient estimate determined from the value of the objective function.

Type: Application

Filed: June 4, 2021

Publication date: April 20, 2023

Inventors: Siddhant Madhu Jayakumar, Razvan Pascanu, Jack William Rae, Simon Osindero, Erich Konrad Elsen
ACTION SELECTION FOR REINFORCEMENT LEARNING USING A MANAGER NEURAL NETWORK THAT GENERATES GOAL VECTORS DEFINING AGENT OBJECTIVES

Publication number: 20230090824

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a system configured to select actions to be performed by an agent that interacts with an environment. The system comprises a manager neural network subsystem and a worker neural network subsystem. The manager subsystem is configured to, at each of the multiple time steps, generate a final goal vector for the time step. The worker subsystem is configured to, at each of multiple time steps, use the final goal vector generated by the manager subsystem to generate a respective action score for each action in a predetermined set of actions.

Type: Application

Filed: November 30, 2022

Publication date: March 23, 2023

Inventors: Simon Osindero, Koray Kavukcuoglu, Alexander Vezhnevets
Parallel video processing neural networks

Patent number: 11580736

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel processing of video frames using neural networks. One of the methods includes receiving a video sequence comprising a respective video frame at each of a plurality of time steps; and processing the video sequence using a video processing neural network to generate a video processing output for the video sequence, wherein the video processing neural network includes a sequence of network components, wherein the network components comprise a plurality of layer blocks each comprising one or more neural network layers, wherein each component is active for a respective subset of the plurality of time steps, and wherein each layer block is configured to, at each time step at which the layer block is active, receive an input generated at a previous time step and to process the input to generate a block output.

Type: Grant

Filed: January 7, 2019

Date of Patent: February 14, 2023

Assignee: DeepMind Technologies Limited

Inventors: Simon Osindero, Joao Carreira, Viorica Patraucean, Andrew Zisserman
Action selection for reinforcement learning using a manager neural network that generates goal vectors defining agent objectives

Patent number: 11537887

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a system configured to select actions to be performed by an agent that interacts with an environment. The system comprises a manager neural network subsystem and a worker neural network subsystem. The manager subsystem is configured to, at each of the multiple time steps, generate a final goal vector for the time step. The worker subsystem is configured to, at each of multiple time steps, use the final goal vector generated by the manager subsystem to generate a respective action score for each action in a predetermined set of actions.

Type: Grant

Filed: May 5, 2020

Date of Patent: December 27, 2022

Assignee: DeepMind Technologies Limited

Inventors: Simon Osindero, Koray Kavukcuoglu, Alexander Vezhnevets
REINFORCEMENT LEARNING FOR ACTIVE SEQUENCE PROCESSING

Publication number: 20220392206

Abstract: A system that is configured to receive a sequence of task inputs and to perform a machine learning task is described. The system includes a reinforcement learning (RL) neural network and a task neural network. The RL neural network is configured to: generate, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, and provide the respective decision of each task input to the task neural network.

Type: Application

Filed: November 13, 2020

Publication date: December 8, 2022

Inventors: Viorica PATRAUCEAN, Bilal PIOT, Joao CARREIRA, Volodymyr MNIH, Simon OSINDERO
ENERGY-BASED ASSOCIATIVE MEMORY NEURAL NETWORKS

Publication number: 20220180147

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing associative memory. In one aspect a system comprises an associative memory neural network to process an input to generate an output that defines an energy corresponding to the input. A reading subsystem retrieves stored information from the associative memory neural network. The reading subsystem performs operations including receiving a given, i.e. query, input and retrieving a data element from the associative memory neural network that is associated with the given input. The retrieving is performed by iteratively adjusting the given input using the associative memory neural network.

Type: Application

Filed: May 19, 2020

Publication date: June 9, 2022

Inventors: Sergey Bartunov, Jack William Rae, Timothy Paul Lillicrap, Simon Osindero
MODULATING AGENT BEHAVIOR TO OPTIMIZE LEARNING PROGRESS

Publication number: 20210089908

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent. One of the methods includes sampling a behavior modulation in accordance with a current probability distribution; for each of one or more time steps: processing an input comprising an observation characterizing a current state of the environment at the time step using an action selection neural network to generate a respective action score for each action in a set of possible actions that can be performed by the agent; modifying the action scores using the sampled behavior modulation; and selecting the action to be performed by the agent at the time step based on the modified action scores; determining a fitness measure corresponding to the sampled behavior modulation; and updating the current probability distribution over the set of possible behavior modulations using the fitness measure corresponding to the behavior modulation.

Type: Application

Filed: September 25, 2020

Publication date: March 25, 2021

Inventors: Tom Schaul, Diana Luiza Borsa, Fengning Ding, David Szepesvari, Georg Ostrovski, Simon Osindero, William Clinton Dabney
PARALLEL VIDEO PROCESSING NEURAL NETWORKS

Publication number: 20210027064

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for parallel processing of video frames using neural networks. One of the methods includes receiving a video sequence comprising a respective video frame at each of a plurality of time steps; and processing the video sequence using a video processing neural network to generate a video processing output for the video sequence, wherein the video processing neural network includes a sequence of network components, wherein the network components comprise a plurality of layer blocks each comprising one or more neural network layers, wherein each component is active for a respective subset of the plurality of time steps, and wherein each layer block is configured to, at each time step at which the layer block is active, receive an input generated at a previous time step and to process the input to generate a block output.

Type: Application

Filed: January 7, 2019

Publication date: January 28, 2021

Inventors: Simon Osindero, Joao Carreira, Viorica Patraucean, Andrew Zisserman
TRAINING NEURAL NETWORKS USING SYNTHETIC GRADIENTS

Publication number: 20200320396

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network including a first subnetwork followed by a second subnetwork on training inputs by optimizing an objective function. In one aspect, a method includes processing a training input using the neural network to generate a training model output, including processing a subnetwork input for the training input using the first subnetwork to generate a subnetwork activation for the training input in accordance with current values of parameters of the first subnetwork, and providing the subnetwork activation as input to the second subnetwork; determining a synthetic gradient of the objective function for the first subnetwork by processing the subnetwork activation using a synthetic gradient model in accordance with current values of parameters of the synthetic gradient model; and updating the current values of the parameters of the first subnetwork using the synthetic gradient.

Type: Application

Filed: May 19, 2017

Publication date: October 8, 2020

Applicant: Deepmind Technologies Limited

Inventors: Oriol VINYALS, Alexander Benjamin GRAVES, Wojciech CZARNECKI, Koray KAVUKCUOGLU, Simon OSINDERO, Maxwell Elliot JADERBERG
ACTION SELECTION FOR REINFORCEMENT LEARNING USING NEURAL NETWORKS

Publication number: 20200265313

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a system configured to select actions to be performed by an agent that interacts with an environment. The system comprises a manager neural network subsystem and a worker neural network subsystem. The manager subsystem is configured to, at each of the multiple time steps, generate a final goal vector for the time step. The worker subsystem is configured to, at each of multiple time steps, use the final goal vector generated by the manager subsystem to generate a respective action score for each action in a predetermined set of actions.

Type: Application

Filed: May 5, 2020

Publication date: August 20, 2020

Inventors: Simon Osindero, Koray Kavukcuoglu, Alexander Vezhnevets
Selective content insertion into areas of media objects

Patent number: 10706889

Abstract: One or more computing devices, systems, and/or methods for selective content insertion into areas of media objects are provided. For example, a media object (e.g., an image or video), is selected for composition with content, such as where a message, interactive content, a hyperlink, or other types of content is overlaid or embedded into the media object to create a composite media object. The content is added into an area of the media object that is selectively identified to reduce occlusion and/or improve visual cohesiveness between the content and the media object (e.g., added to an area with a similar or complimentary color, having an adequate size with spare amounts of visual features such as a soccer player, a ball, or other entity, etc.). In this way, the content may be add into the area of the media object to create a composite media object to provide to users.

Type: Grant

Filed: July 7, 2016

Date of Patent: July 7, 2020

Assignee: Oath Inc.

Inventors: Bart Thomée, Ioannis Kalantidis, Clayton Ellis Mellina, Simon Osindero
Action selection for reinforcement learning using neural networks

Patent number: 10679126

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a system configured to select actions to be performed by an agent that interacts with an environment. The system comprises a manager neural network subsystem and a worker neural network subsystem. The manager subsystem is configured to, at each of the multiple time steps, generate a final goal vector for the time step. The worker subsystem is configured to, at each of multiple time steps, use the final goal vector generated by the manager subsystem to generate a respective action score for each action in a predetermined set of actions.

Type: Grant

Filed: July 15, 2019

Date of Patent: June 9, 2020

Assignee: DeepMind Technologies Limited

Inventors: Simon Osindero, Koray Kavukcuoglu, Alexander Vezhnevets

1 2 next