Patents Assigned to DeepMind Technologies

Reinforcement learning using meta-learned intrinsic rewards

Patent number: 12293283

Abstract: There is described methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a reinforcement learning system. The reinforcement learning system comprises an agent configured to perform actions based upon a policy and an intrinsic reward system configured to generate intrinsic reward values for the agent based upon the actions taken by the agent. The method comprises training the reinforcement learning system based upon a plurality of tasks. The training comprises updating the agent's policy based upon the intrinsic reward values generated by the intrinsic reward system and updating the intrinsic reward system based upon an extrinsic reward value obtained based upon the task being performed by the agent. The training further comprises re-initializing the agent's policy when an expiration criterion associated with the agent is met.

Type: Grant

Filed: September 25, 2020

Date of Patent: May 6, 2025

Assignee: DeepMind Technologies Limited

Inventors: Zeyu Zheng, Junhyuk Oh, Satinder Singh Baveja
Beam search decoding with forward-looking scores

Patent number: 12287795

Abstract: Methods and systems for beam search decoding. One of the methods includes initializing beam data specifying a set of k candidate output sequences and a respective total score for each of the candidate output sequences; updating the beam data at each of a plurality of decoding steps, comprising, at each decoding step: generating a score distribution that comprises a respective score for each token in the vocabulary; identifying a plurality of expanded sequences; generating, for each expanded sequence, a respective backwards-looking score; generating, for each expanded sequence, a respective forward-looking score; computing, for each expanded sequence, a respective total score from the respective forward-looking score for the expanded sequence and the respective backwards-looking score for the expanded sequence; and updating the set of k candidate output sequences using the respective total scores for the expanded sequences.

Type: Grant

Filed: December 29, 2023

Date of Patent: April 29, 2025

Assignee: DeepMind Technologies Limited

Inventors: Domenic Joseph Donato, Christopher James Dyer, Rémi Leblond
Generating audio data using unaligned text inputs with an adversarial network

Patent number: 12288547

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a generative neural network to convert conditioning text inputs to audio outputs. The generative neural network includes an alignment neural network that is configured to receive a generative input that includes the conditioning text input and to process the generative input to generate an aligned conditioning sequence that comprises a respective feature representation at each of a plurality of first time steps and that is temporally aligned with the audio output.

Type: Grant

Filed: June 4, 2021

Date of Patent: April 29, 2025

Assignee: DeepMind Technologies Limited

Inventors: Jeffrey Donahue, Karen Simonyan, Sander Etienne Lea Dieleman, Mikolaj Binkowski, Erich Konrad Elsen
Reinforcement learning using distributed prioritized replay

Patent number: 12277497

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training an action selection neural network used to select actions to be performed by an agent interacting with an environment. One of the systems includes (i) a plurality of actor computing units, in which each of the actor computing units is configured to maintain a respective replica of the action selection neural network and to perform a plurality of actor operations, and (ii) one or more learner computing units, in which each of the one or more learner computing units is configured to perform a plurality of learner operations.

Type: Grant

Filed: April 6, 2023

Date of Patent: April 15, 2025

Assignee: DeepMind Technologies Limited

Inventors: David Budden, Gabriel Barth-Maron, John Quan, Daniel George Horgan
Energy-based associative memory neural networks

Patent number: 12277487

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing associative memory. In one aspect a system comprises an associative memory neural network to process an input to generate an output that defines an energy corresponding to the input. A reading subsystem retrieves stored information from the associative memory neural network. The reading subsystem performs operations including receiving a given, i.e. query, input and retrieving a data element from the associative memory neural network that is associated with the given input. The retrieving is performed by iteratively adjusting the given input using the associative memory neural network.

Type: Grant

Filed: May 19, 2020

Date of Patent: April 15, 2025

Assignee: DeepMind Technologies Limited

Inventors: Sergey Bartunov, Jack William Rae, Timothy Paul Lillicrap, Simon Osindero
Selecting action slates using reinforcement learning

Patent number: 12277493

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting action slates using reinforcement learning. One of the methods includes receiving an observation characterizing a current state of an environment; selecting an action slate by processing the observation and a plurality of candidate action slates using a deep neural network, wherein each candidate action slate comprises a respective plurality of actions from the set of actions, and wherein the deep neural network is configured to, for each of the candidate action slates, process the observation and the actions in the candidate action slate to generate a slate Q value for the candidate action slate that is an estimate of a long-term reward resulting from the candidate action slate being provided to the action selector in response to the observation; and providing the selected action slate to an action selector in response to the observation.

Type: Grant

Filed: May 18, 2020

Date of Patent: April 15, 2025

Assignee: DeepMind Technologies Limited

Inventor: Peter Goran Sunehag
Generative adversarial networks with temporal and spatial discriminators for efficient video generation

Patent number: 12277672

Abstract: The present disclosure proposes the use of a dual discriminator network that comprises a temporal discriminator network for discriminating based on temporal features of a series of images and a spatial discriminator network for discriminating based on spatial features of individual images. The training methods described herein provide improvements in computational efficiency. This is achieved by applying the spatial discriminator network to a set of one or more images that have reduced temporal resolution and applying the temporal discriminator network to a set of images that have reduced spatial resolution. This allows each of the discriminator networks to be applied more efficiently in order to produce a discriminator score for use in training the generator, whilst maintaining accuracy of the discriminator network. In addition, this allows a generator network to be trained to more accurately generate sequences of images, through the use of the improved discriminator.

Type: Grant

Filed: May 22, 2020

Date of Patent: April 15, 2025

Assignee: DeepMind Technologies Limited

Inventors: Aidan Clark, Jeffrey Donahue, Karen Simonyan
Training machine learning models by determining update rules using neural networks

Patent number: 12271823

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for training machine learning models. One method includes obtaining a machine learning model, wherein the machine learning model comprises one or more model parameters, and the machine learning model is trained using gradient descent techniques to optimize an objective function; determining an update rule for the model parameters using a recurrent neural network (RNN); and applying a determined update rule for a final time step in a sequence of multiple time steps to the model parameters.

Type: Grant

Filed: March 8, 2023

Date of Patent: April 8, 2025

Assignee: DeepMind Technologies Limited

Inventors: Misha Man Ray Denil, Tom Schaul, Marcin Andrychowicz, Joao Ferdinando Gomes de Freitas, Sergio Gomez Colmenarejo, Matthew William Hoffman, David Benjamin Pfau
Generating images using neural networks

Patent number: 12267518

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating images using neural networks. One of the methods includes generating the output image pixel by pixel from a sequence of pixels taken from the output image, comprising, for each pixel in the output image, generating a respective score distribution over a discrete set of possible color values for each of the plurality of color channels.

Type: Grant

Filed: January 8, 2024

Date of Patent: April 1, 2025

Assignee: DeepMind Technologies Limited

Inventors: Aaron Gerard Antonius van den Oord, Nal Emmerich Kalchbrenner, Karen Simonyan
Action selection based on environment observations and textual instructions

Patent number: 12265795

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for selecting actions to be performed by an agent interacting with an environment. In one aspect, a system includes a language encoder model that is configured to receive a text string in a particular natural language, and process the text string to generate a text embedding of the text string. The system includes an observation encoder neural network that is configured to receive an observation characterizing a state of the environment, and process the observation to generate an observation embedding of the observation. The system includes a subsystem that is configured to obtain a current text embedding of a current text string and a current observation embedding of a current observation. The subsystem is configured to select an action to be performed by the agent in response to the current observation.

Type: Grant

Filed: April 29, 2024

Date of Patent: April 1, 2025

Assignee: DeepMind Technologies Limited

Inventors: Karl Moritz Hermann, Philip Blunsom, Felix George Hill
Neural programming

Patent number: 12260334

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural programming. One of the methods includes processing a current neural network input using a core recurrent neural network to generate a neural network output; determining, from the neural network output, whether or not to end a currently invoked program and to return to a calling program from the set of programs; determining, from the neural network output, a next program to be called; determining, from the neural network output, contents of arguments to the next program to be called; receiving a representation of a current state of the environment; and generating a next neural network input from an embedding for the next program to be called and the representation of the current state of the environment.

Type: Grant

Filed: October 30, 2023

Date of Patent: March 25, 2025

Assignee: DeepMind Technologies Limited

Inventors: Scott Ellison Reed, Joao Ferdinando Gomes de Freitas
Action classification in video clips using attention-based neural networks

Patent number: 12254693

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for classifying actions in a video. One of the methods obtaining a feature representation of a video clip; obtaining data specifying a plurality of candidate agent bounding boxes in the key video frame; and for each candidate agent bounding box: processing the feature representation through an action transformer neural network.

Type: Grant

Filed: October 2, 2023

Date of Patent: March 18, 2025

Assignee: DeepMind Technologies Limited

Inventors: Joao Carreira, Carl Doersch, Andrew Zisserman
Training a neural network using outputs of a corruption neural network

Patent number: 12254678

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for processing a network input using a trained neural network with network parameters to generate an output for a machine learning task. The training includes: receiving a set of training examples each including a training network input and a reference output; for each training iteration, generating a corrupted network input for each training network input using a corruption neural network; updating perturbation parameters of the corruption neural network using a first objective function based on the corrupted network inputs; generating an updated corrupted network input for each training network input based on the updated perturbation parameters; and generating a network output for each updated corrupted network input using the neural network; for each training example, updating the network parameters using a second objective function based on the network output and the reference output.

Type: Grant

Filed: April 1, 2022

Date of Patent: March 18, 2025

Assignee: DeepMind Technologies Limited

Inventors: Dan-Andrei Calian, Sven Adrian Gowal, Timothy Arthur Mann, András György
Antisymmetric neural networks

Patent number: 12248861

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using antisymmetric neural networks.

Type: Grant

Filed: September 3, 2020

Date of Patent: March 11, 2025

Assignee: DeepMind Technologies Limited

Inventors: David Benjamin Pfau, James Spencer, Alexander Graeme de Garis Matthews
Machine learning systems with memory based parameter adaptation for learning fast and slower

Patent number: 12242947

Abstract: There is described herein a computer-implemented method of processing an input data item. The method comprises processing the input data item using a parametric model to generate output data, wherein the parametric model comprises a first sub-model and a second sub-model. The processing comprises processing, by the first sub-model, the input data to generate a query data item, retrieving, from a memory storing data point-value pairs, at least one data point-value pair based upon the query data item and modifying weights of the second sub-model based upon the retrieved at least one data point-value pair. The output data is then generated based upon the modified second sub-model.

Type: Grant

Filed: October 29, 2018

Date of Patent: March 4, 2025

Assignee: DeepMind Technologies Limited

Inventors: Pablo Sprechmann, Siddhant Jayakumar, Jack William Rae, Alexander Pritzel, Adrià Puigdomènech Badia, Oriol Vinyals, Razvan Pascanu, Charles Blundell
Adaptive visual speech recognition

Patent number: 12211488

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing video data using an adaptive visual speech recognition model. One of the methods includes receiving a video that includes a plurality of video frames that depict a first speaker: obtaining a first embedding characterizing the first speaker; and processing a first input comprising (i) the video and (ii) the first embedding using a visual speech recognition neural network having a plurality of parameters, wherein the visual speech recognition neural network is configured to process the video and the first embedding in accordance with trained values of the parameters to generate a speech recognition output that defines a sequence of one or more words being spoken by the first speaker in the video.

Type: Grant

Filed: June 15, 2022

Date of Patent: January 28, 2025

Assignee: DeepMind Technologies Limited

Inventors: Ioannis Alexandros Assael, Brendan Shillingford, Joao Ferdinando Gomes de Freitas
Speech synthesis utilizing audio waveform difference signal(s)

Patent number: 12211484

Abstract: Techniques are disclosed that enable generation of an audio waveform representing synthesized speech based on a difference signal determined using an autoregressive model. Various implementations include using a distribution of the difference signal values to represent sounds found in human speech with a higher level of granularity than sounds not frequently found in human speech. Additional or alternative implementations include using one or more speakers of a client device to render the generated audio waveform.

Type: Grant

Filed: January 19, 2024

Date of Patent: January 28, 2025

Assignee: DeepMind Technologies Limited

Inventors: Luis Carlos Cobo Rus, Nal Kalchbrenner, Erich Elsen, Chenjie Gu
Distributional reinforcement learning using quantile function neural networks

Patent number: 12205032

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for selecting an action to be performed by a reinforcement learning agent interacting with an environment. In one aspect, a method comprises: receiving a current observation; for each action of a plurality of actions: randomly sampling one or more probability values; for each probability value: processing the action, the current observation, and the probability value using a quantile function network to generate an estimated quantile value for the probability value with respect to a probability distribution over possible returns that would result from the agent performing the action in response to the current observation; determining a measure of central tendency of the one or more estimated quantile values; and selecting an action to be performed by the agent in response to the current observation using the measures of central tendency for the actions.

Type: Grant

Filed: December 15, 2023

Date of Patent: January 21, 2025

Assignee: DeepMind Technologies Limited

Inventors: Georg Ostrovski, William Clinton Dabney
Predicting properties of materials from physical material structures

Patent number: 12190236

Abstract: Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for predicting one or more properties of a material. One of the methods includes maintaining data specifying a set of known materials each having a respective known physical structure; receiving data specifying a new material; identifying a plurality of known materials in the set of known materials that are similar to the new material; determining a predicted embedding of the new material from at least respective embeddings corresponding to each of the similar known materials; and processing the predicted embedding of the new material using an experimental prediction neural network to predict one or more properties of the new material.

Type: Grant

Filed: April 26, 2021

Date of Patent: January 7, 2025

Assignee: DeepMind Technologies Limited

Inventors: Annette Ada Nkechinyere Obika, Tian Xie, Victor Constant Bapst, Alexander Lloyd Gaunt, James Kirkpatrick
Training action selection neural networks using off-policy actor critic reinforcement learning and stochastic dueling neural networks

Patent number: 12190223

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an action selection neural network. One of the methods includes maintaining a replay memory that stores trajectories generated as a result of interaction of an agent with an environment; and training an action selection neural network having policy parameters on the trajectories in the replay memory, wherein training the action selection neural network comprises: sampling a trajectory from the replay memory; and adjusting current values of the policy parameters by training the action selection neural network on the trajectory using an off-policy actor critic reinforcement learning technique.

Type: Grant

Filed: May 28, 2020

Date of Patent: January 7, 2025

Assignee: DeepMind Technologies Limited

Inventors: Ziyu Wang, Nicolas Manfred Otto Heess, Victor Constant Bapst

1 2 3 4 5 … next