Patents by Inventor Andrea Tacchetti

Andrea Tacchetti has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

TRAINING A POLICY NEURAL NETWORK FOR CONTROLLING AN AGENT USING BEST RESPONSE POLICY ITERATION

Publication number: 20220261635

Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media, for training a policy neural network by repeatedly updating the policy neural network at each of a plurality of training iterations. One of the methods includes generating training data for the training iteration by controlling the agent in accordance with an improved policy that selects actions in response to input state representations. A best response computation is performed using (i) a candidate policy generated from respective policy neural networks as of one or more preceding iterations and (ii) a candidate value neural network. The candidate value neural network is configured to generate a value output that is an estimate of a value of the environment being in the state characterized by a state representation to complete a particular task. The policy neural network is updated by training the policy neural network on the training data.

Type: Application

Filed: January 7, 2022

Publication date: August 18, 2022

Inventors: Thomas William Anthony, Thomas Edward Eccles, Andrea Tacchetti, János Kramár, Ian Michael Gemp, Thomas Chalmers Hudson, Nicolas Pierre Mickaël Porcel, Marc Lanctot, Julien Perolat, Richard Everett, Thore Kurt Hartwig Graepel, Yoram Bachrach
Neural network architecture for efficient resource allocation

Patent number: 11250475

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for efficiently allocating resources among participants. Methods can include receiving valuation data specifying, for each of a plurality of entities, a respective valuation for each of a plurality of resource subsets, each resource subset comprising a different combination of one or more resources of a plurality of resources. After receiving valuation data, assigning each resource in the plurality of resources to a respective entity of the plurality of entities based on the valuations and generating, for each particular entity, a respective input representation that is derived from valuations of every other entity in the plurality of entities other than the particular entity. The input representation for each particular entity is processed using a neural network to generate a rule for the particular entity and a payment based on the rule output for the entities.

Type: Grant

Filed: July 1, 2020

Date of Patent: February 15, 2022

Assignee: DeepMind Technologies Limited

Inventors: Andrea Tacchetti, Daniel Joseph Strouse, Marta Garnelo Abellanas, Thore Kurt Hartwig Graepel, Yoram Bachrach
NEURAL NETWORK ARCHITECTURE FOR EFFICIENT RESOURCE ALLOCATION

Publication number: 20220005079

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for efficiently allocating resources among participants. Methods can include receiving valuation data specifying, for each of a plurality of entities, a respective valuation for each of a plurality of resource subsets, each resource subset comprising a different combination of one or more resources of a plurality of resources. After receiving valuation data, assigning each resource in the plurality of resources to a respective entity of the plurality of entities based on the valuations and generating, for each particular entity, a respective input representation that is derived from valuations of every other entity in the plurality of entities other than the particular entity. The input representation for each particular entity is processed using a neural network to generate a rule for the particular entity and a payment based on the rule output for the entities.

Type: Application

Filed: July 1, 2020

Publication date: January 6, 2022

Inventors: Andrea Tacchetti, Daniel Joseph Strouse, Marta Garnelo Abellanas, Thore Kurt Hartwig Graepel, Yoram Bachrach
GRAPH NEURAL NETWORK SYSTEMS FOR BEHAVIOR PREDICTION AND REINFORCEMENT LEARNING IN MULTPLE AGENT ENVIRONMENTS

Publication number: 20210192358

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting the actions of, or influences on, agents in environments with multiple agents, in particular for reinforcement learning. In one aspect, a relational forward model (RFM) system receives agent data representing agent actions for each of multiple agents and implements: an encoder graph neural network subsystem to process the agent data as graph data to provide encoded graph data, a recurrent graph neural network subsystem to process the encoded graph data to provide processed graph data, a decoder graph neural network subsystem to decode the processed graph data to provide decoded graph data and an output to provide representation data for node and/or edge attributes of the decoded graph data relating to a predicted action of one or more of the agents. A reinforcement learning system includes the RFM system.

Type: Application

Filed: May 20, 2019

Publication date: June 24, 2021

Inventors: Hasuk Song, Andrea Tacchetti, Peter William Battaglia, Vinicius Zambaldi

TRAINING A POLICY NEURAL NETWORK FOR CONTROLLING AN AGENT USING BEST RESPONSE POLICY ITERATION

Neural network architecture for efficient resource allocation

NEURAL NETWORK ARCHITECTURE FOR EFFICIENT RESOURCE ALLOCATION

GRAPH NEURAL NETWORK SYSTEMS FOR BEHAVIOR PREDICTION AND REINFORCEMENT LEARNING IN MULTPLE AGENT ENVIRONMENTS