Patents by Inventor Andrea Tacchetti

Andrea Tacchetti has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220261635
    Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media, for training a policy neural network by repeatedly updating the policy neural network at each of a plurality of training iterations. One of the methods includes generating training data for the training iteration by controlling the agent in accordance with an improved policy that selects actions in response to input state representations. A best response computation is performed using (i) a candidate policy generated from respective policy neural networks as of one or more preceding iterations and (ii) a candidate value neural network. The candidate value neural network is configured to generate a value output that is an estimate of a value of the environment being in the state characterized by a state representation to complete a particular task. The policy neural network is updated by training the policy neural network on the training data.
    Type: Application
    Filed: January 7, 2022
    Publication date: August 18, 2022
    Inventors: Thomas William Anthony, Thomas Edward Eccles, Andrea Tacchetti, János Kramár, Ian Michael Gemp, Thomas Chalmers Hudson, Nicolas Pierre Mickaël Porcel, Marc Lanctot, Julien Perolat, Richard Everett, Thore Kurt Hartwig Graepel, Yoram Bachrach
  • Patent number: 11250475
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for efficiently allocating resources among participants. Methods can include receiving valuation data specifying, for each of a plurality of entities, a respective valuation for each of a plurality of resource subsets, each resource subset comprising a different combination of one or more resources of a plurality of resources. After receiving valuation data, assigning each resource in the plurality of resources to a respective entity of the plurality of entities based on the valuations and generating, for each particular entity, a respective input representation that is derived from valuations of every other entity in the plurality of entities other than the particular entity. The input representation for each particular entity is processed using a neural network to generate a rule for the particular entity and a payment based on the rule output for the entities.
    Type: Grant
    Filed: July 1, 2020
    Date of Patent: February 15, 2022
    Assignee: DeepMind Technologies Limited
    Inventors: Andrea Tacchetti, Daniel Joseph Strouse, Marta Garnelo Abellanas, Thore Kurt Hartwig Graepel, Yoram Bachrach
  • Publication number: 20220005079
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for efficiently allocating resources among participants. Methods can include receiving valuation data specifying, for each of a plurality of entities, a respective valuation for each of a plurality of resource subsets, each resource subset comprising a different combination of one or more resources of a plurality of resources. After receiving valuation data, assigning each resource in the plurality of resources to a respective entity of the plurality of entities based on the valuations and generating, for each particular entity, a respective input representation that is derived from valuations of every other entity in the plurality of entities other than the particular entity. The input representation for each particular entity is processed using a neural network to generate a rule for the particular entity and a payment based on the rule output for the entities.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 6, 2022
    Inventors: Andrea Tacchetti, Daniel Joseph Strouse, Marta Garnelo Abellanas, Thore Kurt Hartwig Graepel, Yoram Bachrach
  • Publication number: 20210192358
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting the actions of, or influences on, agents in environments with multiple agents, in particular for reinforcement learning. In one aspect, a relational forward model (RFM) system receives agent data representing agent actions for each of multiple agents and implements: an encoder graph neural network subsystem to process the agent data as graph data to provide encoded graph data, a recurrent graph neural network subsystem to process the encoded graph data to provide processed graph data, a decoder graph neural network subsystem to decode the processed graph data to provide decoded graph data and an output to provide representation data for node and/or edge attributes of the decoded graph data relating to a predicted action of one or more of the agents. A reinforcement learning system includes the RFM system.
    Type: Application
    Filed: May 20, 2019
    Publication date: June 24, 2021
    Inventors: Hasuk Song, Andrea Tacchetti, Peter William Battaglia, Vinicius Zambaldi