Patents by Inventor Andrea Gesmundo

Andrea Gesmundo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11915120
    Abstract: Systems and methods for flexible parameter sharing for multi-task learning are provided. A training method can include obtaining a test input, selecting a particular task from one or more tasks, and training a multi-task machine-learned model for the particular task by performing a forward pass using the test input and one or more connection probability matrices to generate a sample distribution of test outputs, training the components of the machine-learned model based at least in part on the sample distribution, and performing a backwards pass to train a connection probability matrix of the multi-task machine-learned model using a straight-through Gumbel-softmax approximation.
    Type: Grant
    Filed: March 17, 2020
    Date of Patent: February 27, 2024
    Assignee: GOOGLE LLC
    Inventors: Effrosyni Kokiopoulou, Krzysztof Stanislaw Maziarz, Andrea Gesmundo, Luciano Sbaiz, Gábor Bartók, Jesse Berent
  • Patent number: 11900222
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing a machine learning model that is trained to perform a machine learning task. In one aspect, a method comprises receiving a request to train a machine learning model on a set of training examples; determining a set of one or more meta-data values characterizing the set of training examples; using a mapping function to map the set of meta-data values characterizing the set of training examples to data identifying a particular machine learning model architecture; selecting, using the particular machine learning model architecture, a final machine learning model architecture for performing the machine learning task; and training a machine learning model having the final machine learning model architecture on the set of training examples.
    Type: Grant
    Filed: March 15, 2019
    Date of Patent: February 13, 2024
    Assignee: Google LLC
    Inventors: Jyrki A. Alakuijala, Quentin Lascombes de Laroussilhe, Andrey Khorlin, Jeremiah Joseph Harmsen, Andrea Gesmundo
  • Publication number: 20230376755
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network system to perform multiple machine learning tasks.
    Type: Application
    Filed: May 19, 2023
    Publication date: November 23, 2023
    Inventors: Andrea Gesmundo, Jeffrey Adgate Dean
  • Publication number: 20230281193
    Abstract: Systems, methods, and computer readable media related to generating query variants for a submitted query. In many implementations, the query variants are generated utilizing a generative model. A generative model is productive, in that it can be utilized to actively generate a variant of a query based on application of tokens of the query to the generative model, and optionally based on application of additional input features to the generative model.
    Type: Application
    Filed: May 12, 2023
    Publication date: September 7, 2023
    Inventors: Jyrki Alakuijala, Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Wojciech Gajewski, Andrea Gesmundo, Neil Houlsby, Wei Wang
  • Patent number: 11663201
    Abstract: Systems, methods, and computer readable media related to generating query variants for a submitted query. In many implementations, the query variants are generated utilizing a generative model. A generative model is productive, in that it can be utilized to actively generate a variant of a query based on application of tokens of the query to the generative model, and optionally based on application of additional input features to the generative model.
    Type: Grant
    Filed: April 27, 2018
    Date of Patent: May 30, 2023
    Assignee: GOOGLE LLC
    Inventors: Jyrki Alakuijala, Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Wojciech Gajewski, Andrea Gesmundo, Neil Houlsby, Wei Wang
  • Patent number: 11544536
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating neural network architectures. One of the methods includes receiving a request to determine an architecture for a task neural network; maintaining data specifying a plurality of candidate architectures for the task neural network; repeatedly performing operations comprising: selecting one or more candidate architectures in the maintained data to be modified; generating a new candidate architecture from the selected candidate architecture by, for each hyperparameter in the set of hyperparameters, selecting the value for the hyperparameter for the new candidate architecture; and adding data specifying the new candidate architecture to the maintained data; and selecting, as the final architecture for the task neural network, one of the candidate architectures specified in the maintained data.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: January 3, 2023
    Assignee: Google LLC
    Inventor: Andrea Gesmundo
  • Publication number: 20220121906
    Abstract: A method of determining a final architecture for a task neural network for performing a target machine learning task is described. The target machine learning task is associated with a target training dataset.
    Type: Application
    Filed: January 30, 2020
    Publication date: April 21, 2022
    Inventors: EFFROSYNI KOKIOPOULOU, ANJA HAUTH, LUCIANO SBAIZ, ANDREA GESMUNDO, GABOR BARTOK, JESSE BERENT
  • Publication number: 20220092416
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining neural network architectures. One of the methods includes receiving training data for training a task neural network to perform a particular machine learning task; and selecting, from a space of possible architectures, an architecture for the task neural network, wherein the space of possible architectures is represented as a graph of nodes connected by edges, each node in the graph representing a decision point in selecting the architecture and each edge in the graph representing an action.
    Type: Application
    Filed: December 27, 2019
    Publication date: March 24, 2022
    Inventors: Neil Matthew Tinmouth Houlsby, Quentin Lascombes de Laroussilhe, Stanislaw Kamil Jastrzebski, Andrea Gesmundo
  • Publication number: 20210232895
    Abstract: Systems and methods for flexible parameter sharing for multi-task learning are provided. A training method can include obtaining a test input, selecting a particular task from one or more tasks, and training a multi-task machine-learned model for the particular task by performing a forward pass using the test input and one or more connection probability matrices to generate a sample distribution of test outputs, training the components of the machine-learned model based at least in part on the sample distribution, and performing a backwards pass to train a connection probability matrix of the multi-task machine-learned model using a straight-through Gumbel-softmax approximation.
    Type: Application
    Filed: March 17, 2020
    Publication date: July 29, 2021
    Inventors: Effrosyni Kokiopoulou, Krzysztof Stanislaw Maziarz, Andrea Gesmundo, Luciano Sbaiz, Gábor Bartók, Jesse Berent
  • Publication number: 20200142888
    Abstract: Systems, methods, and computer readable media related to generating query variants for a submitted query. In many implementations, the query variants are generated utilizing a generative model. A generative model is productive, in that it can be utilized to actively generate a variant of a query based on application of tokens of the query to the generative model, and optionally based on application of additional input features to the generative model.
    Type: Application
    Filed: April 27, 2018
    Publication date: May 7, 2020
    Inventors: Jyrki Alakuijala, Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Wojciech Gajewski, Andrea Gesmundo, Neil Houlsby, Wei Wang
  • Publication number: 20200104687
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating neural network architectures. One of the methods includes receiving a request to determine an architecture for a task neural network; maintaining data specifying a plurality of candidate architectures for the task neural network; repeatedly performing operations comprising: selecting one or more candidate architectures in the maintained data to be modified; generating a new candidate architecture from the selected candidate architecture by, for each hyperparameter in the set of hyperparameters, selecting the value for the hyperparameter for the new candidate architecture; and adding data specifying the new candidate architecture to the maintained data; and selecting, as the final architecture for the task neural network, one of the candidate architectures specified in the maintained data.
    Type: Application
    Filed: September 27, 2019
    Publication date: April 2, 2020
    Inventor: Andrea Gesmundo