Patents by Inventor Ilya Tolstikhin

Ilya Tolstikhin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220375211
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using mixer neural networks. One of the methods includes obtaining one or more images comprising a plurality of pixels; determining, for each image of the one or more images, a plurality of image patches of the image, wherein each image patch comprises a different subset of the pixels of the image; processing, for each image of the one or more images, the corresponding plurality of image patches to generate an input sequence comprising a respective input element at each of a plurality of input positions, wherein a plurality of the input elements correspond to respective different image patches; and processing the input sequences using a neural network to generate a network output that characterizes the one or more images, wherein the neural network comprises one or more mixer neural network layers.
    Type: Application
    Filed: May 5, 2022
    Publication date: November 24, 2022
    Inventors: Ilya Tolstikhin, Neil Matthew Tinmouth Houlsby, Alexander Kolesnikov, Lucas Klaus Beyer, Alexey Dosovitskiy, Mario Lucic, Xiaohua Zhai, Thomas Unterthiner, Daniel M. Keysers, Jakob D. Uszkoreit, Yin Ching Jessica Yung, Andreas Steiner
  • Publication number: 20210256422
    Abstract: Provided are systems and methods for predicting machine learning model performance from the model parameter values, including for use in making improved decisions with regard to early stopping of training procedures. As one example, the present disclosure discusses the prediction of the accuracy (e.g., relative to a defined task and testing dataset such as a computer vision task) of trained neural networks (e.g., convolutional neural networks (CNNs)), using only the parameter values (e.g., the values of the network's weights) as inputs. As such, one example aspect of the present disclosure is directed to computing systems that include and use a machine-learned performance prediction model that has been trained to predict performance values of machine-learned models based on their parameter values (e.g., weight values and/or hyperparameter values).
    Type: Application
    Filed: February 17, 2021
    Publication date: August 19, 2021
    Inventors: Thomas Unterthiner, Daniel Martin Keysers, Sylvain Gelly, Olivier Jean Andre Bousquet, Ilya Tolstikhin