Patents by Inventor Gamaleldin Elsayed

Gamaleldin Elsayed has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Neural network layers with a controlled degree of spatial invariance

Patent number: 12265911

Abstract: A computing system can include one or more non-transitory computer-readable media that collectively store a neural network including one or more layers with relaxed spatial invariance. Each of the one or more layers can be configured to receive a respective layer input. Each of the one or more layers can be configured to convolve a plurality of different kernels against the respective layer input to generate a plurality of intermediate outputs, each of the plurality of intermediate outputs having a plurality of portions. Each of the one or more layers can be configured to apply, for each of the plurality of intermediate outputs, a respective plurality of weights respectively associated with the plurality of portions to generate a respective weighted output. Each of the one or more layers can be configured to generate a respective layer output based on the weighted outputs.

Type: Grant

Filed: December 14, 2020

Date of Patent: April 1, 2025

Assignee: GOOGLE LLC

Inventors: Gamaleldin Elsayed, Prajit Ramachandran, Jon Shlens, Simon Kornblith
Conditional Object-Centric Learning with Slot Attention for Video and Other Sequential Data

Publication number: 20220383628

Abstract: A method includes obtaining first feature vectors and second feature vectors representing contents of a first and second image frame, respectively, of an input video. The method may also include generating, based on the first feature vectors, first slot vectors, where each slot vector represents attributes of a corresponding entity as represented in the first image frame, and generating, based on the first slot vectors, predicted slot vectors including a corresponding predicted slot vector that represents a transition of the attributes of the corresponding entity from the first to the second image frame. The method may additionally include generating, based on the predicted slot vectors and the second feature vectors, second slot vectors including a corresponding slot vector that represents the attributes of the corresponding entity as represented in the second image frame, and determining an output based on the predicted slot vectors or the second slot vectors.

Type: Application

Filed: April 21, 2022

Publication date: December 1, 2022

Inventors: Thomas Kipf, Gamaleldin Elsayed, Aravindh Mahendran, Austin Charles Stone, Sara Sabour Rouh Aghdam, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff
Accurate and interpretable classification with hard attention

Patent number: 11475277

Abstract: Generally, the present disclosure is directed to novel machine-learned classification models that operate with hard attention to make discrete attention actions. The present disclosure also provides a self-supervised pre-training procedure that initializes the model to a state with more frequent rewards. Given only the ground truth classification labels for a set of training inputs (e.g., images), the proposed models are able to learn a policy over discrete attention locations that identifies certain portions of the input (e.g., patches of the images) that are relevant to the classification. In such fashion, the models are able to provide high accuracy classifications while also providing an explicit and interpretable basis for the decision.

Type: Grant

Filed: May 13, 2020

Date of Patent: October 18, 2022

Assignee: GOOGLE LLC

Inventors: Gamaleldin Elsayed, Simon Kornblith, Quoc V. Le
Neural Network Layers with a Controlled Degree of Spatial Invariance

Publication number: 20210248472

Abstract: The present disclosure provides a neural network including one or more layers with relaxed spatial invariance. Each of the one or more layers can be configured to receive a respective layer input. Each of the one or more layers can be configured to convolve a plurality of different kernels against the respective layer input to generate a plurality of intermediate outputs, each of the plurality of intermediate outputs having a plurality of portions. Each of the one or more layers can be configured to apply, for each of the plurality of intermediate outputs, a respective plurality of weights respectively associated with the plurality of portions to generate a respective weighted output. Each of the one or more layers can be configured to generate a respective layer output based on the weighted outputs.

Type: Application

Filed: December 14, 2020

Publication date: August 12, 2021

Inventors: Gamaleldin Elsayed, Prajit Ramachandran, Jon Shlens, Simon Kornblith
Accurate and Interpretable Classification with Hard Attention

Publication number: 20200364540

Abstract: Generally, the present disclosure is directed to novel machine-learned classification models that operate with hard attention to make discrete attention actions. The present disclosure also provides a self-supervised pre-training procedure that initializes the model to a state with more frequent rewards. Given only the ground truth classification labels for a set of training inputs (e.g., images), the proposed models are able to learn a policy over discrete attention locations that identifies certain portions of the input (e.g., patches of the images) that are relevant to the classification. In such fashion, the models are able to provide high accuracy classifications while also providing an explicit and interpretable basis for the decision.

Type: Application

Filed: May 13, 2020

Publication date: November 19, 2020

Inventors: Gamaleldin Elsayed, Simon Kornblith, Quoc V. Le

Neural network layers with a controlled degree of spatial invariance

Conditional Object-Centric Learning with Slot Attention for Video and Other Sequential Data

Accurate and interpretable classification with hard attention

Neural Network Layers with a Controlled Degree of Spatial Invariance

Accurate and Interpretable Classification with Hard Attention