Patents by Inventor Geoffrey E. Hinton
Geoffrey E. Hinton has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12242818Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sequence modeling. One of the methods includes receiving an input sequence having a plurality of input positions; determining a plurality of blocks of consecutive input positions; processing the input sequence using a neural network to generate a latent alignment, comprising, at each of a plurality of input time steps: receiving a partial latent alignment from a previous input time step; selecting an input position in each block, wherein the token at the selected input position of the partial latent alignment in each block is a mask token; and processing the partial latent alignment and the input sequence using the neural network to generate a new latent alignment, wherein the new latent alignment comprises, at the selected input position in each block, an output token or a blank token; and generating, using the latent alignment, an output sequence.Type: GrantFiled: February 8, 2021Date of Patent: March 4, 2025Assignee: Google LLCInventors: William Chan, Chitwan Saharia, Geoffrey E. Hinton, Mohammad Norouzi, Navdeep Jaitly
-
Publication number: 20250053786Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a network output of high dimensional data comprising one or more output tokens. In one aspect, a system comprises a neural network system configured to initialize an analog bit representation of the network output comprising a set of continuous numeric values for each of the output tokens. The neural network system generates an updated analog bit representation that comprises a set of updated continuous numeric values. At each of a plurality of update iterations, the neural network system processes a diffusion input comprising the analog bit representation using a diffusion machine learning model to update the analog bit representation.Type: ApplicationFiled: August 7, 2023Publication date: February 13, 2025Inventors: Ting Chen, Ruixiang Zhang, Geoffrey E. Hinton
-
Publication number: 20240346298Abstract: A parallel convolutional neural network is provided. The CNN is implemented by a plurality of convolutional neural networks each on a respective processing node. Each CNN has a plurality of layers. A subset of the layers are interconnected between processing nodes such that activations are fed forward across nodes. The remaining subset is not so interconnected.Type: ApplicationFiled: February 9, 2024Publication date: October 17, 2024Inventors: Alexander Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton
-
Patent number: 12067758Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting objects in images. One of the methods includes obtaining an input image; processing the input image to generate predicted part feature data, the predicted part feature data comprising, for each of a plurality of possible object parts: a part presence probability representing a likelihood that the possible object part is depicted in the input image, a predicted pose of the possible object part in the input image given that the possible object part is depicted in the input image, and an object part feature vector characterizing the depiction of the possible object part given that the possible object part is depicted in the input image; and processing the predicted part feature data for the plurality of possible object parts to generate an object detection output that identifies one or more objects depicted in the input image.Type: GrantFiled: May 22, 2020Date of Patent: August 20, 2024Assignee: Google LLCInventors: Adam Roman Kosiorek, Geoffrey E. Hinton, Sara Sabour Rouh Aghdam, Yee Whye Teh
-
Patent number: 11978268Abstract: Methods, systems, and apparatus including computer programs encoded on a computer storage medium, for generating convex decomposition of objects using neural network models. One of the methods includes receiving an input that depicts an object. The input is processed using a neural network to generate an output that defines a convex representation of the object. The output includes, for each of a plurality of convex elements, respective parameters that define a position of the convex element in the convex representation of the object.Type: GrantFiled: November 18, 2022Date of Patent: May 7, 2024Assignee: Google LLCInventors: Boyang Deng, Kyle Genova, Soroosh Yazdani, Sofien Bouaziz, Geoffrey E. Hinton, Andrea Tagliasacchi
-
Publication number: 20240144109Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.Type: ApplicationFiled: December 28, 2023Publication date: May 2, 2024Inventors: Oriol Vinyals, Jeffrey Adgate Dean, Geoffrey E. Hinton
-
Patent number: 11941867Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a classification neural network.Type: GrantFiled: January 22, 2020Date of Patent: March 26, 2024Assignee: Google LLCInventors: Geoffrey E. Hinton, Nicholas Myles Wisener Frosst, Nicolas Guy Robert Papernot
-
Patent number: 11928577Abstract: A parallel convolutional neural network is provided. The CNN is implemented by a plurality of convolutional neural networks each on a respective processing node. Each CNN has a plurality of layers. A subset of the layers are interconnected between processing nodes such that activations are fed forward across nodes. The remaining subset is not so interconnected.Type: GrantFiled: April 27, 2020Date of Patent: March 12, 2024Assignee: Google LLCInventors: Alexander Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton
-
Patent number: 11900232Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.Type: GrantFiled: July 13, 2022Date of Patent: February 13, 2024Assignee: Google LLCInventors: Oriol Vinyals, Jeffrey Adgate Dean, Geoffrey E. Hinton
-
Patent number: 11829882Abstract: A system for training a neural network. A switch is linked to feature detectors in at least some of the layers of the neural network. For each training case, the switch randomly selectively disables each of the feature detectors in accordance with a preconfigured probability. The weights from each training case are then normalized for applying the neural network to test data.Type: GrantFiled: April 9, 2021Date of Patent: November 28, 2023Assignee: Google LLCInventors: Geoffrey E. Hinton, Alexander Krizhevsky, Ilya Sutskever, Nitish Srivastava
-
Patent number: 11694060Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a neural network that is configured to receive a network input and to generate a network output for the network input. The neural network comprises a plurality of layers arranged in a sequence, including a plurality of capsule layers. Each particular capsule in a particular capsule layer is configured to receive respective inputs including: (i) outputs generated by capsules of a previous capsule layer that is before the particular capsule layer in the sequence, and (ii) final routing factors between capsules of the previous capsule layer and the particular capsule, wherein the final routing factors are generated by a routing subsystem. Each particular capsule in the particular capsule layer is configured to determine a particular capsule output based on the received inputs, wherein the particular capsule output is of dimension greater than one.Type: GrantFiled: October 4, 2022Date of Patent: July 4, 2023Assignee: Google LLCInventors: Geoffrey E. Hinton, Nicholas Myles Wisener Frosst, Sara Sabour Rouh Aghdam
-
Publication number: 20230078756Abstract: Methods, systems, and apparatus including computer programs encoded on a computer storage medium, for generating convex decomposition of objects using neural network models. One of the methods includes receiving an input that depicts an object. The input is processed using a neural network to generate an output that defines a convex representation of the object. The output includes, for each of a plurality of convex elements, respective parameters that define a position of the convex element in the convex representation of the object.Type: ApplicationFiled: November 18, 2022Publication date: March 16, 2023Inventors: Boyang Deng, Kyle Genova, Soroosh Yazdani, Sofien Bouaziz, Geoffrey E. Hinton, Andrea Tagliasacchi
-
Publication number: 20230075716Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for sequence modeling. One of the methods includes receiving an input sequence having a plurality of input positions; determining a plurality of blocks of consecutive input positions; processing the input sequence using a neural network to generate a latent alignment, comprising, at each of a plurality of input time steps: receiving a partial latent alignment from a previous input time step; selecting an input position in each block, wherein the token at the selected input position of the partial latent alignment in each block is a mask token; and processing the partial latent alignment and the input sequence using the neural network to generate a new latent alignment, wherein the new latent alignment comprises, at the selected input position in each block, an output token or a blank token; and generating, using the latent alignment, an output sequence.Type: ApplicationFiled: February 8, 2021Publication date: March 9, 2023Inventors: William Chan, Chitwan Saharia, Geoffrey E. Hinton, Mohammad Norouzi, Navdeep Jaitly
-
Publication number: 20230027069Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a neural network that is configured to receive a network input and to generate a network output for the network input. The neural network comprises a plurality of layers arranged in a sequence, including a plurality of capsule layers. Each particular capsule in a particular capsule layer is configured to receive respective inputs including: (i) outputs generated by capsules of a previous capsule layer that is before the particular capsule layer in the sequence, and (ii) final routing factors between capsules of the previous capsule layer and the particular capsule, wherein the final routing factors are generated by a routing subsystem. Each particular capsule in the particular capsule layer is configured to determine a particular capsule output based on the received inputs, wherein the particular capsule output is of dimension greater than one.Type: ApplicationFiled: October 4, 2022Publication date: January 26, 2023Inventors: Geoffrey E. Hinton, Nicholas Myles Wisener Frosst, Sara Sabour Rouh Aghdam
-
Patent number: 11508167Abstract: Methods, systems, and apparatus including computer programs encoded on a computer storage medium, for generating convex decomposition of objects using neural network models. One of the methods includes receiving an input that depicts an object. The input is processed using a neural network to generate an output that defines a convex representation of the object. The output includes, for each of a plurality of convex elements, respective parameters that define a position of the convex element in the convex representation of the object.Type: GrantFiled: April 13, 2020Date of Patent: November 22, 2022Assignee: Google LLCInventors: Boyang Deng, Kyle Genova, Soroosh Yazdani, Sofien Bouaziz, Geoffrey E. Hinton, Andrea Tagliasacchi
-
Patent number: 11494609Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a neural network that is configured to receive a network input and to generate a network output for the network input. The neural network comprises a plurality of layers arranged in a sequence, including a plurality of capsule layers. Each particular capsule in a particular capsule layer is configured to receive respective inputs including: (i) outputs generated by capsules of a previous capsule layer that is before the particular capsule layer in the sequence, and (ii) final routing factors between capsules of the previous capsule layer and the particular capsule, wherein the final routing factors are generated by a routing subsystem. Each particular capsule in the particular capsule layer is configured to determine a particular capsule output based on the received inputs, wherein the particular capsule output is of dimension greater than one.Type: GrantFiled: December 15, 2017Date of Patent: November 8, 2022Assignee: Google LLCInventors: Geoffrey E. Hinton, Nicholas Myles Wisener Frosst, Sara Sabour Rouh Aghdam
-
Publication number: 20220351091Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.Type: ApplicationFiled: July 13, 2022Publication date: November 3, 2022Inventors: Oriol Vinyals, Jeffrey Adgate Dean, Geoffrey E. Hinton
-
Patent number: 11423337Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a distilled machine learning model. One of the methods includes training a cumbersome machine learning model, wherein the cumbersome machine learning model is configured to receive an input and generate a respective score for each of a plurality of classes; and training a distilled machine learning model on a plurality of training inputs, wherein the distilled machine learning model is also configured to receive inputs and generate scores for the plurality of classes, comprising: processing each training input using the cumbersome machine learning model to generate a cumbersome target soft output for the training input; and training the distilled machine learning model to, for each of the training inputs, generate a soft output that matches the cumbersome target soft output for the training input.Type: GrantFiled: April 7, 2020Date of Patent: August 23, 2022Assignee: Google LLCInventors: Oriol Vinyals, Jeffrey Adgate Dean, Geoffrey E. Hinton
-
Publication number: 20220230425Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting objects in images. One of the methods includes obtaining an input image; processing the input image to generate predicted part feature data, the predicted part feature data comprising, for each of a plurality of possible object parts: a part presence probability representing a likelihood that the possible object part is depicted in the input image, a predicted pose of the possible object part in the input image given that the possible object part is depicted in the input image, and an object part feature vector characterizing the depiction of the possible object part given that the possible object part is depicted in the input image; and processing the predicted part feature data for the plurality of possible object parts to generate an object detection output that identifies one or more objects depicted in the input image.Type: ApplicationFiled: May 22, 2020Publication date: July 21, 2022Inventors: Adam Roman Kosiorek, Geoffrey E. Hinton, Sara Sabour Rouh Aghdam, Yee Whye Teh
-
Publication number: 20220101624Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a classification neural network.Type: ApplicationFiled: January 22, 2020Publication date: March 31, 2022Inventors: Geoffrey E. Hinton, Nicholas Myles Wisener Frosst, Nicolas Guy Robert Papernot