Patents by Inventor Ilya Sutskever

Ilya Sutskever has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for parallelizing convolutional neural networks

Patent number: 11928577

Abstract: A parallel convolutional neural network is provided. The CNN is implemented by a plurality of convolutional neural networks each on a respective processing node. Each CNN has a plurality of layers. A subset of the layers are interconnected between processing nodes such that activations are fed forward across nodes. The remaining subset is not so interconnected.

Type: Grant

Filed: April 27, 2020

Date of Patent: March 12, 2024

Assignee: Google LLC

Inventors: Alexander Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton
SYSTEMS AND METHODS FOR GENERATING CODE USING LANGUAGE MODELS TRAINED ON COMPUTER CODE

Publication number: 20240020096

Abstract: Disclosed herein are methods, systems, and computer-readable media for generating computer code based on natural language input. In an embodiment, a method may comprise one or more of: receiving a docstring representing natural language text specifying a digital programming result; generating, using a trained machine learning model, and based on the docstring, a computer code sample configured to produce respective candidate results; causing the computer code sample to be executed; identifying, based on the executing, a computer code sample configured to produce a particular candidate result associated with the digital programming result; performing at least one of outputting, via a user interface, the identified computer code sample, compiling the identified computer code sample, transmitting the identified computer code sample to a recipient device, storing the identified computer code sample, and/or re-executing the identified computer code sample.

Type: Application

Filed: May 23, 2023

Publication date: January 18, 2024

Applicant: OpenAI Opco, LLC

Inventors: Mark CHEN, Jerry TWOREK, Ilya SUTSKEVER, Wojciech ZAREMBA, Heewoo JUN, Henrique PONDE DE OLIVEIRA PINTO
SYSTEMS AND METHODS FOR GENERATING NATURAL LANGUAGE USING LANGUAGE MODELS TRAINED ON COMPUTER CODE

Publication number: 20240020116

Abstract: Disclosed herein are methods, systems, and computer-readable media for generating natural language based on computer code input. In an embodiment, a method may comprise one or more of: accessing a docstring generation model configured to generate docstrings from computer code; receiving one or more computer code samples; generating, using the docstring generation model and based on the received one or more computer code samples, one or more candidate docstrings representing natural language text, each of the one or more candidate docstrings being associated with at least a portion of the one or more computer code samples; identifying at least one of the one or more candidate docstrings that provides an intent of the at least a portion of the one or more computer code samples; and/or outputting, via a user interface, the at least one identified docstring with the at least a portion of the one or more computer code samples.

Type: Application

Filed: May 23, 2023

Publication date: January 18, 2024

Applicant: OpenAI Opco, LLC

Inventors: Mark CHEN, Jerry TWOREK, Ilya SUTSKEVER, Wojciech ZAREMBA, Heewoo JUN, Henrique PONDE DE OLIVEIRA PINTO
System and method for addressing overfitting in a neural network

Patent number: 11829882

Abstract: A system for training a neural network. A switch is linked to feature detectors in at least some of the layers of the neural network. For each training case, the switch randomly selectively disables each of the feature detectors in accordance with a preconfigured probability. The weights from each training case are then normalized for applying the neural network to test data.

Type: Grant

Filed: April 9, 2021

Date of Patent: November 28, 2023

Assignee: Google LLC

Inventors: Geoffrey E. Hinton, Alexander Krizhevsky, Ilya Sutskever, Nitish Srivastava
Predicting likelihoods of conditions being satisfied using recurrent neural networks

Patent number: 11790216

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for predicting likelihoods of conditions being satisfied using recurrent neural networks. One of the systems is configured to process a temporal sequence comprising a respective input at each of a plurality of time steps and comprises: one or more recurrent neural network layers; one or more logistic regression nodes, wherein each of the logistic regression nodes corresponds to a respective condition from a predetermined set of conditions, and wherein each of the logistic regression nodes is configured to, for each of the plurality of time steps: receive the network internal state for the time step; and process the network internal state for the time step in accordance with current values of a set of parameters of the logistic regression node to generate a future condition score for the corresponding condition for the time step.

Type: Grant

Filed: July 27, 2020

Date of Patent: October 17, 2023

Assignee: Google LLC

Inventors: Gregory Sean Corrado, Ilya Sutskever, Jeffrey Adgate Dean
REINFORCEMENT LEARNING USING ADVANTAGE ESTIMATES

Publication number: 20220284266

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for computing Q values for actions to be performed by an agent interacting with an environment from a continuous action space of actions. In one aspect, a system includes a value subnetwork configured to receive an observation characterizing a current state of the environment and process the observation to generate a value estimate; a policy subnetwork configured to receive the observation and process the observation to generate an ideal point in the continuous action space; and a subsystem configured to receive a particular point in the continuous action space representing a particular action; generate an advantage estimate for the particular action; and generate a Q value for the particular action that is an estimate of an expected return resulting from the agent performing the particular action when the environment is in the current state.

Type: Application

Filed: March 25, 2022

Publication date: September 8, 2022

Inventors: Shixiang Gu, Timothy Paul Lillicrap, Ilya Sutskever, Sergey Vladimir Levine
GENERATING REPRESENTATIONS OF INPUT SEQUENCES USING NEURAL NETWORKS

Publication number: 20220101082

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating representations of input sequences. One of the methods includes obtaining an input sequence, the input sequence comprising a plurality of inputs arranged according to an input order; processing the input sequence using a first long short term memory (LSTM) neural network to convert the input sequence into an alternative representation for the input sequence; and processing the alternative representation for the input sequence using a second LSTM neural network to generate a target sequence for the input sequence, the target sequence comprising a plurality of outputs arranged according to an output order.

Type: Application

Filed: December 10, 2021

Publication date: March 31, 2022

Inventors: Oriol Vinyals, Quoc V. Le, Ilya Sutskever
Reinforcement learning using advantage estimates

Patent number: 11288568

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for computing Q values for actions to be performed by an agent interacting with an environment from a continuous action space of actions. In one aspect, a system includes a value subnetwork configured to receive an observation characterizing a current state of the environment and process the observation to generate a value estimate; a policy subnetwork configured to receive the observation and process the observation to generate an ideal point in the continuous action space; and a subsystem configured to receive a particular point in the continuous action space representing a particular action; generate an advantage estimate for the particular action; and generate a Q value for the particular action that is an estimate of an expected return resulting from the agent performing the particular action when the environment is in the current state.

Type: Grant

Filed: February 9, 2017

Date of Patent: March 29, 2022

Assignee: Google LLC

Inventors: Shixiang Gu, Timothy Paul Lillicrap, Ilya Sutskever, Sergey Vladimir Levine
Generating representations of input sequences using neural networks

Patent number: 11222252

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating representations of input sequences. One of the methods includes obtaining an input sequence, the input sequence comprising a plurality of inputs arranged according to an input order; processing the input sequence using a first long short term memory (LSTM) neural network to convert the input sequence into an alternative representation for the input sequence; and processing the alternative representation for the input sequence using a second LSTM neural network to generate a target sequence for the input sequence, the target sequence comprising a plurality of outputs arranged according to an output order.

Type: Grant

Filed: December 6, 2018

Date of Patent: January 11, 2022

Assignee: Google LLC

Inventors: Oriol Vinyals, Quoc V. Le, Ilya Sutskever
Generating target sequences from input sequences using partial conditioning

Patent number: 11195521

Abstract: A system can be configured to perform tasks such as converting recorded speech to a sequence of phonemes that represent the speech, converting an input sequence of graphemes into a target sequence of phonemes, translating an input sequence of words in one language into a corresponding sequence of words in another language, or predicting a target sequence of words that follow an input sequence of words in a language (e.g., a language model). In a speech recognizer, the RNN system may be used to convert speech to a target sequence of phonemes in real-time so that a transcription of the speech can be generated and presented to a user, even before the user has completed uttering the entire speech input.

Type: Grant

Filed: February 4, 2020

Date of Patent: December 7, 2021

Assignee: Google LLC

Inventors: Navdeep Jaitly, Quoc V. Le, Oriol Vinyals, Samuel Bengio, Ilya Sutskever
Augmenting neural networks with external memory using reinforcement learning

Patent number: 11080594

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for augmenting neural networks with an external memory using reinforcement learning. One of the methods includes providing an output derived from the system output portion of the neural network output as a system output in the sequence of system outputs; selecting a memory access process from a predetermined set of memory access processes for accessing the external memory from the reinforcement learning portion of the neural network output; writing and reading data from locations in the external memory in accordance with the selected memory access process using the differentiable portion of the neural network output; and combining the data read from the external memory with a next system input in the sequence of system inputs to generate a next neural network input in the sequence of neural network inputs.

Type: Grant

Filed: December 30, 2016

Date of Patent: August 3, 2021

Assignee: DeepMind Technologies Limited

Inventors: Ilya Sutskever, Ivo Danihelka, Alexander Benjamin Graves, Gregory Duncan Wayne, Wojciech Zaremba
SYSTEM AND METHOD FOR ADDRESSING OVERFITTING IN A NEURAL NETWORK

Publication number: 20210224659

Abstract: A system for training a neural network. A switch is linked to feature detectors in at least some of the layers of the neural network. For each training case, the switch randomly selectively disables each of the feature detectors in accordance with a preconfigured probability. The weights from each training case are then normalized for applying the neural network to test data.

Type: Application

Filed: April 9, 2021

Publication date: July 22, 2021

Inventors: Geoffrey E. Hinton, Alexander Krizhevsky, Ilya Sutskever, Nitish Srivastava
System and method for addressing overfitting in a neural network

Patent number: 10977557

Abstract: A system for training a neural network. A switch is linked to feature detectors in at least some of the layers of the neural network. For each training case, the switch randomly selectively disables each of the feature detectors in accordance with a preconfigured probability. The weights from each training case are then normalized for applying the neural network to test data.

Type: Grant

Filed: July 26, 2019

Date of Patent: April 13, 2021

Assignee: Google LLC

Inventors: Geoffrey E. Hinton, Alexander Krizhevsky, Ilya Sutskever, Nitish Srivastava
Convolutional gated recurrent neural networks

Patent number: 10977547

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for implementing a convolutional gated recurrent neural network (CGRN). In one of the systems, the CGRN is configured to maintain a state that is a tensor having dimensions x by y by m, wherein x, y, and m are each greater than one, and for each of a plurality of time steps, update a currently maintained state by processing the currently maintained state through a plurality of convolutional gates.

Type: Grant

Filed: November 11, 2016

Date of Patent: April 13, 2021

Assignee: Google LLC

Inventors: Lukasz Mieczyslaw Kaiser, Ilya Sutskever
Neural network programmer

Patent number: 10963779

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing operations using data from a data source. In one aspect, a method includes a neural network system including a controller neural network configured to: receive a controller input for a time step and process the controller input and a representation of a system input to generate: an operation score distribution that assigns a respective operation score to an operation and a data score distribution that assigns a respective data score in the data source. The neural network system can also include an operation subsystem configured to: perform operations to generate operation outputs, wherein at least one of the operations is performed on data in the data source, and combine the operation outputs in accordance with the operation score distribution and the data score distribution to generate a time step output for the time step.

Type: Grant

Filed: November 11, 2016

Date of Patent: March 30, 2021

Assignee: Google LLC

Inventors: Quoc V. Le, Ilya Sutskever, Arvind Neelakantan
Neural machine translation systems with rare word processing

Patent number: 10936828

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for neural translation systems with rare word processing. One of the methods is a method training a neural network translation system to track the source in source sentences of unknown words in target sentences, in a source language and a target language, respectively and includes deriving alignment data from a parallel corpus, the alignment data identifying, in each pair of source and target language sentences in the parallel corpus, aligned source and target words; annotating the sentences in the parallel corpus according to the alignment data and a rare word model to generate a training dataset of paired source and target language sentences; and training a neural network translation model on the training dataset.

Type: Grant

Filed: November 16, 2018

Date of Patent: March 2, 2021

Assignee: Google LLC

Inventors: Quoc V. Le, Minh-Thang Luong, Ilya Sutskever, Oriol Vinyals, Wojciech Zaremba
PREDICTING LIKELIHOODS OF CONDITIONS BEING SATISFIED USING RECURRENT NEURAL NETWORKS

Publication number: 20210019604

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for predicting likelihoods of conditions being satisfied using recurrent neural networks. One of the systems is configured to process a temporal sequence comprising a respective input at each of a plurality of time steps and comprises: one or more recurrent neural network layers; one or more logistic regression nodes, wherein each of the logistic regression nodes corresponds to a respective condition from a predetermined set of conditions, and wherein each of the logistic regression nodes is configured to, for each of the plurality of time steps: receive the network internal state for the time step; and process the network internal state for the time step in accordance with current values of a set of parameters of the logistic regression node to generate a future condition score for the corresponding condition for the time step.

Type: Application

Filed: July 27, 2020

Publication date: January 21, 2021

Inventors: Gregory Sean Corrado, Ilya Sutskever, Jeffrey Adgate Dean
Selecting actions to be performed by a reinforcement learning agent using tree search

Patent number: 10867242

Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media, for training a value neural network that is configured to receive an observation characterizing a state of an environment being interacted with by an agent and to process the observation in accordance with parameters of the value neural network to generate a value score. One of the systems performs operations that include training a supervised learning policy neural network; initializing initial values of parameters of a reinforcement learning policy neural network having a same architecture as the supervised learning policy network to the trained values of the parameters of the supervised learning policy neural network; training the reinforcement learning policy neural network on second training data; and training the value neural network to generate a value score for the state of the environment that represents a predicted long-term reward resulting from the environment being in the state.

Type: Grant

Filed: September 29, 2016

Date of Patent: December 15, 2020

Assignee: DeepMind Technologies Limited

Inventors: Thore Kurt Hartwig Graepel, Shih-Chieh Huang, David Silver, Arthur Clement Guez, Laurent Sifre, Ilya Sutskever, Christopher Maddison
SYSTEM AND METHOD FOR PARALLELIZING CONVOLUTIONAL NEURAL NETWORKS

Publication number: 20200327391

Abstract: A parallel convolutional neural network is provided. The CNN is implemented by a plurality of convolutional neural networks each on a respective processing node. Each CNN has a plurality of layers. A subset of the layers are interconnected between processing nodes such that activations are fed forward across nodes. The remaining subset is not so interconnected.

Type: Application

Filed: April 27, 2020

Publication date: October 15, 2020

Inventors: Alexander Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton
Generating Target Sequences From Input Sequences Using Partial Conditioning

Publication number: 20200251099

Abstract: A system can be configured to perform tasks such as converting recorded speech to a sequence of phonemes that represent the speech, converting an input sequence of graphemes into a target sequence of phonemes, translating an input sequence of words in one language into a corresponding sequence of words in another language, or predicting a target sequence of words that follow an input sequence of words in a language (e.g., a language model). In a speech recognizer, the RNN system may be used to convert speech to a target sequence of phonemes in real-time so that a transcription of the speech can be generated and presented to a user, even before the user has completed uttering the entire speech input.

Type: Application

Filed: February 4, 2020

Publication date: August 6, 2020

Inventors: Navdeep Jaitly, Quoc V. Le, Oriol Vinyals, Samuel Bengio, Ilya Sutskever

1 2 3 4 next