Patents by Inventor Volodymyr Mnih

Volodymyr Mnih has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Image processing with recurrent attention

Patent number: 10223617

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using recurrent attention. One of the methods includes determining a location in the first image; extracting a glimpse from the first image using the location; generating a glimpse representation of the extracted glimpse; processing the glimpse representation using a recurrent neural network to update a current internal state of the recurrent neural network to generate a new internal state; processing the new internal state to select a location in a next image in the image sequence after the first image; and processing the new internal state to select an action from a predetermined set of possible actions.

Type: Grant

Filed: June 4, 2015

Date of Patent: March 5, 2019

Assignee: DeepMind Technologies Limited

Inventors: Volodymyr Mnih, Koray Kavukcuoglu
ASYNCHRONOUS DEEP REINFORCEMENT LEARNING

Publication number: 20180260708

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for asynchronous deep reinforcement learning. One of the systems includes a plurality of workers, wherein each worker is configured to operate independently of each other worker, and wherein each worker is associated with a respective actor that interacts with a respective replica of the environment during the training of the deep neural network.

Type: Application

Filed: May 11, 2018

Publication date: September 13, 2018

Inventors: Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, David Silver, Koray Kavukcuoglu
METHODS AND APPARATUS FOR REINFORCEMENT LEARNING

Publication number: 20170278018

Abstract: We describe a method of reinforcement learning for a subject system having multiple states and actions to move from one state to the next. Training data is generated by operating on the system with a succession of actions and used to train a second neural network. Target values for training the second neural network are derived from a first neural network which is generated by copying weights of the second neural network at intervals.

Type: Application

Filed: June 9, 2017

Publication date: September 28, 2017

Inventors: Volodymyr Mnih, Koray Kavukcuoglu
SYSTEM AND METHOD FOR LABELLING AERIAL IMAGES

Publication number: 20170228871

Abstract: A system and method for labelling aerial images. A neural network generates predicted map data. The parameters of the neural network are trained by optimizing an objective function which compensates for noise in the map images. The function compensates both omission noise and registration noise.

Type: Application

Filed: April 26, 2017

Publication date: August 10, 2017

Inventors: Volodymyr Mnih, Geoffrey E. Hinton
System and method for labelling aerial images

Patent number: 9704068

Abstract: A system and method for labelling aerial images. A neural network generates predicted map data. The parameters of the neural network are trained by optimizing an objective function which compensates for noise in the map images. The function compensates both omission noise and registration noise.

Type: Grant

Filed: June 21, 2013

Date of Patent: July 11, 2017

Assignee: Google Inc.

Inventors: Volodymyr Mnih, Geoffrey E. Hinton
Methods and apparatus for reinforcement learning

Patent number: 9679258

Abstract: We describe a method of reinforcement learning for a subject system having multiple states and actions to move from one state to the next. Training data is generated by operating on the system with a succession of actions and used to train a second neural network. Target values for training the second neural network are derived from a first neural network which is generated by copying weights of the second neural network at intervals.

Type: Grant

Filed: December 5, 2013

Date of Patent: June 13, 2017

Assignee: Google Inc.

Inventors: Volodymyr Mnih, Koray Kavukcuoglu
ASYNCHRONOUS DEEP REINFORCEMENT LEARNING

Publication number: 20170140270

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for asynchronous deep reinforcement learning. One of the systems includes a plurality of workers, wherein each worker is configured to operate independently of each other worker, and wherein each worker is associated with a respective actor that interacts with a respective replica of the environment during the training of the deep neural network.

Type: Application

Filed: November 11, 2016

Publication date: May 18, 2017

Inventors: Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, David Silver, Koray Kavukcuoglu
DISTRIBUTED TRAINING OF REINFORCEMENT LEARNING SYSTEMS

Publication number: 20160232445

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributed training of reinforcement learning systems. One of the methods includes receiving, by a learner, current values of the parameters of the Q network from a parameter server, wherein each learner maintains a respective learner Q network replica and a respective target Q network replica; updating, by the learner, the parameters of the learner Q network replica maintained by the learner using the current values; selecting, by the learner, an experience tuple from a respective replay memory; computing, by the learner, a gradient from the experience tuple using the learner Q network replica maintained by the learner and the target Q network replica maintained by the learner; and providing, by the learner, the computed gradient to the parameter server.

Type: Application

Filed: February 4, 2016

Publication date: August 11, 2016

Inventors: Praveen Deepak Srinivasan, Rory Fearon, Cagdas Alcicek, Arun Sarath Nair, Samuel Blackwell, Vedavyas Panneershelvam, Alessandro De Maria, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Mustafa Suleyman
METHODS AND APPARATUS FOR REINFORCEMENT LEARNING

Publication number: 20150100530

Abstract: We describe a method of reinforcement learning for a subject system having multiple states and actions to move from one state to the next. Training data is generated by operating on the system with a succession of actions and used to train a second neural network. Target values for training the second neural network are derived from a first neural network which is generated by copying weights of the second neural network at intervals.

Type: Application

Filed: December 5, 2013

Publication date: April 9, 2015

Inventors: Volodymyr MNIH, Koray KAVUKCUOGLU
SYSTEM AND METHOD FOR LABELLING AERIAL IMAGES

Publication number: 20130343641

Abstract: A system and method for labelling aerial images. A neural network generates predicted map data. The parameters of the neural network are trained by optimizing an objective function which compensates for noise in the map images. The function compensates both omission noise and registration noise.

Type: Application

Filed: June 21, 2013

Publication date: December 26, 2013

Inventors: Volodymyr Mnih, Geoffrey E. Hinton

prev 1 2 3