Patents by Inventor Volodymyr Mnih

Volodymyr Mnih has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10223617
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing images using recurrent attention. One of the methods includes determining a location in the first image; extracting a glimpse from the first image using the location; generating a glimpse representation of the extracted glimpse; processing the glimpse representation using a recurrent neural network to update a current internal state of the recurrent neural network to generate a new internal state; processing the new internal state to select a location in a next image in the image sequence after the first image; and processing the new internal state to select an action from a predetermined set of possible actions.
    Type: Grant
    Filed: June 4, 2015
    Date of Patent: March 5, 2019
    Assignee: DeepMind Technologies Limited
    Inventors: Volodymyr Mnih, Koray Kavukcuoglu
  • Publication number: 20180260708
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for asynchronous deep reinforcement learning. One of the systems includes a plurality of workers, wherein each worker is configured to operate independently of each other worker, and wherein each worker is associated with a respective actor that interacts with a respective replica of the environment during the training of the deep neural network.
    Type: Application
    Filed: May 11, 2018
    Publication date: September 13, 2018
    Inventors: Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, David Silver, Koray Kavukcuoglu
  • Publication number: 20170278018
    Abstract: We describe a method of reinforcement learning for a subject system having multiple states and actions to move from one state to the next. Training data is generated by operating on the system with a succession of actions and used to train a second neural network. Target values for training the second neural network are derived from a first neural network which is generated by copying weights of the second neural network at intervals.
    Type: Application
    Filed: June 9, 2017
    Publication date: September 28, 2017
    Inventors: Volodymyr Mnih, Koray Kavukcuoglu
  • Publication number: 20170228871
    Abstract: A system and method for labelling aerial images. A neural network generates predicted map data. The parameters of the neural network are trained by optimizing an objective function which compensates for noise in the map images. The function compensates both omission noise and registration noise.
    Type: Application
    Filed: April 26, 2017
    Publication date: August 10, 2017
    Inventors: Volodymyr Mnih, Geoffrey E. Hinton
  • Patent number: 9704068
    Abstract: A system and method for labelling aerial images. A neural network generates predicted map data. The parameters of the neural network are trained by optimizing an objective function which compensates for noise in the map images. The function compensates both omission noise and registration noise.
    Type: Grant
    Filed: June 21, 2013
    Date of Patent: July 11, 2017
    Assignee: Google Inc.
    Inventors: Volodymyr Mnih, Geoffrey E. Hinton
  • Patent number: 9679258
    Abstract: We describe a method of reinforcement learning for a subject system having multiple states and actions to move from one state to the next. Training data is generated by operating on the system with a succession of actions and used to train a second neural network. Target values for training the second neural network are derived from a first neural network which is generated by copying weights of the second neural network at intervals.
    Type: Grant
    Filed: December 5, 2013
    Date of Patent: June 13, 2017
    Assignee: Google Inc.
    Inventors: Volodymyr Mnih, Koray Kavukcuoglu
  • Publication number: 20170140270
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for asynchronous deep reinforcement learning. One of the systems includes a plurality of workers, wherein each worker is configured to operate independently of each other worker, and wherein each worker is associated with a respective actor that interacts with a respective replica of the environment during the training of the deep neural network.
    Type: Application
    Filed: November 11, 2016
    Publication date: May 18, 2017
    Inventors: Volodymyr Mnih, Adrià Puigdomènech Badia, Alexander Benjamin Graves, Timothy James Alexander Harley, David Silver, Koray Kavukcuoglu
  • Publication number: 20160232445
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributed training of reinforcement learning systems. One of the methods includes receiving, by a learner, current values of the parameters of the Q network from a parameter server, wherein each learner maintains a respective learner Q network replica and a respective target Q network replica; updating, by the learner, the parameters of the learner Q network replica maintained by the learner using the current values; selecting, by the learner, an experience tuple from a respective replay memory; computing, by the learner, a gradient from the experience tuple using the learner Q network replica maintained by the learner and the target Q network replica maintained by the learner; and providing, by the learner, the computed gradient to the parameter server.
    Type: Application
    Filed: February 4, 2016
    Publication date: August 11, 2016
    Inventors: Praveen Deepak Srinivasan, Rory Fearon, Cagdas Alcicek, Arun Sarath Nair, Samuel Blackwell, Vedavyas Panneershelvam, Alessandro De Maria, Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Mustafa Suleyman
  • Publication number: 20150100530
    Abstract: We describe a method of reinforcement learning for a subject system having multiple states and actions to move from one state to the next. Training data is generated by operating on the system with a succession of actions and used to train a second neural network. Target values for training the second neural network are derived from a first neural network which is generated by copying weights of the second neural network at intervals.
    Type: Application
    Filed: December 5, 2013
    Publication date: April 9, 2015
    Inventors: Volodymyr MNIH, Koray KAVUKCUOGLU
  • Publication number: 20130343641
    Abstract: A system and method for labelling aerial images. A neural network generates predicted map data. The parameters of the neural network are trained by optimizing an objective function which compensates for noise in the map images. The function compensates both omission noise and registration noise.
    Type: Application
    Filed: June 21, 2013
    Publication date: December 26, 2013
    Inventors: Volodymyr Mnih, Geoffrey E. Hinton