Patents by Inventor Sébastien Henri
Sébastien Henri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11328183Abstract: A neural network system is proposed. The neural network can be trained by model-based reinforcement learning to select actions to be performed by an agent interacting with an environment, to perform a task in an attempt to achieve a specified result. The system may comprise at least one imagination core which receives a current observation characterizing a current state of the environment, and optionally historical observations, and which includes a model of the environment. The imagination core may be configured to output trajectory data in response to the current observation, and/or historical observations. The trajectory data comprising a sequence of future features of the environment imagined by the imagination core. The system may also include a rollout encoder to encode the features, and an output stage to receive data derived from the rollout embedding and to output action policy data for identifying an action based on the current observation.Type: GrantFiled: September 14, 2020Date of Patent: May 10, 2022Assignee: DeepMind Technologies LimitedInventors: Daniel Pieter Wierstra, Yujia Li, Razvan Pascanu, Peter William Battaglia, Theophane Guillaume Weber, Lars Buesing, David Paul Reichert, Arthur Clement Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Manfred Otto Heess, Sebastien Henri Andre Racaniere
-
Patent number: 11200482Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for environment simulation. In one aspect, a system comprises a recurrent neural network configured to, at each of a plurality of time steps, receive a preceding action for a preceding time step, update a preceding initial hidden state of the recurrent neural network from the preceding time step using the preceding action, update a preceding cell state of the recurrent neural network from the preceding time step using at least the initial hidden state for the time step, and determine a final hidden state for the time step using the cell state for the time step. The system further comprises a decoder neural network configured to receive the final hidden state for the time step and process the final hidden state to generate a predicted observation characterizing a predicted state of the environment at the time step.Type: GrantFiled: June 5, 2020Date of Patent: December 14, 2021Assignee: DeepMind Technologies LimitedInventors: Daniel Pieter Wierstra, Shakir Mohamed, Silvia Chiappa, Sebastien Henri Andre Racaniere
-
Publication number: 20210089834Abstract: A neural network system is proposed to select actions to be performed by an agent interacting with an environment to perform a task in an attempt to achieve a specified result. The system may include a controller to receive state data and context data, and to output action data. The system may also include an imagination module to receive the state and action data, and to output consequent state data. The system may also include a manager to receive the state data and the context data, and to output route data which defines whether the system is to execute an action or to imagine. The system may also include a memory to store the context data.Type: ApplicationFiled: December 7, 2020Publication date: March 25, 2021Inventors: Daniel Pieter Wierstra, Yujia Li, Razvan Pascanu, Peter William Battaglia, Theophane Guillaume Weber, Lars Buesing, David Paul Reichert, Oriol Vinyals, Nicolas Manfred Otto Heess, Sebastien Henri Andre Racaniere
-
Publication number: 20210073594Abstract: A neural network system is proposed. The neural network can be trained by model-based reinforcement learning to select actions to be performed by an agent interacting with an environment, to perform a task in an attempt to achieve a specified result. The system may comprise at least one imagination core which receives a current observation characterizing a current state of the environment, and optionally historical observations, and which includes a model of the environment. The imagination core may be configured to output trajectory data in response to the current observation, and/or historical observations. The trajectory data comprising a sequence of future features of the environment imagined by the imagination core. The system may also include a rollout encoder to encode the features, and an output stage to receive data derived from the rollout embedding and to output action policy data for identifying an action based on the current observation.Type: ApplicationFiled: September 14, 2020Publication date: March 11, 2021Inventors: Daniel Pieter Wierstra, Yujia Li, Razvan Pascanu, Peter William Battaglia, Theophane Guillaume Weber, Lars Buesing, David Paul Reichert, Arthur Clement Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Manfred Otto Heess, Sebastien Henri Andre Racaniere
-
Patent number: 10860895Abstract: A neural network system is proposed to select actions to be performed by an agent interacting with an environment to perform a task in an attempt to achieve a specified result. The system may include a controller to receive state data and context data, and to output action data. The system may also include an imagination module to receive the state and action data, and to output consequent state data. The system may also include a manager to receive the state data and the context data, and to output route data which defines whether the system is to execute an action or to imagine. The system may also include a memory to store the context data.Type: GrantFiled: November 19, 2019Date of Patent: December 8, 2020Assignee: DeepMind Technologies LimitedInventors: Daniel Pieter Wierstra, Yujia Li, Razvan Pascanu, Peter William Battaglia, Theophane Guillaume Weber, Lars Buesing, David Paul Reichert, Oriol Vinyals, Nicolas Manfred Otto Heess, Sebastien Henri Andre Racaniere
-
Publication number: 20200342289Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for environment simulation. In one aspect, a system comprises a recurrent neural network configured to, at each of a plurality of time steps, receive a preceding action for a preceding time step, update a preceding initial hidden state of the recurrent neural network from the preceding time step using the preceding action, update a preceding cell state of the recurrent neural network from the preceding time step using at least the initial hidden state for the time step, and determine a final hidden state for the time step using the cell state for the time step. The system further comprises a decoder neural network configured to receive the final hidden state for the time step and process the final hidden state to generate a predicted observation characterizing a predicted state of the environment at the time step.Type: ApplicationFiled: June 5, 2020Publication date: October 29, 2020Inventors: Daniel Pieter Wierstra, Shakir Mohamed, Silvia Chiappa, Sebastien Henri Andre Racaniere
-
Patent number: 10776670Abstract: A neural network system is proposed. The neural network can be trained by model-based reinforcement learning to select actions to be performed by an agent interacting with an environment, to perform a task in an attempt to achieve a specified result. The system may comprise at least one imagination core which receives a current observation characterizing a current state of the environment, and optionally historical observations, and which includes a model of the environment. The imagination core may be configured to output trajectory data in response to the current observation, and/or historical observations. The trajectory data comprising a sequence of future features of the environment imagined by the imagination core. The system may also include a rollout encoder to encode the features, and an output stage to receive data derived from the rollout embedding and to output action policy data for identifying an action based on the current observation.Type: GrantFiled: November 19, 2019Date of Patent: September 15, 2020Assignee: DeepMind Technologies LimitedInventors: Daniel Pieter Wierstra, Yujia Li, Razvan Pascanu, Peter William Battaglia, Theophane Guillaume Weber, Lars Buesing, David Paul Reichert, Arthur Clement Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Manfred Otto Heess, Sebastien Henri Andre Racaniere
-
Patent number: 10713559Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for environment simulation. In one aspect, a system comprises a recurrent neural network configured to, at each of a plurality of time steps, receive a preceding action for a preceding time step, update a preceding initial hidden state of the recurrent neural network from the preceding time step using the preceding action, update a preceding cell state of the recurrent neural network from the preceding time step using at least the initial hidden state for the time step, and determine a final hidden state for the time step using the cell state for the time step. The system further comprises a decoder neural network configured to receive the final hidden state for the time step and process the final hidden state to generate a predicted observation characterizing a predicted state of the environment at the time step.Type: GrantFiled: May 3, 2019Date of Patent: July 14, 2020Assignee: DeepMind Technologies LimitedInventors: Daniel Pieter Wierstra, Shakir Mohamed, Silvia Chiappa, Sebastien Henri Andre Racaniere
-
Publication number: 20200090006Abstract: A neural network system is proposed. The neural network can be trained by model-based reinforcement learning to select actions to be performed by an agent interacting with an environment, to perform a task in an attempt to achieve a specified result. The system may comprise at least one imagination core which receives a current observation characterizing a current state of the environment, and optionally historical observations, and which includes a model of the environment. The imagination core may be configured to output trajectory data in response to the current observation, and/or historical observations. The trajectory data comprising a sequence of future features of the environment imagined by the imagination core. The system may also include a rollout encoder to encode the features, and an output stage to receive data derived from the rollout embedding and to output action policy data for identifying an action based on the current observation.Type: ApplicationFiled: November 19, 2019Publication date: March 19, 2020Inventors: Daniel Pieter Wierstra, Yujia Li, Razvan Pascanu, Peter William Battaglia, Theophane Guillaume Weber, Lars Buesing, David Paul Reichert, Arthur Clement Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Manfred Otto Heess, Sebastien Henri Andre Racaniere
-
Publication number: 20200082227Abstract: A neural network system is proposed to select actions to be performed by an agent interacting with an environment to perform a task in an attempt to achieve a specified result. The system may include a controller to receive state data and context data, and to output action data. The system may also include an imagination module to receive the state and action data, and to output consequent state data. The system may also include a manager to receive the state data and the context data, and to output route data which defines whether the system is to execute an action or to imagine. The system may also include a memory to store the context data.Type: ApplicationFiled: November 19, 2019Publication date: March 12, 2020Inventors: Daniel Pieter Wierstra, Yujia Li, Razvan Pascanu, Peter William Battaglia, Theophane Guillaume Weber, Lars Buesing, David Paul Reichert, Oriol Vinyals, Nicolas Manfred Otto Heess, Sebastien Henri Andre Racaniere
-
Publication number: 20190266475Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for environment simulation. In one aspect, a system comprises a recurrent neural network configured to, at each of a plurality of time steps, receive a preceding action for a preceding time step, update a preceding initial hidden state of the recurrent neural network from the preceding time step using the preceding action, update a preceding cell state of the recurrent neural network from the preceding time step using at least the initial hidden state for the time step, and determine a final hidden state for the time step using the cell state for the time step. The system further comprises a decoder neural network configured to receive the final hidden state for the time step and process the final hidden state to generate a predicted observation characterizing a predicted state of the environment at the time step.Type: ApplicationFiled: May 3, 2019Publication date: August 29, 2019Inventors: Daniel Pieter Wierstra, Shakir Mohamed, Silvia Chiappa, Sebastien Henri Andre Racaniere
-
Patent number: 10129838Abstract: Methods, systems, and devices are described that provide for D2D synchronization. The methods, systems, and/or devices may include tools and techniques that provide for synchronizing a mobile device based on detection of a reliability alarm. A reliability alarm may be used between mobile devices, which is transmitted and/or received on specific D2D resources. Since the resources are reserved for the reliability alarm, a mobile device which was previously isolated from network synchronization will be able to receive the reliability alarm that a reliable synchronization signal is close when it moves within range of a reliable device. Once a reliability alarm is received the mobile device may free other resources to allow it to receive synchronization signals from the reliable devices. The mobile device may then synchronize with the network based on the received synchronization signals and transmit its own reliability alarm for subsequent isolated devices to use.Type: GrantFiled: May 23, 2014Date of Patent: November 13, 2018Assignee: QUALCOMM IncorporatedInventors: Navid Abedini, Nilesh Nilkanth Khude, Saurabha Rangrao Tavildar, Sébastien Henri, Junyi Li, Vincent Douglas Park
-
Patent number: 9888449Abstract: A method, an apparatus, and a computer program product for wireless communication are provided in connection with enabling timing source selection and deselection in a decentralized manner for distributed D2D synchronization in densely populated communications systems. In an example, a communications device, functioning in a non-timing source (TS) mode, is equipped to receive a request for timing information during a synchronization channel. The communications device may further be equipped to determine whether to switch to a TS mode based on a selection utility metric value. In another example, a communications device, functioning in a TS mode, is equipped to transmit a TIB during a synchronization channel, and determine whether to switch to a non-TS mode based on a deselection utility metric value. In the TS mode, the UE is configured to transmit a TIB, while in the non-TS mode the UE is configured not to transmit the TIB.Type: GrantFiled: January 17, 2014Date of Patent: February 6, 2018Assignee: QUALCOMM IncorporatedInventors: Nilesh Nilkanth Khude, Sebastien Henri, Vincent Douglas Park, Junyi Li
-
Patent number: 9467957Abstract: A method, an apparatus, and a computer program product for wireless communication are provided in connection with improving resource allocation for distributed D2D synchronization in densely populated communications systems. In an example, a communications device is equipped to transmit a synchronization signal during a beacon period of a synchronization channel. In an aspect, the synchronization channel may include the beacon period, a paging period, and a TIB period. The communications device may further be equipped to monitor at least one of the beacon period, the paging period, or the TIB period of the synchronization channel for one or more signals from one or more UEs in a D2D network, and determine whether to transmit information during at least one of the beacon period, the paging period, or the TIB period based at least in part on the monitoring.Type: GrantFiled: January 21, 2014Date of Patent: October 11, 2016Assignee: QUALCOMM INCORPORATEDInventors: Nilesh Nilkanth Khude, Sebastien Henri, Vincent Douglas Park, Junyi Li
-
Publication number: 20150341873Abstract: Methods, systems, and devices are described that provide for D2D synchronization. The methods, systems, and/or devices may include tools and techniques that provide for synchronizing a mobile device based on detection of a reliability alarm. A reliability alarm may be used between mobile devices, which is transmitted and/or received on specific D2D resources. Since the resources are reserved for the reliability alarm, a mobile device which was previously isolated from network synchronization will be able to receive the reliability alarm that a reliable synchronization signal is close when it moves within range of a reliable device. Once a reliability alarm is received the mobile device may free other resources to allow it to receive synchronization signals from the reliable devices. The mobile device may then synchronize with the network based on the received synchronization signals and transmit its own reliability alarm for subsequent isolated devices to use.Type: ApplicationFiled: May 23, 2014Publication date: November 26, 2015Applicant: Qualcomm IncorporatedInventors: Navid Abedini, Nilesh Nilkanth Khude, Saurabha Rangrao Tavildar, Sébastien Henri, Junyi Li, Vincent Douglas Park
-
Publication number: 20150208368Abstract: A method, an apparatus, and a computer program product for wireless communication are provided in connection with improving resource allocation for distributed D2D synchronization in densely populated communications systems. In an example, a communications device is equipped to transmit a synchronization signal during a beacon period of a synchronization channel. In an aspect, the synchronization channel may include the beacon period, a paging period, and a TIB period. The communications device may further be equipped to monitor at least one of the beacon period, the paging period, or the TIB period of the synchronization channel for one or more signals from one or more UEs in a D2D network, and determine whether to transmit information during at least one of the beacon period, the paging period, or the TIB period based at least in part on the monitoring.Type: ApplicationFiled: January 21, 2014Publication date: July 23, 2015Applicant: QUALCOMM IncorporatedInventors: Nilesh Nilkanth KHUDE, Sebastien HENRI, Vincent Douglas PARK, Junyi Ll
-
Publication number: 20150208367Abstract: A method, an apparatus, and a computer program product for wireless communication are provided in connection with enabling timing source selection and deselection in a decentralized manner for distributed D2D synchronization in densely populated communications systems. In an example, a communications device, functioning in a non-timing source (TS) mode, is equipped to receive a request for timing information during a synchronization channel. The communications device may further be equipped to determine whether to switch to a TS mode based on a selection utility metric value. In another example, a communications device, functioning in a TS mode, is equipped to transmit a TIB during a synchronization channel, and determine whether to switch to a non-TS mode based on a deselection utility metric value. In the TS mode, the UE is configured to transmit a TIB, while in the non-TS mode the UE is configured not to transmit the TIB.Type: ApplicationFiled: January 17, 2014Publication date: July 23, 2015Applicant: QUALCOMM IncorporatedInventors: Nilesh Nilkanth KHUDE, Sebastien HENRI, Vincent Douglas PARK, Junyi Ll
-
Publication number: 20150078369Abstract: A method, an apparatus, and a computer program product for wireless communication are provided in connection with improving convergence to a common timing structure for devices in a distributed synchronization D2D network. In an example, a communications device is equipped to detect, by a UE, a synchronization signal during a listening slot duration scan of a communication channel. In an aspect, the listening slot duration may be defined based on a first timing structure, and the synchronization signal may be defined based on a second timing structure. The communications device may further be equipped to obtain timing information associated with the second timing structure from the synchronization signal, and determine whether the first timing structure or the second timing structure is a preferred timing structure.Type: ApplicationFiled: September 17, 2013Publication date: March 19, 2015Applicant: QUALCOMM IncorporatedInventors: Nilesh N. KHUDE, Saurabha R. TAVILDAR, Sebastien HENRI, Navid ABEDINI, Junyi LI, Vincent D. PARK
-
Publication number: 20090093994Abstract: A system, method, and computer program for determining a descriptor, comprising calculating a maximum distance for a plurality of points in a sector between each of said plurality of points and an origin; calculating a minimal distance from one of said plurality of points and a target line, wherein said maximum distance is an initial value; computing a plurality of Fourier coefficients from said minimal distances; and defining an invariant descriptor from said Fourier coefficients, and appropriate means and computer-readable instructions.Type: ApplicationFiled: October 9, 2007Publication date: April 9, 2009Inventor: Sebastien Henri Andre Racaniere
-
Patent number: D1001723Type: GrantFiled: September 30, 2021Date of Patent: October 17, 2023Assignee: The Goodyear Tire & Rubber CompanyInventors: Franck Reygrobellet, Jaroslaw Micek, Sebastien Henri Augustin Cauuet, Jean-Noel Marsat, Marta de Conceicao Ferreira Torres