Patents by Inventor Piotr Michal Stanczyk

Piotr Michal Stanczyk has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220343164
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing reinforcement learning with centralized inference and training One of the methods includes receiving, at a current time-step in a plurality of time-steps, a respective observation by an actor for each environment of a plurality of environments; obtaining, for each environment, a respective reward for the actor as a result of the actor performing a respective action at a previous time-step preceding the current time-step; processing, for each environment, the respective observation and respective reward through a policy model; providing, to the actor, the respective policy outputs for each of the plurality of environments; maintaining at a repository and for each environment, a respective sequence of tuples corresponding to the actor; determining that a maintained sequence meets a threshold condition; and in response, training the policy model on the maintained sequence.
    Type: Application
    Filed: September 25, 2020
    Publication date: October 27, 2022
    Inventors: Lasse Espeholt, Ke Wang, Marcin M. Michalski, Piotr Michal Stanczyk, Raphaƫl Marinier