Patents by Inventor Theodore Harris Moskovitz

Theodore Harris Moskovitz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240265263
    Abstract: A method is described for iteratively training a policy model, such as a neural network, of a computer-implemented action selection system to control an agent interacting with an environment to perform a task subject to one or more constraints. The task has a reward associated with performance of the task. Each constraint limits to a corresponding threshold the expected value of the total of a corresponding constraint function which if the future actions of the agent are chosen according to the policy model, and each constraint is associated with a corresponding multiplier variable. In each iteration, a mixed reward function is generated based on values for the multiplier variables generated in the preceding iteration, and estimates of the rewards and the values of constraint reward functions if the actions are chosen based on the policy model generated in the preceding iteration.
    Type: Application
    Filed: January 26, 2024
    Publication date: August 8, 2024
    Inventors: Theodore Harris Moskovitz, Brendan Timothy O'Donoghue, Tom Ben Zion Zahavy, Johan Sebastian Flennerhag, Vivek Veeriah Jeya Veeraiah, Satinder Singh Baveja