Patents by Inventor Felix Berkenkamp

Felix Berkenkamp has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12246450
    Abstract: A computer-implemented method for for learning a policy. The method includes: recording at least an episode of interactions of the agent with its environment following policy and adding the recorded episode to a set of training data; optimizing a transition dynamics model based on the training data such that the transition dynamics model predicts the next states of the environment depending on the states and actions contained in the training data; optimizing policy parameters based on the training data and the transition dynamics model by optimizing a reward. In the method, the transition dynamics model comprises a first model characterizing the global model and a second model characterizing a correction model, which is configured to correct outputs of the first model.
    Type: Grant
    Filed: March 1, 2022
    Date of Patent: March 11, 2025
    Assignee: Robert Bosch GmbH
    Inventors: Felix Berkenkamp, Lukas Froehlich, Maksym Lefarov, Andreas Doerr
  • Publication number: 20250077726
    Abstract: A method for configuring a technical system. The method includes: detecting reference observations; conditioning reference system models on the reference observations detected for the reference system; detecting observations of results of the technical system to be configured for different values of the configuration parameters; adjusting an a priori model for the relationship between the values of the configuration parameters and the results provided by the technical system to the observations detected for the technical system, wherein the a priori model is formed from a weighted combination of the conditioned reference system models; ascertaining an a posteriori model for the relationship between the values of the configuration parameters and the results provided by the technical system by conditioning the adjusted a priori model on the observations detected for the technical system to be configured; and configuring the technical system using the ascertained a posteriori model.
    Type: Application
    Filed: August 27, 2024
    Publication date: March 6, 2025
    Inventors: Felix Berkenkamp, Kathrin Skubch, Paul Sebastian Baireuther, Petru Tighineanu
  • Patent number: 12194631
    Abstract: A method for controlling a physical system. The method includes training a neural network to output, for a plurality of tasks, a result of the task carried out, in each case in response to the input of a control configuration of the physical system and the input of a value of a task input parameter; ascertaining a control configuration for a further task with the aid of Bayesian optimization, the neural network, parameterized by the task input parameter, being used as a model for the relationship between control configuration and result; and controlling the physical system according to the control configuration to carry out the further task.
    Type: Grant
    Filed: August 27, 2021
    Date of Patent: January 14, 2025
    Assignee: ROBERT BOSCH GMBH
    Inventors: Felix Berkenkamp, Jonathan Spitz, Kathrin Skubch, Lukas Grossberger, Stefan Falkner, Anna Eivazi
  • Publication number: 20240370773
    Abstract: A method for training a machine learning model. The method includes: ascertaining, for each of a plurality of training data elements, a gradient of a target function, wherein the gradient comprises a component for each of a plurality of parameters of the machine learning model; generating an overall gradient by averaging the ascertained gradients component-wise by summing, for each component, the values of the ascertained gradients for this component and by dividing a resulting sum for the component by the number of ascertained gradients for which the component is above a specified threshold value; and adjusting the machine learning model in a direction given by the overall gradient.
    Type: Application
    Filed: April 24, 2024
    Publication date: November 7, 2024
    Inventors: Diane Sabine Staudt, Felix Berkenkamp, Felix Milo Richter, Ilya Kamenshchikov, Valentin Loeffelmann
  • Publication number: 20240311640
    Abstract: A computer-implemented method of learning a policy for an agent. The method includes: receiving an initialized first neural network, in particular a Q-functionor value-function, an initialized second neural network, auxiliary parameters, and the initialized policy; repeating the following steps until a termination condition is fulfilled: sampling a plurality of pairs of states, actions, rewards and new states from a storage. Sampling actions for the current states, and actions for the new sampled states; computing features from a penultimate layer of the first neural network based on the sampled states and actions and updating the second neural network and the auxiliary parameters as well as updating parameters the first neural network using a re-weighted loss.
    Type: Application
    Filed: February 28, 2024
    Publication date: September 19, 2024
    Inventors: Felix Berkenkamp, Gaurav Manek, Jeremy Zieg Kolter, Melrose Roderick
  • Patent number: 12084073
    Abstract: A method and device parameterize a driving dynamics controller of a vehicle, which intervenes in a controlling manner in a driving dynamics of the vehicle. The driving dynamics controller ascertains an action depending on a vehicle state. The method includes providing a model for predicting a vehicle state. The model configured to predict a subsequent vehicle state depending on the vehicle state and the action. At least one data tuple is ascertained including a sequence of vehicle states and respectively associated actions. The vehicle states are ascertained by the driving dynamics controller using the model depending on an ascertained action. The parameters of the driving dynamics controller are changed/adjusted such that a cost function which ascertains costs of the trajectory depending on the vehicle states and on the ascertained actions of the respectively associated vehicle states and is dependent on the parameters of the driving dynamics controller is minimized.
    Type: Grant
    Filed: June 29, 2022
    Date of Patent: September 10, 2024
    Assignee: Robert Bosch GmbH
    Inventors: Andreas Doerr, Felix Berkenkamp, Maksym Lefarov, Valentin Loeffelmann
  • Publication number: 20240198518
    Abstract: A method for training a control policy. The method includes estimating the variance of a value function which associates a state with a value of the state or a pair of state and action with a value of the pair by solving a Bellman uncertainty equation, wherein, for each of multiple states, the reward function of the Bellman uncertainty equation is set to the difference of the total uncertainty about the mean of the value of the subsequent state following the state and the average aleatoric uncertainty of the value of the subsequent state and biasing the control policy in training towards regions for which the estimation gives a higher variance of the value function than for other regions.
    Type: Application
    Filed: November 13, 2023
    Publication date: June 20, 2024
    Inventors: Alessandro Giacomo Bottero, Carlos Enrique Luis Goncalves, Felix Berkenkamp, Jan Peters, Julia Vinogradska
  • Publication number: 20240012368
    Abstract: A computer-implemented control method of constrained controlling of a computer-controlled system. The system is controlled according to a control input, which is safe if a constraint quantity resulting from the controlling of the computer-controlled system exceeds a constraint threshold. A current control input is determined based on previous control inputs and corresponding previous noisy measurements. The computer-controlled system is controlled according to the current control input, thereby obtaining a current noisy measurement of the resulting constraint quantity. The current control input is determined based on a mutual information between a first random variable representing the constraint quantity resulting from the current control input and a second random variable indicating whether a further control input is safe.
    Type: Application
    Filed: June 8, 2023
    Publication date: January 11, 2024
    Inventors: Alessandro Giacomo Bottero, Carlos Enrique Luis Goncalves, Felix Berkenkamp, Jan Peters, Julia Vinogradska
  • Publication number: 20230097371
    Abstract: A system, a device and a method for controlling a physical or chemical process. The method includes: determining a second a posteriori model based on a first a posteriori model that describes the relationship between an input variable and an output variable of a process related to the physical/chemical process, the second a posteriori model describing the relationship between an input variable and an output variable of the physical or chemical process; and controlling the physical or chemical process using the second a posteriori model.
    Type: Application
    Filed: September 22, 2022
    Publication date: March 30, 2023
    Inventors: Petru Tighineanu, Attila Reiss, Felix Berkenkamp, Julia Vinogradska, Kathrin Skubch, Paul Sebastian Baireuther
  • Publication number: 20230100765
    Abstract: A method for estimating input certainty for a neural network using generative modeling. The method includes generating, using an input data, two or more input data and embedding vector combinations and providing, at the neural network, each of the two or more input data and embedding vector combinations. The method also includes receiving, from the neural network, an output value for each input data and embedding vector combination of the two or more input data and embedding vector combinations. The method also includes computing a variance value for the output values of each respective input data and embedding vector combinations and determining a certainty value for the input data based on the variance value.
    Type: Application
    Filed: September 28, 2021
    Publication date: March 30, 2023
    Inventors: Melrose Roderick, Felix Berkenkamp, Fatemeh Sheikholeslami, Jeremy Kolter
  • Publication number: 20230001940
    Abstract: A method and device parameterize a driving dynamics controller of a vehicle, which intervenes in a controlling manner in a driving dynamics of the vehicle. The driving dynamics controller ascertains an action depending on a vehicle state. The method includes providing a model for predicting a vehicle state. The model configured to predict a subsequent vehicle state depending on the vehicle state and the action. At least one data tuple is ascertained including a sequence of vehicle states and respectively associated actions. The vehicle states are ascertained by the driving dynamics controller using the model depending on an ascertained action. The parameters of the driving dynamics controller are changed/adjusted such that a cost function which ascertains costs of the trajectory depending on the vehicle states and on the ascertained actions of the respectively associated vehicle states and is dependent on the parameters of the driving dynamics controller is minimized.
    Type: Application
    Filed: June 29, 2022
    Publication date: January 5, 2023
    Inventors: Andreas Doerr, Felix Berkenkamp, Maksym Lefarov, Valentin Loeffelmann
  • Publication number: 20220297290
    Abstract: A computer-implemented method for for learning a policy. The method includes: recording at least an episode of interactions of the agent with its environment following policy and adding the recorded episode to a set of training data; optimizing a transition dynamics model based on the training data such that the transition dynamics model predicts the next states of the environment depending on the states and actions contained in the training data; optimizing policy parameters based on the training data and the transition dynamics model by optimizing a reward. In the method, the transition dynamics model comprises a first model characterizing the global model and a second model characterizing a correction model, which is configured to correct outputs of the first model.
    Type: Application
    Filed: March 1, 2022
    Publication date: September 22, 2022
    Inventors: Felix Berkenkamp, Lukas Froehlich, Maksym Lefarov, Andreas Doerr
  • Publication number: 20220097227
    Abstract: A method for controlling a physical system. The method includes training a neural network to output, for a plurality of tasks, a result of the task carried out, in each case in response to the input of a control configuration of the physical system and the input of a value of a task input parameter; ascertaining a control configuration for a further task with the aid of Bayesian optimization, the neural network, parameterized by the task input parameter, being used as a model for the relationship between control configuration and result; and controlling the physical system according to the control configuration to carry out the further task.
    Type: Application
    Filed: August 27, 2021
    Publication date: March 31, 2022
    Inventors: Felix Berkenkamp, Jonathan Spitz, Kathrin Skubch, Lukas Grossberger, Stefan Falkner, Anna Eivazi