Patents Assigned to Prowler.io Limited
  • Publication number: 20200364555
    Abstract: There is disclosed a machine learning technique of determining a policy for an agent controlling an entity in a two-entity system. The method comprises assigning a prior policy and a respective rationality to each entity of the two-entity system, each assigned rationality being associated with a permitted divergence of a policy associated with the associated entity from the prior policy p assigned to that entity, and determining the policy to be followed by an agent corresponding to one entity by optimising an objective function F*(s), wherein the objective function F*(s) includes factors dependent on the respective rationalities and prior policies assigned to the two entities. In this way, the policy followed by an agent controlling an entity in a system can be determined taking into account the rationality of another entity within the system.
    Type: Application
    Filed: October 26, 2018
    Publication date: November 19, 2020
    Applicant: Prowler.io Limited
    Inventors: Jordi GRAU-MOYA, Felix LEIBFRIED, Haitham BOU AMMAR
  • Publication number: 20200218999
    Abstract: A reinforcement learning system comprises an environment (having multiple possible states), and agent, and a policy learner.. The agent is arranged to receive state information indicative of a current environment state and generate an action signal dependent on the state information and a policy associated with the agent, where the action signal is operable to cause an environment-state change. The agent is further arranged to generate experience data dependent on the state information and information conveyed by the action signal. The policy learner is configured to process the experience data in order to update the policy associated with the agent. The reinforcement learning system further comprises a probabilistic model arranged to generate, dependent on the current state of the environment, probabilistic data relating to future states of the environment, and the agent is further arranged to generate the action signal in dependence on the probabilistic data.
    Type: Application
    Filed: March 19, 2020
    Publication date: July 9, 2020
    Applicant: Prowler.io Limited
    Inventors: Stefanos ELEFTHERIADIS, James HENSMAN, Sebastian JOHN, Hugh SALIMBENI
  • Publication number: 20200218932
    Abstract: A method includes: receiving training data comprising a plurality of training data items, each training data item labelled under a respective class and comprising a elements arranged in conformity with a structured representation having an associated coordinate system; determining patches of the training data, each patch comprising a subset of the elements of a respective training data item and being associated with a location within the co-ordinate system of the structured representation; and initialising a set of parameters for a Gaussian process. The method further includes iteratively: processing pairs of the determined patches, using a patch response kernel to determine patch response data; determining, using the patch response data, entries of a covariance matrix; and updating the set of parameters in dependence on the determined entries of the covariance matrix.
    Type: Application
    Filed: March 19, 2020
    Publication date: July 9, 2020
    Applicant: Prowler.io Limited
    Inventors: James HENSMAN, Mark VAN DER WILK, Vincent DUTORDOIR