Patents by Inventor Georgios Theocharous

Georgios Theocharous has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10783450
    Abstract: Certain embodiments involve learning user preferences and predicting user behavior based on sequential user behavior data. For example, a system obtains data about a sequence of prior actions taken by multiple users. The system determines a similarity between a prior action taken by the various users and groups the various users into groups or clusters based at least in part on the similarity. The system trains a machine-learning algorithm such that the machine-learning algorithm can be used to predict a subsequent action of a user among the various users based on the various clusters. The system further obtains data about a current action of a new user and determines which of the clusters to associate with the new user based on the new user's current action. The system determines an action to be recommended to the new user based on the cluster associated with the new user. The action can include a series or sequence of actions to be taken by the new user.
    Type: Grant
    Filed: November 10, 2016
    Date of Patent: September 22, 2020
    Assignee: ADOBE INC.
    Inventors: Nikolaos Vlassis, Georgios Theocharous, Mandana Hamidi Haines
  • Publication number: 20200241878
    Abstract: The present disclosure relates to generating proposed digital actions in high-dimensional action spaces for client devices utilizing reinforcement learning models. For example, the disclosed systems can utilize a supervised machine learning model to train a latent representation decoder to determine proposed digital actions based on latent representations. Additionally, the disclosed systems can utilize a latent representation policy gradient model to train a state-based latent representation generation policy to generate latent representations based on the current state of client devices. Subsequently, the disclosed systems can identify the current state of a client device and a plurality of available actions, utilize the state-based latent representation generation policy to generate a latent representation based on the current state, and utilize the latent representation decoder to determine a proposed digital action from the plurality of available actions by analyzing the latent representation.
    Type: Application
    Filed: January 29, 2019
    Publication date: July 30, 2020
    Inventors: Yash Chandak, Georgios Theocharous
  • Patent number: 10558987
    Abstract: Optimizing customer lifetime value (LTV) techniques are described. In one or more implementations, a simulator is configured to derive a prediction model based on data indicative of user interaction online with marketing offers. The prediction model may be produced by automatically classifying variables according to feature types and matching each feature type to a response function that defines how the variable responds to input actions. The classification of variables and/or corresponding response functions per the prediction model may consider dependencies between variables and dependencies between successive states. An evaluator may then be invoked to apply the prediction model to test a proposed marketing strategy offline. Application of the prediction model is designed to predict user response to simulated offers/actions and enable evaluation of marketing strategies with respect to one or more long-term objectives.
    Type: Grant
    Filed: March 12, 2014
    Date of Patent: February 11, 2020
    Assignee: Adobe Inc.
    Inventors: Georgios Theocharous, Assaf Joseph Hallak
  • Publication number: 20200033144
    Abstract: The present disclosure relates to generating and modifying recommended event sequences utilizing a dynamic user preference interface. For example, in one or more embodiments, the system generates a recommended event sequence using a recommendation model trained based on a plurality of historical event sequences. The system then provides, for display via a client device, the recommendation, a plurality of interactive elements for entry of user preferences, and a visual representation of historical event sequences. Upon detecting input of user preferences, the system can modify a reward function of the recommendation model and provide a modified recommended event sequence together with the plurality of interactive elements. In one or more embodiments, as a user enters user preferences, the system additionally modifies the visual representation to display subsets of the plurality of historical event sequences corresponding to the preferences.
    Type: Application
    Filed: July 27, 2018
    Publication date: January 30, 2020
    Inventors: Fan Du, Sana Malik Lee, Georgios Theocharous, Eunyee Koh
  • Patent number: 10430825
    Abstract: A digital medium environment is described to recommend advertisements using ranking functions. A ranking function is configured to compute a score by applying a user context vector associated with a user to individual ranking weight vectors associated with advertisements, and provide the advertisement with the highest score to the user. In order to learn the ranking weight vectors for the ranking function, training data is obtained that includes user interactions with advertisements during previous sessions as well as user context vectors. The ranking weight vectors for the ranking function associated with each advertisement can then be learned by controlling the score generated by the ranking function to be higher for positive interactions than the negative interactions. To do so, the ranking weight vectors may be learned by optimizing an area under the curve ranking loss (AUCL) for the ranking function.
    Type: Grant
    Filed: January 18, 2016
    Date of Patent: October 1, 2019
    Assignee: Adobe Inc.
    Inventors: Sougata Chaudhuri, Georgios Theocharous, Mohammad Ghavamzadeh
  • Publication number: 20190295004
    Abstract: Systems and methods provide a recommendation system for recommending sequential content. The training of a reinforcement learning (RL) agent is bootstrapped from passive data. The RL agent of the sequential recommendations system is trained using the passive data over a number of epochs involving interactions between the sequential recommendation system and user devices. At each epoch, available active data from previous epochs is obtained, and transition probabilities are generated from the passive data and at least one parameter derived from the currently available active data. Recommended content is selected based on a current state and the generated transition probabilities, and the active data is updated from the current epoch based on the recommended content and a resulting new state. A clustering approach can also be employed when deriving parameters from active data to balance model expressiveness and data sparsity.
    Type: Application
    Filed: March 23, 2018
    Publication date: September 26, 2019
    Inventors: Sorathan Chaturapruek, Georgios Theocharous, Kent Andrew Edmonds
  • Publication number: 20190279096
    Abstract: The present disclosure relates to recommending points of interest to a plurality of users based on a type of each user as well as constraints associated with the points of interest. For example, one or more embodiments determine a user type for each user and determine user preferences based on the user type. Additionally, the system can determine resource constraints associated with each point of interest, indicating limitations on the capacity of each associated resource. The system can then provide recommendations to the plurality of users based on the user types and the resource constraints. In particular, the system can recommend points of interest that satisfy the preferences corresponding to each user type subject to the resource constraints of each point of interest. For example, one or more embodiments involve solving a linear program that takes into account user types to obtain recommendation policies subject to the resource constraints.
    Type: Application
    Filed: March 7, 2018
    Publication date: September 12, 2019
    Inventors: Frits de Nijs, Georgios Theocharous
  • Publication number: 20190236410
    Abstract: Systems and methods provide for bootstrapping a sequential recommendation system from passive data. A learning agent of the sequential recommendations system is trained using the passive data over a number of epochs involving interactions between the sequential recommendation system and user devices. At each epoch, available active data from previous epochs is obtained, and transition probabilities are generated from the passive data and at least one parameter derived from the currently available active data. A recommended action is selected given a current state and the generated transition probabilities, and the active data is updated from the current epoch based on the recommended action and a resulting new state. A clustering approach can also be employed when deriving parameters from active data to balance model expressiveness and data sparsity.
    Type: Application
    Filed: February 1, 2018
    Publication date: August 1, 2019
    Inventors: Sorathan Chaturapruek, Georgios Theocharous
  • Publication number: 20180276691
    Abstract: Metric forecasting techniques and systems in a digital medium environment are described that leverage similarity of elements, one to another, in order to generate a forecast value for a metric for a particular element. In one example, training data is received that describes a time series of values of the metric for a plurality of elements. The model is trained to generate the forecast value of the metric, the training using machine learning of a neural network based on the training data. The training includes generating dimensional-transformation data configured to transform the training data into a simplified representation to determine similarity of the plurality of elements, one to another, with respect to the metric over the time series. The training also includes generating model parameters of the neural network based on the simplified representation to generate the forecast value of the metric.
    Type: Application
    Filed: March 21, 2017
    Publication date: September 27, 2018
    Applicant: Adobe Systems Incorporated
    Inventors: Chunyuan Li, Hung Hai Bui, Mohammad Ghavamzadeh, Georgios Theocharous
  • Publication number: 20180211266
    Abstract: Metric forecasting techniques in a digital medium environment are described. A time series interval is identified by an analytics system that is exhibited by input usage data. The input usage data describes values of a metric involved in the provision of the digital content by a service provider system. A determination is then made by the analytics system as to whether historical usage data includes the identified time series interval. A forecast model is then selected by the analytics system from a plurality of forecast models based on a result of the determination and the identified time series interval. Forecast data is then generated by a forecast module of the analytics system. The forecast data is configured to predict at least one value of the metric based on the selected forecast model, a result of the determination, and the input usage data.
    Type: Application
    Filed: January 24, 2017
    Publication date: July 26, 2018
    Applicant: Adobe Systems Incorporated
    Inventors: Georgios Theocharous, Trevor H. Paulsen
  • Publication number: 20180165590
    Abstract: Certain embodiments involve generating personalized recommendations for users by inferring a propensity of each individual user to accept a recommendation. For example, a system generates a personalized user model based on a historical transition matrix that provides state transition probabilities from a general population of users. The probabilities are adjusted based on the propensity for a user to accept a recommendation. The system determines a recommended action for the user to transition between predefined states based on the user model. Once the user has performed an activity that transitions from a current state, the system adjusts a probability distribution for an estimate of the propensity based on whether the activity is the recommended action.
    Type: Application
    Filed: December 9, 2016
    Publication date: June 14, 2018
    Inventors: Nikolaos Vlassis, Georgios Theocharous
  • Publication number: 20180129971
    Abstract: Certain embodiments involve learning user preferences and predicting user behavior based on sequential user behavior data. For example, a system obtains data about a sequence of prior actions taken by multiple users. The system determines a similarity between a prior action taken by the various users and groups the various users into groups or clusters based at least in part on the similarity. The system trains a machine-learning algorithm such that the machine-learning algorithm can be used to predict a subsequent action of a user among the various users based on the various clusters. The system further obtains data about a current action of a new user and determines which of the clusters to associate with the new user based on the new user's current action. The system determines an action to be recommended to the new user based on the cluster associated with the new user. The action can include a series or sequence of actions to be taken by the new user.
    Type: Application
    Filed: November 10, 2016
    Publication date: May 10, 2018
    Inventors: Nikolaos Vlassis, Georgios Theocharous, Mandana Hamidi Haines
  • Publication number: 20170206549
    Abstract: A digital medium environment is described to recommend advertisements using ranking functions. A ranking function is configured to compute a score by applying a user context vector associated with a user to individual ranking weight vectors associated with advertisements, and provide the advertisement with the highest score to the user. In order to learn the ranking weight vectors for the ranking function, training data is obtained that includes user interactions with advertisements during previous sessions as well as user context vectors. The ranking weight vectors for the ranking function associated with each advertisement can then be learned by controlling the score generated by the ranking function to be higher for positive interactions than the negative interactions. To do so, the ranking weight vectors may be learned by optimizing an area under the curve ranking loss (AUCL) for the ranking function.
    Type: Application
    Filed: January 18, 2016
    Publication date: July 20, 2017
    Inventors: Sougata Chaudhuri, Georgios Theocharous, Mohammad Ghavamzadeh
  • Publication number: 20160148246
    Abstract: Risk quantification, policy search, and automated safe policy deployment techniques are described. In one or more implementations, techniques are utilized to determine safety of a policy, such as to express a level of confidence that a new policy will exhibit an increased measure of performance (e.g., interactions or conversions) over a currently deployed policy. In order to make this determination, reinforcement learning and concentration inequalities are utilized, which generate and bound confidence values regarding the measurement of performance of the policy and thus provide a statistical guarantee of this performance. These techniques are usable to quantify risk in deployment of a policy, select a policy for deployment based on estimated performance and a confidence level in this estimate (e.g.
    Type: Application
    Filed: November 24, 2014
    Publication date: May 26, 2016
    Inventors: Philip S. Thomas, Georgios Theocharous, Mohammad Ghavamzadeh
  • Publication number: 20160148250
    Abstract: Risk quantification, policy search, and automated safe policy deployment techniques are described. In one or more implementations, techniques are utilized to determine safety of a policy, such as to express a level of confidence that a new policy will exhibit an increased measure of performance (e.g., interactions or conversions) over a currently deployed policy. In order to make this determination, reinforcement learning and concentration inequalities are utilized, which generate and bound confidence values regarding the measurement of performance of the policy and thus provide a statistical guarantee of this performance. These techniques are usable to quantify risk in deployment of a policy, select a policy for deployment based on estimated performance and a confidence level in this estimate (e.g.
    Type: Application
    Filed: November 24, 2014
    Publication date: May 26, 2016
    Inventors: Philip S. Thomas, Georgios Theocharous, Mohammad Ghavamzadeh
  • Publication number: 20160148251
    Abstract: Risk quantification, policy search, and automated safe policy deployment techniques are described. In one or more implementations, techniques are utilized to determine safety of a policy, such as to express a level of confidence that a new policy will exhibit an increased measure of performance (e.g., interactions or conversions) over a currently deployed policy. In order to make this determination, reinforcement learning and concentration inequalities are utilized, which generate and bound confidence values regarding the measurement of performance of the policy and thus provide a statistical guarantee of this performance. These techniques are usable to quantify risk in deployment of a policy, select a policy for deployment based on estimated performance and a confidence level in this estimate (e.g.
    Type: Application
    Filed: November 24, 2014
    Publication date: May 26, 2016
    Inventors: Philip S. Thomas, Georgios Theocharous, Mohammad Ghavamzadeh
  • Publication number: 20150262205
    Abstract: Optimizing customer lifetime value (LTV) techniques are described. In one or more implementations, a simulator is configured to derive a prediction model based on data indicative of user interaction online with marketing offers. The prediction model may be produced by automatically classifying variables according to feature types and matching each feature type to a response function that defines how the variable responds to input actions. The classification of variables and/or corresponding response functions per the prediction model may consider dependencies between variables and dependencies between successive states. An evaluator may then be invoked to apply the prediction model to test a proposed marketing strategy offline. Application of the prediction model is designed to predict user response to simulated offers/actions and enable evaluation of marketing strategies with respect to one or more long-term objectives.
    Type: Application
    Filed: March 12, 2014
    Publication date: September 17, 2015
    Applicant: Adobe Systems Incorporated
    Inventors: Georgios Theocharous, Assaf Joseph Hallak
  • Publication number: 20150134443
    Abstract: In various example embodiments, a system and method for testing marketing strategies and approximate simulators offline for lifetime value marketing. In example embodiments, real world data, simulated data, and one or more policies that resulted in the simulated data are obtained. Errors between the real world data and the simulated data are determined. Using the determined errors, bounds are determined. Simulators are ranked based on the determined bounds, whereby a lower bound indicates a first simulator providing simulated data closer to the real world data then a second simulator having a higher bound.
    Type: Application
    Filed: November 14, 2013
    Publication date: May 14, 2015
    Applicant: Adobe Systems Incorporated
    Inventors: Assaf Hallak, Georgios Theocharous