Patents by Inventor Joaquin Quinonero Candela

Joaquin Quinonero Candela has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Biasing Selection of Advertisements from an Advertisement Campaign

Publication number: 20150206170

Abstract: For ad campaigns that have multiple advertisements, each associated with an ad creative, which are automatically selected, an online system may bias selection of advertisements away from underestimated advertisements and towards early-selected advertisements with positive user interactions. To increase the likelihood of various advertisements in an ad campaign being evaluated for presentation to users, the online system may: associate a relatively high performance score with each advertisement in an ad campaign, randomly select advertisements from an ad campaign, modify bid amounts associated with advertisements in the ad campaign, or allocate a portion of the ad campaign's budget for allocation across advertisements in the ad campaign. After presenting a threshold number of advertisements in an ad campaign or receiving an instruction from an advertiser, advertisements from the ad campaign may be selected using conventional methods.

Type: Application

Filed: January 17, 2014

Publication date: July 23, 2015

Applicant: Facebook, Inc.

Inventors: Chinmay Deepak Karande, Joaquin Quinonero Candela, Yaron Greif
Parallelization of online learning algorithms

Patent number: 8904149

Abstract: Methods, systems, and media are provided for a dynamic batch strategy utilized in parallelization of online learning algorithms. The dynamic batch strategy provides a merge function on the basis of a threshold level difference between the original model state and an updated model state, rather than according to a constant or pre-determined batch size. The merging includes reading a batch of incoming streaming data, retrieving any missing model beliefs from partner processors, and training on the batch of incoming streaming data. The steps of reading, retrieving, and training are repeated until the measured difference in states exceeds a set threshold level. The measured differences which exceed the threshold level are merged for each of the plurality of processors according to attributes. The merged differences which exceed the threshold level are combined with the original partial model states to obtain an updated global model state.

Type: Grant

Filed: June 24, 2010

Date of Patent: December 2, 2014

Assignee: Microsoft Corporation

Inventors: Taha Bekir Eren, Oleg Isakov, Weizhu Chen, Jeffrey Scott Dunn, Thomas Ivan Borchert, Joaquin Quinonero Candela, Thore Kurt Hartwig Graepel, Ralf Herbrich
Event prediction in dynamic environments

Patent number: 8417650

Abstract: Event prediction in dynamic environments is described. In an embodiment a prediction engine may use the learnt information to predict events in order to control a system such as for internet advertising, email filtering, fraud detection or other applications. In an example one or more variables exists for pre-specified features describing or associated with events and each variable is considered to have an associated weight and time stamp. For example, belief about each weight is represented using a probability distribution and a dynamics process is used to modify the probability distribution in a manner dependent on the time stamp for that weight. For example, the uncertainty about the associated variable's influence on prediction of future events is increased. Examples of different schedules for applying the dynamics process are given.

Type: Grant

Filed: January 27, 2010

Date of Patent: April 9, 2013

Assignee: Microsoft Corporation

Inventors: Thore Graepel, Joaquin Quinonero Candela, Thomas Ivan Borchert, Ralf Herbrich
Computationally efficient probabilistic linear regression

Patent number: 8250003

Abstract: A computationally efficient method of performing probabilistic linear regression is described. In an embodiment, the method involves adding a white noise term to a weighted linear sum of basis functions and then normalizing the combination. This generates a linear model comprising a set of sparse, normalized basis functions and a modulated noise term. When using the linear model to perform linear regression, the modulated noise term increases the variance associated with output values which are distant from any data points.

Type: Grant

Filed: September 12, 2008

Date of Patent: August 21, 2012

Assignee: Microsoft Corporation

Inventors: Joaquin Quinonero Candela, Edward Lloyd Snelson, Olliver Michael Christian Williams
Scalable clustering

Patent number: 8204838

Abstract: A scalable clustering system is described. In an embodiment the clustering system is operable for extremely large scale applications where millions of items having tens of millions of features are clustered. In an embodiment the clustering system uses a probabilistic cluster model which models uncertainty in the data set where the data set may be for example, advertisements which are subscribed to keywords, text documents containing text keywords, images having associated features or other items. In an embodiment the clustering system is used to generate additional features for associating with a given item. For example, additional keywords are suggested which an advertiser may like to subscribe to. The additional features that are generated have associated probability values which may be used to rank those features in some embodiments. User feedback about the generated features is received and used to revise the feature generation process in some examples.

Type: Grant

Filed: April 10, 2009

Date of Patent: June 19, 2012

Assignee: Microsoft Corporation

Inventors: Anton Schwaighofer, Joaquin Quiñonero Candela, Thomas Borchert, Thore Graepel, Ralf Herbrich
Parallelization of Online Learning Algorithms

Publication number: 20110320767

Abstract: Methods, systems, and media are provided for a dynamic batch strategy utilized in parallelization of online learning algorithms. The dynamic batch strategy provides a merge function on the basis of a threshold level difference between the original model state and an updated model state, rather than according to a constant or pre-determined batch size. The merging includes reading a batch of incoming streaming data, retrieving any missing model beliefs from partner processors, and training on the batch of incoming streaming data. The steps of reading, retrieving, and training are repeated until the measured difference in states exceeds a set threshold level. The measured differences which exceed the threshold level are merged for each of the plurality of processors according to attributes. The merged differences which exceed the threshold level are combined with the original partial model states to obtain an updated global model state.

Type: Application

Filed: June 24, 2010

Publication date: December 29, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Taha Bekir Eren, Oleg Isakov, Weizhu Chen, Jeffrey Scott Dunn, Thomas Ivan Borchert, Joaquin Quinonero Candela, Thore Kurt Hartwig Graepel, Ralf Herbrich
Event Prediction in Dynamic Environments

Publication number: 20110184778

Abstract: Event prediction in dynamic environments is described. In an embodiment a prediction engine may use the learnt information to predict events in order to control a system such as for internet advertising, email filtering, fraud detection or other applications. In an example one or more variables exists for pre-specified features describing or associated with events and each variable is considered to have an associated weight and time stamp. For example, belief about each weight is represented using a probability distribution and a dynamics process is used to modify the probability distribution in a manner dependent on the time stamp for that weight. For example, the uncertainty about the associated variable's influence on prediction of future events is increased. Examples of different schedules for applying the dynamics process are given.

Type: Application

Filed: January 27, 2010

Publication date: July 28, 2011

Applicant: Microsoft Corporation

Inventors: Thore Graepel, Joaquin Quinonero Candela, Thomas Ivan Borchert, Ralf Herbrich
Scalable Clustering

Publication number: 20100262568

Abstract: A scalable clustering system is described. In an embodiment the clustering system is operable for extremely large scale applications where millions of items having tens of millions of features are clustered. In an embodiment the clustering system uses a probabilistic cluster model which models uncertainty in the data set where the data set may be for example, advertisements which are subscribed to keywords, text documents containing text keywords, images having associated features or other items. In an embodiment the clustering system is used to generate additional features for associating with a given item. For example, additional keywords are suggested which an advertiser may like to subscribe to. The additional features that are generated have associated probability values which may be used to rank those features in some embodiments. User feedback about the generated features is received and used to revise the feature generation process in some examples.

Type: Application

Filed: April 10, 2009

Publication date: October 14, 2010

Applicant: Microsoft Corporation

Inventors: Anton Schwaighofer, Joaquin Quinonero Candela, Thomas Borchert, Thore Graepel, Ralf Herbrich
Computationally Efficient Probabilistic Linear Regression

Publication number: 20100070435

Abstract: A computationally efficient method of performing probabilistic linear regression is described. In an embodiment, the method involves adding a white noise term to a weighted linear sum of basis functions and then normalizing the combination. This generates a linear model comprising a set of sparse, normalized basis functions and a modulated noise term. When using the linear model to perform linear regression, the modulated noise term increases the variance associated with output values which are distant from any data points.

Type: Application

Filed: September 12, 2008

Publication date: March 18, 2010

Applicant: Microsoft Corporation

Inventors: Joaquin Quinonero Candela, Edward Lloyd Snelson, Oliver Michael Williams
Event Prediction

Publication number: 20090043593

Abstract: There are many situations in which it is desired to predict outcomes of events. In an example, an event prediction system is described which receives variables for a proposed event. The system accesses learnt statistics describing belief about weights associated with the variables and uses the weights to determine probability information that the proposed event will have a specified outcome. The process involves combining the accessed statistics and mapping them into a number representing the probability. In another example, a machine learning process using assumed density filtering is used to learn the statistics from data about observed events. The event prediction system may be used as part of any suitable type of system such as an internet advertising system, an email filtering system, or a fraud detection system.

Type: Application

Filed: August 8, 2007

Publication date: February 12, 2009

Applicant: Microsoft Corporation

Inventors: Ralf Herbrich, Thore Graepel, Onno Zoeter, Joaquin Quinonero Candela, Phillip Trelford