Patents by Inventor Martin Szummer

Martin Szummer has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210073638
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a machine learning model that has been trained through reinforcement learning to select a content item. One of the methods includes receiving first data characterizing a first context in which a first content item may be presented to a first user in a presentation environment; and providing the first data as input to a long-term engagement machine learning model, the model having been trained through reinforcement learning to: receive a plurality of inputs, and process each of the plurality of inputs to generate a respective engagement score for each input that represents a predicted, time-adjusted total number of selections by the respective user of future content items presented to the respective user in the presentation environment if the respective content item is presented in the respective context.
    Type: Application
    Filed: November 16, 2020
    Publication date: March 11, 2021
    Inventors: Benjamin Kenneth Coppin, Mustafa Suleyman, Thomas Chadwick Walters, Timothy Mann, Chia-Yueh Carlton Chu, Martin Szummer, Luis Carlos Cobo Rus, Jean-Francois Crespo
  • Patent number: 10839310
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a machine learning model that has been trained through reinforcement learning to select a content item. One of the methods includes receiving first data characterizing a first context in which a first content item may be presented to a first user in a presentation environment; and providing the first data as input to a long-term engagement machine learning model, the model having been trained through reinforcement learning to: receive a plurality of inputs, and process each of the plurality of inputs to generate a respective engagement score for each input that represents a predicted, time-adjusted total number of selections by the respective user of future content items presented to the respective user in the presentation environment if the respective content item is presented in the respective context.
    Type: Grant
    Filed: July 15, 2016
    Date of Patent: November 17, 2020
    Assignee: Google LLC
    Inventors: Benjamin Kenneth Coppin, Mustafa Suleyman, Thomas Chadwick Walters, Timothy Mann, Chia-Yueh Carlton Chu, Martin Szummer, Luis Carlos Cobo Rus, Jean-Francois Crespo
  • Publication number: 20180018580
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using a machine learning model that has been trained through reinforcement learning to select a content item. One of the methods includes receiving first data characterizing a first context in which a first content item may be presented to a first user in a presentation environment; and providing the first data as input to a long-term engagement machine learning model, the model having been trained through reinforcement learning to: receive a plurality of inputs, and process each of the plurality of inputs to generate a respective engagement score for each input that represents a predicted, time-adjusted total number of selections by the respective user of future content items presented to the respective user in the presentation environment if the respective content item is presented in the respective context.
    Type: Application
    Filed: July 15, 2016
    Publication date: January 18, 2018
    Inventors: Benjamin Kenneth Coppin, Mustafa Suleyman, Thomas Chadwick Walters, Timothy Mann, Chia-Yueh Carlton Chu, Martin Szummer, Luis Carlos Cobo Rus, Jean-Francois Crespo
  • Patent number: 8380723
    Abstract: Inferring query intent in information retrieval is described. In an example reformulations of an initial query by a user are used to create a query neighborhood. In the example, the query neighborhood is used to identify a set of possibly related queries. First and higher order reformulations of the initial query may be used to expand the query neighborhood. In an example precision can be improved by reducing the query neighborhood to more closely related queries for example, two queries can be connected if they are often clicked for the same document. In an example two queries can be connected using a random walk and all pairs of queries that are not connected by a random walk of less than a fixed threshold are removed. The connected queries can be used to form clusters and weights can be applied in order to determine the most likely related queries.
    Type: Grant
    Filed: May 21, 2010
    Date of Patent: February 19, 2013
    Assignee: Microsoft Corporation
    Inventors: Filip Radlinski, Martin Szummer, Nick Craswell
  • Patent number: 8346800
    Abstract: Content-based information retrieval is described. In an example, a query item such as an image, document, email or other item is presented and items with similar content are retrieved from a database of items. In an example, each time a query is presented, a classifier is formed based on that query and using a training set of items. For example, the classifier is formed in real-time and is formed in such a way that a limit on the proportion of the items in the database that will be retrieved is set. In an embodiment, the query item is analyzed to identify tokens in that item and subsets of those tokens are selected to form the classifier. For example, the subsets of tokens are combined using Boolean operators in a manner which is efficient for searching on particular types of database.
    Type: Grant
    Filed: April 2, 2009
    Date of Patent: January 1, 2013
    Assignee: Microsoft Corporation
    Inventors: Martin Szummer, Andrew Fitzgibbon, Lorenzo Torresani
  • Publication number: 20110289063
    Abstract: Inferring query intent in information retrieval is described. In an example reformulations of an initial query by a user are used to create a query neighborhood. In the example, the query neighborhood is used to identify a set of possibly related queries. First and higher order reformulations of the initial query may be used to expand the query neighborhood. In an example precision can be improved by reducing the query neighborhood to more closely related queries for example, two queries can be connected if they are often clicked for the same document. In an example two queries can be connected using a random walk and all pairs of queries that are not connected by a random walk of less than a fixed threshold are removed. The connected queries can be used to form clusters and weights can be applied in order to determine the most likely related queries.
    Type: Application
    Filed: May 21, 2010
    Publication date: November 24, 2011
    Applicant: Microsoft Corporation
    Inventors: Filip Radlinski, Martin Szummer, Nick Craswell
  • Patent number: 8037043
    Abstract: An information retrieval system is described for retrieving a list of documents such as web pages or other items from a document index in response to a user query. In an embodiment a prediction engine is used to predict both explicit relevance information such as judgment labels and implicit relevance information such as click data. In an embodiment the predicted relevance information is applied to a stored utility function that describes user satisfaction with a search session. This produces utility scores for proposed lists of documents. Using the utility scores one of the lists of documents is selected. In this way different sources of relevance information are combined into a single information retrieval system in a principled and effective manner which gives improved performance.
    Type: Grant
    Filed: September 9, 2008
    Date of Patent: October 11, 2011
    Assignee: Microsoft Corporation
    Inventors: Onno Zoeter, Michael J. Taylor, Edward Lloyd Snelson, John P. Guiver, Nicholas Craswell, Martin Szummer
  • Patent number: 7877385
    Abstract: Information retrieval using query-document pair information is described. In an embodiment, a click record is accessed having information about queries and documents where user clicks have been observed for query-document pairs. A click graph is either formed or accessed. This has nodes connected by edges, each node representing any of a document and a query and each edge representing at least one observed click. Given at least one first node in the click graph, a similarity measure is determined between that first node and each of one or more second nodes. The second nodes are then ranked on the basis of the similarity measure results and the ranking is used to retrieve information from the click record.
    Type: Grant
    Filed: September 21, 2007
    Date of Patent: January 25, 2011
    Assignee: Microsoft Corporation
    Inventors: Nicholas Craswell, Martin Szummer
  • Publication number: 20100257202
    Abstract: Content-based information retrieval is described. In an example, a query item such as an image, document, email or other item is presented and items with similar content are retrieved from a database of items. In an example, each time a query is presented, a classifier is formed based on that query and using a training set of items. For example, the classifier is formed in real-time and is formed in such a way that a limit on the proportion of the items in the database that will be retrieved is set. In an embodiment, the query item is analyzed to identify tokens in that item and subsets of those tokens are selected to form the classifier. For example, the subsets of tokens are combined using Boolean operators in a manner which is efficient for searching on particular types of database.
    Type: Application
    Filed: April 2, 2009
    Publication date: October 7, 2010
    Applicant: Microsoft Corporation
    Inventors: Martin Szummer, Andrew Fitzgibbon, Lorenzo Torresani
  • Patent number: 7720773
    Abstract: We set out a graphical model for describing probability distributions over labeled partitions of an undirected graph which are conditioned on observed data. We show how to efficiently perform exact inference in these models, by exploiting the structure of the graph and adapting the sum-product and max-product algorithms. The method can be used for partitioning and labeling hand-drawn ink fragments, image data, speech data and natural language data amongst other types of data elements. A significant performance increase is obtained by labeling and partitioning simultaneously. It is also possible to partition without labeling.
    Type: Grant
    Filed: December 29, 2005
    Date of Patent: May 18, 2010
    Assignee: Microsoft Corporation
    Inventors: Martin Szummer, Philip Cowans
  • Publication number: 20100076949
    Abstract: An information retrieval system is described for retrieving a list of documents such as web pages or other items from a document index in response to a user query. In an embodiment a prediction engine is used to predict both explicit relevance information such as judgment labels and implicit relevance information such as click data. In an embodiment the predicted relevance information is applied to a stored utility function that describes user satisfaction with a search session. This produces utility scores for proposed lists of documents. Using the utility scores one of the lists of documents is selected. In this way different sources of relevance information are combined into a single information retrieval system in a principled and effective manner which gives improved performance.
    Type: Application
    Filed: September 9, 2008
    Publication date: March 25, 2010
    Applicant: Microsoft Corporation
    Inventors: Onno Zoeter, Michael J. Taylor, Edward Lloyd Snelson, John P. Guiver, Nicholas Craswell, Martin Szummer
  • Patent number: 7512273
    Abstract: Digital ink strokes may be fragmented to form a training data set. A neighborhood graph may be formed as a plurality of connected nodes. Relevant features of the training data may be determined in each fragment such as local site features, interaction features, and/or part-label interaction features. Using a conditional random field which may include a hidden random field modeling parameters may be developed to provide a training model to determine a posterior probability of the labels given observed data. In this manner, the training model may be used to predict a label for an observed ink stroke. The modeling parameters may be learned from only a portion of the set of ink strokes in an unsupervised way. For example, many compound objects may include compositional parts. In some cases, appropriate compositional parts may be discovered or inferred during training of the model based on the training data.
    Type: Grant
    Filed: October 21, 2005
    Date of Patent: March 31, 2009
    Assignee: Microsoft Corporation
    Inventor: Martin Szummer
  • Publication number: 20090083222
    Abstract: Information retrieval using query-document pair information is described. In an embodiment, a click record is accessed having information about queries and documents where user clicks have been observed for query-document pairs. A click graph is either formed or accessed. This has nodes connected by edges, each node representing any of a document and a query and each edge representing at least one observed click. Given at least one first node in the click graph, a similarity measure is determined between that first node and each of one or more second nodes. The second nodes are then ranked on the basis of the similarity measure results and the ranking is used to retrieve information from the click record.
    Type: Application
    Filed: September 21, 2007
    Publication date: March 26, 2009
    Applicant: Microsoft Corporation
    Inventors: Nicholas Craswell, Martin Szummer
  • Publication number: 20070156617
    Abstract: We set out a graphical model for describing probability distributions over labeled partitions of an undirected graph which are conditioned on observed data. We show how to efficiently perform exact inference in these models, by exploiting the structure of the graph and adapting the sum-product and max-product algorithms. The method can be used for partitioning and labeling hand-drawn ink fragments, image data, speech data and natural language data amongst other types of data elements. A significant performance increase is obtained by labeling and partitioning simultaneously. It is also possible to partition without labeling.
    Type: Application
    Filed: December 29, 2005
    Publication date: July 5, 2007
    Applicant: Microsoft Corporation
    Inventors: Martin Szummer, Philip Cowans
  • Publication number: 20060115145
    Abstract: A Bayesian approach to training in conditional random fields takes a prior distribution over the modeling parameters of interest. These prior distributions may be used to generate an approximate form of a posterior distribution over the parameters, which may be trained with example or training data. Automatic relevance determination (ARD) may be integrated in the training to automatically select relevant features of the training data. From the trained posterior distribution of the parameters, a posterior distribution over the parameters based on the training data and the prior distributions over parameters may be approximated to form a training model. Using the developed training model, a given image may be evaluated by integrating over the posterior distribution over parameters to obtain a marginal probability distribution over the labels given that observational data.
    Type: Application
    Filed: November 30, 2004
    Publication date: June 1, 2006
    Applicant: Microsoft Corporation
    Inventors: Christopher Bishop, Martin Szummer, Tonatiuh Centeno, Markus Svensen, Yuan Qi
  • Publication number: 20060098871
    Abstract: Digital ink strokes may be fragmented to form a training data set. A neighborhood graph may be formed as a plurality of connected nodes. Relevant features of the training data may be determined in each fragment such as local site features, interaction features, and/or part-label interaction features. Using a conditional random field which may include a hidden random field modeling parameters may be developed to provide a training model to determine a posterior probability of the labels given observed data. In this manner, the training model may be used to predict a label for an observed ink stroke. The modeling parameters may be learned from only a portion of the set of ink strokes in an unsupervised way. For example, many compound objects may include compositional parts. In some cases, appropriate compositional parts may be discovered or inferred during training of the model based on the training data.
    Type: Application
    Filed: October 21, 2005
    Publication date: May 11, 2006
    Applicant: Microsoft Corporation
    Inventor: Martin Szummer