Patents by Inventor Paul Haahr

Paul Haahr has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8965919
    Abstract: A stopword detection component detects stopwords (also stop-phrases) in search queries input to keyword-based information retrieval systems. Potential stopwords are initially identified by comparing the terms in the search query to a list of known stopwords. Context data is then retrieved based on the search query and the identified stopwords. In one implementation, the context data includes documents retrieved from a document index. In another implementation, the context data includes categories relevant to the search query. Sets of retrieved context data are compared to one another to determine if they are substantially similar. If the sets of context data are substantially similar, this fact may be used to infer that the removal of the potential stopword(s) is not material to the search. If the sets of context data are not substantially similar, the potential stopword can be considered material to the search and should not be removed from the query.
    Type: Grant
    Filed: December 30, 2013
    Date of Patent: February 24, 2015
    Assignee: Google Inc.
    Inventors: Simon Tong, Uri Lerner, Amit Singhal, Paul Haahr, Steven Baker
  • Patent number: 8825645
    Abstract: A ranking component ranks documents, such as web pages or web sites, to obtain a ranking score that defines a quality judgment of the document. The ranking score of a particular document is based on the ranking score of the documents which link to it and based on affiliation among the documents.
    Type: Grant
    Filed: January 6, 2012
    Date of Patent: September 2, 2014
    Assignee: Google Inc.
    Inventors: Krishna Bharat, Amit Singhal, Paul Haahr
  • Publication number: 20140149415
    Abstract: A system and method for providing search query refinements are presented. A stored query and a stored document are associated as a logical pairing. A weight is assigned to the logical pairing. The search query is issued and a set of search documents is produced. At least one search document is matched to at least one stored document. The stored query and the assigned weight associated with the matching at least one stored document are retrieved. At least one cluster is formed based on the stored query and the assigned weight associated with the matching at least one stored document. The stored query associated with the matching at least one stored document are scored for the at least one cluster relative to at least one other cluster. At least one such scored search query is suggested as a set of query refinements.
    Type: Application
    Filed: January 31, 2014
    Publication date: May 29, 2014
    Inventors: Paul Haahr, Steven D. Baker
  • Patent number: 8719276
    Abstract: A system includes a ranking component that ranks nodes, such as web sites, to obtain ranking values that define a quality judgment of the nodes. The ranking values are based on links between the nodes and, among other things, deemphasize links between affiliated nodes. Additionally, the amount of rank that any particular node can contribute to another node may be capped at a threshold level, thus tending to prevent some nodes from unduly influencing the ranking values.
    Type: Grant
    Filed: January 4, 2011
    Date of Patent: May 6, 2014
    Assignee: Google Inc.
    Inventors: Paul Haahr, Martin Kaszkiel, Amit Singhal
  • Publication number: 20140052735
    Abstract: Web pages of a Website may be processed to improve search results. For example, information likely to pertain to more than just the Web page it is directly associated with may be identified. One or more other, related, Web pages that such information is likely to pertain to is also identified. The identified information is associated with the identified other Web page(s) and this association is saved in a way to affect a search result score of the Web page(s).
    Type: Application
    Filed: August 15, 2013
    Publication date: February 20, 2014
    Inventors: Daniel EGNOR, Paul HAAHR, Kevin D. LACKER, John Ogden LAMPING, Amitabh K. SINGHAL, Ke YANG
  • Patent number: 8645407
    Abstract: A system and method for providing search query refinements are presented. A stored query and a stored document are associated as a logical pairing. A weight is assigned to the logical pairing. The search query is issued and a set of search documents is produced. At least one search document is matched to at least one stored document. The stored query and the assigned weight associated with the matching at least one stored document are retrieved. At least one cluster is formed based on the stored query and the assigned weight associated with the matching at least one stored document. The stored query associated with the matching at least one stored document are scored for the at least one cluster relative to at least one other cluster. At least one such scored search query is suggested as a set of query refinements.
    Type: Grant
    Filed: November 4, 2011
    Date of Patent: February 4, 2014
    Assignee: Google Inc.
    Inventors: Paul Haahr, Steven Baker
  • Patent number: 8639690
    Abstract: A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Grant
    Filed: April 24, 2012
    Date of Patent: January 28, 2014
    Assignee: Google Inc.
    Inventors: Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Olcan Sercinoglu, Simon Tong
  • Patent number: 8626787
    Abstract: A stopword detection component detects stopwords (also stop-phrases) in search queries input to keyword-based information retrieval systems. Potential stopwords are initially identified by comparing the terms in the search query to a list of known stopwords. Context data is then retrieved based on the search query and the identified stopwords. In one implementation, the context data includes documents retrieved from a document index. In another implementation, the context data includes categories relevant to the search query. Sets of retrieved context data are compared to one another to determine if they are substantially similar. If the sets of context data are substantially similar, this fact may be used to infer that the removal of the potential stopword(s) is not material to the search. If the sets of context data are not substantially similar, the potential stopword can be considered material to the search and should not be removed from the query.
    Type: Grant
    Filed: June 20, 2013
    Date of Patent: January 7, 2014
    Assignee: Google Inc.
    Inventors: Simon Tong, Uri Lerner, Amit Singhal, Paul Haahr, Steven Baker
  • Patent number: 8583675
    Abstract: In general, one aspect of the subject matter described can be embodied in a method that includes, for a first document that is included in first search results responsive to a first user-submitted query, selecting a plurality of previously submitted queries for which the first document was a responsive search result. The method can further include determining whether second documents that are relevant to the previously submitted query have at least a threshold level of diversity in comparison to the first search results, wherein second documents are determined to be relevant to the previously submitted query based on data that is indicative of user behavior. The method can additionally include identifying one or more queries from the selected previously submitted queries to provide as first suggested queries, and providing the one or more identified queries as first suggested queries with the first search results for the first user-submitted query.
    Type: Grant
    Filed: August 30, 2010
    Date of Patent: November 12, 2013
    Assignee: Google Inc.
    Inventors: Paul Haahr, Charles E. Martin
  • Patent number: 8577901
    Abstract: A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Grant
    Filed: September 30, 2011
    Date of Patent: November 5, 2013
    Assignee: Google Inc.
    Inventors: Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Olcan Sercinoglu, Simon Tong
  • Patent number: 8577893
    Abstract: A system ranks documents based on contexts associated with the documents. The system identifies a reference in a first document, where the reference is associated with a second document. The system analyzes a portion of the first document associated with the reference, identifies a rare word (or words) from the portion, creates a context identifier based on the rare word(s), and ranks the second document based on the context identifier.
    Type: Grant
    Filed: March 15, 2004
    Date of Patent: November 5, 2013
    Assignee: Google Inc.
    Inventors: Anna Patterson, Paul Haahr
  • Patent number: 8561185
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for privacy protection. In one aspect, a method includes accessing personally identifiable information (PII) type definitions that characterize PII types; identifying PII type information included in content of a web page, the PII type information being information matching at least one PII type definition; identifying secondary information included in the content of the web page, the secondary information being information that is predefined as being associated with PII type information; determining a risk score from the PII type information and the secondary information; and classifying the web page as a personal information exposure risk if the risk score meets a confidentiality threshold, wherein the personal information exposure risk is indicative of the web page including personally identifiable information.
    Type: Grant
    Filed: May 17, 2011
    Date of Patent: October 15, 2013
    Assignee: Google Inc.
    Inventors: Muthuprasanna Muthusrinivasan, Paul Haahr, Matthew D. Cutts
  • Patent number: 8549014
    Abstract: A system may determine a measure of how a content of a document changes over time, generate a score for the document based, at least in part, on the measure of how the content of the document changes over time, and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Grant
    Filed: September 14, 2011
    Date of Patent: October 1, 2013
    Assignee: Google Inc.
    Inventors: Anurag Acharya, Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Simon Tong
  • Patent number: 8527524
    Abstract: A system may determine a measure of how a content of a document changes over time, generate a score for the document based, at least in part, on the measure of how the content of the document changes over time, and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Grant
    Filed: June 30, 2011
    Date of Patent: September 3, 2013
    Assignee: Google Inc.
    Inventors: Anurag Acharya, Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Simon Tong
  • Patent number: 8521717
    Abstract: Web pages of a Website may be processed to improve search results. For example, information likely to pertain to more than just the Web page it is directly associated with may be identified. One or more other, related, Web pages that such information is likely to pertain to is also identified. The identified information is associated with the identified other Web page(s) and this association is saved in a way to affect a search result score of the Web page(s).
    Type: Grant
    Filed: April 21, 2011
    Date of Patent: August 27, 2013
    Assignee: Google Inc.
    Inventors: Daniel Egnor, Paul Haahr, Kevin Lackner, John Lamping, Amitabh K. Singhal, Ke Yang
  • Patent number: 8521749
    Abstract: A system may determine a document inception date associated with a document, generate a score for the document based, at least in part, on the document inception date, and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Grant
    Filed: October 12, 2010
    Date of Patent: August 27, 2013
    Assignee: Google Inc.
    Inventors: Matt Cutts, Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Simon Tong
  • Patent number: 8504584
    Abstract: A system and method for generating query refinement suggestions may include collecting refinement data for at least one received source query. The collected refinement data is then clustered to form at least one cluster. At least one potential refinement query suggestion is identified from the refinement data within the at least one cluster.
    Type: Grant
    Filed: October 11, 2011
    Date of Patent: August 6, 2013
    Assignee: Google Inc.
    Inventors: Steven Baker, Paul Haahr
  • Patent number: 8489604
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for evaluating resource selection processes. One method includes receiving test queries and generating a first group of resources corresponding to a first automated resource selection process and generating a second group of resources corresponding to a second automated resource selection process for each query. Another method includes generating a query results table for use in generating the groups of resources. The query results table maps queries to resources matched to the queries, and maps each resource to a respective score for the resource and the query, and one or more index selection signals for the resource.
    Type: Grant
    Filed: October 26, 2010
    Date of Patent: July 16, 2013
    Assignee: Google Inc.
    Inventors: Adam Sadovsky, Paul Haahr, Trevor Strohman, Per Bjornsson, Jun Xu, Gabriel Schine, Jay Shrauner
  • Patent number: 8473510
    Abstract: A stopword detection component detects stopwords (also stop-phrases) in search queries input to keyword-based information retrieval systems. Potential stopwords are initially identified by comparing the terms in the search query to a list of known stopwords. Context data is then retrieved based on the search query and the identified stopwords. In one implementation, the context data includes documents retrieved from a document index. In another implementation, the context data includes categories relevant to the search query. Sets of retrieved context data are compared to one another to determine if they are substantially similar. If the sets of context data are substantially similar, this fact may be used to infer that the removal of the potential stopword(s) is not material to the search. If the sets of context data are not substantially similar, the potential stopword can be considered material to the search and should not be removed from the query.
    Type: Grant
    Filed: May 31, 2012
    Date of Patent: June 25, 2013
    Assignee: Google Inc.
    Inventors: Simon Tong, Uri Lerner, Amit Singhal, Paul Haahr, Steven Baker
  • Patent number: 8407231
    Abstract: A method may include receiving a document and an initial score for the document; determining that there has been a decrease in a rate or quantity of new links that point to the document over time; classifying the document as stale in response to the determining; decreasing the initial score for the document, resulting in an updated score; and ranking the document with regard to at least one other document based, at least in part, on the score.
    Type: Grant
    Filed: October 1, 2010
    Date of Patent: March 26, 2013
    Assignee: Google Inc.
    Inventors: Anurag Acharya, Matt Cutts, Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Simon Tong