Patents by Inventor Paul Haahr

Paul Haahr has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10459989
    Abstract: In general, one aspect of the subject matter described can be embodied in a method that includes, obtaining a plurality of search results responsive to an initial search query, the search results including a first search result that identifies a first resource; determining, using a document-to-query-to-document model, that the first resource is relevant to a first suggested query different from the initial search query; generating a presentation of the search results responsive to the initial search query; and providing the presentation of the search results in response to the initial search query. Each search result in the presentation includes a link to a respective resource, wherein the first search result in the presentation includes a link that, upon a selection by a user, can cause the first suggested query to be submitted to a search engine.
    Type: Grant
    Filed: February 2, 2017
    Date of Patent: October 29, 2019
    Assignee: Google LLC
    Inventors: Paul Haahr, Charles E. Martin
  • Patent number: 10452718
    Abstract: A stopword detection component detects stopwords (also stop-phrases) in search queries input to keyword-based information retrieval systems. Potential stopwords are initially identified by comparing the terms in the search query to a list of known stopwords. Context data is then retrieved based on the search query and the identified stopwords. In one implementation, the context data includes documents retrieved from a document index. In another implementation, the context data includes categories relevant to the search query. Sets of retrieved context data are compared to one another to determine if they are substantially similar. If the sets of context data are substantially similar, this fact may be used to infer that the removal of the potential stopword(s) is not material to the search. If the sets of context data are not substantially similar, the potential stopword can be considered material to the search and should not be removed from the query.
    Type: Grant
    Filed: October 18, 2017
    Date of Patent: October 22, 2019
    Assignee: GOOGLE LLC
    Inventors: Simon Tong, Uri Lerner, Amit Singhal, Paul Haahr, Steven Baker
  • Patent number: 10248662
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating descriptive text for images. In one aspect, a method includes identifying a set of seed descriptors for an image in a document that is hosted on a website. For each seed descriptor, structure information is generated that specifies a structure of the document with respect to the image and the seed descriptor. One or more templates are generated for each seed descriptor using the structure information for the seed descriptor. Each template can include image location information, document structure information, image feature information, and a generative rule that generates descriptive text for other images in other documents. Descriptive text for other images is generated using the templates and the other documents. The descriptive text is associated with the images.
    Type: Grant
    Filed: March 20, 2018
    Date of Patent: April 2, 2019
    Assignee: Google LLC
    Inventors: Yonghui Wu, Michael E. Flaster, Randall G. Keller, Paul Haahr
  • Patent number: 10223439
    Abstract: A system and method for generating query refinement suggestions may include collecting refinement data for at least one received source query. The collected refinement data is then clustered to form at least one cluster. At least one potential refinement query suggestion is identified from the refinement data within the at least one cluster.
    Type: Grant
    Filed: November 10, 2016
    Date of Patent: March 5, 2019
    Assignee: GOOGLE LLC
    Inventors: Steven Baker, Paul Haahr
  • Publication number: 20180210895
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating descriptive text for images. In one aspect, a method includes identifying a set of seed descriptors for an image in a document that is hosted on a website. For each seed descriptor, structure information is generated that specifies a structure of the document with respect to the image and the seed descriptor. One or more templates are generated for each seed descriptor using the structure information for the seed descriptor. Each template can include image location information, document structure information, image feature information, and a generative rule that generates descriptive text for other images in other documents. Descriptive text for other images is generated using the templates and the other documents. The descriptive text is associated with the images.
    Type: Application
    Filed: March 20, 2018
    Publication date: July 26, 2018
    Inventors: Yonghui Wu, Michael E. Flaster, Randall G. Keller, Paul Haahr
  • Patent number: 10025868
    Abstract: Methods, systems, and apparatus, including computer program products, for establishing preferences for sites that are explicitly identified by a user by either a textual input or an acceptance in response to a recommendation, and adjusting the presentation of search results based on these preferences.
    Type: Grant
    Filed: March 15, 2016
    Date of Patent: July 17, 2018
    Assignee: Google LLC
    Inventors: Bryan C. Horling, Michael Y. Brukman, Owen Otto, Philip A. McDonnell, Kavi J. Goel, Paul Haahr
  • Patent number: 9971790
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating descriptive text for images. In one aspect, a method includes identifying a set of seed descriptors for an image in a document that is hosted on a website. For each seed descriptor, structure information is generated that specifies a structure of the document with respect to the image and the seed descriptor. One or more templates are generated for each seed descriptor using the structure information for the seed descriptor. Each template can include image location information, document structure information, image feature information, and a generative rule that generates descriptive text for other images in other documents. Descriptive text for other images is generated using the templates and the other documents. The descriptive text is associated with the images.
    Type: Grant
    Filed: March 14, 2014
    Date of Patent: May 15, 2018
    Assignee: Google LLC
    Inventors: Yonghui Wu, Michael E. Flaster, Randall G. Keller, Paul Haahr
  • Patent number: 9817920
    Abstract: A stopword detection component detects stopwords (also stop-phrases) in search queries input to keyword-based information retrieval systems. Potential stopwords are initially identified by comparing the terms in the search query to a list of known stopwords. Context data is then retrieved based on the search query and the identified stopwords. In one implementation, the context data includes documents retrieved from a document index. In another implementation, the context data includes categories relevant to the search query. Sets of retrieved context data are compared to one another to determine if they are substantially similar. If the sets of context data are substantially similar, this fact may be used to infer that the removal of the potential stopword(s) is not material to the search. If the sets of context data are not substantially similar, the potential stopword can be considered material to the search and should not be removed from the query.
    Type: Grant
    Filed: February 23, 2015
    Date of Patent: November 14, 2017
    Assignee: Google LLC
    Inventors: Simon Tong, Uri Lerner, Amit Singhal, Paul Haahr, Steven Baker
  • Patent number: 9563692
    Abstract: In general, one aspect of the subject matter described can be embodied in a method that includes, obtaining a plurality of search results responsive to an initial search query, the search results including a first search result that identifies a first resource; determining, using a document-to-query-to-document model, that the first resource is relevant to a first suggested query different from the initial search query; generating a presentation of the search results responsive to the initial search query; and providing the presentation of the search results in response to the initial search query. Each search result in the presentation includes a link to a respective resource, wherein the first search result in the presentation includes a link that, upon a selection by a user, can cause the first suggested query to be submitted to a search engine.
    Type: Grant
    Filed: April 24, 2015
    Date of Patent: February 7, 2017
    Assignee: Google Inc.
    Inventors: Paul Haahr, Charles E. Martin
  • Patent number: 9558233
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining a measure of quality for a resource. In one aspect, a method includes determining a seed score for each seed resource in a set. The seed score for a seed resource can be based on a number of resources that include a link to the seed resource and a number of selections of the links A set of source resources is identified. A source score is determined for each source resource. The source score for a source resource is based on the seed score for each seed resource linked to by the source resource. Source-referenced resources are identified. A resource score is determined for each source-referenced resource. The resource score for a source-referenced resource can be based on the source score for each source resource that includes a link to the source-referenced resource.
    Type: Grant
    Filed: December 31, 2012
    Date of Patent: January 31, 2017
    Assignee: Google Inc.
    Inventors: Hyung-Jin Kim, Paul Haahr, Kien Ng, Chung Tin Kwok, Moustafa A. Hammad, Sushrut Karanjkar
  • Patent number: 9552388
    Abstract: A system and method for providing search query refinements are presented. A stored query and a stored document are associated as a logical pairing. A weight is assigned to the logical pairing. The search query is issued and a set of search documents is produced. At least one search document is matched to at least one stored document. The stored query and the assigned weight associated with the matching at least one stored document are retrieved. At least one cluster is formed based on the stored query and the assigned weight associated with the matching at least one stored document. The stored query associated with the matching at least one stored document are scored for the at least one cluster relative to at least one other cluster. At least one such scored search query is suggested as a set of query refinements.
    Type: Grant
    Filed: January 31, 2014
    Date of Patent: January 24, 2017
    Assignee: Google Inc.
    Inventors: Paul Haahr, Steven D. Baker
  • Patent number: 9495443
    Abstract: A system and method for generating query refinement suggestions may include collecting refinement data for at least one received source query. The collected refinement data is then clustered to form at least one cluster. At least one potential refinement query suggestion is identified from the refinement data within the at least one cluster.
    Type: Grant
    Filed: July 31, 2013
    Date of Patent: November 15, 2016
    Assignee: Google Inc.
    Inventors: Steven Baker, Paul Haahr
  • Patent number: 9436747
    Abstract: Methods, systems, and apparatus, including computer program products, for generating synthetic queries using seed queries and structural similarity between documents are described. In one aspect, a method includes identifying embedded coding fragments (e.g., HTML tag) from a structured document and a seed query; generating one or more query templates, each query template corresponding to at least one coding fragment, the query template including a generative rule to be used in generating candidate synthetic queries; generating the candidate synthetic queries by applying the query templates to other documents that are hosted on the same web site as the document; identifying terms that match structure of the query templates as candidate synthetic queries; measuring a performance for each of the candidate synthetic queries; and designating as synthetic queries the candidate synthetic queries that have performance measurements exceeding a performance threshold.
    Type: Grant
    Filed: June 25, 2015
    Date of Patent: September 6, 2016
    Assignee: Google Inc.
    Inventors: Steven D. Baker, Michael E. Flaster, Nitin Gupta, Paul Haahr, Srinivasan Venkatachary, Yonghui Wu
  • Patent number: 9317563
    Abstract: Methods, systems, and apparatus, including computer program products, for establishing preferences for sites that are explicitly identified by a user by either a textual input or an acceptance in response to a recommendation, and adjusting the presentation of search results based on these preferences.
    Type: Grant
    Filed: October 23, 2013
    Date of Patent: April 19, 2016
    Assignee: Google Inc.
    Inventors: Bryan C. Horling, Michael Y. Brukman, Owen Otto, Philip A. McDonnell, Kavi J. Goel, Paul Haahr
  • Patent number: 9208233
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using synthetic descriptive text to rank search results. One of the methods includes receiving a search query from a user device; receiving data identifying a plurality of search result resources and respective initial scores for each of the search result resources; determining, from a search engine index, that a particular search result resource of the plurality of search result resources is associated with one or more pieces of synthetic descriptive text, wherein each piece of synthetic descriptive text is generated by applying a respective template to a respective linking resource that links to the particular search result resource; computing a synthetic descriptive text score for the particular search result resource; and adjusting the initial score for the particular search result resource based at least in part on the synthetic descriptive text score.
    Type: Grant
    Filed: December 31, 2012
    Date of Patent: December 8, 2015
    Assignee: Google Inc.
    Inventors: Sundeep Tirumalareddy, Michael E. Flaster, Eric Lehman, Paul Haahr
  • Patent number: 9208232
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating synthetic descriptive text. One of the methods includes identifying a group of linking resources, wherein each of the linking resources includes a link to a respective target resource; determining, from a search engine index, that at least some of the target resources are associated with seed queries; generating term location information that identifies, for each seed query, locations of terms from the seed query in the linking resource that links to the target resource associated with the seed query; generating synthetic descriptive text for the target resources based on the term location information; and associating the synthetic descriptive text with the target resources in the search engine index.
    Type: Grant
    Filed: December 31, 2012
    Date of Patent: December 8, 2015
    Assignee: Google Inc.
    Inventors: Sundeep Tirumalareddy, Michael E. Flaster, Eric Lehman, Paul Haahr, Yonghui Wu
  • Patent number: 9092528
    Abstract: In general, one aspect of the subject matter described can be embodied in a method that includes, for a first document that is included in first search results responsive to a first user-submitted query, selecting a plurality of previously submitted queries for which the first document was a responsive search result. The method can further include determining whether second documents that are relevant to the previously submitted query have at least a threshold level of diversity in comparison to the first search results, wherein second documents are determined to be relevant to the previously submitted query based on data that is indicative of user behavior. The method can additionally include identifying one or more queries from the selected previously submitted queries to provide as first suggested queries, and providing the one or more identified queries as first suggested queries with the first search results for the first user-submitted query.
    Type: Grant
    Filed: November 8, 2013
    Date of Patent: July 28, 2015
    Assignee: Google Inc.
    Inventors: Paul Haahr, Charles E. Martin
  • Publication number: 20150161086
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating descriptive text for images. In one aspect, a method includes identifying a set of seed descriptors for an image in a document that is hosted on a website. For each seed descriptor, structure information is generated that specifies a structure of the document with respect to the image and the seed descriptor. One or more templates are generated for each seed descriptor using the structure information for the seed descriptor. Each template can include image location information, document structure information, image feature information, and a generative rule that generates descriptive text for other images in other documents. Descriptive text for other images is generated using the templates and the other documents. The descriptive text is associated with the images.
    Type: Application
    Filed: March 14, 2014
    Publication date: June 11, 2015
    Applicant: Google Inc.
    Inventors: Yonghui Wu, Michael E. Flaster, Randall G. Keller, Paul Haahr
  • Patent number: 9015802
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for privacy protection. In one aspect, a method includes accessing personally identifiable information (PII) type definitions that characterize PII types; identifying PII type information included in content of a web page, the PII type information being information matching at least one PII type definition; identifying secondary information included in the content of the web page, the secondary information being information that is predefined as being associated with PII type information; determining a risk score from the PII type information and the secondary information; and classifying the web page as a personal information exposure risk if the risk score meets a confidentiality threshold, wherein the personal information exposure risk is indicative of the web page including personally identifiable information.
    Type: Grant
    Filed: September 12, 2013
    Date of Patent: April 21, 2015
    Assignee: Google Inc.
    Inventors: Muthuprasanna Muthusrinivasan, Paul Haahr, Matthew D. Cutts
  • Patent number: 8990210
    Abstract: Web pages of a Website may be processed to improve search results. For example, information likely to pertain to more than just the Web page it is directly associated with may be identified. One or more other, related, Web pages that such information is likely to pertain to is also identified. The identified information is associated with the identified other Web page(s) and this association is saved in a way to affect a search result score of the Web page(s).
    Type: Grant
    Filed: August 15, 2013
    Date of Patent: March 24, 2015
    Assignee: Google Inc.
    Inventors: Daniel Egnor, Paul Haahr, Kevin D. Lacker, John Ogden Lamping, Amitabh K. Singhal, Ke Yang