Patents by Inventor Monika H. Henzinger

Monika H. Henzinger has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20120290597
    Abstract: Near-duplicate documents may be identified by (a) accepting a set of documents, (b) processing the set of documents to determine a first set of near-duplicate documents using a first document similarity technique, and (c) processing the first set of near duplicate documents to determine a second set of near-duplicate documents using a second document similarity technique. The first document similarity technique might be token order dependent, and the second document similarity technique might be order independent. The first document similarity technique might be token frequency independent, and the second document similarity technique might be frequency dependent.
    Type: Application
    Filed: September 2, 2011
    Publication date: November 15, 2012
    Applicant: Google Inc.
    Inventor: Monika H. Henzinger
  • Publication number: 20120226705
    Abstract: Methods and apparatus consistent with the invention provide improved organization of documents responsive to a search query. In one embodiment, a search query is received and a list of responsive documents is identified. The responsive documents are organized based in whole or in part on usage statistics.
    Type: Application
    Filed: March 9, 2012
    Publication date: September 6, 2012
    Applicant: GOOGLE INC.
    Inventors: Jeffrey A. Dean, Benedict Gomes, Krishna Bharat, Georges Harik, Monika H. Henzinger
  • Patent number: 8190608
    Abstract: A system performs cross-language query translations. The system receives a search query that includes terms in a first language and determines possible translations of the terms of the search query into a second language. The system also locates documents for use as parallel corpora to aid in the translation by: (1) locating documents in the first language that contain references that match the terms of the search query and identify documents in the second language; (2) locating documents in the first language that contain references that match the terms of the query and refer to other documents in the first language and identify documents in the second language that contain references to the other documents; or (3) locating documents in the first language that match the terms of the query and identify documents in the second language that contain references to the documents in the first language.
    Type: Grant
    Filed: June 30, 2011
    Date of Patent: May 29, 2012
    Assignee: Google Inc.
    Inventors: Luis Gravano, Monika H. Henzinger
  • Patent number: 8156100
    Abstract: Methods and apparatus consistent with the invention provide improved organization of documents responsive to a search query. In one embodiment, a search query is received and a list of responsive documents is identified. The responsive documents are organized based in whole or in part on usage statistics.
    Type: Grant
    Filed: February 24, 2011
    Date of Patent: April 10, 2012
    Assignee: Google Inc.
    Inventors: Jeffrey A. Dean, Benedict A. Gomes, Krishna Bharat, Georges Harik, Monika H. Henzinger
  • Publication number: 20120078871
    Abstract: Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the populated lists. Two documents may be considered to be near-duplicates if any one of their fingerprints match.
    Type: Application
    Filed: December 7, 2011
    Publication date: March 29, 2012
    Inventors: William Pugh, Monika H. Henzinger
  • Patent number: 8131751
    Abstract: The present disclosure includes, among other things, systems, methods and program products for selecting subsequences (shingles or tuples) generated from sequences of tokens.
    Type: Grant
    Filed: December 3, 2008
    Date of Patent: March 6, 2012
    Assignee: Google Inc.
    Inventors: Behshad Behzadi, Yaniv Bernstein, Stefan Burkhardt, Monika H. Henzinger, Benjamin Liebald, Richard Tucker
  • Patent number: 8065296
    Abstract: A system may provide items during a time period and determine a quality of the items provided during the time period using a time series model.
    Type: Grant
    Filed: September 29, 2004
    Date of Patent: November 22, 2011
    Assignee: Google Inc.
    Inventors: Alexander Mark Franz, Monika H. Henzinger
  • Patent number: 8060501
    Abstract: Techniques are disclosed that locate implicitly defined semantic structures in a document, such as, for example, implicitly defined lists in an HTML document. The semantic structures can be used in the calculation of distance values between terms in the documents. The distance values may be used, for example, in the generation of ranking scores that indicate a relevance level of the document to a search query.
    Type: Grant
    Filed: March 23, 2010
    Date of Patent: November 15, 2011
    Assignee: Google Inc.
    Inventors: Georges R Harik, Monika H Henzinger
  • Patent number: 8055669
    Abstract: A search query for a search engine may be improved by incorporating alternate terms into the search query that are semantically similar to terms of the search query, taking into account information derived from the search query. An initial set of alternate terms that may be semantically similar to the original terms in the search query is generated. The initial set of alternate terms may be compared to information derived from the original search query. One example of such information is a set of documents retrieved in response to a search performed using the initial search query. One or more of the alternate terms may be added to the original search query based on their relationship to the information derived from the original search query.
    Type: Grant
    Filed: March 3, 2003
    Date of Patent: November 8, 2011
    Assignee: Google Inc.
    Inventors: Amit Singhal, Mehran Sahami, John Lamping, Marcin Kaszkiel, Monika H. Henzinger
  • Patent number: 8015162
    Abstract: Near-duplicate documents may be identified by processing an accepted set of documents to determine a first set of near-duplicate documents using a first technique, and processing the first set to determine a second set of near-duplicate documents using a second technique. The first technique might be token order dependent, and the second technique might be order independent. The first technique might be token frequency independent, and the second technique might be frequency dependent. The first technique might determine whether two documents are near-duplicates using representations based on a subset of the words or tokens of the documents, and the second technique might determine whether two documents are near-duplicates using representations based on all of the words or tokens of the documents.
    Type: Grant
    Filed: August 4, 2006
    Date of Patent: September 6, 2011
    Assignee: Google Inc.
    Inventor: Monika H. Henzinger
  • Patent number: 8001118
    Abstract: Methods and apparatus consistent with the invention provide improved organization of documents responsive to a search query. In one embodiment, a search query is received and a list of responsive documents is identified. The responsive documents are organized based in whole or in part on usage statistics.
    Type: Grant
    Filed: March 2, 2001
    Date of Patent: August 16, 2011
    Assignee: Google Inc.
    Inventors: Jeffrey A. Dean, Benedict Gomes, Krishna Bharat, Georges Harik, Monika H. Henzinger
  • Patent number: 7996402
    Abstract: A system performs cross-language query translations. The system receives a search query that includes terms in a first language and determines possible translations of the terms of the search query into a second language. The system also locates documents for use as parallel corpora to aid in the translation by: (1) locating documents in the first language that contain references that match the terms of the search query and identify documents in the second language; (2) locating documents in the first language that contain references that match the terms of the query and refer to other documents in the first language and identify documents in the second language that contain references to the other documents; or (3) locating documents in the first language that match the terms of the query and identify documents in the second language that contain references to the documents in the first language.
    Type: Grant
    Filed: August 31, 2010
    Date of Patent: August 9, 2011
    Assignee: Google Inc.
    Inventors: Luis Gravano, Monika H. Henzinger
  • Publication number: 20110179023
    Abstract: Methods and apparatus consistent with the invention provide improved organization of documents responsive to a search query. In one embodiment, a search query is received and a list of responsive documents is identified. The responsive documents are organized based in whole or in part on usage statistics.
    Type: Application
    Filed: February 24, 2011
    Publication date: July 21, 2011
    Applicant: GOOGLE INC.
    Inventors: Jeffrey A. Dean, Benedict A. Gomes, Krishna Bharat, Georges Harik, Monika H. Henzinger
  • Patent number: 7962469
    Abstract: A system limits search results based on context information. The system obtains the context information and a search query, and obtains a set of references to documents in response to the search query. The system then filters the set of references based on the context information and presents the filtered set of references to a user.
    Type: Grant
    Filed: October 9, 2007
    Date of Patent: June 14, 2011
    Assignee: Google Inc.
    Inventors: Urs Hoelzle, Monika H. Henzinger, David desJardins
  • Patent number: 7814103
    Abstract: A system performs cross-language query translations. The system receives a search query that includes terms in a first language and determines possible translations of the terms of the search query into a second language. The system also locates documents for use as parallel corpora to aid in the translation by: (1) locating documents in the first language that contain references that match the terms of the search query and identify documents in the second language; (2) locating documents in the first language that contain references that match the terms of the query and refer to other documents in the first language and identify documents in the second language that contain references to the other documents; or (3) locating documents in the first language that match the terms of the query and identify documents in the second language that contain references to the documents in the first language.
    Type: Grant
    Filed: August 30, 2006
    Date of Patent: October 12, 2010
    Assignee: Google Inc.
    Inventors: Luis Gravano, Monika H. Henzinger
  • Patent number: 7716216
    Abstract: Techniques are disclosed that locate implicitly defined semantic structures in a document, such as, for example, implicitly defined lists in an HTML document. The semantic structures can be used in the calculation of distance values between terms in the documents. The distance values may be used, for example, in the generation of ranking scores that indicate a relevance level of the document to a search query.
    Type: Grant
    Filed: March 31, 2004
    Date of Patent: May 11, 2010
    Assignee: Google Inc.
    Inventors: Georges R. Harik, Monika H. Henzinger
  • Patent number: 7631310
    Abstract: A load balancer evenly distributes processing loads to multiple computing devices. A data structure may be divided into multiple files, each of which corresponds to an estimated load value. The files are assigned to the computing devices in such a way that the processing load at each of the computing devices and the number of files assigned to each of the computing devices is generally balanced.
    Type: Grant
    Filed: September 1, 2004
    Date of Patent: December 8, 2009
    Assignee: Google Inc.
    Inventor: Monika H. Henzinger
  • Patent number: 7421432
    Abstract: A system facilitates a search by a user. The system detects selection of one or more words in a document currently accessed by the user, generates a search query using the selected word(s), and retrieves a document based on the search query. When the document includes one or more links corresponding to a linked document, the system analyzes each of the links, prefetches the linked documents corresponding to a number of the links, and presents the document to the user. The system receives selection of one of the links and retrieves the linked document corresponding to the selected link. The system identifies one or more pieces of information in the retrieved document, determines a link to a related document for each of the identified pieces of information, and provides the determined links with the related document to the user.
    Type: Grant
    Filed: December 13, 2000
    Date of Patent: September 2, 2008
    Assignee: Google Inc.
    Inventors: Urs Hoelzle, Monika H. Henzinger, Lawrence E. Page
  • Publication number: 20080162478
    Abstract: Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the populated lists. Two documents may be considered to be near-duplicates if any one of their fingerprints match.
    Type: Application
    Filed: March 15, 2008
    Publication date: July 3, 2008
    Inventors: William PUGH, Monika H. Henzinger
  • Patent number: 7366718
    Abstract: Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the populated lists. Two documents may be considered to be near-duplicates if any one of their fingerprints match.
    Type: Grant
    Filed: June 27, 2003
    Date of Patent: April 29, 2008
    Assignee: Google, Inc.
    Inventors: William Pugh, Monika H. Henzinger