Patents by Inventor Christopher A. Olston

Christopher A. Olston has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7673019
    Abstract: A system and method for determining a quantitative measure of qualitative usability of related Web pages. Web pages are accepted that each include at least one hyperlink referencing and proximal cues relating to distal content included in another Web page. An information goal identifying a target Web page is specified. An activation network is formed. A directed graph including nodes corresponding to the Web pages and arcs corresponding to the hyperlinks is built. A weight is assigned to each arc to represent a probability of traversal of the corresponding hyperlink based on a relatedness of keywords in the information goal to the proximal cues included in the referenced Web page. A traversal through the activation network to the node corresponding to the target Web page is evaluated as a quantitative measure of usability.
    Type: Grant
    Filed: November 4, 2005
    Date of Patent: March 2, 2010
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Ed H. Chi, Christopher Olston
  • Publication number: 20090234830
    Abstract: The present invention relates to methods, systems, and computer readable media comprising instructions for identifying needy queries for which additional responsive content is needed. The method of the present invention comprises receiving a query comprising one or more terms and retrieving one or more content items identified as responsive to the query, the one or more content items ranked according to one or more ranking techniques. A score is generated for the one or more ranked content items identified as responsive to the query. A determination is thereafter made as to whether the query is needy based upon a comparison of the one or more scores associated with the one or more content items identified as responsive to the query and a needy query score threshold.
    Type: Application
    Filed: March 11, 2008
    Publication date: September 17, 2009
    Applicant: YAHOO! INC.
    Inventors: Christopher Olston, Sandeep Pandey
  • Publication number: 20090204575
    Abstract: A web crawler loads a policy from a customizable stored module that is separate and distinct from the web crawler's source code. The web crawler follows these policies in determining the order in which the web crawler will visit and index web pages in an index used by an Internet search engine. As a result, the web crawler's behavior can be modified more easily. The web crawler's behavior can be finely tuned to be more efficient and/or to accommodate the particular needs of the search engine. Multiple different policies may be maintained concurrently in separate stored modules, and the web crawler can be instructed to use different modules' policies at different specified times or under different specified circumstances.
    Type: Application
    Filed: February 7, 2008
    Publication date: August 13, 2009
    Inventors: CHRISTOPHER OLSTON, Andrew Tomkins
  • Publication number: 20090182706
    Abstract: Computer-implemented methods, modules and clients relate to expanded, pruned sample table for testing database queries against a base table. The expanded, pruned sample table is formed from the base table by a process of initial sampling, synthesis, and pruning.
    Type: Application
    Filed: January 16, 2008
    Publication date: July 16, 2009
    Inventors: Christopher Olston, Utkarsh Srivastava
  • Publication number: 20090164425
    Abstract: An improved system and method for crawl ordering of a web crawler by impact upon search results of a search engine is provided. Content-independent features of uncrawled web pages may be obtained, and the impact of uncrawled web pages may be estimated for queries of a workload using the content-independent features. The impact of uncrawled web pages may be estimated for queries by computing an expected impact score for uncrawled web pages that match needy queries. Query sketches may be created for a subset of the queries by computing an expected impact score for crawled web pages and uncrawled web pages matching the queries. Web pages may then be selected to fetch using a combined query-based estimate and query-independent estimate of the impact of fetching the web pages on search query results.
    Type: Application
    Filed: December 20, 2007
    Publication date: June 25, 2009
    Applicant: Yahoo! Inc.
    Inventors: Christopher Olston, Sandeep Pandey
  • Publication number: 20080104502
    Abstract: An improved system and method is provided for adaptively refreshing a web page. A base version of the web page may be partitioned into a collection of fragments. Then the collection of fragments may be compared with the corresponding fragments of a recent version of the web page to determine a divergence measurement of the difference between the base version and the recent version of the web page. The divergence measurement may be recorded in a change profile representing a change history of the web page that includes a sequence of numeric pairs indicating a time offset and a divergence measurement of the difference between a version of the web page at the time offset and a base version of the web page. The refresh period for the web page may be adjusted by applying an adaptive refresh policy using the divergence measurements recorded in the change profile.
    Type: Application
    Filed: October 26, 2006
    Publication date: May 1, 2008
    Applicant: Yahoo! Inc.
    Inventor: Christopher Olston
  • Publication number: 20080104257
    Abstract: An improved system and method is provided for adaptively refreshing a web page. A base version of the web page may be partitioned into a collection of fragments. Then the collection of fragments may be compared with the corresponding fragments of a recent version of the web page to determine a divergence measurement of the difference between the base version and the recent version of the web page. The divergence measurement may be recorded in a change profile representing a change history of the web page that includes a sequence of numeric pairs indicating a time offset and a divergence measurement of the difference between a version of the web page at the time offset and a base version of the web page. The refresh period for the web page may be adjusted by applying an adaptive refresh policy using the divergence measurements recorded in the change profile.
    Type: Application
    Filed: October 26, 2006
    Publication date: May 1, 2008
    Applicant: Yahoo! Inc.
    Inventor: Christopher Olston
  • Publication number: 20080104256
    Abstract: An improved system and method is provided for adaptively refreshing a web page. A base version of the web page may be partitioned into a collection of fragments. Then the collection of fragments may be compared with the corresponding fragments of a recent version of the web page to determine a divergence measurement of the difference between the base version and the recent version of the web page. The divergence measurement may be recorded in a change profile representing a change history of the web page that includes a sequence of numeric pairs indicating a time offset and a divergence measurement of the difference between a version of the web page at the time offset and a base version of the web page. The refresh period for the web page may be adjusted by applying an adaptive refresh policy using the divergence measurements recorded in the change profile.
    Type: Application
    Filed: October 26, 2006
    Publication date: May 1, 2008
    Applicant: Yahoo! Inc.
    Inventor: Christopher Olston
  • Publication number: 20080098370
    Abstract: The present invention, in an example embodiment, provides a special-purpose formal language and translator for the parallel processing of large databases in a distributed system. The special-purpose language has features of both a declarative programming language and a procedural programming language and supports the co-grouping of tables, each with an arbitrary alignment function, and the specification of procedural operations to be performed on the resulting co-groups. The language's translator translates a program in the language into optimized structured calls to an application programming interface for implementations of functionality related to the parallel processing of tasks over a distributed system. In an example embodiment, the application programming interface includes interfaces for MapReduce functionality, whose implementations are supplemented by the embodiment.
    Type: Application
    Filed: October 20, 2006
    Publication date: April 24, 2008
    Inventors: Marcus Felipe Fontoura, Vanja Josifovski, Shanmugasundaram Ravikumar, Christopher Olston, Benjamin Clay Reed, Andrew Tomkins
  • Publication number: 20080010250
    Abstract: An improved system and method is provided for searching a collection of objects that may be located in hierarchies of auxiliary information for retrieval of response objects. A framework to perform a generalization search in hierarchies may be used to generalize a search by moving up to a higher level in a hierarchy of taxonomies or to specialize a search by moving down to a lower level in the hierarchy of taxonomies. Once the system may decide to enumerate response objects at a particular level of generalization, a budgeted generalization search may be used for enumerating a set of response objects within a budgeted cost.
    Type: Application
    Filed: July 7, 2006
    Publication date: January 10, 2008
    Applicant: Yahoo! Inc.
    Inventors: Marcus Felipe Fontoura, Vanja Josifovski, Christopher Olston, Shanmugasundaram Ravikumar, Andrew Tomkins
  • Publication number: 20080010251
    Abstract: An improved system and method is provided for searching a collection of objects that may be located in hierarchies of auxiliary information for retrieval of response objects. A framework to perform a generalization search in hierarchies may be used to generalize a search by moving up to a higher level in a hierarchy of taxonomies or to specialize a search by moving down to a lower level in the hierarchy of taxonomies. Once the system may decide to enumerate response objects at a particular level of generalization, a budgeted generalization search may be used for enumerating a set of response objects within a budgeted cost.
    Type: Application
    Filed: July 7, 2006
    Publication date: January 10, 2008
    Applicant: Yahoo! Inc.
    Inventors: Marcus Felipe Fontoura, Vanja Josifovski, Christopher Olston, Shanmugasundaram Ravikumar, Andrew Tomkins
  • Publication number: 20070106758
    Abstract: A system and method for determining a quantitative measure of qualitative usability of related Web pages. Web pages are accepted that each include at least one hyperlink referencing and proximal cues relating to distal content included in another Web page. An information goal identifying a target Web page is specified. An activation network is formed. A directed graph including nodes corresponding to the Web pages and arcs corresponding to the hyperlinks is built. A weight is assigned to each arc to represent a probability of traversal of the corresponding hyperlink based on a relatedness of keywords in the information goal to the proximal cues included in the referenced Web page. A traversal through the activation network to the node corresponding to the target Web page is evaluated as a quantitative measure of usability.
    Type: Application
    Filed: November 4, 2005
    Publication date: May 10, 2007
    Inventors: Ed Chi, Christopher Olston