Abstract: An article is extracted from a document using a decision combiner to process a plurality of reading order alternatives. The text flow analysis generates the plurality of reading order alternatives of separate body text regions.
Type:
Grant
Filed:
October 13, 2004
Date of Patent:
July 13, 2010
Assignee:
Hewlett-Packard Development Company, L.P.
Inventors:
Sherif Yacoub, Jean-Manuel Van Thong, John Burns
Abstract: Search queries are received that should be run against data. As time elapses, new queries and new data may be received. Previously run queries may be referred to as base queries and the data that was searched using the queries may be referred to as base data. The base queries and new queries may be parsed to identify queries that are similar. The similar queries are then combined into a unique query so that multiple queries that are similar are not used to search the same data. The unique queries that are generated are used to search the new data received to generate a first set of search results. The new queries received are used to search the base data to generate a second set of search results. The search results for the new queries are then determined based on the first and second set of search results. Also, the search results for the base queries are determined based on the first set results.
Type:
Grant
Filed:
October 13, 2004
Date of Patent:
June 29, 2010
Assignee:
Yahoo! Inc.
Inventors:
Patrick Loo, Sotirios Matzanas, Ming Zhang, Matthias Eichstaedt, Mitra Naeimi, Jim Fondren
Abstract: A method and interface for managing indices of ordered elements are provided. A subset of elements are selected from an index of ordered elements and displayed on a user interface device. A user can manipulate the display of different subsets of the ordered elements via a user input device. Additional indicia corresponding to the subset of index elements are also displayed on the interface. The additional indicia can include a reference to a visual indicator of index display depth and/or a reference to the location of the displayed elements within the index.
Type:
Grant
Filed:
June 30, 2005
Date of Patent:
June 8, 2010
Assignee:
Microsoft Corporation
Inventors:
George G Robertson, Mary P Czerwinski, Daniel C Robbins
Abstract: Different URLs that actually reference the same web page or other web resource are detected and that information is used to only download one instance of a web page or web resource from a web site. All web pages or web resources downloaded from a web server are compared to identify which are substantially identical. Once identical web pages or web resources with different URLs are found, the different URLs are then analyzed to identify what portions of the URL are essential for identifying a particular web page or web resource, and what portions are irrelevant. Once this has been done for each set of substantially identical web pages or web resources (also referred to as an “equivalence class” herein), these per-equivalence-class rules are generalized to trans-equivalence-class rules.
Abstract: Various embodiments of a method, apparatus and article of manufacture to manage an index are provided. A circular index, having an index size, is provided. The circular index stores information to reference data in a sequential list. Accesses to the index and the list are monitored to provide at least one performance indicator. The performance indicator represents an effect of the index on accessing items in the list. The index size is changed based on the at least one performance indicator. The monitoring of the accesses and the changing of the index size are repeated.
Type:
Grant
Filed:
October 12, 2004
Date of Patent:
November 13, 2007
Assignee:
International Business Machines Corporation