Patents by Inventor Runping Qi
Runping Qi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8655888Abstract: Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.Type: GrantFiled: December 22, 2011Date of Patent: February 18, 2014Assignee: International Business Machines CorporationInventors: Marcus F. Fontoura, Ronny Lempel, Runping Qi, Jason Y. Zien
-
Patent number: 8346759Abstract: Provided are a system and article of manufacture for searching documents for ranges of numeric values. A number of posting lists is generated, wherein each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored. A query on a query range of values within the set of values is received and a determination is made of a minimum number of posting lists associated with consecutive values that together include the query range of values. The determined posting lists are merged.Type: GrantFiled: August 6, 2008Date of Patent: January 1, 2013Assignee: International Business Machines CorporationInventors: Marcus Felipe Fontoura, Ronny Lempel, Runping Qi, Jason Yeong Zien
-
Patent number: 8296304Abstract: Disclosed is a method, system, and program for handling redirects in documents. At least one equivalence class that includes documents that are connected through a redirect. Cycles for each equivalence class are detected, wherein documents in a cycle are marked so that they are not indexed. Incomplete chains for each equivalence class are detected, wherein documents in an incomplete chain are marked so that they are not indexed. A representative for each equivalence class is selected.Type: GrantFiled: January 26, 2004Date of Patent: October 23, 2012Assignee: International Business Machines CorporationInventors: Marcus F. Fontoura, Andreas Neumann, Runping Qi, Eugene J. Shekita
-
Patent number: 8271498Abstract: Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.Type: GrantFiled: August 12, 2008Date of Patent: September 18, 2012Assignee: International Business Machines CorporationInventors: Marcus Felipe Fontoura, Ronny Lempel, Runping Qi, Jason Yeong Zien
-
Publication number: 20120096016Abstract: Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.Type: ApplicationFiled: December 22, 2011Publication date: April 19, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Marcus Felipe Fontoura, Ronny Lempel, Runping Qi, Jason Yeong Zien
-
Patent number: 8131726Abstract: A method for indexing a plurality of documents, that includes a plurality of duplicate documents, first identifies one or more duplicate groups of documents from among the plurality of documents. Then, one index of content for the duplicate group is created instead of indexing the content from every document within the duplicate group. However, in contrast to the content index, an index of metadata for each of the documents in the duplicate group is created. Thus the content of each duplicate group is indexed only once, while a search engine using such indexing techniques retains the capability to answer queries as if the duplicated content was indexed for each document of the group.Type: GrantFiled: January 12, 2005Date of Patent: March 6, 2012Assignee: International Business Machines CorporationInventors: Andrei Z. Broder, Marcus Felipe Fontoura, Michael Herscovici, Ronny Lempel, John Ai McPherson, Jr., Andreas Neumann, Runping Qi, Eugene Jon Shekita
-
Patent number: 8032532Abstract: A method and system for querying multifaceted information. An inverted index is constructed to include unique indexed tokens associated with posting lists of one or more documents. An indexed token is either a facet token included in a document as an annotation or a path prefix of the facet token. The annotation indicates a path within a tree structure representing a facet that includes the document. The tree structure includes nodes representing categories of documents. A query is received that includes constraints on documents. The constraints are associated with indexed tokens and corresponding posting lists. An execution of the query includes identifying the corresponding posting lists by utilizing the constraints and the inverted index and intersecting the posting lists to obtain a query result.Type: GrantFiled: May 21, 2008Date of Patent: October 4, 2011Assignee: International Business Machines CorporationInventors: Andrei Z. Broder, Nadav Eiron, Felipe Marcus Fontoura, Ronny Lempel, Ning Li, John Ai McPherson, Jr., Andreas Neumann, Shila Ofek-Koifman, Runping Qi, Eugene J. Shekita
-
Patent number: 7783626Abstract: Provided is a technique for building an index. A new indexi+1 is built and an anchor text tablei+1 and a duplicates tablei+1 are output using a storei, a delta store, and previously generated global analysis computationsi, wherein the previously generated global analysis computationsi include an anchor text tablei, a rank tablei, and a duplicates tablei. New global analysis computationsi+1 are generated using the anchor text tablei+1, the duplicates tablei+1, and the previously generated global analysis computationsi.Type: GrantFiled: August 17, 2007Date of Patent: August 24, 2010Assignee: International Business Machines CorporationInventors: Marcus Felipe Fontoura, Reiner Kraft, Tony Kai-Chi Leung, John A. McPherson, Jr., Andreas Neumann, Runping Qi, Sridhar Rajagopalan, Eugene J. Shekita, Jason Yeong Zien
-
Patent number: 7552109Abstract: A collaborative focused crawler crawls documents on a network locating documents that match multiple focus topics. The collaborative crawler comprises a fetcher and a focus engine. The fetcher prioritizes which documents to crawl based on a set of rules, obtains documents from the network, and outputs crawled documents to the focus engine. The focus engine determines whether a fetched document is relevant to any of the multiple focus topics. The focus engine determines whether fetched documents are disallowed. If a fetched document is disallowed, the present system may place the URL for that web document in a blacklist, a list of URLs that may not be crawled. URLs may be disallowed if they match a disallowed topic or if they fail a set of rules designed for a web space focus, for example, domain rules, IP address rules, and prefix rules.Type: GrantFiled: October 15, 2003Date of Patent: June 23, 2009Assignee: International Business Machines CorporationInventors: Srinivasan Balasubramanian, Laurent Chavet, Runping Qi
-
Patent number: 7496568Abstract: A method for querying multifaceted information. An inverted index is constructed to include unique indexed tokens associated with posting lists of one or more documents. An indexed token is either a facet token included in a document as an annotation or a path prefix of the facet token. The annotation indicates a path within a tree structure representing a facet that includes the document. The tree structure includes nodes representing categories of documents. Constructing the inverted index includes generating a full path token and an associated full path token posting list. A query is received that includes constraints on documents. The constraints are associated with indexed tokens and corresponding posting lists. An execution of the query includes identifying the corresponding posting lists by utilizing the constraints and the inverted index and intersecting the posting lists to obtain a query result.Type: GrantFiled: November 30, 2006Date of Patent: February 24, 2009Assignee: International Business Machines CorporationInventors: Andrei Z. Broder, Nadav Eiron, Felipe Marcus Fontoura, Ronny Lempel, Ning Li, John Ai McPherson, Jr., Andreas Neumann, Shila Ofek-Koifman, Runping Qi, Eugene J. Shekita
-
Publication number: 20080301130Abstract: Provided are a method, system, and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values.Type: ApplicationFiled: August 12, 2008Publication date: December 4, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Marcus Felipe Fontoura, Ronny Lempel, Runping Qi, Jason Yeong Zien
-
Patent number: 7461064Abstract: Provided are a method, system, and program for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents having values within the range of consecutive values associated with the posting list. Each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored.Type: GrantFiled: September 24, 2004Date of Patent: December 2, 2008Assignee: International Buiness Machines CorporationInventors: Marcus Felipe Fontoura, Ronny Lempel, Runping Qi, Jason Yeong Zien
-
Publication number: 20080294634Abstract: Provided are a system and article of manufacture for searching documents for ranges of numeric values. Document identifiers for documents include at least one value that is a member of a set of values. A number of posting lists is generated, wherein each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents including at least one value within the range of consecutive values associated with the posting list, and wherein each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored, wherein the posting lists are used to process a query on a range of values within the set of values. A query on a query range of values within the set of values is received and a determination is made of a minimum number of posting lists associated with consecutive values that together include the query range of values.Type: ApplicationFiled: August 6, 2008Publication date: November 27, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Marcus Felipe Fontoura, Ronny Lempel, Runping Qi, Jason Yeong Zien
-
Publication number: 20080222117Abstract: A method and system for querying multifaceted information. An inverted index is constructed to include unique indexed tokens associated with posting lists of one or more documents. An indexed token is either a facet token included in a document as an annotation or a path prefix of the facet token. The annotation indicates a path within a tree structure representing a facet that includes the document. The tree structure includes nodes representing categories of documents. A query is received that includes constraints on documents. The constraints are associated with indexed tokens and corresponding posting lists. An execution of the query includes identifying the corresponding posting lists by utilizing the constraints and the inverted index and intersecting the posting lists to obtain a query result.Type: ApplicationFiled: May 21, 2008Publication date: September 11, 2008Inventors: Andrei Z. Broder, Nadav Eiron, Felipe Marcus Fontoura, Ronny Lempel, Ning Li, John Ai McPherson, Andreas Neumann, Shila Ofek-Koifman, Runping Qi, Eugene J. Shekita
-
Publication number: 20080133473Abstract: A method for querying multifaceted information. An inverted index is constructed to include unique indexed tokens associated with posting lists of one or more documents. An indexed token is either a facet token included in a document as an annotation or a path prefix of the facet token. The annotation indicates a path within a tree structure representing a facet that includes the document. The tree structure includes nodes representing categories of documents. Constructing the inverted index includes generating a full path token and an associated full path token posting list. A query is received that includes constraints on documents. The constraints are associated with indexed tokens and corresponding posting lists. An execution of the query includes identifying the corresponding posting lists by utilizing the constraints and the inverted index and intersecting the posting lists to obtain a query result.Type: ApplicationFiled: November 30, 2006Publication date: June 5, 2008Inventors: Andrei Z. Broder, Nadav Eiron, Felipe Marcus Fontoura, Ronny Lempel, Ning Li, John Ai McPherson, Andreas Neumann, Shila Ofek-Koifman, Runping Qi, Eugene J. Shekita
-
Publication number: 20070282829Abstract: Disclosed is a technique for building an index. A new indexi+1 is built and an anchor text tablei+1 and a duplicates tableti+1 are output using a storesi, a delta store, and previously generated global analysis computationsi, wherein the previously generated global analysis computationsi include an anchor text tablei, a rank tablei, and a duplicates tablei. New global analysis computationsi+1 are generated using the anchor text tablei+1, the duplicates tablei+1, and the previously generated global analysis computationsi.Type: ApplicationFiled: August 17, 2007Publication date: December 6, 2007Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Marcus Fontoura, Reiner Kraft, Tony Leung, John McPherson, Andreas Neumann, Runping Qi, Sridhar Rajagopalan, Eugene Shekita, Jason Zien
-
Patent number: 7293005Abstract: Disclosed is a technique for building an index in which global analysis computations and index creation are pipelined, wherein the global analysis computations share intermediate results.Type: GrantFiled: January 26, 2004Date of Patent: November 6, 2007Assignee: International Business Machines CorporationInventors: Marcus F. Fontoura, Reiner Kraft, Tony K. Leung, John Ai McPherson, Jr., Andreas Neumann, Runping Qi, Sridhar Rajagopalan, Eugene J. Shekita, Jason Yeong Zien
-
Patent number: 7103589Abstract: A system and method for searching, accessing updating database records on a plurality of geographically dispersed databases is provided. Update databases consisting of a plurality of update records having at least one field capable of being set to at least one particular value are used to update records in target databases. When a database search is conducted on one or more target databases, a search is also conducted on update databases associated with the target databases. The results of the searches are processed. If the at least one field in an update record is set to a predefined value, then all returned records equivalent to the update record are discarded. Otherwise, the update record is added to the search result.Type: GrantFiled: March 22, 2000Date of Patent: September 5, 2006Assignee: Metro One Telecommunications, Inc.Inventors: Michael A. Kepler, Christopher A. Huey, Runping Qi, Christopher A. Wake
-
Publication number: 20060155739Abstract: A method for indexing a plurality of documents, that includes a plurality of duplicate documents, first identifies one or more duplicate groups of documents from among the plurality of documents. Then, one index of content for the duplicate group is created instead of indexing the content from every document within the duplicate group. However, in contrast to the content index, an index of metadata for each of the documents in the duplicate group is created. Thus the content of each duplicate group is indexed only once, while a search engine using such indexing techniques retains the capability to answer queries as if the duplicated content was indexed for each document of the group.Type: ApplicationFiled: January 12, 2005Publication date: July 13, 2006Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Andrei Broder, Marcus Fontoura, Michael Herscovici, Ronny Lempel, John McPherson, Andreas Neumann, Runping Qi, Eugene Shekita
-
Publication number: 20060074962Abstract: Provided are a method, system, and program for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents having values within the range of consecutive values associated with the posting list. Each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored.Type: ApplicationFiled: September 24, 2004Publication date: April 6, 2006Inventors: Marcus Fontoura, Ronny Lempel, Runping Qi, Jason Zien