Patents by Inventor David Carmel

David Carmel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Prediction of query difficulty for a generic search engine

Publication number: 20060085399

Abstract: A query difficulty prediction unit includes a query difficulty predictor to determine the extent of overlap between query documents received from a search engine operating on an input query and sub-query documents received from the search engine operating on sub-queries of the input query. The unit generates a query difficulty prediction from the extent of overlap.

Type: Application

Filed: October 19, 2004

Publication date: April 20, 2006

Applicant: International Business Machines Corporation

Inventors: David Carmel, Lawrence Darlow, Shai Fine, Elad Yom-Tov
Automatic query refinement

Patent number: 6941297

Abstract: A method for Automatic Query Refinement, the method including extracting at least one lexical affinity from at least one document in a search result set of documents, the set resulting from the application of a first query to a document search engine, adding the lexical affinity to the first query, thereby forming a new query, and applying the new query to the document search engine, resulting in a new search result set.

Type: Grant

Filed: July 31, 2002

Date of Patent: September 6, 2005

Assignee: International Business Machines Corporation

Inventors: David Carmel, Eitan Farchi, Yael Petruschka
System, method and computer program product for performing unstructured information management and automatic text analysis, and providing multiple document views derived from different document tokenizations

Publication number: 20040243645

Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. Also disclosed is system, method and computer program product to process document data. The method includes inputting a document and operating at least one text analysis engine that comprises a plurality of coupled annotators for tokenizing document data for identifying and annotating a particular type of semantic content. Operating the at least one text analysis engine generates a plurality of views of a document, where each of the plurality of views are derived from a different tokenization of the document.

Type: Application

Filed: May 30, 2003

Publication date: December 2, 2004

Applicant: International Business Machines Corporation

Inventors: Andrei Z. Broder, David Carmel, Arthur C. Ciccolo, David Ferrucci, Yoelle Maarek, Yosi Mass, Aya Soffer, Wlodek W. Zadrozny
System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a weighted and (WAND)

Publication number: 20040243557

Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).

Type: Application

Filed: May 30, 2003

Publication date: December 2, 2004

Applicant: International Business Machines Corporation

Inventors: Andrei Z. Broder, David Carmel, Michael Herscovici, Aya Soffer, Jason Zien
Lossy index compression

Publication number: 20040158580

Abstract: An apparatus is provided for performing a method (FIG. 2) for pruning an index of a corpus of text documents, wherein the method includes steps for ranking (50) the postings in the index and pruning (48) from the index the postings below a given level in the ranking. The pruning methods of the invention are lossy, since some document postings are removed from the full index; however, the user cannot differentiate the lossy index from the full index.

Type: Application

Filed: January 9, 2004

Publication date: August 12, 2004

Inventors: David Carmel, Doron Cohen, Ronald Fagin, Eitan Farchi, Michael Herscovici, Yoelle Maarek, Aya Soffer
Indexing and querying semi-structured documents

Publication number: 20040128615

Abstract: A method for indexing a semi-structured document, the method including arranging at least one structure entity of a semi-structured document into at least one node of a context structure tree, associating a unique context identifier with any of the structure entities, creating, for any value of any of the structure entities, a context-modified value by appending a context delimiter and the context identifier to the value, and inserting the context-modified value into a free-text tree.

Type: Application

Filed: December 27, 2002

Publication date: July 1, 2004

Applicant: International Business Machines Corporation

Inventors: David Carmel, Naama Kraus, Benjamin Mandler
Temporal link analysis of linked entities

Publication number: 20040128273

Abstract: A method for gathering and recording temporal information for a linked entity, the method including identifying a link related activity within a linked source entity, and recording a time stamp in association with the link related activity.

Type: Application

Filed: December 31, 2002

Publication date: July 1, 2004

Applicant: International Business Machines Corporation

Inventors: Einat Amitay, David Carmel, Michael Herscovici, Ronny Lempel, Aya Soffer, Uri Weiss
Automatic query refinement

Publication number: 20040034632

Abstract: A method for Automatic Query Refinement, the method including extracting at least one lexical affinity from at least one document in a search result set of documents, the set resulting from the application of a first query to a document search engine, adding the lexical affinity to the first query, thereby forming a new query, and applying the new query to the document search engine, resulting in a new search result set.

Type: Application

Filed: July 31, 2002

Publication date: February 19, 2004

Applicant: International Business Machines Corporation

Inventors: David Carmel, Eitan Farchi, Yael Petruschka
Information search using knowledge agents

Publication number: 20040030688

Abstract: A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.

Type: Application

Filed: August 1, 2003

Publication date: February 12, 2004

Applicant: International Business Machines Corporation

Inventors: Yariv Aridor, David Carmel, Michael Herscovici, Yoelle Maarek-Smadja, Aya Soffer, Ronny Lempel
Information search using knowledge agents

Patent number: 6636848

Abstract: A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.

Type: Grant

Filed: July 6, 2000

Date of Patent: October 21, 2003

Assignee: International Business Machines Corporation

Inventors: Yariv Aridor, David Carmel, Michael Herscovici, Yoelle Maarek-Smadja, Aya Soffer, Ronny Lempel

prev 1 2 3 4 5