Patents by Inventor David Carmel

David Carmel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20060085399
    Abstract: A query difficulty prediction unit includes a query difficulty predictor to determine the extent of overlap between query documents received from a search engine operating on an input query and sub-query documents received from the search engine operating on sub-queries of the input query. The unit generates a query difficulty prediction from the extent of overlap.
    Type: Application
    Filed: October 19, 2004
    Publication date: April 20, 2006
    Applicant: International Business Machines Corporation
    Inventors: David Carmel, Lawrence Darlow, Shai Fine, Elad Yom-Tov
  • Patent number: 6941297
    Abstract: A method for Automatic Query Refinement, the method including extracting at least one lexical affinity from at least one document in a search result set of documents, the set resulting from the application of a first query to a document search engine, adding the lexical affinity to the first query, thereby forming a new query, and applying the new query to the document search engine, resulting in a new search result set.
    Type: Grant
    Filed: July 31, 2002
    Date of Patent: September 6, 2005
    Assignee: International Business Machines Corporation
    Inventors: David Carmel, Eitan Farchi, Yael Petruschka
  • Publication number: 20040243645
    Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. Also disclosed is system, method and computer program product to process document data. The method includes inputting a document and operating at least one text analysis engine that comprises a plurality of coupled annotators for tokenizing document data for identifying and annotating a particular type of semantic content. Operating the at least one text analysis engine generates a plurality of views of a document, where each of the plurality of views are derived from a different tokenization of the document.
    Type: Application
    Filed: May 30, 2003
    Publication date: December 2, 2004
    Applicant: International Business Machines Corporation
    Inventors: Andrei Z. Broder, David Carmel, Arthur C. Ciccolo, David Ferrucci, Yoelle Maarek, Yosi Mass, Aya Soffer, Wlodek W. Zadrozny
  • Publication number: 20040243557
    Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).
    Type: Application
    Filed: May 30, 2003
    Publication date: December 2, 2004
    Applicant: International Business Machines Corporation
    Inventors: Andrei Z. Broder, David Carmel, Michael Herscovici, Aya Soffer, Jason Zien
  • Publication number: 20040158580
    Abstract: An apparatus is provided for performing a method (FIG. 2) for pruning an index of a corpus of text documents, wherein the method includes steps for ranking (50) the postings in the index and pruning (48) from the index the postings below a given level in the ranking. The pruning methods of the invention are lossy, since some document postings are removed from the full index; however, the user cannot differentiate the lossy index from the full index.
    Type: Application
    Filed: January 9, 2004
    Publication date: August 12, 2004
    Inventors: David Carmel, Doron Cohen, Ronald Fagin, Eitan Farchi, Michael Herscovici, Yoelle Maarek, Aya Soffer
  • Publication number: 20040128615
    Abstract: A method for indexing a semi-structured document, the method including arranging at least one structure entity of a semi-structured document into at least one node of a context structure tree, associating a unique context identifier with any of the structure entities, creating, for any value of any of the structure entities, a context-modified value by appending a context delimiter and the context identifier to the value, and inserting the context-modified value into a free-text tree.
    Type: Application
    Filed: December 27, 2002
    Publication date: July 1, 2004
    Applicant: International Business Machines Corporation
    Inventors: David Carmel, Naama Kraus, Benjamin Mandler
  • Publication number: 20040128273
    Abstract: A method for gathering and recording temporal information for a linked entity, the method including identifying a link related activity within a linked source entity, and recording a time stamp in association with the link related activity.
    Type: Application
    Filed: December 31, 2002
    Publication date: July 1, 2004
    Applicant: International Business Machines Corporation
    Inventors: Einat Amitay, David Carmel, Michael Herscovici, Ronny Lempel, Aya Soffer, Uri Weiss
  • Publication number: 20040034632
    Abstract: A method for Automatic Query Refinement, the method including extracting at least one lexical affinity from at least one document in a search result set of documents, the set resulting from the application of a first query to a document search engine, adding the lexical affinity to the first query, thereby forming a new query, and applying the new query to the document search engine, resulting in a new search result set.
    Type: Application
    Filed: July 31, 2002
    Publication date: February 19, 2004
    Applicant: International Business Machines Corporation
    Inventors: David Carmel, Eitan Farchi, Yael Petruschka
  • Publication number: 20040030688
    Abstract: A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.
    Type: Application
    Filed: August 1, 2003
    Publication date: February 12, 2004
    Applicant: International Business Machines Corporation
    Inventors: Yariv Aridor, David Carmel, Michael Herscovici, Yoelle Maarek-Smadja, Aya Soffer, Ronny Lempel
  • Patent number: 6636848
    Abstract: A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.
    Type: Grant
    Filed: July 6, 2000
    Date of Patent: October 21, 2003
    Assignee: International Business Machines Corporation
    Inventors: Yariv Aridor, David Carmel, Michael Herscovici, Yoelle Maarek-Smadja, Aya Soffer, Ronny Lempel