Patents by Inventor Jason Zien

Jason Zien has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20070112763
    Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).
    Type: Application
    Filed: November 30, 2006
    Publication date: May 17, 2007
    Inventors: Andrei Broder, David Carmel, Michael Herscovici, Aya Soffer, Jason Zien
  • Patent number: 7146361
    Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).
    Type: Grant
    Filed: May 30, 2003
    Date of Patent: December 5, 2006
    Assignee: International Business Machines Corporation
    Inventors: Andrei Z Broder, David Carmel, Michael Herscovici, Aya Soffer, Jason Zien
  • Publication number: 20060259482
    Abstract: Provided are techniques for computer-based electronic Information Retrieval (IR). An extended inverted index structure by generating one or more lexical affinities (LA), wherein each of the one or more lexical affinities comprises two or more search items found in proximity in one or more documents in a pool of documents, and generating a posting list for each of the one or more lexical affinities, wherein each posting list is associated with a specific lexical affinity and contains document identifying information for each of the one or more documents in the pool that contains the specific lexical affinity and a location within the document where the specific lexical affinity occurs.
    Type: Application
    Filed: January 18, 2006
    Publication date: November 16, 2006
    Inventors: Peter Altevogt, Marcus Fontoura, Silvio Wiedrich, Jason Zien
  • Publication number: 20060074962
    Abstract: Provided are a method, system, and program for searching documents for ranges of numeric values. Document identifiers for documents are accessed, wherein the documents include at least one value that is a member of a set of values. A number of posting lists are generated. Each posting list is associated with a range of consecutive values within the set of values and includes document identifiers for documents having values within the range of consecutive values associated with the posting list. Each document identifier is associated with one value in the set of values included in the document identified by the document identifier. The generated posting lists are stored.
    Type: Application
    Filed: September 24, 2004
    Publication date: April 6, 2006
    Inventors: Marcus Fontoura, Ronny Lempel, Runping Qi, Jason Zien
  • Publication number: 20050165838
    Abstract: Disclosed is a technique for indexing data. A token is received. It is determined whether a data field associated with the token is a fixed width. When the data field is a fixed width, the token is designated as one for which fixed width sort is to be performed. When the data field is a variable length, the token is designated as one for which a variable width sort is to be performed.
    Type: Application
    Filed: January 26, 2004
    Publication date: July 28, 2005
    Inventors: Marcus Fontoura, Andreas Neumann, Sridhar Rajagopalan, Eugene Shekita, Jason Zien
  • Publication number: 20050165718
    Abstract: Disclosed is a technique for building an index in which global analysis computations and index creation are pipelined, wherein the global analysis computations share intermediate results.
    Type: Application
    Filed: January 26, 2004
    Publication date: July 28, 2005
    Inventors: Marcus Fontoura, Reiner Kraft, Tony Leung, John McPherson, Andreas Neumann, Runping Qi, Sridhar Rajagopalan, Eugene Shekita, Jason Zien
  • Publication number: 20050060290
    Abstract: A query is received and parsed to generate a set of query terms. Statistical information is identified regarding each of the query terms and different permutations of the query terms. Additionally, lexical affinities associated with the permutations of query terms are identified. Next, the query is classified into a query category and a set of ranking parameters and routing information (associated with the query category) are identified. The query is then issued to a search engine by applying the identified ranking parameters and routing information, whereupon the search engine executes the query and forwards search results that can be accessed by an application using an API (e.g., the results can be viewed via a browser).
    Type: Application
    Filed: September 15, 2003
    Publication date: March 17, 2005
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Michael Herscovici, Reiner Kraft, Ronny Lempel, Jason Zien
  • Publication number: 20040243557
    Abstract: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).
    Type: Application
    Filed: May 30, 2003
    Publication date: December 2, 2004
    Applicant: International Business Machines Corporation
    Inventors: Andrei Z. Broder, David Carmel, Michael Herscovici, Aya Soffer, Jason Zien