Patents Assigned to Clearwell Systems, Inc.
  • Publication number: 20120296891
    Abstract: Techniques are provided for automatic sampling evaluation. An automatic sampling evaluation system enables users to evaluate convergence of one or more search processes. For example, given a set of searches that were validated by human review, a system can implement a retrieval process that samples one or more non-retrieved collections. Each individual document's similarity in the one or more non-retrieved collections is automatically evaluated to other documents in any retrieved sets. Given a goal of achieving a high recall, documents with high similarity can then be analyzed for additional noun phrases that may be used for a next iteration of a search. Convergence can be expected if the information gain in the new feedback loop is less than previous iterations, and if the additional documents identified are below a certain threshold document count.
    Type: Application
    Filed: May 18, 2011
    Publication date: November 22, 2012
    Applicant: Clearwell Systems, Inc.
    Inventor: Venkat Rangan
  • Publication number: 20120209853
    Abstract: A set of trigrams can be generated for each document in a plurality of documents processed by an e-discovery system. Each trigram in the set of trigrams for a given document is a sequence of three terms in the given document. A set of trigrams for each similar document is then determined based on the set of trigrams for the original document. To facilitate identification of the similar documents, a full text index is then generated for the plurality of documents and the set of trigrams for each document are indexed into the full text index, as individual terms. Queries can be generated into the full text index based on trigrams of a document to determine other similar or near-duplicate documents. After a set of potentially similar documents are identified, a separate distance criteria can be applied to evaluate the level of similarity between the two documents in an efficient way.
    Type: Application
    Filed: February 16, 2011
    Publication date: August 16, 2012
    Applicant: Clearwell Systems, inc.
    Inventors: Malay Desai, Medha Shewale, Venkat Rangan
  • Publication number: 20120209847
    Abstract: In various embodiments, a semantic space associated with a corpus of electronically stored information (ESI) may be created and used for concept searches. Documents (and any other objects in the ESI, in general) may be represented as vectors in the semantic space. Vectors may correspond to identifiers, such as, for example, indexed terms. The semantic space for a corpus of ESI can be used in information filtering, information retrieval, indexing, and relevancy rankings.
    Type: Application
    Filed: February 16, 2011
    Publication date: August 16, 2012
    Applicant: Clearwell Systems, Inc.
    Inventor: Venkat Rangan
  • Publication number: 20120158728
    Abstract: The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example.
    Type: Application
    Filed: February 27, 2012
    Publication date: June 21, 2012
    Applicant: CLEARWELL SYSTEMS, INC.
    Inventors: Mohan Kumar, Gary Lehrman, Hari Krishna Dara
  • Patent number: 8171393
    Abstract: The invention provides techniques for efficiently organizing and reviewing electronic documents to be produced in the course of a discovery process. The technique provides for marking the master or pivot document with review information, and identifying a plurality of duplicate documents related to the master or pivot document. The technique provides for reviewing a master or pivot document and propagating the review information to a set of related documents. The technique provides for producing a plurality of electronic documents where each of the electronic documents is marked up in accordance with the review information. The method provides for organizing the plurality of electronic documents so it can be presented and searched in an efficient manner.
    Type: Grant
    Filed: April 16, 2008
    Date of Patent: May 1, 2012
    Assignee: Clearwell Systems, Inc.
    Inventors: Venkat Rangan, Anurag Kashyap, Prem Kumar, Malay Desai
  • Patent number: 8032598
    Abstract: A system of ranking e-mail threads is disclosed. The system receives e-mail messages, and determines e-mail threads in response to the e-mail messages. The system determines an e-mail rank associated with each e-mail message in the e-mail threads, where an e-mail rank associated with an e-mail message is determined in response to a sender identifier related to the e-mail message. The system also determines a thread rank for each e-mail thread, where a thread rank associated with an e-mail thread is determined in response to e-mail ranks of each e-mail message associated with each respective e-mail thread. The system then determines an ordering of the e-mail threads based on the thread rank associated with each e-mail thread.
    Type: Grant
    Filed: January 23, 2007
    Date of Patent: October 4, 2011
    Assignee: Clearwell Systems, Inc.
    Inventors: Yongqiang He, Mohan Kumar, Venkat Rangan
  • Patent number: 7899871
    Abstract: A method for processing e-mails includes receiving a plurality of e-mails. For each e-mail in the plurality of e-mails, a feature representation is generated for an e-mail based on a set of noun phrases associated with the e-mail. A set of topics associated with the plurality of e-mails is generated based on the feature representation for each e-mail. Sentence structure associated with the e-mail and parts of speech associated with the e-mail may be determined. The parts of speech, including a set of noun phrases associated with the e-mail, may be used to generate the feature representation for the e-mail.
    Type: Grant
    Filed: August 14, 2007
    Date of Patent: March 1, 2011
    Assignee: Clearwell Systems, Inc.
    Inventors: Mohan Kumar, Venkat Rangan
  • Patent number: 7743051
    Abstract: Methods and systems for searching e-mails are disclosed. In one embodiment, a method for searching e-mails includes receiving input indicative of one or more search terms. A query plan is determined based on the one or more search terms. The method includes performing a search in response to the query plan to determine information related to the one or more e-mails. Then, a set of results are generated based on the information related to the one or more e-mails.
    Type: Grant
    Filed: August 14, 2007
    Date of Patent: June 22, 2010
    Assignee: Clearwell Systems, Inc.
    Inventors: Anurag Kashyap, Malay Desai, Venkat Rangan, Gary Lehrman
  • Publication number: 20100030798
    Abstract: The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example.
    Type: Application
    Filed: July 29, 2008
    Publication date: February 4, 2010
    Applicant: Clearwell Systems, Inc.
    Inventors: Mohan Kumar, Gary Lehrman, Hari Krishna Dara
  • Patent number: 7657603
    Abstract: A system for processing e-mail messages receives, from an e-mail repository, a transactional e-mail message comprising message attribute data. The system places the transactional e-mail message in an e-mail thread in response to the message attribute data of the transaction e-mail message. The system determines whether one or more derived e-mail messages are included in the transactional e-mail message. If one or more derived e-mail messages are included, the system determines derived message attribute data of the one or more derived e-mail messages. The system then places the derived e-mail message in the e-mail thread in response to the derived message attribute data of the derived e-mail message.
    Type: Grant
    Filed: July 13, 2006
    Date of Patent: February 2, 2010
    Assignee: Clearwell Systems, Inc.
    Inventors: Yongqiang He, Mohan Kumar, Venkat Rangan
  • Publication number: 20090265609
    Abstract: The invention provides techniques for efficiently organizing and reviewing electronic documents to be produced in the course of a discovery process. The technique provides for marking the master or pivot document with review information, and identifying a plurality of duplicate documents related to the master or pivot document. The technique provides for reviewing a master or pivot document and propagating the review information to a set of related documents. The technique provides for producing a plurality of electronic documents where each of the electronic documents is marked up in accordance with the review information. The method provides for organizing the plurality of electronic documents so it can be presented and searched in an efficient manner.
    Type: Application
    Filed: April 16, 2008
    Publication date: October 22, 2009
    Applicant: Clearwell Systems, Inc.
    Inventors: Venkat Rangan, Anurag Kashyap, Prem Kumar, Malay Desai
  • Patent number: 7593995
    Abstract: A system of ranking e-mail threads is disclosed. The system receives e-mail messages, and determines e-mail threads in response to the e-mail messages. The system determines an e-mail rank associated with each e-mail message in the e-mail threads, where an e-mail rank associated with an e-mail message is determined in response to a sender identifier related to the e-mail message. The system also determines a thread rank for each e-mail thread, where a thread rank associated with an e-mail thread is determined in response to e-mail ranks of each e-mail message associated with each respective e-mail thread. The system then determines an ordering of the e-mail threads based on the thread rank associated with each e-mail thread.
    Type: Grant
    Filed: July 13, 2006
    Date of Patent: September 22, 2009
    Assignee: Clearwell Systems, Inc.
    Inventors: Yongqiang He, Mohan Kumar, Venkat Rangan