Patents Assigned to Clearwell Systems, Inc.
-
Publication number: 20120296891Abstract: Techniques are provided for automatic sampling evaluation. An automatic sampling evaluation system enables users to evaluate convergence of one or more search processes. For example, given a set of searches that were validated by human review, a system can implement a retrieval process that samples one or more non-retrieved collections. Each individual document's similarity in the one or more non-retrieved collections is automatically evaluated to other documents in any retrieved sets. Given a goal of achieving a high recall, documents with high similarity can then be analyzed for additional noun phrases that may be used for a next iteration of a search. Convergence can be expected if the information gain in the new feedback loop is less than previous iterations, and if the additional documents identified are below a certain threshold document count.Type: ApplicationFiled: May 18, 2011Publication date: November 22, 2012Applicant: Clearwell Systems, Inc.Inventor: Venkat Rangan
-
Publication number: 20120209853Abstract: A set of trigrams can be generated for each document in a plurality of documents processed by an e-discovery system. Each trigram in the set of trigrams for a given document is a sequence of three terms in the given document. A set of trigrams for each similar document is then determined based on the set of trigrams for the original document. To facilitate identification of the similar documents, a full text index is then generated for the plurality of documents and the set of trigrams for each document are indexed into the full text index, as individual terms. Queries can be generated into the full text index based on trigrams of a document to determine other similar or near-duplicate documents. After a set of potentially similar documents are identified, a separate distance criteria can be applied to evaluate the level of similarity between the two documents in an efficient way.Type: ApplicationFiled: February 16, 2011Publication date: August 16, 2012Applicant: Clearwell Systems, inc.Inventors: Malay Desai, Medha Shewale, Venkat Rangan
-
Publication number: 20120209847Abstract: In various embodiments, a semantic space associated with a corpus of electronically stored information (ESI) may be created and used for concept searches. Documents (and any other objects in the ESI, in general) may be represented as vectors in the semantic space. Vectors may correspond to identifiers, such as, for example, indexed terms. The semantic space for a corpus of ESI can be used in information filtering, information retrieval, indexing, and relevancy rankings.Type: ApplicationFiled: February 16, 2011Publication date: August 16, 2012Applicant: Clearwell Systems, Inc.Inventor: Venkat Rangan
-
Publication number: 20120158728Abstract: The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example.Type: ApplicationFiled: February 27, 2012Publication date: June 21, 2012Applicant: CLEARWELL SYSTEMS, INC.Inventors: Mohan Kumar, Gary Lehrman, Hari Krishna Dara
-
Patent number: 8171393Abstract: The invention provides techniques for efficiently organizing and reviewing electronic documents to be produced in the course of a discovery process. The technique provides for marking the master or pivot document with review information, and identifying a plurality of duplicate documents related to the master or pivot document. The technique provides for reviewing a master or pivot document and propagating the review information to a set of related documents. The technique provides for producing a plurality of electronic documents where each of the electronic documents is marked up in accordance with the review information. The method provides for organizing the plurality of electronic documents so it can be presented and searched in an efficient manner.Type: GrantFiled: April 16, 2008Date of Patent: May 1, 2012Assignee: Clearwell Systems, Inc.Inventors: Venkat Rangan, Anurag Kashyap, Prem Kumar, Malay Desai
-
Patent number: 8032598Abstract: A system of ranking e-mail threads is disclosed. The system receives e-mail messages, and determines e-mail threads in response to the e-mail messages. The system determines an e-mail rank associated with each e-mail message in the e-mail threads, where an e-mail rank associated with an e-mail message is determined in response to a sender identifier related to the e-mail message. The system also determines a thread rank for each e-mail thread, where a thread rank associated with an e-mail thread is determined in response to e-mail ranks of each e-mail message associated with each respective e-mail thread. The system then determines an ordering of the e-mail threads based on the thread rank associated with each e-mail thread.Type: GrantFiled: January 23, 2007Date of Patent: October 4, 2011Assignee: Clearwell Systems, Inc.Inventors: Yongqiang He, Mohan Kumar, Venkat Rangan
-
Patent number: 7899871Abstract: A method for processing e-mails includes receiving a plurality of e-mails. For each e-mail in the plurality of e-mails, a feature representation is generated for an e-mail based on a set of noun phrases associated with the e-mail. A set of topics associated with the plurality of e-mails is generated based on the feature representation for each e-mail. Sentence structure associated with the e-mail and parts of speech associated with the e-mail may be determined. The parts of speech, including a set of noun phrases associated with the e-mail, may be used to generate the feature representation for the e-mail.Type: GrantFiled: August 14, 2007Date of Patent: March 1, 2011Assignee: Clearwell Systems, Inc.Inventors: Mohan Kumar, Venkat Rangan
-
Patent number: 7743051Abstract: Methods and systems for searching e-mails are disclosed. In one embodiment, a method for searching e-mails includes receiving input indicative of one or more search terms. A query plan is determined based on the one or more search terms. The method includes performing a search in response to the query plan to determine information related to the one or more e-mails. Then, a set of results are generated based on the information related to the one or more e-mails.Type: GrantFiled: August 14, 2007Date of Patent: June 22, 2010Assignee: Clearwell Systems, Inc.Inventors: Anurag Kashyap, Malay Desai, Venkat Rangan, Gary Lehrman
-
Publication number: 20100030798Abstract: The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example.Type: ApplicationFiled: July 29, 2008Publication date: February 4, 2010Applicant: Clearwell Systems, Inc.Inventors: Mohan Kumar, Gary Lehrman, Hari Krishna Dara
-
Patent number: 7657603Abstract: A system for processing e-mail messages receives, from an e-mail repository, a transactional e-mail message comprising message attribute data. The system places the transactional e-mail message in an e-mail thread in response to the message attribute data of the transaction e-mail message. The system determines whether one or more derived e-mail messages are included in the transactional e-mail message. If one or more derived e-mail messages are included, the system determines derived message attribute data of the one or more derived e-mail messages. The system then places the derived e-mail message in the e-mail thread in response to the derived message attribute data of the derived e-mail message.Type: GrantFiled: July 13, 2006Date of Patent: February 2, 2010Assignee: Clearwell Systems, Inc.Inventors: Yongqiang He, Mohan Kumar, Venkat Rangan
-
Publication number: 20090265609Abstract: The invention provides techniques for efficiently organizing and reviewing electronic documents to be produced in the course of a discovery process. The technique provides for marking the master or pivot document with review information, and identifying a plurality of duplicate documents related to the master or pivot document. The technique provides for reviewing a master or pivot document and propagating the review information to a set of related documents. The technique provides for producing a plurality of electronic documents where each of the electronic documents is marked up in accordance with the review information. The method provides for organizing the plurality of electronic documents so it can be presented and searched in an efficient manner.Type: ApplicationFiled: April 16, 2008Publication date: October 22, 2009Applicant: Clearwell Systems, Inc.Inventors: Venkat Rangan, Anurag Kashyap, Prem Kumar, Malay Desai
-
Patent number: 7593995Abstract: A system of ranking e-mail threads is disclosed. The system receives e-mail messages, and determines e-mail threads in response to the e-mail messages. The system determines an e-mail rank associated with each e-mail message in the e-mail threads, where an e-mail rank associated with an e-mail message is determined in response to a sender identifier related to the e-mail message. The system also determines a thread rank for each e-mail thread, where a thread rank associated with an e-mail thread is determined in response to e-mail ranks of each e-mail message associated with each respective e-mail thread. The system then determines an ordering of the e-mail threads based on the thread rank associated with each e-mail thread.Type: GrantFiled: July 13, 2006Date of Patent: September 22, 2009Assignee: Clearwell Systems, Inc.Inventors: Yongqiang He, Mohan Kumar, Venkat Rangan