Patents Assigned to Text IQ, Inc.

Automatic document classification

Patent number: 12493641

Abstract: A method may include obtaining a set of documents. Text objects from a text object database generated based on the set of documents may be compared to a search term object to identify relevant text objects that match the search term object. A context object for each of the relevant text objects that indicate usage of the relevant text objects within the documents corresponding to the relevant text objects may be determined, and context objects may be grouped according to similarities between the context objects. A first or second classification may be applied to each context object based on one or more criteria, and the first and second classifications may also be applied to each group based on the classifications of the context objects within the groups. Documents within the set of documents may be given the first or second classifications based on relations and similarities to the classified groups.

Type: Grant

Filed: December 6, 2021

Date of Patent: December 9, 2025

Assignee: Text IQ, Inc.

Inventors: Xiao Liu, Jasneet Singh Sabharwal, Apoorv Agarwal, Ethan Benjamin, Xing Zeng
Identifying documents that contain potential code words using a machine learning model

Patent number: 12314665

Abstract: Identifying documents that contain potential code words using a machine learning model. In some embodiments, a method may include receiving documents, identifying a first corpus and a second corpus in the documents, extracting a first set of word embeddings from the first corpus and a second set of word embeddings from the second corpus, generating a first vector space for the first set of word embeddings and a second vector space for the second set of word embeddings using a machine learning model, performing a vector rotation to improve alignment of the first set of word embeddings with the second set of word embeddings, identifying a word embedding in the first vector space that is not aligned with a corresponding word embedding in the second vector space as a potential code word, and identifying one or more documents that contain the potential code word in the first corpus.

Type: Grant

Filed: February 20, 2024

Date of Patent: May 27, 2025

Assignee: Text IQ, Inc.

Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Sabharwal
Automatic document classification

Patent number: 12125000

Abstract: A method to automatically classify emails may include generating multiple entity data objects using entities identified in receiver and sender fields of emails and categorizing the multiple entity data objects into a first set of data objects and a second set of data objects. The method may also include extracting all tokens from each email and searching the extracted tokens for tokens associated with the data objects of the first set of data objects. The method may further include identifying the emails that include the extracted tokens that are associated with the data objects of the first set of data objects, identifying a particular data object of the first set of data objects to which an identified email corresponds, and automatically classifying the identified email in the first category in response to identifying the particular data object of the first set of data objects to which an identified email corresponds.

Type: Grant

Filed: September 29, 2022

Date of Patent: October 22, 2024

Assignee: TEXT IQ, INC.

Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Singh Sabharwal
Identifying documents that contain potential code words using a machine learning model

Patent number: 11907660

Abstract: Identifying documents that contain potential code words using a machine learning model. In some embodiments, a method may include receiving documents, identifying a first corpus and a second corpus in the documents, extracting a first set of word embeddings from the first corpus and a second set of word embeddings from the second corpus, generating a first vector space for the first set of word embeddings and a second vector space for the second set of word embeddings using a machine learning model, performing a vector rotation to improve alignment of the first set of word embeddings with the second set of word embeddings, identifying a word embedding in the first vector space that is not aligned with a corresponding word embedding in the second vector space as a potential code word, and identifying one or more documents that contain the potential code word in the first corpus.

Type: Grant

Filed: March 9, 2023

Date of Patent: February 20, 2024

Assignee: TEXT IQ, INC.

Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Sabharwal
Identifying and ranking potentially privileged documents using a machine learning topic model

Patent number: 11631021

Abstract: A method for identifying and ranking potentially privileged documents using a machine learning topic model may include receiving a set of documents. The method may also include, for each of two or more documents in the set of documents, extracting a set of spans from the document, generating, using a machine learning topic model, a set of topics and a subset of legal topics for the set of spans, generating a vector of probabilities for each span with a probability being assigned to each topic in the set of topics for the span, assigning a score to one or more spans in the set of spans by summing the probabilities in the vector that are assigned to a topic in the subset of legal topics, and assigning a score to the document. The method may further include ranking the two or more documents by their assigned scores.

Type: Grant

Filed: October 2, 2019

Date of Patent: April 18, 2023

Assignee: Text IQ, Inc.

Inventors: Ethan Benjamin, Apoorv Agarwal
Identifying documents that contain potential code words using a machine learning model

Patent number: 11625534

Abstract: Identifying documents that contain potential code words using a machine learning model. In some embodiments, a method may include receiving documents, identifying a first corpus and a second corpus in the documents, extracting a first set of word embeddings from the first corpus and a second set of word embeddings from the second corpus, generating a first vector space for the first set of word embeddings and a second vector space for the second set of word embeddings using a machine learning model, performing a vector rotation to improve alignment of the first set of word embeddings with the second set of word embeddings, identifying a word embedding in the first vector space that is not aligned with a corresponding word embedding in the second vector space as a potential code word, and identifying one or more documents that contain the potential code word in the first corpus.

Type: Grant

Filed: February 10, 2020

Date of Patent: April 11, 2023

Assignee: Text IQ, Inc.

Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Sabharwal
Automatic document classification

Patent number: 11574287

Abstract: A method to automatically classify emails may include generating multiple entity data objects using entities identified in receiver and sender fields of emails and categorizing the multiple entity data objects into a first set of data objects and a second set of data objects. The method may also include extracting all tokens from each email and searching the extracted tokens for tokens associated with the data objects of the first set of data objects. The method may further include identifying the emails that include the extracted tokens that are associated with the data objects of the first set of data objects, identifying a particular data object of the first set of data objects to which an identified email corresponds, and automatically classifying the identified email in the first category in response to identifying the particular data object of the first set of data objects to which an identified email corresponds.

Type: Grant

Filed: June 24, 2021

Date of Patent: February 7, 2023

Assignee: Text IQ, Inc.

Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Singh Sabharwal

Automatic document classification

Identifying documents that contain potential code words using a machine learning model

Automatic document classification

Identifying documents that contain potential code words using a machine learning model

Identifying and ranking potentially privileged documents using a machine learning topic model

Identifying documents that contain potential code words using a machine learning model

Automatic document classification