Patents Assigned to Text IQ, Inc.
  • Patent number: 12125000
    Abstract: A method to automatically classify emails may include generating multiple entity data objects using entities identified in receiver and sender fields of emails and categorizing the multiple entity data objects into a first set of data objects and a second set of data objects. The method may also include extracting all tokens from each email and searching the extracted tokens for tokens associated with the data objects of the first set of data objects. The method may further include identifying the emails that include the extracted tokens that are associated with the data objects of the first set of data objects, identifying a particular data object of the first set of data objects to which an identified email corresponds, and automatically classifying the identified email in the first category in response to identifying the particular data object of the first set of data objects to which an identified email corresponds.
    Type: Grant
    Filed: September 29, 2022
    Date of Patent: October 22, 2024
    Assignee: TEXT IQ, INC.
    Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Singh Sabharwal
  • Patent number: 11907660
    Abstract: Identifying documents that contain potential code words using a machine learning model. In some embodiments, a method may include receiving documents, identifying a first corpus and a second corpus in the documents, extracting a first set of word embeddings from the first corpus and a second set of word embeddings from the second corpus, generating a first vector space for the first set of word embeddings and a second vector space for the second set of word embeddings using a machine learning model, performing a vector rotation to improve alignment of the first set of word embeddings with the second set of word embeddings, identifying a word embedding in the first vector space that is not aligned with a corresponding word embedding in the second vector space as a potential code word, and identifying one or more documents that contain the potential code word in the first corpus.
    Type: Grant
    Filed: March 9, 2023
    Date of Patent: February 20, 2024
    Assignee: TEXT IQ, INC.
    Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Sabharwal
  • Patent number: 11631021
    Abstract: A method for identifying and ranking potentially privileged documents using a machine learning topic model may include receiving a set of documents. The method may also include, for each of two or more documents in the set of documents, extracting a set of spans from the document, generating, using a machine learning topic model, a set of topics and a subset of legal topics for the set of spans, generating a vector of probabilities for each span with a probability being assigned to each topic in the set of topics for the span, assigning a score to one or more spans in the set of spans by summing the probabilities in the vector that are assigned to a topic in the subset of legal topics, and assigning a score to the document. The method may further include ranking the two or more documents by their assigned scores.
    Type: Grant
    Filed: October 2, 2019
    Date of Patent: April 18, 2023
    Assignee: Text IQ, Inc.
    Inventors: Ethan Benjamin, Apoorv Agarwal
  • Patent number: 11625534
    Abstract: Identifying documents that contain potential code words using a machine learning model. In some embodiments, a method may include receiving documents, identifying a first corpus and a second corpus in the documents, extracting a first set of word embeddings from the first corpus and a second set of word embeddings from the second corpus, generating a first vector space for the first set of word embeddings and a second vector space for the second set of word embeddings using a machine learning model, performing a vector rotation to improve alignment of the first set of word embeddings with the second set of word embeddings, identifying a word embedding in the first vector space that is not aligned with a corresponding word embedding in the second vector space as a potential code word, and identifying one or more documents that contain the potential code word in the first corpus.
    Type: Grant
    Filed: February 10, 2020
    Date of Patent: April 11, 2023
    Assignee: Text IQ, Inc.
    Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Sabharwal
  • Patent number: 11574287
    Abstract: A method to automatically classify emails may include generating multiple entity data objects using entities identified in receiver and sender fields of emails and categorizing the multiple entity data objects into a first set of data objects and a second set of data objects. The method may also include extracting all tokens from each email and searching the extracted tokens for tokens associated with the data objects of the first set of data objects. The method may further include identifying the emails that include the extracted tokens that are associated with the data objects of the first set of data objects, identifying a particular data object of the first set of data objects to which an identified email corresponds, and automatically classifying the identified email in the first category in response to identifying the particular data object of the first set of data objects to which an identified email corresponds.
    Type: Grant
    Filed: June 24, 2021
    Date of Patent: February 7, 2023
    Assignee: Text IQ, Inc.
    Inventors: Apoorv Agarwal, Ethan Benjamin, Jasneet Singh Sabharwal