Patents by Inventor Ben Colman

Ben Colman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20260024319
    Abstract: A method for training a model for classifying videos as real or fake can include generating image tiles and audio data segments from an input video, generating a sequence of image embeddings based on the image tiles using a visual encoder and a sequence of audio embeddings based on the audio data segments using an audio encoder, transforming, using a V2A network, a first subset of the sequence of image embeddings into synthetic audio embeddings, transforming, using an A2V network, a first subset of the sequence of audio embeddings into synthetic image embeddings, updating the sequence of image embeddings by using the synthetic image embeddings, updating the sequence of audio embeddings using the synthetic audio embeddings, training the encoders and the networks using the updated sequences of image embeddings and audio embeddings, and training a classifier using the trained encoders and the trained networks.
    Type: Application
    Filed: August 7, 2025
    Publication date: January 22, 2026
    Applicant: Reality Defender, Inc.
    Inventors: Gaurav BHARAJ, Trevine OORLOFF, Surya KOPPISETTI, Nicolò BONETTINI, Ben COLMAN, Ali SHAHRIYARI
  • Publication number: 20250363996
    Abstract: Audio deepfake detection (ADD) is crucial to combat the potential misuse of synthesized speech from generative AI models. Existing ADD models suffer from generalization issues, with a large performance discrepancy seen between in-domain and out-of-domain data. Also, the black-box nature of the existing models limits their use in real-world scenarios where interpretation capabilities are required. Described is a new ADD training framework that explicitly uses the Style and LInguistics Mismatch (SLIM) in the fake class to separate it from the real class. The style-linguistics dependency is learned through a self-supervised pretraining stage, where only real samples are needed. Using frozen frontend encoders, SLIM outperforms benchmark methods on out-of-domain datasets while providing competitive results on in-domain datasets. The features learned by SLIM can be directly used to quantify the style-linguistics mismatch of deepfake samples, hence facilitating explainability.
    Type: Application
    Filed: May 20, 2025
    Publication date: November 27, 2025
    Applicant: Reality Defender, Inc.
    Inventors: Gaurav BHARAJ, Surya KOPPISETTI, Yi ZHU, Trang TRAN, Ben COLMAN, Ali SHAHRIYARI
  • Patent number: 12475686
    Abstract: An exemplary method for detecting deepfake images and providing customized analysis comprises: receiving, from a user, a textual user inquiry regarding an image; inputting the textual inquiry and the image into a deepfake detection model, wherein the deepfake detection model comprises: an image encoder for generating a plurality of image embeddings based on the image; a text encoder for generating a plurality of textual embeddings based on the textual inquiry; one or more layers for generating a plurality of answer embeddings; and a language model for generating a textual analysis based on the plurality of answer embeddings; and outputting the textual analysis, wherein the textual analysis includes a classification result of whether the image is fake and further includes one or more visual features in the image and one or more attributes of the one or more visual features that contribute to the classification result.
    Type: Grant
    Filed: March 28, 2025
    Date of Patent: November 18, 2025
    Assignee: Reality Defender, Inc.
    Inventors: Gaurav Bharaj, Yue Zhang, Ben Colman, Ali Shahriyari
  • Patent number: 12411910
    Abstract: An exemplary method for detecting fake audios comprises: converting audio data into an image representation of the audio data; providing the image representation of the audio data to a trained machine-learning model, the machine learning model: generating, using a trained self-attention branch, one or more representation embeddings corresponding to the image representation of the audio data; and receiving, using a trained classifier component, the one or more representation embeddings and outputting a classification result. The machine-learning model is trained by: in a first stage, training one or more self- and cross-attention components via contrastive learning, each self- and cross-attention component comprises a first self-attention branch, a second self-attention branch, and a cross-attention branch; and in a second stage, training the classifier component; and providing the classification result.
    Type: Grant
    Filed: November 20, 2024
    Date of Patent: September 9, 2025
    Assignee: Reality Defender, Inc.
    Inventors: Gaurav Bharaj, Chirag Goel, Surya Koppisetti, Ben Colman, Ali Shahriyari
  • Patent number: 12412376
    Abstract: A method for training a model for classifying videos as real or fake can include generating image tiles and audio data segments from an input video, generating a sequence of image embeddings based on the image tiles using a visual encoder and a sequence of audio embeddings based on the audio data segments using an audio encoder, transforming, using a V2A network, a first subset of the sequence of image embeddings into synthetic audio embeddings, transforming, using an A2V network, a first subset of the sequence of audio embeddings into synthetic image embeddings, updating the sequence of image embeddings by using the synthetic image embeddings, updating the sequence of audio embeddings using the synthetic audio embeddings, training the encoders and the networks using the updated sequences of image embeddings and audio embeddings, and training a classifier using the trained encoders and the trained networks.
    Type: Grant
    Filed: June 14, 2024
    Date of Patent: September 9, 2025
    Assignee: Reality Defender, Inc.
    Inventors: Gaurav Bharaj, Trevine Oorloff, Surya Koppisetti, Nicolò Bonettini, Ben Colman, Ali Shahriyari
  • Publication number: 20250245296
    Abstract: An exemplary method for detecting fake audios comprises: converting audio data into an image representation of the audio data; providing the image representation of the audio data to a trained machine-learning model, the machine learning model: generating, using a trained self-attention branch, one or more representation embeddings corresponding to the image representation of the audio data; and receiving, using a trained classifier component, the one or more representation embeddings and outputting a classification result. The machine-learning model is trained by: in a first stage, training one or more self- and cross-attention components via contrastive learning, each self- and cross-attention component comprises a first self-attention branch, a second self-attention branch, and a cross-attention branch; and in a second stage, training the classifier component; and providing the classification result.
    Type: Application
    Filed: November 20, 2024
    Publication date: July 31, 2025
    Applicant: Reality Defender, Inc.
    Inventors: Gaurav BHARAJ, Chirag GOEL, Surya KOPPISETTI, Ben COLMAN, Ali SHAHRIYARI
  • Publication number: 20250225773
    Abstract: An exemplary method for detecting deepfake images and providing customized analysis comprises: receiving, from a user, a textual user inquiry regarding an image; inputting the textual inquiry and the image into a deepfake detection model, wherein the deepfake detection model comprises: an image encoder for generating a plurality of image embeddings based on the image; a text encoder for generating a plurality of textual embeddings based on the textual inquiry; one or more layers for generating a plurality of answer embeddings; and a language model for generating a textual analysis based on the plurality of answer embeddings; and outputting the textual analysis, wherein the textual analysis includes a classification result of whether the image is fake and further includes one or more visual features in the image and one or more attributes of the one or more visual features that contribute to the classification result.
    Type: Application
    Filed: March 28, 2025
    Publication date: July 10, 2025
    Applicant: Reality Defender, Inc.
    Inventors: Gaurav BHARAJ, Yue ZHANG, Ben COLMAN, Ali SHAHRIYARI
  • Publication number: 20250200948
    Abstract: An exemplary method for reducing bias in a training image dataset for training a machine-learning model comprises: receiving a plurality of text strings comprising at least one text string describing each image in the training image dataset; generating a plurality of embeddings based on the plurality of text strings; identifying, based on the plurality of embeddings, a plurality of visual features in the training image dataset; identifying one or more correlations between the plurality of visual features in the training image dataset; receiving a user input identifying at least one biased correlation from the one or more correlations; and training the machine-learning model at least partially by adjusting one or more data sampling weights associated with one or more training images in the training image dataset based on the user input.
    Type: Application
    Filed: February 27, 2025
    Publication date: June 19, 2025
    Applicant: Reality Defender, Inc.
    Inventors: Gaurav BHARAJ, Miao ZHANG, Zee FRYER, Ben COLMAN, Ali SHAHRIYARI
  • Publication number: 20250166358
    Abstract: A method for training a model for classifying videos as real or fake can include generating image tiles and audio data segments from an input video, generating a sequence of image embeddings based on the image tiles using a visual encoder and a sequence of audio embeddings based on the audio data segments using an audio encoder, transforming, using a V2A network, a first subset of the sequence of image embeddings into synthetic audio embeddings, transforming, using an A2V network, a first subset of the sequence of audio embeddings into synthetic image embeddings, updating the sequence of image embeddings by using the synthetic image embeddings, updating the sequence of audio embeddings using the synthetic audio embeddings, training the encoders and the networks using the updated sequences of image embeddings and audio embeddings, and training a classifier using the trained encoders and the trained networks.
    Type: Application
    Filed: June 14, 2024
    Publication date: May 22, 2025
    Applicant: Reality Defender, Inc.
    Inventors: Gaurav BHARAJ, Trevine OORLOFF, Surya KOPPISETTI, Nicolò BONETTINI, Ben COLMAN, Ali SHAHRIYARI
  • Patent number: 12288379
    Abstract: An exemplary method for detecting deepfake images and providing customized analysis comprises: receiving, from a user, a textual user inquiry regarding an image; inputting the textual inquiry and the image into a deepfake detection model, wherein the deepfake detection model comprises: an image encoder for generating a plurality of image embeddings based on the image; a text encoder for generating a plurality of textual embeddings based on the textual inquiry; one or more layers for generating a plurality of answer embeddings; and a language model for generating a textual analysis based on the plurality of answer embeddings; and outputting the textual analysis, wherein the textual analysis includes a classification result of whether the image is fake and further includes one or more visual features in the image and one or more attributes of the one or more visual features that contribute to the classification result.
    Type: Grant
    Filed: June 21, 2024
    Date of Patent: April 29, 2025
    Assignee: Reality Defender, Inc.
    Inventors: Gaurav Bharaj, Yue Zhang, Ben Colman, Ali Shahriyari
  • Patent number: 12277753
    Abstract: An exemplary method for reducing bias in a training image dataset for training a machine-learning model comprises: receiving a plurality of text strings comprising at least one text string describing each image in the training image dataset; generating a plurality of embeddings based on the plurality of text strings; identifying, based on the plurality of embeddings, a plurality of visual features in the training image dataset; identifying one or more correlations between the plurality of visual features in the training image dataset; receiving a user input identifying at least one biased correlation from the one or more correlations; and training the machine-learning model at least partially by adjusting one or more data sampling weights associated with one or more training images in the training image dataset based on the user input.
    Type: Grant
    Filed: June 21, 2024
    Date of Patent: April 15, 2025
    Assignee: Reality Defender, Inc.
    Inventors: Gaurav Bharaj, Miao Zhang, Zee Fryer, Ben Colman, Ali Shahriyari
  • Patent number: 12189712
    Abstract: An exemplary method for detecting fake audios comprises: converting audio data into an image representation of the audio data; providing the image representation of the audio data to a trained machine-learning model, the machine learning model: generating, using a trained self-attention branch, one or more representation embeddings corresponding to the image representation of the audio data; and receiving, using a trained classifier component, the one or more representation embeddings and outputting a classification result. The machine-learning model is trained by: in a first stage, training one or more self- and cross-attention components via contrastive learning, each self- and cross-attention component comprises a first self-attention branch, a second self-attention branch, and a cross-attention branch; and in a second stage, training the classifier component; and providing the classification result.
    Type: Grant
    Filed: January 29, 2024
    Date of Patent: January 7, 2025
    Assignee: Reality Defender, Inc.
    Inventors: Gaurav Bharaj, Chirag Goel, Surya Koppisetti, Ben Colman, Ali Shahriyari
  • Publication number: 20240106234
    Abstract: Power-management techniques are described. In the power-management techniques, a computer system (such as a cloud-based computer system or a local computer system) dynamically controls an energy source that provides electrical power to a load at a location. Notably, the computer system selectively transitions between use of a (typically external) energy source (such as a power grid associated with a utility or power supplier, hydroelectric, a generator or a solar array) and an energy storage device (such as a battery or a battery array, thermal power storage, a mechanical flywheel, or another type of energy storage device) at the location to provide electrical power to at least a dominant (e.g., highest power-consuming) load at the location. Note that the selective transitions may be based at least in part on a price of electricity and/or carbon intensity of electricity.
    Type: Application
    Filed: September 7, 2023
    Publication date: March 28, 2024
    Applicant: Energy Applied, Inc.
    Inventors: Pierre Duchesne-Vallade, Ben Colman, Somsack Lavivanh
  • Patent number: D487174
    Type: Grant
    Filed: March 11, 2003
    Date of Patent: February 24, 2004
    Assignee: The Procter & Gamble Company
    Inventors: Ben Colman, Paulus Antonius Augustinus Höfte, Johannes Lambertus Maria Mensink, Leonard Joseph Keller, Jr.
  • Patent number: D498568
    Type: Grant
    Filed: March 11, 2003
    Date of Patent: November 16, 2004
    Assignee: The Procter & Gamble Company
    Inventors: Ben Colman, Paulus Antonius Augustinus Höfte, Johannes Lambertus Maria Mensink, Leonard Joseph Keller, Jr.