Patents Assigned to PINDROP SECURITY, INC.
  • Patent number: 12621382
    Abstract: Systems, methods, and computer-readable media for call classification and for training a model for call classification, an example method comprising: receiving DTMF information from a plurality of calls; determining, for each of the calls, a feature vector including statistics based on DTMF information such as DTMF residual signal comprising channel noise and additive noise; training a model for classification; comparing a new call feature vector to the model; predicting a device type and geographic location based on the comparison of the new call feature vector to the model; classifying the call as spoofed or genuine; and authenticating a call or altering an IVR call flow.
    Type: Grant
    Filed: May 31, 2024
    Date of Patent: May 5, 2026
    Assignee: Pindrop Security, Inc.
    Inventors: Nick Gaubitch, Scott Strong, John Cornwell, Hassan Kingravi, David Dewey
  • Publication number: 20260112357
    Abstract: Disclosed are systems and methods including software processes executed by a server that detect audio-based synthetic speech (“deepfakes”) in a call conversation. The server applies an NLP engine to transcribe call audio and analyze the text for anomalous patterns to detect synthetic speech. Additionally or alternatively, the server executes a voice “liveness” detection system for detecting machine speech, such as synthetic speech or replayed speech. The system performs phrase repetition detection, background change detection, and passive voice liveness detection in call audio signals to detect liveness of a speech utterance. An automated model update module allows the liveness detection model to adapt to new types of presentation attacks, based on the human provided feedback.
    Type: Application
    Filed: December 10, 2025
    Publication date: April 23, 2026
    Applicant: PINDROP SECURITY, INC.
    Inventors: Umair ALTAF, Sai Pradeep PERI, Lakshay PHATELA, Payas GUPTA, Yitao SUN, Svetlana AFANASEVA, Kailash PATIL, Elie KHOURY, Bradley MAGNETTA, Vijay BALASUBRAMANIYAN, Tianxiang CHEN
  • Patent number: 12609132
    Abstract: A computer may train a single-class machine learning using normal speech recordings. The machine learning model or any other model may estimate the normal range of parameters of a physical speech production model based on the normal speech recordings. For example, the computer may use a source-filter model of speech production, where voiced speech is represented by a pulse train and unvoiced speech by a random noise and a combination of the pulse train and the random noise is passed through an auto-regressive filter that emulates the human vocal tract. The computer leverages the fact that intentional modification of human voice introduces errors to source-filter model or any other physical model of speech production. The computer may identify anomalies in the physical model to generate a voice modification score for an audio signal. The voice modification score may indicate a degree of abnormality of human voice in the audio signal.
    Type: Grant
    Filed: September 26, 2022
    Date of Patent: April 21, 2026
    Assignee: Pindrop Security, Inc.
    Inventors: David Looney, Nikolay D. Gaubitch
  • Patent number: 12592239
    Abstract: Disclosed are systems and methods including software processes executed by a server that detect audio-based synthetic speech (“deepfakes”) in a call conversation. Embodiments include systems and methods for detecting fraudulent presentation attacks using multiple functional engines that implement various fraud-detection techniques, to produce calibrated scores and/or fused scores. A computer may, for example, evaluate the audio quality of speech signals within audio signals, where speech signals contain the speech portions having speaker utterances.
    Type: Grant
    Filed: April 25, 2024
    Date of Patent: March 31, 2026
    Assignee: Pindrop Security, Inc.
    Inventors: Elie Khoury, Ganesh Sivaraman, Tianxiang Chen, Nikolay Gaubitch, David Looney, Amit Gupta, Vijay Balasubramaniyan, Nicholas Klein, Anthony Stankus
  • Patent number: 12592220
    Abstract: Disclosed are systems and methods including software processes executed by a server that detect audio-based synthetic speech (“deepfakes”) in a call conversation. The server applies an NLP engine to transcribe call audio and analyze the text for anomalous patterns to detect synthetic speech. Additionally or alternatively, the server executes a voice “liveness” detection system for detecting machine speech, such as synthetic speech or replayed speech. The system performs phrase repetition detection, background change detection, and passive voice liveness detection in call audio signals to detect liveness of a speech utterance. An automated model update module allows the liveness detection model to adapt to new types of presentation attacks, based on the human provided feedback.
    Type: Grant
    Filed: November 9, 2023
    Date of Patent: March 31, 2026
    Assignee: Pindrop Security, Inc.
    Inventors: Umair Altaf, Sai Pradeep Peri, Lakshay Phatela, Payas Gupta, Yitao Sun, Svetlana Afanaseva, Kailash Patil, Elie Khoury, Bradley Magnetta, Vijay Balasubramaniyan, Tianxiang Chen
  • Publication number: 20260075084
    Abstract: Embodiments disclosed herein include software processes and of machine-learning architectures for detecting and mitigating against synthetic speech instances. A computer analyzes audio speech data and metadata received with contact events associated with source identifiers. The computer executes machine-learning architecture(s) that determine whether the contact events likely include human-generated speech or machine-generated synthetic speech. The computer may determine the likelihood that contact events represent a DoS attack launched by a source device, by analyzing behavior features in metadata associated with the source identifier. The computer determines whether the contact events originated from the source user device having the source identifier launched a DoS attack and, if so, may update a blocklist.
    Type: Application
    Filed: September 10, 2025
    Publication date: March 12, 2026
    Applicant: Pindrop Security, Inc.
    Inventors: Vijay Balasubramaniyan, Amit Gupta
  • Publication number: 20260065913
    Abstract: Disclosed are systems and methods including software processes executed by a server that detect machine-generated synthetic singing vocals in a vocal audio signal of an audio signal using a multi-stage machine-learning architecture. A singing detector identifies vocal segments containing singing. A singing liveness detector includes a fakeprint embedding extractor that extracts fakeprint feature vector embeddings representing artifacts of machine-generated vocal signals, scoring layers or classifier layers to generate a singing liveness score for identifying the likelihood a vocal signal is human-generated or synthetic. An optional singer detector includes a vocalprint embedding extractor that extracts vocalprint feature vector embeddings representing singer-specific vocal identity characteristics and generates a singer identification score or attribution score for identifying a particular singer in the vocal signal.
    Type: Application
    Filed: August 25, 2025
    Publication date: March 5, 2026
    Applicant: PINDROP SECURITY, INC.
    Inventors: Hemlata Tak, Ricardo Casal, Ganesh Sivaraman, Elie Khoury
  • Publication number: 20260057057
    Abstract: Embodiments described herein provide for a voice biometrics system execute machine-learning architectures capable of passive, active, continuous, or static operations, or a combination thereof. Systems passively and/or continuously, in some cases in addition to actively and/or statically, enrolling speakers as the speakers speak into or around an edge device (e.g., car, television, radio, phone). The system identifies users on the fly without requiring a new speaker to mirror prompted utterances for reconfiguring operations. The system manages speaker profiles as speakers provide utterances to the system. Machine-learning architectures implement a passive and continuous voice biometrics system, possibly without knowledge of speaker identities. The system creates identities in an unsupervised manner, sometimes passively enrolling and recognizing known or unknown speakers.
    Type: Application
    Filed: October 29, 2025
    Publication date: February 26, 2026
    Applicant: PINDROP SECURITY, INC.
    Inventors: Elie KHOURY, Ganesh SIVARAMAN, Avrosh KUMAR, Ivan ANTOLIC-SOBAN
  • Patent number: 12562150
    Abstract: Disclosed are systems and methods including software processes executed by a server that detect audio-based synthetic speech (“deepfakes”) in a call conversation. The server applies an NLP engine to transcribe call audio and analyze the text for anomalous patterns to detect synthetic speech. Additionally or alternatively, the server executes a voice “liveness” detection system for detecting machine speech, such as synthetic speech or replayed speech. The system performs phrase repetition detection, background change detection, and passive voice liveness detection in call audio signals to detect liveness of a speech utterance. An automated model update module allows the liveness detection model to adapt to new types of presentation attacks, based on the human provided feedback.
    Type: Grant
    Filed: November 9, 2023
    Date of Patent: February 24, 2026
    Assignee: Pindrop Security, Inc.
    Inventors: Umair Altaf, Sai Pradeep Peri, Lakshay Phatela, Payas Gupta, Yitao Sun, Svetlana Afanaseva, Kailash Patil, Elie Khoury, Bradley Magnetta, Vijay Balasubramaniyan, Tianxiang Chen
  • Publication number: 20260052158
    Abstract: Embodiments include a computing device that executes software routines and/or one or more machine-learning architectures providing improved omni-channel authentication solutions. Embodiments include one or more computing devices that provide an authentication interface by which various communication channels may deposit contact or session data received via a first-channel session into a non-transitory storage medium of an authentication database for another channel to obtain and employ (e.g., verify users). This allows the customer to access an online data channel and enter the contact center through a telephony communication channel, but further allows the enterprise contact center systems to passively maintain access to various types of information about the user's identity captured from each contact channel, allowing the call center to request or capture authenticating information (e.g.
    Type: Application
    Filed: October 27, 2025
    Publication date: February 19, 2026
    Applicant: Pindrop Security, Inc.
    Inventors: MohammedAli MERCHANT, Payas GUPTA
  • Publication number: 20260052210
    Abstract: Disclosed herein are embodiments of systems, methods, and products comprises an authentication server for caller ID verification. When a caller makes a phone call, the server receives the phone call and verifies whether the phone call is from a registered device associated with the phone number. The server queries the registered device to retrieve one or more current call states via an authentication function on the registered device. The server compares the states and/or state transitions to the observed states and/or state transitions of the phone call. If the registered device states and/or state transitions match the observed phone call states and/or state transitions, the server verifies that the phone call is from the registered device and not some imposter's device. If there is no such match, the server rejects the phone call before the call phone is connected or terminates the phone call after the phone call is connected.
    Type: Application
    Filed: October 27, 2025
    Publication date: February 19, 2026
    Applicant: PINDROP SECURITY, INC.
    Inventors: Payas GUPTA, Terry NELMS, II
  • Publication number: 20260045262
    Abstract: Disclosed are systems and methods including computing-processes executing machine-learning architectures for voice biometrics, in which the machine-learning architecture implements one or more language compensation functions. Embodiments include an embedding extraction engine (sometimes referred to as an “embedding extractor”) that extracts speaker embeddings and determines a speaker similarity score for determine or verifying the likelihood that speakers in different audio signals are the same speaker. The machine-learning architecture further includes a multi-class language classifier that determines a language likelihood score that indicates the likelihood that a particular audio signal includes a spoken language. The features and functions of the machine-learning architecture described herein may implement the various language compensation techniques to provide more accurate speaker recognition results, regardless of the language spoken by the speaker.
    Type: Application
    Filed: October 15, 2025
    Publication date: February 12, 2026
    Applicant: PINDROP SECURITY, INC.
    Inventors: Elie KHOURY, Tianxiang CHEN, Avrosh KUMAR, Ganesh SIVARAMAN, Kedar PHATAK
  • Publication number: 20260025461
    Abstract: Disclosed are systems and methods including computing-processes, which may include layers of machine-learning architectures, for assessing risk for calls directed to call center systems using carrier signaling metadata. A computer evaluates carrier signaling metadata to perform various new risk-scoring techniques to determine riskiness of calls and authenticate calls. When determining a risk score for an incoming call is received at a call center system, the computer may obtain certain metadata values from inbound metadata, prior call metadata, or from third-party telecommunications services and executes processes for determining the risk score for the call. The risk score operations include several scoring components, including appliance print scoring, carrier detection scoring, ANI location detection scoring, location similarity scoring, and JIP-ANI location similarity scoring, among others.
    Type: Application
    Filed: September 29, 2025
    Publication date: January 22, 2026
    Applicant: Pindrop Security, Inc.
    Inventors: Ricky Casal, Vinay Maddali, Payas Gupta, Kailash Patil
  • Publication number: 20260018177
    Abstract: Embodiments described herein provide for systems and methods for implementing a neural network architecture for spoof detection in audio signals. The neural network architecture contains a layers defining embedding extractors that extract embeddings from input audio signals. Spoofprint embeddings are generated for particular system enrollees to detect attempts to spoof the enrollee's voice. Optionally, voiceprint embeddings are generated for the system enrollees to recognize the enrollee's voice. The voiceprints are extracted using features related to the enrollee's voice. The spoofprints are extracted using features related to features of how the enrollee speaks and other artifacts. The spoofprints facilitate detection of efforts to fool voice biometrics using synthesized speech (e.g., deepfakes) that spoof and emulate the enrollee's voice.
    Type: Application
    Filed: September 15, 2025
    Publication date: January 15, 2026
    Applicant: PINDROP SECURITY, INC.
    Inventors: Tianxiang CHEN, Elie KHOURY
  • Patent number: 12525224
    Abstract: Disclosed are systems and methods including software processes executed by a server that detect audio-based synthetic speech (“deepfakes”) in a call conversation. The server applies an NLP engine to transcribe call audio and analyze the text for anomalous patterns to detect synthetic speech. Additionally or alternatively, the server executes a voice “liveness” detection system for detecting machine speech, such as synthetic speech or replayed speech. The system performs phrase repetition detection, background change detection, and passive voice liveness detection in call audio signals to detect liveness of a speech utterance. An automated model update module allows the liveness detection model to adapt to new types of presentation attacks, based on the human provided feedback.
    Type: Grant
    Filed: November 9, 2023
    Date of Patent: January 13, 2026
    Assignee: Pindrop Security, Inc.
    Inventors: Umair Altaf, Sai Pradeep Peri, Lakshay Phatela, Payas Gupta, Yitao Sun, Svetlana Afanaseva, Kailash Patil, Elie Khoury, Bradley Magnetta, Vijay Balasubramaniyan, Tianxiang Chen
  • Patent number: 12525244
    Abstract: The present invention is directed to a deep neural network (DNN) having a triplet network architecture, which is suitable to perform speaker recognition. In particular, the DNN includes three feed-forward neural networks, which are trained according to a batch process utilizing a cohort set of negative training samples. After each batch of training samples is processed, the DNN may be trained according to a loss function, e.g., utilizing a cosine measure of similarity between respective samples, along with positive and negative margins, to provide a robust representation of voiceprints.
    Type: Grant
    Filed: January 25, 2024
    Date of Patent: January 13, 2026
    Assignee: Pindrop Security, Inc.
    Inventors: Elie Khoury, Matthew Garland
  • Patent number: 12512101
    Abstract: The present invention is directed to a deep neural network (DNN) having a triplet network architecture, which is suitable to perform speaker recognition. In particular, the DNN includes three feed-forward neural networks, which are trained according to a batch process utilizing a cohort set of negative training samples. After each batch of training samples is processed, the DNN may be trained according to a loss function, e.g., utilizing a cosine measure of similarity between respective samples, along with positive and negative margins, to provide a robust representation of voiceprints.
    Type: Grant
    Filed: October 10, 2022
    Date of Patent: December 30, 2025
    Assignee: Pindrop Security, Inc.
    Inventors: Elie Khoury, Matthew Garland
  • Patent number: 12488072
    Abstract: Embodiments described herein provide for a voice biometrics system execute machine-learning architectures capable of passive, active, continuous, or static operations, or a combination thereof. Systems passively and/or continuously, in some cases in addition to actively and/or statically, enrolling speakers as the speakers speak into or around an edge device (e.g., car, television, radio, phone). The system identifies users on the fly without requiring a new speaker to mirror prompted utterances for reconfiguring operations. The system manages speaker profiles as speakers provide utterances to the system. Machine-learning architectures implement a passive and continuous voice biometrics system, possibly without knowledge of speaker identities. The system creates identities in an unsupervised manner, sometimes passively enrolling and recognizing known or unknown speakers.
    Type: Grant
    Filed: April 15, 2021
    Date of Patent: December 2, 2025
    Assignee: Pindrop Security, Inc.
    Inventors: Elie Khoury, Ganesh Sivaraman, Avrosh Kumar, Ivan Antolic-Soban
  • Patent number: 12489760
    Abstract: Embodiments include a computing device that executes software routines and/or one or more machine-learning architectures providing improved omni-channel authentication solutions. Embodiments include one or more computing devices that provide an authentication interface by which various communication channels may deposit contact or session data received via a first-channel session into a non-transitory storage medium of an authentication database for another channel to obtain and employ (e.g., verify users). This allows the customer to access an online data channel and enter the contact center through a telephony communication channel, but further allows the enterprise contact center systems to passively maintain access to various types of information about the user's identity captured from each contact channel, allowing the call center to request or capture authenticating information (e.g.
    Type: Grant
    Filed: August 17, 2023
    Date of Patent: December 2, 2025
    Assignee: Pindrop Security, Inc.
    Inventors: Mohammedali Merchant, Payas Gupta
  • Publication number: 20250363196
    Abstract: Embodiments described herein provide for automatically authenticating operation requests and end-users who submit operation requests during contact events. A server obtains an operation request for an operation originated at an end-user device. The server generates a voice-based one-time password (OTP) using contextual information associated with the requested operation. The server generates and transmits an OTP prompt having text representing the OTP for display at a user interface of the user device. The server receives a response including an audio signal that contains the recording of the user speaking the OTP text aloud. The server uses the audio signal to authenticate the user and the operation request based on the speaker's voice, the accuracy of the user speaking the OTP, and liveness or fraud detection features extracted from the audio signal or metadata from the user device.
    Type: Application
    Filed: May 22, 2025
    Publication date: November 27, 2025
    Applicant: PINDROP SECURITY, INC.
    Inventors: Amit GUPTA, MohammedAli MERCHANT, Vijay BALASUBRAMANIYAN