Patents by Inventor Ganesh Sivaraman

Ganesh Sivaraman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11756564
    Abstract: A computer may segment a noisy audio signal into audio frames and execute a deep neural network (DNN) to estimate an instantaneous function of clean speech spectrum and noisy audio spectrum in the audio frame. This instantaneous function may correspond to a ratio of an a-priori signal to noise ratio (SNR) and an a-posteriori SNR of the audio frame. The computer may add estimated instantaneous function to the original noisy audio frame to output an enhanced speech audio frame.
    Type: Grant
    Filed: June 14, 2019
    Date of Patent: September 12, 2023
    Assignee: PINDROP SECURITY, INC.
    Inventors: Ganesh Sivaraman, Elie Khoury
  • Patent number: 11715460
    Abstract: Described herein are systems and methods for improved audio analysis using a computer-executed neural network having one or more in-network data augmentation layers. The systems described herein help ease or avoid unwanted strain on computing resources by employing the data augmentation techniques within the layers of the neural network. The in-network data augmentation layers will produce various types of simulated audio data when the computer applies the neural network on an inputted audio signal during a training phase, enrollment phase, and/or testing phase. Subsequent layers of the neural network (e.g., convolutional layer, pooling layer, data augmentation layer) ingest the simulated audio data and the inputted audio signal and perform various operations.
    Type: Grant
    Filed: October 8, 2020
    Date of Patent: August 1, 2023
    Assignee: PINDROP SECURITY, INC.
    Inventors: Elie Khoury, Ganesh Sivaraman, Tianxiang Chen, Amruta Vidwans
  • Publication number: 20230214608
    Abstract: Systems, asset tag devices, and methods are described herein to provide asset, or object, tracking using a plurality of beacons. The asset tag devices may be affixed, or coupled to, the asset, or object, and may receive beacon advertising packets from the beacons, and then transmit, or broadcast, tag advertising packets including information from the received beacon advertising packets, which may be used to determine an approximate location of the asset tag device, and in turn, the asset or object.
    Type: Application
    Filed: December 29, 2022
    Publication date: July 6, 2023
    Inventors: Kurt Larson, John Young, Ganesh Sivaraman
  • Publication number: 20230137652
    Abstract: Disclosed are systems and methods including computing-processes executing machine-learning architectures for voice biometrics, in which the machine-learning architecture implements one or more language compensation functions. Embodiments include an embedding extraction engine (sometimes referred to as an “embedding extractor”) that extracts speaker embeddings and determines a speaker similarity score for determine or verifying the likelihood that speakers in different audio signals are the same speaker. The machine-learning architecture further includes a multi-class language classifier that determines a language likelihood score that indicates the likelihood that a particular audio signal includes a spoken language. The features and functions of the machine-learning architecture described herein may implement the various language compensation techniques to provide more accurate speaker recognition results, regardless of the language spoken by the speaker.
    Type: Application
    Filed: October 31, 2022
    Publication date: May 4, 2023
    Applicant: Pindrop Security, Inc.
    Inventors: Elie KHOURY, Tianxiang CHEN, Avrosh KUMAR, Ganesh SIVARAMAN, Kedar PHATAK
  • Publication number: 20230107741
    Abstract: Disclosed are systems and methods including computing-processes executing machine-learning architectures implementing label distribution loss functions to improve age estimation performance and generalization. The machine-learning architecture includes a front-end neural network architecture defining a speaker embedding extraction engine of the machine-learning architecture, and a backend neural network architecture defining an age estimation engine of the machine-learning architecture. The embedding extractor is trained to extract low-level acoustic features of a speaker's speech, such as mel-frequency cepstral coefficients (MFCCs), from audio signals, and then extract a feature vector or speaker embedding vector that mathematically represents the low-level features of the speaker. The age estimator is trained to generate an estimated age for the speaker and a Gaussian probability distribution around the estimated age, by applying the various types of layers of the age estimator on the speaker embedding.
    Type: Application
    Filed: October 5, 2022
    Publication date: April 6, 2023
    Applicant: Pindrop Security, Inc.
    Inventors: Amruta SARAF, Elie KHOURY, Ganesh SIVARAMAN
  • Publication number: 20220084509
    Abstract: Embodiments described herein provide for a machine-learning architecture system that enhances the speech audio of a user-defined target speaker by suppressing interfering speakers, as well as background noise and reverberations. The machine-learning architecture includes a speech separation engine for separating the speech signal of a target speaker from a mixture of multiple speakers' speech, and a noise suppression engine for suppressing various types of noise in the input audio signal. The speaker-specific speech enhancement architecture performs speaker mixture separation and background noise suppression to enhance the perceptual quality of the speech audio. The output of the machine-learning architecture is an enhanced audio signal improving the voice quality of a target speaker on a single-channel audio input containing a mixture of speaker speech signals and various types of noise.
    Type: Application
    Filed: September 14, 2021
    Publication date: March 17, 2022
    Applicant: PINDROP SECURITY, INC.
    Inventors: Ganesh SIVARAMAN, Avrosh KUMAR, Elie KHOURY
  • Publication number: 20210326421
    Abstract: Embodiments described herein provide for a voice biometrics system execute machine-learning architectures capable of passive, active, continuous, or static operations, or a combination thereof. Systems passively and/or continuously, in some cases in addition to actively and/or statically, enrolling speakers as the speakers speak into or around an edge device (e.g., car, television, radio, phone). The system identifies users on the fly without requiring a new speaker to mirror prompted utterances for reconfiguring operations. The system manages speaker profiles as speakers provide utterances to the system. Machine-learning architectures implement a passive and continuous voice biometrics system, possibly without knowledge of speaker identities. The system creates identities in an unsupervised manner, sometimes passively enrolling and recognizing known or unknown speakers.
    Type: Application
    Filed: April 15, 2021
    Publication date: October 21, 2021
    Inventors: Elie KHOURY, Ganesh SIVARAMAN, Avrosh KUMAR, Ivan ANTOLIC-SOBAN
  • Publication number: 20210241776
    Abstract: Embodiments described herein provide for systems and methods for voice-based cross-channel enrollment and authentication. The systems control for and mitigate against variations in audio signals received across any number of communications channels by training and employing a neural network architecture comprising a speaker verification neural network and a bandwidth expansion neural network. The bandwidth expansion neural network is trained on narrowband audio signals to produce and generate estimated wideband audio signals corresponding to the narrowband audio signals. These estimated wideband audio signals may be fed into one or more downstream applications, such as the speaker verification neural network or embedding extraction neural network. The speaker verification neural network can then compare and score inbound embeddings for a current call against enrolled embeddings, regardless of the channel used to receive the inbound signal or enrollment signal.
    Type: Application
    Filed: February 2, 2021
    Publication date: August 5, 2021
    Inventors: Ganesh SIVARAMAN, Elie KHOURY, Avrosh KUMAR
  • Publication number: 20210110813
    Abstract: Described herein are systems and methods for improved audio analysis using a computer-executed neural network having one or more in-network data augmentation layers. The systems described herein help ease or avoid unwanted strain on computing resources by employing the data augmentation techniques within the layers of the neural network. The in-network data augmentation layers will produce various types of simulated audio data when the computer applies the neural network on an inputted audio signal during a training phase, enrollment phase, and/or testing phase. Subsequent layers of the neural network (e.g., convolutional layer, pooling layer, data augmentation layer) ingest the simulated audio data and the inputted audio signal and perform various operations.
    Type: Application
    Filed: October 8, 2020
    Publication date: April 15, 2021
    Inventors: Elie KHOURY, Ganesh SIVARAMAN, Tianxiang CHEN, Amruta VIDWANS
  • Publication number: 20190385630
    Abstract: A computer may segment a noisy audio signal into audio frames and execute a deep neural network (DNN) to estimate an instantaneous function of clean speech spectrum and noisy audio spectrum in the audio frame. This instantaneous function may correspond to a ratio of an a-priori signal to noise ratio (SNR) and an a-posteriori SNR of the audio frame. The computer may add estimated instantaneous function to the original noisy audio frame to output an enhanced speech audio frame.
    Type: Application
    Filed: June 14, 2019
    Publication date: December 19, 2019
    Inventors: Ganesh Sivaraman, Elie Khoury
  • Patent number: 9792343
    Abstract: The present invention provides a method, a network device and a system for allowing for resuming a preceding incomplete synchronization session is provided, wherein the preceding incomplete synchronization session has been interrupted during its performing. In principle the resuming of the preceding incomplete synchronization session is based on the following operations according to the inventive concept. A communication connection for synchronization of data between a first and a second device is establishing. The first and the second device comprise each a predefined set of data records to be synchronized. A first and a second update identifier are communicated between the first and the second device. The first update identifier specifies a preceding complete synchronization session having been performed between them and the second update identifier specifies a preceding incomplete synchronization session having been performed between them.
    Type: Grant
    Filed: November 8, 2002
    Date of Patent: October 17, 2017
    Assignee: Nokia Technologies Oy
    Inventors: Ganesh Sivaraman, Riku Mettälä
  • Patent number: 7711792
    Abstract: The present invention provides a method for generating a request for information about a data store. The request codes a data store descriptor which is used to characterize or identify a matching data store from a plurality of data stores. The re-transmitted information comprises retrieved information necessary for the requesting device or the applications executed thereon to access the identified data store, respectively. The present invention provides a method for generating a response in consequence on receiving a request for information a about data store. The data store descriptor coded in the request is used to characterize or identify a matching data store from a plurality of data stores and information about the data store is retrieved. This retrieved information coded in a response is transmitted to the previously requesting device enabling the requesting device or the applications executed thereon to access the identified data store, respectively.
    Type: Grant
    Filed: June 27, 2003
    Date of Patent: May 4, 2010
    Assignee: Nokia Corporation
    Inventors: Ganesh Sivaraman, Riku Mettala, Jussi Piispanen
  • Publication number: 20040205263
    Abstract: The present invention provides a method, a network device and a system for allowing for resuming a preceding incomplete synchronization session is provided, wherein the preceding incomplete synchronization session has been interrupted during its performing. In principle the resuming of the preceding incomplete synchronization session is based on the following operations according to the inventive concept. A communication connection for synchronization of data between a first and a second device is establishing. The first and the second device comprise each a predefined set of data records to be synchronized. A first and a second update identifier are communicated between the first and the second device. The first update identifier specifies a preceding complete synchronization session having been performed between them and the second update identifier specifies a preceding incomplete synchronization session having been performed between them.
    Type: Application
    Filed: November 8, 2002
    Publication date: October 14, 2004
    Applicant: Nokia Corporation
    Inventors: Ganesh Sivaraman, Riku Mettala
  • Publication number: 20040088372
    Abstract: The present invention provides a method for generating a request for information about a data store. The request codes a data store descriptor which is used to characterize or identify a matching data store from a plurality of data stores. The re-transmitted information comprises retrieved information necessary for the requesting device or the applications executed thereon to access the identified data store, respectively. The present invention provides a method for generating a response in consequence on receiving a request for information a about data store. The data store descriptor coded in the request is used to characterize or identify a matching data store from a plurality of data stores and information about the data store is retrieved. This retrieved information coded in a response is transmitted to the previously requesting device enabling the requesting device or the applications executed thereon to access the identified data store, respectively.
    Type: Application
    Filed: June 27, 2003
    Publication date: May 6, 2004
    Applicant: Nokia Corporation
    Inventors: Ganesh Sivaraman, Riku Mettala, Jussi Piispanen
  • Publication number: 20040044799
    Abstract: The present invention provides a method, a network device and a system for allowing for resuming a preceding incomplete synchronization session is provided, wherein the preceding incomplete synchronization session has been interrupted during its performing. In principle the resuming of the preceding incomplete synchronization session is based on the following operations according to the inventive concept. A communication connection for synchronization of data between a first and a second device is establishing. The first and the second device comprise each a predefined set of data records to be synchronized. A first and a second update identifier are communicated between the first and the second device. The first update identifier specifies a preceding complete synchronization session having been performed between them and the second update identifier specifies a preceding incomplete synchronization session having been performed between them.
    Type: Application
    Filed: September 3, 2002
    Publication date: March 4, 2004
    Applicant: Nokia Corporation
    Inventors: Ganesh Sivaraman, Riku Mettala