Patents by Inventor Sarangarajan Parthasarathy

Sarangarajan Parthasarathy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data

Patent number: 6405166

Abstract: A multimedia search apparatus and method for searching multimedia content using speaker detection to segment the multimedia content. The multimedia search apparatus receives a search request from a user device. The search request identifies the target speaker for which the search is to be conducted. Based on the search request, the multimedia search apparatus retrieves multimedia content from a multimedia database. The multimedia search apparatus retrieves models, such as Gaussian Mixture Models (GMMs), from a model storage device, corresponding to the target speaker and background data. Based on the retrieved models, the multimedia search device searches the multimedia data of the multimedia content and segments the multimedia data. The segments are identified by calculating an average normalized score for a block of frames of the multimedia data and determining if the average normalized score for the block of frames exceeds one or more predetermined thresholds.

Type: Grant

Filed: October 15, 2001

Date of Patent: June 11, 2002

Assignee: AT&T Corp.

Inventors: Qian Huang, Ivan Magrin-Chagnolleau, Sarangarajan Parthasarathy, Aaron Edward Rosenberg
Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data

Publication number: 20020029144

Abstract: A multimedia search apparatus and method for searching multimedia content using speaker detection to segment the multimedia content. The multimedia search apparatus receives a search request from a user device. The search request identifies the target speaker for which the search is to be conducted. Based on the search request, the multimedia search apparatus retrieves multimedia content from a multimedia database. The multimedia search apparatus retrieves models, such as Gaussian Mixture Models (GMMs), from a model storage device, corresponding to the target speaker and background data. Based on the retrieved models, the multimedia search device searches the multimedia data of the multimedia content and segments the multimedia data. The segments are identified by calculating an average normalized score for a block of frames of the multimedia data and determining if the average normalized score for the block of frames exceeds one or more predetermined thresholds.

Type: Application

Filed: October 15, 2001

Publication date: March 7, 2002

Applicant: AT&T Corp.

Inventors: Qian Huang, Ivan Magrin-Chagnolleau, Sarangarajan Parthasarathy, Aaron Edward Rosenberg
Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models

Patent number: 6330536

Abstract: A speaker identification system is provided that constructs speaker models using a discriminant analysis technique where the data in each class is modeled by Gaussian mixtures. The speaker identification method and apparatus determines the identity of a speaker, as one of a small group, based on a sentence-length password utterance. A speaker's utterance is received and a sequence of a first set of feature vectors are computed based on the received utterance. The first set of feature vectors are then transformed into a second set of feature vectors using transformations specific to a particular segmentation unit, and likelihood scores of the second set of feature vectors are computed using speaker models trained using mixture discriminant analysis. The likelihood scores are then combined to determine an utterance score and the speaker's identity is validated based on the utterance score. The speaker identification method and apparatus also includes training and enrollment phases.

Type: Grant

Filed: March 16, 2001

Date of Patent: December 11, 2001

Assignee: AT&T Corp.

Inventors: Sarangarajan Parthasarathy, Aaron E. Rosenberg
Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data

Patent number: 6317710

Abstract: A multimedia search apparatus and method for searching multimedia content using speaker detection to segment the multimedia content. The multimedia search apparatus receives a search request from a user device. The search request identifies the target speaker for which the search is to be conducted. Based on the search request, the multimedia search apparatus retrieves multimedia content from a multimedia database. The multimedia search apparatus retrieves models, such as Gaussian Mixture Models (GMMs), from a model storage device, corresponding to the target speaker and background data. Based on the retrieved models, the multimedia search device searches the audio data of the multimedia content and segments the audio data. The segments are identified by calculating an average normalized score for a block of frames of the audio data and determining if the average normalized score for the block of frames exceeds one or more predetermined thresholds.

Type: Grant

Filed: July 14, 1999

Date of Patent: November 13, 2001

Assignee: AT&T Corp.

Inventors: Qian Huang, Ivan Magrin-Chagnolleau, Sarangarajan Parthasarathy, Aaron Edward Rosenberg
Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models

Patent number: 6233555

Abstract: A speaker identification system is provided that constructs speaker models using a discriminant analysis technique where the data in each class is modeled by Gaussian mixtures. The speaker identification method and apparatus determines the identity of a speaker, as one of a small group, based on a sentence-length password utterance. A speaker's utterance is received and a sequence of a first set of feature vectors are computed based on the received utterance. The first set of feature vectors are then transformed into a second set of feature vectors using transformations specific to a particular segmentation unit, and likelihood scores of the second set of feature vectors are computed using speaker models trained using mixture discriminant analysis. The likelihood scores are then combined to determine an utterance score and the speaker's identity is validated based on the utterance score. The speaker identification method and apparatus also includes training and enrollment phases.

Type: Grant

Filed: November 24, 1998

Date of Patent: May 15, 2001

Assignee: AT&T Corporation

Inventors: Sarangarajan Parthasarathy, Aaron E. Rosenberg
Speaker identification with user-selected password phrases

Patent number: 5913192

Abstract: A speaker identification system includes a speaker-independent phrase recognizer. The speaker-independent phrase recognizer scores a password utterance against all the sets of phonetic transcriptions in a lexicon database to determine the N best speaker-independent scores, determines the N best sets of phonetic transcriptions based on the N best speaker-independent scores, and determines the N best possible identities. A speaker-dependent phrase recognizer retrieves the hidden Markov model corresponding to each of the N best possible identities, and scores the password utterance against each of the N hidden Markov models to generate a speaker-dependent score for each of the N best possible identities. A score processor coupled to the outputs of the speaker-independent phrase recognizer and the speaker-dependent phrase recognizer determines a putative identity. A verifier coupled to the score processor authenticates the determined putative identity.

Type: Grant

Filed: August 22, 1997

Date of Patent: June 15, 1999

Assignee: AT&T Corp

Inventors: Sarangarajan Parthasarathy, Aaron Edward Rosenberg

prev 1 2 3 4 5

Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data

Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data

Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models

Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data

Method and apparatus for speaker identification using mixture discriminant analysis to develop speaker models

Speaker identification with user-selected password phrases