Patents by Inventor Vishwa N. Gupta

Vishwa N. Gupta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Content-based video copy detection

Patent number: 8671109

Abstract: A method to detect video copying based on content. The method comprises providing a set of reference data elements derived from a set of reference video frames in a reference video stream; providing a set of query data elements derived from a set of query video frames in a query video stream, each of the query data elements having a corresponding query data element identifier; associating with each of the reference data elements a fingerprint selected from among the query data element identifiers; and determining a similarity measure for the query video stream relative to the reference video stream by a comparison of the query data element identifiers to the fingerprints.

Type: Grant

Filed: December 2, 2011

Date of Patent: March 11, 2014

Assignee: CRIM (Centre de Recherche Informatique de Montreal)

Inventors: Vishwa N. Gupta, Parisa Darvish Zadeh Varcheie
CONTENT-BASED VIDEO COPY DETECTION

Publication number: 20120143915

Abstract: A method to detect video copying based on content. The method comprises providing a set of reference data elements derived from a set of reference video frames in a reference video stream; providing a set of query data elements derived from a set of query video frames in a query video stream, each of the query data elements having a corresponding query data element identifier; associating with each of the reference data elements a fingerprint selected from among the query data element identifiers; and determining a similarity measure for the query video stream relative to the reference video stream by a comparison of the query data element identifiers to the fingerprints.

Type: Application

Filed: December 2, 2011

Publication date: June 7, 2012

Applicant: CRIM (CENTRE DE RECHRCHE INFORMATIQUE DE MONTREAL)

Inventors: Vishwa N. Gupta, Parisa Darvish Zadeh Varcheie
Method and apparatus for performing text to speech synthesis

Patent number: 6980834

Abstract: A data communication terminal, such as a cellular telephone, capable of synthesizing speech. The data communication terminal can establish a communication session with a base station over a transmission facility implementing a voice channel and a data channel. The data communication terminal includes a speech synthesizer engine that receives from the remote entity a signal transmitted over the data channel and that conveys the vocal tract characteristics of the message to be delivered as a spoken announcement. The base station generates the signal containing the vocal tract characteristics from a text-based signal of the message to be synthesized. The invention also extends to a base station that can convert the text based message to be synthesized into a signal containing vocal tract characteristics and that sends the signal to a remote terminal over the data channel.

Type: Grant

Filed: December 5, 2002

Date of Patent: December 27, 2005

Assignee: Nortel Networks Limited

Inventors: Vishwa N. Gupta, Paul Boucher
Method and apparatus for performing text to speech synthesis

Publication number: 20030083105

Abstract: A data communication terminal, such as a cellular telephone, capable of synthesizing speech. The data communication terminal can establish a communication session with a base station over a transmission facility implementing a voice channel and a data channel. The data communication terminal includes a speech synthesizer engine that receives from the remote entity a signal transmitted over the data channel and that conveys the vocal tract characteristics of the message to be delivered as a spoken announcement. The base station generates the signal containing the vocal tract characteristics from a text-based signal of the message to be synthesized. The invention also extends to a base station that can convert the text based message to be synthesized into a signal containing vocal tract characteristics and that sends the signal to a remote terminal over the data channel.

Type: Application

Filed: December 5, 2002

Publication date: May 1, 2003

Inventors: Vishwa N. Gupta, Paul Boucher
Method and apparatus for performing text to speech synthesis

Patent number: 6516207

Abstract: A data communication terminal, such as a cellular telephone, capable of synthesizing speech. The data communication terminal can establish a communication session with a base station over a transmission facility implementing a voice channel and a data channel. The data communication terminal includes a speech synthesizer engine that receives from the remote entity a signal transmitted over the data channel and that conveys the vocal tract characteristics of the message to be delivered as a spoken announcement. The base station generates the signal containing the vocal tract characteristics from a text-based signal of the message to be synthesized. The invention also extends to a base station that can convert the text based message to be synthesized into a signal containing vocal tract characteristics and that sends the signal to a remote terminal over the data channel.

Type: Grant

Filed: December 7, 1999

Date of Patent: February 4, 2003

Assignee: Nortel Networks Limited

Inventors: Vishwa N. Gupta, Paul Boucher
Method and apparatus for speech recognition

Patent number: 6092045

Abstract: Comparing a series of observations representing unknown speech, to stored models representing known speech, the series of observations being divided into at least two blocks each comprising two or more of the observations, is carried out in an order which makes better use of memory. First, the observations in one of the blocks are compared (31), to a subset comprising one or more of the models, to determine a likelihood of a match to each of the one or more models. This step is repeated (33) for models other than those in the subset; and the whole process is repeated (34) for each block.

Type: Grant

Filed: July 21, 1998

Date of Patent: July 18, 2000

Assignee: Nortel Networks Corporation

Inventors: Peter R. Stubley, Andre Gillet, Vishwa N. Gupta, Christopher K. Toulson, David B. Peters
Updating markov models based on speech input and additional information for automated telephone directory assistance

Patent number: 5644680

Abstract: In methods and apparatus for at least partially automating a telephone directory assistance function, directory assistance callers are prompted to speak locality or called entity names associated with desired directory listings. A speech recognition algorithm is applied to speech signals received in response to prompting to determine spoken locality or called entity names. Desired telephone numbers are released to callers, and released telephone numbers are used to confirm or correct at least some of the recognized locality or called entity names. Speech signal representations labelled with the confirmed or corrected names are used as labelled speech tokens to refine prior training of the speech recognition algorithm. The training refinement automatically adjusts for deficiencies in prior training of the speech recognition algorithm and to long term changes in the speech patterns of directory assistance callers served by a particular directory assistance installation.

Type: Grant

Filed: May 25, 1995

Date of Patent: July 1, 1997

Assignee: Northern Telecom Limited

Inventors: Gregory J. Bielby, Vishwa N. Gupta, Lauren C. Hodgson, Matthew Lennig, R. Douglas Sharp, Hans A. Wasmeier
Speech recognition method using a two-pass search

Patent number: 5515475

Abstract: A method of recognizing speech comprises searching a vocabulary of words for a match to an unknown utterance. Words in the vocabulary are represented by concatenated allophone models and the vocabulary is represented as a network. On a first pass of the search, a one-state duration constrained model is used to search the vocabulary network. The one-state model has as its transition probability the maximum observed transitional probability (model distance) of the unknown utterance for the corresponding allophone model. Words having top scores are chosen from the first pass search and, in a second pass of the search, rescored using a full Viterbi trellis with the complete allophone models and model distances. The rescores are sorted to provide a few top choices. Using a second set of speech parameters these few top choices are again rescored. Comparison of the scores using each set of speech parameters determines a recognition choice. Post processing is also possible to further enhance recognition accuracy.

Type: Grant

Filed: June 24, 1993

Date of Patent: May 7, 1996

Assignee: Northern Telecom Limited

Inventors: Vishwa N. Gupta, Matthew Lennig
Method and apparatus for training speech recognition algorithms for directory assistance applications

Patent number: 5488652

Abstract: In methods and apparatus for at least partially automating a telephone directory assistance function, directory assistance callers are prompted to speak locality or called entity names associated with desired directory listings. A speech recognition algorithm is applied to speech signals received in response to prompting to determine spoken locality or called entity names. Desired telephone numbers are released to callers, and released telephone numbers are used to confirm or correct at least some of the recognized locality or called entity names. Speech signal representations labelled with the confirmed or corrected names are used as labelled speech tokens to refine prior training of the speech recognition algorithm. The training refinement automatically adjusts for deficiencies in prior training of the speech recognition algorithm and to long term changes in the speech patterns of directory assistance callers served by a particular directory assistance installation.

Type: Grant

Filed: April 14, 1994

Date of Patent: January 30, 1996

Assignee: Northern Telecom Limited

Inventors: Gregory J. Bielby, Vishwa N. Gupta, Lauren C. Hodgson, Matthew Lennig, R. Douglas Sharp, Hans A. Wasmeier
Phoneme based speech recognition

Patent number: 5390278

Abstract: A flexible vocabulary speech recognition system is provided for recognizing speech transmitted via the public switched telephone network. The flexible vocabulary recognition (FVR) system is a phoneme based system. The phonemes are modelled as hidden Markov models. The vocabulary is represented as concatenated phoneme models. The phoneme models are trained using Viterbi training enhanced by: substituting the covariance matrix of given phonemes by others, applying energy level thresholds and voiced, unvoiced, silence labelling constraints during Viterbi training. Specific vocabulary members, such as digits, are represented by allophone models. A* searching of the lexical network is facilitated by providing a reduced network which provides estimate scores used to evaluate the recognition path through the lexical network. Joint recognition and rejection of out-of-vocabulary words are provided by using both cepstrum and LSP parameter vectors.

Type: Grant

Filed: October 8, 1991

Date of Patent: February 14, 1995

Assignee: Bell Canada

Inventors: Vishwa N. Gupta, Matthew Lennig, Patrick J. Kenny, Christopher K. Toulson
Speech recognition

Patent number: 4956865

Abstract: In a speech recognizer, for recognizing unknown utterances in isolated-word speech or continuous speech, improved recognition accuracy is obtained by augmenting the usual spectral representation of the unknown utterance with a dynamic component. A corresponding dynamic component is provided in the templates with which the spectral representation of the utterance is compared. In preferred embodiments, the representation is mel-based cepstral and the dynamic components comprise vector differences between pairs of primary cepstra. Preferably the time interval between each pair is about 50 milliseconds. It is also preferable to compute a dynamic perceptual loudness component along with the dynamic parameters.

Type: Grant

Filed: May 2, 1988

Date of Patent: September 11, 1990

Assignee: Northern Telecom Limited

Inventors: Matthew Lennig, Paul Mermelstein, Vishwa N. Gupta