Patents by Inventor Sunil K. Gupta
Sunil K. Gupta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7664267Abstract: An encryption device and method and decryption device and method which implement a bit-based encryption scheme and hardware design. The encryption device includes a random number generator, receiving a main key, determining a working key using at least one random number and outputting a working key, a model, receiving the main key, the working key and plain text to be encoded and generating at least two frequency counts. The encryption device further includes an encoder, which outputs encoded text based on the working key, the plain text and the at least two frequency counts. The encryption device and method and decryption device and method process encrypted text that is based upon a stream structure with an unlimited key length and may be compressed by 50%. The encoded text is changeable with different environments even for the same plain text and the same key. Operations of the hardware design are based on arithmetic additions and shifts, and not multiplications and divisions.Type: GrantFiled: June 30, 2005Date of Patent: February 16, 2010Assignee: Agere Systems Inc.Inventors: Chenggang Duan, Fengguang Zhao, Sunil K. Gupta
-
Patent number: 7373294Abstract: The intonation of speech is modified by an appropriate combination of resampling and time-domain harmonic scaling. Resampling increases (upsampling) or decreases (downsampling) the number of data points in a signal. Harmonic scaling adds or removes pitch cycles to or from a signal. The pitch of a speech signal can be increased by combining downsampling with harmonic scaling that adds an appropriate number of pitch cycles. Alternatively, pitch can be decreased by combining upsampling with harmonic scaling that removes an appropriate number of pitch cycles. The present invention can be implemented in an automated speech-therapy tool that is able to modify the intonation of prerecorded reference speech signals for playback to a user to emphasize the correct pronunciation by increasing the pitch of selected portions of words or phrases that the user had previously mispronounced.Type: GrantFiled: May 15, 2003Date of Patent: May 13, 2008Assignee: Lucent Technologies Inc.Inventors: Juergen Cezanne, Sunil K. Gupta, Chetan Vinchhi
-
Patent number: 7302389Abstract: A computer-based system generates alternative phonetic transcriptions for a target word or phrase corresponding to specific phonological processes that replace individual phonemes or clusters of two or more phonemes with replacement phonemes. The system compares a user's speech with a list of possible transcriptions that includes the base (i.e., correct) transcription of the test target as well as the different alternative transcriptions, to identify the transcription that best matches the user's. In a speech therapy application, the system identifies the phonological process(es), if any, associated with the user's speech and generates statistics over multiple test targets that can be used to diagnose the user's specific phonological disorders. The system can also be implemented in other contexts such as foreign language instruction and automated attendant applications to cover a wide variety and range of accents and/or phonological disorders.Type: GrantFiled: August 8, 2003Date of Patent: November 27, 2007Assignee: Lucent Technologies Inc.Inventors: Sunil K. Gupta, Prabhu Raghavan, Chetan Vinchhi
-
Patent number: 7299188Abstract: A method and apparatus for generating a pronunciation score by receiving a user phrase intended to conform to a reference phrase and processing the user phrase in accordance with at least one of an articulation-scoring engine, a duration scoring engine and an intonation-scoring engine to derive thereby the pronunciation score. The scores provided by the various scoring engines are adapted to provide a visual and/or numerical feedback that provides information pertaining to correctness or incorrectness in one or more speech-features such as intonation, articulation, voicing, phoneme error and relative word duration. Such useful interactive feedback will allow a user to quickly identify the problem area and take remedial action in reciting “tutor” sentences or phrases.Type: GrantFiled: February 10, 2003Date of Patent: November 20, 2007Assignee: Lucent Technologies Inc.Inventors: Sunil K. Gupta, ZiYi Lu, Prabhu Raghavan, Zulfiquar Sayeed, Aravind Sethuraman, Chetan Vinchhi
-
Patent number: 7266127Abstract: The system includes a jitter buffer for receiving speech packets in a Voice over Internet Protocol (VoIP) system, a playback device for adjusting the playback speed of the received speed packets, and a jitter buffer manager for detecting out of sequence packets in the jitter buffer and for sending commands to the playback device to adjust playback speed based on the detection. The speech signal is played back at the nominal speed when there are no out of sequence packets. The playback speed is decreased when an out of sequence packet is detected, thereby tending to increase the jitter buffer length. When an out of sequence packet arrives, the playback speed is increased in order to restore jitter buffer length to its nominal length.Type: GrantFiled: February 8, 2002Date of Patent: September 4, 2007Assignee: Lucent Technologies Inc.Inventors: Sunil K. Gupta, Chetan Vinchhi
-
Patent number: 7219059Abstract: A method and apparatus for generating a pronunciation score by receiving a user phrase intended to conform to a reference phrase and processing the user phrase in accordance with at least one of an articulation-scoring engine, a duration scoring engine and an intonation-scoring engine to derive thereby the pronunciation score.Type: GrantFiled: July 3, 2002Date of Patent: May 15, 2007Assignee: Lucent Technologies Inc.Inventors: Sunil K. Gupta, Ziyi Lu, Fengguang Zhao
-
Publication number: 20040243412Abstract: A computer-based automatic speech recognition (ASR) system generates a sequence of text material used to train the ASR system. The system compares the sequence of text material to inputs corresponding to a user's speech utterances of that text material in order to update the speech models (e.g., phoneme templates) used during normal ASR processing. The ASR system is able to generate a user-dependent sequence of text material for adapting the speech models, where at least some of the text material is based on the evaluation of previous user utterances. In this way, the system can be trained more efficiently by concentrating on particular speech models that are more problematic than others for the particular user (or group of users).Type: ApplicationFiled: May 29, 2003Publication date: December 2, 2004Inventors: Sunil K. Gupta, Prabhu Raghavan
-
Publication number: 20040230430Abstract: A computer-based system generates alternative phonetic transcriptions for a target word or phrase corresponding to specific phonological processes that replace individual phonemes or clusters of two or more phonemes with replacement phonemes. The system compares a user's speech with a list of possible transcriptions that includes the base (i.e., correct) transcription of the test target as well as the different alternative transcriptions, to identify the transcription that best matches the user's. In a speech therapy application, the system identifies the phonological process(es), if any, associated with the user's speech and generates statistics over multiple test targets that can be used to diagnose the user's specific phonological disorders. The system can also be implemented in other contexts such as foreign language instruction and automated attendant applications to cover a wide variety and range of accents and/or phonological disorders.Type: ApplicationFiled: August 8, 2003Publication date: November 18, 2004Inventors: Sunil K. Gupta, Prabhu Raghavan, Chetan Vinchhi
-
Publication number: 20040230431Abstract: A computer-based system generates alternative pronunciations for a test word or phrase corresponding to specific phonological processes that replace individual phonemes or clusters of two or more phonemes with replacement phonemes. The system compares a user's pronunciation with a list of possible pronunciations that includes the base (i.e., correct) pronunciation of the test target as well as the different alternative pronunciations to identify the pronunciation that best matches the user's. The system identifies the phonological process(es), if any, associated with the user's pronunciation and generates statistics over multiple test targets that can be used to diagnose, in a speech therapy context, the user's specific phonological disorders.Type: ApplicationFiled: May 14, 2003Publication date: November 18, 2004Inventors: Sunil K. Gupta, Prabhu Raghavan, Chetan Vinchhi
-
Publication number: 20040230421Abstract: The intonation of speech is modified by an appropriate combination of resampling and time-domain harmonic scaling. Resampling increases (upsampling) or decreases (downsampling) the number of data points in a signal. Harmonic scaling adds or removes pitch cycles to or from a signal. The pitch of a speech signal can be increased by combining downsampling with harmonic scaling that adds an appropriate number of pitch cycles. Alternatively, pitch can be decreased by combining upsampling with harmonic scaling that removes an appropriate number of pitch cycles. The present invention can be implemented in an automated speech-therapy tool that is able to modify the intonation of prerecorded reference speech signals for playback to a user to emphasize the correct pronunciation by increasing the pitch of selected portions of words or phrases that the user had previously mispronounced.Type: ApplicationFiled: May 15, 2003Publication date: November 18, 2004Inventors: Juergen Cezanne, Sunil K. Gupta, Chetan Vinchhi
-
Publication number: 20040006468Abstract: A method and apparatus for generating a pronunciation score by receiving a user phrase intended to conform to a reference phrase and processing the user phrase in accordance with at least one of an articulation-scoring engine, a duration scoring engine and an intonation-scoring engine to derive thereby the pronunciation score.Type: ApplicationFiled: July 3, 2002Publication date: January 8, 2004Applicant: LUCENT TECHNOLOGIES INC.Inventors: Sunil K. Gupta, Ziyi Lu, Fengguang Zhao
-
Publication number: 20040006461Abstract: A method and apparatus for generating a pronunciation score by receiving a user phrase intended to conform to a reference phrase and processing the user phrase in accordance with at least one of an articulation-scoring engine, a duration scoring engine and an intonation-scoring engine to derive thereby the pronunciation score. The scores provided by the various scoring engines are adapted to provide a visual and/or numerical feedback that provides information pertaining to correctness or incorrectness in one or more speech-features such as intonation, articulation, voicing, phoneme error and relative word duration. Such useful interactive feedback will allow a user to quickly identify the problem area and take remedial action in reciting “tutor” sentences or phrases.Type: ApplicationFiled: February 10, 2003Publication date: January 8, 2004Inventors: Sunil K. Gupta, ZiYi Lu, Prabhu Raghavan, Zulfiquar Sayeed, Aravind Sethuraman, Chetan Vinchhi
-
Publication number: 20030152093Abstract: The system includes a jitter buffer for receiving speech packets in a Voice over Internet Protocol (VoIP) system, a playback device for adjusting the playback speed of the received speed packets, and a jitter buffer manager for detecting out of sequence packets in the jitter buffer and for sending commands to the playback device to adjust playback speed based on the detection. The speech signal is played back at the nominal speed when there are no out of sequence packets. The playback speed is decreased when an out of sequence packet is detected, thereby tending to increase the jitter buffer length. When an out of sequence packet arrives, the playback speed is increased in order to restore jitter buffer length to its nominal length.Type: ApplicationFiled: February 8, 2002Publication date: August 14, 2003Inventors: Sunil K. Gupta, Chetan Vinchhi
-
Publication number: 20030120493Abstract: The system includes a client device in communication with a server. The client device receives an input speech utterance in a voice dialog via an input device from a user of the system. The client device includes a speech recognition engine that compares the received input speech to stored recognition vocabulary representing a currently active vocabulary. The speech recognition engine recognizes the received utterance, and an application dynamically updates the recognition vocabulary. The dynamic update of the active vocabulary can also be initiated from the server, depending upon the client application being run at the client device. The server generates a result that is sent to the client device via a suitable communication path. The client application also provides the ability to customize voice-activated commands in the recognition vocabulary related to common client device functions, by using a speaker-training feature of the speech recognition engine.Type: ApplicationFiled: December 21, 2001Publication date: June 26, 2003Inventor: Sunil K. Gupta
-
Publication number: 20030023439Abstract: A method and system of recognizing speech based in part on an observation that a speaker naturally pauses and speaks smaller subgroups of speech units or digits that form part of a complete longer speech sequence. In the method, subgroups of speech units are processed by the system between a human's natural pauses. This pause is detected by the system and the subgroup is processed in order to provide a recognition result, which is a best representation of the input subgroup. The recognition result is immediately repeated back to the user for verification. The user is prompted to repeat a subgroup for re-recognition and re-verification if a rejection criteria is met; otherwise the processing steps are repeated for remaining subgroups until it has been determined that the complete speech sequence has been accurately recognized.Type: ApplicationFiled: May 2, 2001Publication date: January 30, 2003Inventors: Gregory Ciurpita, Sunil K. Gupta, Prabhu Ragavan
-
Publication number: 20020094080Abstract: An encryption device and method and decryption device and method which implement a bit-based encryption scheme and hardware design. The encryption device includes a random number generator, receiving a main key, determining a working key using at least one random number and outputting a working key, a model, receiving the main key, the working key and plain text to be encoded and generating at least two frequency counts. The encryption device further includes an encoder, which outputs encoded text based on the working key, the plain text and the at least two frequency counts. The encryption device and method and decryption device and method progress encrypted text that is based upon a stream structure with an unlimited key length and may be compressed by 50%. The encoded text is changeable with different environments even for the same plain text and the same key. Operations of the hardware design are based on arithmetic additions and shifts, and not multiplications and divisions.Type: ApplicationFiled: November 30, 2000Publication date: July 18, 2002Inventors: Chenggang Duan, Fengguang Zhao, Sunil K. Gupta
-
Patent number: 6280740Abstract: Vaccine formulations comprising recombinant early (E) and late (L) proteins of papillomavirus and oxidized mannan as well as methods of making and using the formulations are provided.Type: GrantFiled: January 14, 2000Date of Patent: August 28, 2001Assignee: Merck & Co., Inc.Inventors: Sunil K. Gupta, George E. Mark, III
-
Patent number: 6138095Abstract: Speech recognition in which the log probabilities of the null and alternative hypothesis are computed for an input speech sample by comparison with specific stored speech vocabularies/grammars and with general speech characteristics. The difference in probabilities is normalized by the magnitude of the null hypothesis to derive a likelihood factor which is compared with a rejection threshold that is utterance-length dependent. Advantageously, a high-order polynomial representation of the rejection threshold length dependency may be simplified by a series of piece-wise constants which are stored as rejection thresholds to be selected in accordance with the length of the input speech sample.Type: GrantFiled: September 3, 1998Date of Patent: October 24, 2000Assignee: Lucent Technologies Inc.Inventors: Sunil K. Gupta, Frank Kao-Ping Soong
-
Patent number: 6009390Abstract: In a speech recognition system, tied-mixture hidden Markov models (HMMs) are used to match, in the maximum likelihood sense, the phonemes of spoken words given the acoustic input thereof. In a well known manner, such speech recognition requires computation of state observation likelihoods (SOLs). Because of the use of HMMs, each SOL computation involves a substantial number of Gaussian kernels and mixture component weights. In accordance with the invention, the number of Gaussian kernels is cut down to reduce the computational complexity and increase the efficiency of memory access to the kernels. For example, only the non-zero mixture component weights and the Gaussian kernels associated therewith are considered in the SOL computation. In accordance with an aspect of the invention, only a subset of the Gaussian kernels of significant values, regardless of the values of the associated mixture component weights, are considered in the SOL computation.Type: GrantFiled: September 11, 1997Date of Patent: December 28, 1999Assignee: Lucent Technologies Inc.Inventors: Sunil K. Gupta, Raziel Haimi-Cohen, Frank K. Soong
-
Patent number: 5995926Abstract: In a speech recognition system for performing voice dialing, an inventive connected digit recognizer is employed to recognize a sequence of spoken digits. The inventive recognizer generates the maximum-likelihood digit sequence corresponding to the spoken sequence in accordance with the Viterbi algorithm. However, unlike a prior art connected digit recognizer, the inventive recognizer does not assume that a digit model in a sequence can be followed by any digit model with equal probability. Rather, the inventive recognizer takes into account, for each digit model being decided on, a conditional probability that that digit model would follow a given digit model preceding thereto.Type: GrantFiled: July 21, 1997Date of Patent: November 30, 1999Assignee: Lucent Technologies Inc.Inventors: Sunil K. Gupta, Frank K. Soong