Patents Examined by Vijay Chawan

System and method for disambiguating phonetic input

Patent number: 7395203

Abstract: A system and method for inputting Chinese characters using Pinyin without requiring the entry of a delimiter key between Pinyin entries in a reduced keyboard is disclosed. The system searches for all possible single or multiple Pinyin spellings based on the entered Latin alphabets. Once the user has completed the inputting of the Pinyin spellings for desired Chinese phrase or characters, all possible matching phrases or characters are displayed on screen and off-screen due to screen size. The user then scrolls through a list of matching phrases or characters and selects the desired one by clicking.

Type: Grant

Filed: July 30, 2003

Date of Patent: July 1, 2008

Assignee: Tegic Communications, Inc.

Inventors: Jianchao Wu, Jenny Huang-Yu Lai, Lian He, Pim van Meurs, Keng Chong Wong, Lu Zhang
System and method for mobile automatic speech recognition

Patent number: 7386443

Abstract: A system and method of updating automatic speech recognition parameters on a mobile device are disclosed. The method comprises storing user account-specific adaptation data associated with ASR on a computing device associated with a wireless network, generating new ASR adaptation parameters based on transmitted information from the mobile device when a communication channel between the computing device and the mobile device becomes available and transmitting the new ASR adaptation data to the mobile device when a communication channel between the computing device and the mobile device becomes available. The new ASR adaptation data on the mobile device more accurately recognizes user utterances.

Type: Grant

Filed: January 9, 2004

Date of Patent: June 10, 2008

Assignee: AT&T Corp.

Inventors: Sarangarajan Parthasarathy, Richard Cameron Rose
Biometric voice authentication

Patent number: 7386448

Abstract: A system and method enrolls a speaker with an enrollment utterance and authenticates a user with a biometric analysis of an authentication utterance, without the need for a PIN (Personal Identification Number). During authentication, the system uses the same authentication utterance to identify who a speaker claims to be with speaker recognition, and verify whether is the speaker is actually the claimed person. Thus, it is not necessary for the speaker to identify biometric data using a PIN. The biometric analysis includes a neural tree network to determine unique aspects of the authentication utterances for comparison to the enrollment authentication. The biometric analysis leverages a statistical analysis using Hidden Markov Models to before authorizing the speaker.

Type: Grant

Filed: June 24, 2004

Date of Patent: June 10, 2008

Assignee: T-Netix, Inc.

Inventors: John C. Poss, Dag Boye, Mark W. Mobley
Simple noise suppression model

Patent number: 7379866

Abstract: An approach for efficiently reducing background noise from speech signal in real-time applications is presented. A noisy input speech signal is processed through an inverse filter when the spectrum tilt of the input signal is not that of a pure background noise model the noisy input signal is also filtered in order to reduce the spectrum valley areas of the noisy input signal when the background noise is present.

Type: Grant

Filed: March 11, 2004

Date of Patent: May 27, 2008

Assignee: Mindspeed Technologies, Inc.

Inventor: Yang Gao
System and methods for concealing errors in data transmission

Patent number: 7379865

Abstract: A frame erasure concealment device and method that is based on reestimating gain parameters for a code excited linear prediction (CELP) coder is disclosed. During operation, when a frame in a stream of received data is detected as being erased, the coding parameters, especially an adaptive codebook gain gp and a fixed codebook gain gc, of the erased and subsequent frames can be reestimated by a gain matching procedure. By using this technique with the IS-641 speech coder, it has been found that the present invention improves frame erasure concealment device and method improve the speech quality under various channel conditions, compared with a conventional extrapolation-based concealment algorithm.

Type: Grant

Filed: October 26, 2001

Date of Patent: May 27, 2008

Assignee: AT&T Corp.

Inventors: Hong-Goo Kang, Hong Kook Kim
Speech recognition device to mark parts of a recognized text

Patent number: 7376560

Abstract: In a transcription device (1) for transcribing a spoken text (GT) into a recognized text (ET) and for editing incorrectly recognized parts of the recognized text (ET), marking means (12, 15, 17) are provided that are arranged for the partly automatic and partly manual marking of parts of the spoken text (GT) and/or of the recognized text (ET) that have a common characteristic. As a result, subsequent unified processing of marked parts of the text that have common characteristics becomes possible.

Type: Grant

Filed: October 9, 2002

Date of Patent: May 20, 2008

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Heinrich Bartosik, Kresimir Rajic
Automated speech recognition filter

Patent number: 7373297

Abstract: An automated speech recognition filter is disclosed. The automated speech recognition filter device provides a speech signal to an automated speech platform that approximates an original speech signal as spoken into a transceiver by a user. In providing the speech signal, the automated speech recognition filter determines various models representative of a cumulative signal degradation of the original speech signal from various devices along a transmission signal path and a reception signal path between the transceiver and a device housing the filter. The automated speech platform can thereby provide an audio signal corresponding to a context of the original speech signal.

Type: Grant

Filed: February 6, 2004

Date of Patent: May 13, 2008

Assignee: General Motors Corporation

Inventors: Stephen C. Habermas, Ognjen Todic, Kai-Ten Feng, Jane F. MacFarlane
Compressing language models with Golomb coding

Patent number: 7363225

Abstract: A list of integer values is generated from n-grams of a user input. The list of integer values is sorted. Differences between adjacent integer values in the list are calculated. Each calculated difference is encoded using a Golomb code. A Golomb compressed language model is accessed to identify likely matches.

Type: Grant

Filed: June 23, 2005

Date of Patent: April 22, 2008

Assignee: Microsoft Corporation

Inventors: Kenneth Church, Bo Thiesson, Edward Hart, Jr.
Method for entering text

Patent number: 7363224

Abstract: In a method of entering text into a device a first character input is provided that is indicative of a first character of a text entry. Next, a vocalization of the text entry is captured. A probable word candidate is then identified for a first word of the vocalization based upon the first character input and an analysis of the vocalization. Finally, the probable word candidate is displayed for a user.

Type: Grant

Filed: December 30, 2003

Date of Patent: April 22, 2008

Assignee: Microsoft Corporation

Inventors: Xuendong D. Huang, Alejandro Acero, Kuansan Wang, Milind Mahajan
Text-to-speech and image generation of multimedia attachments to e-mail

Patent number: 7356470

Abstract: A multi-mail system and method is disclosed in which a sender may convey and a recipient can realize emotional aspects associated with substantive content of a multi-mail message by receiving a message that is more than textual in nature. Voice recognition technology and programmatic relation of sound and graphics may be used to produce a talking image. In one embodiment, the image may include the user's own visual and/or audio likeness. In an alternate embodiment, the image may comprise any available visual and/or audio display selected by the user. The multi-mail message may be inputted by a user in a text format and transposed into a format including the selected image and/or voice. In an alternate embodiment, a spoken message may be converted into a format including the selected image and/or voice. The formatted messages are then stored and/or transmitted via an email system or some other electronic network.

Type: Grant

Filed: October 18, 2005

Date of Patent: April 8, 2008

Inventors: Adam Roth, Geoffrey O'Sullivan, Barclay A. Dunn
Method and apparatus to eliminate discontinuities in adaptively filtered signals

Patent number: 7353168

Abstract: A method to eliminate discontinuities in an adaptively filtered signal includes filtering a beginning portion of a current signal frame using a past set of filter coefficients, thereby producing a first filtered frame portion. The method also includes filtering the beginning portion of the current signal frame using a current set of filter coefficients, thereby producing a second filtered frame portion. The method also includes modifying the second filtered frame portion with the first filtered frame portion so as to smooth a possible filtered signal discontinuity between the second filtered frame portion and a past filtered frame produced using the past filter coefficients.

Type: Grant

Filed: June 28, 2002

Date of Patent: April 1, 2008

Assignee: Broadcom Corporation

Inventors: Jes Thyssen, Chris C Lee, Juin-Hwey Chen
Perceptual harmonic cepstral coefficients as the front-end for speech recognition

Patent number: 7337107

Abstract: Pitch estimation and classification into voiced, unvoiced and transitional speech were performed by a spectro-temporal auto-correlation technique. A peak picking formula was then employed. A weighting function was then applied to the power spectrum. The harmonics weighted power spectrum underwent mel-scaled band-pass filtering, and the log-energy of the filter's output was discrete cosine transformed to produce cepstral coefficients. A within-filter cubic-root amplitude compression was applied to reduce amplitude variation without compromise of the gain invariance properties.

Type: Grant

Filed: October 2, 2001

Date of Patent: February 26, 2008

Assignee: The Regents of the University of California

Inventors: Kenneth Rose, Liang Gu
Method and apparatus for transmitting an audio stream having additional payload in a hidden sub-channel

Patent number: 7330812

Abstract: Methods and apparatus are provided for communicating an audio stream. A perceptual mask is estimated for an audio stream, based on the perceptual threshold of the human auditory system. A hidden sub-channel is dynamically allocated substantially below the estimated perceptual mask based on the characteristics of the audio stream, in which additional payload is transmitted. The additional payload can be related to components of the audio stream that would not otherwise be transmitted in a narrowband signal, or to concurrent services that can be accessed while the audio stream is being transmitted. A suitable receiver can recover the additional payload, whereas the audio stream will be virtually unaffected from a human auditory standpoint when received by a traditional receiver. A coding scheme is also provided in which a portion of a codec is used to code an upper-band portion of an audio stream, while the narrowband portion is left uncoded.

Type: Grant

Filed: September 10, 2003

Date of Patent: February 12, 2008

Assignee: National Research Council of Canada

Inventor: Heping Ding
Transferring compressed audio via a playback buffer

Patent number: 7328148

Abstract: A method for transferring real time information on a record carrier, typically bitstream audio on an optical disc, which method comprises encoding consecutive segments of the real time information to compressed real time data in frames, and determining a buffer occupancy for at least one frame, which buffer occupancy is indicative of an amount of compressed real time data to be present in the playback buffer at the start of decoding said frame. A signal is transmitted carrying the compressed real time data and the buffer occupancy, which data are received, stored in a playback buffer and finally decoded. The retrieving and/or the decoding is controlled in dependence on said transferred buffer occupancy. A playback buffer can be used effectively without risk for underflow or overflow. Also a method for recording audio information on a record carrier, a recording device, a record carrier and a playback device are described.

Type: Grant

Filed: August 24, 2005

Date of Patent: February 5, 2008

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Johannes M. M. Verbakel, Josephus J. M. M. Geelen
Systems and methods for dynamically analyzing temporality in speech

Patent number: 7324944

Abstract: Systems and methods for dynamically analyzing temporality in an individual's speech in order to selectively categorize the speech fluency of the individual and/or to selectively provide speech training based on the results of the dynamic analysis. Temporal variables in one or more speech samples are dynamically quantified. The temporal variables in combination with a dynamic process, which is derived from analyses of temporality in the speech of native speakers and language learners, are used to provide a fluency score that identifies a proficiency of the individual. In some implementations, temporal variables are measured instantaneously.

Type: Grant

Filed: December 11, 2003

Date of Patent: January 29, 2008

Assignee: Brigham Young University, Technology Transfer Office

Inventors: Lynne Hansen, Joshua Rowe
Method and apparatus for determining an estimate

Patent number: 7318028

Abstract: For determining an estimate of a need for information units for encoding a signal, a measure for the distribution of the energy in the frequency band is taken into account in addition to the admissible interference for a frequency band and an energy of the frequency band. With this, a better estimate of the need for information units is obtained, so that coding can be done more efficiently and more accurately.

Type: Grant

Filed: August 31, 2006

Date of Patent: January 8, 2008

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.

Inventors: Michael Schug, Johannes Hilpert, Stefan Geyersberger, Max Neuendorf
Error correction in speech recognition

Patent number: 7315818

Abstract: New techniques and systems may be implemented to improve error correction in speech recognition. These new techniques and systems may be implemented to correct errors in speech recognition systems may be used in a standard desktop environment, in a mobile environment, or in any other type of environment that can receive and/or present recognized speech.

Type: Grant

Filed: May 11, 2005

Date of Patent: January 1, 2008

Assignee: Nuance Communications, Inc.

Inventors: Daniell Stevens, Robert Roth, Joel M. Gould, Michael J. Newman, Dean Sturtevant, Charles E. Ingold, David Abrahams, Allan Gold
Distributed network based message processing system for text-to-speech streaming data

Patent number: 7313528

Abstract: Text-to-speech streaming data is output to an end user using a distributed network based message processing system. The distributed network system includes a user access server that controls access of registered users to the data retrieval system. An internetwork data retrieval system server retrieves raw data from an internetwork. A text-to-speech server converts the raw data to an audible speech data. A memory storage output device stores a streaming media file containing the audible speech data and a streaming media server transmits the audible speech data to the registered users via the internetwork.

Type: Grant

Filed: July 31, 2003

Date of Patent: December 25, 2007

Assignee: Sprint Communications Company L.P.

Inventor: Eric Miller
Wirelessly loaded speaking consumer product container method and system

Patent number: 7305344

Abstract: A method of communicating product use instructions to patient includes: (a.) providing a group of product container with microprocessors, and, (b.) providing a central processor separate from the product containers. Each product container has the microprocessor attached to the container. The microprocessor includes: (a)(i) a wave file receiving chip; (a)(ii) a wave file storage means; (a)(iii) a wave file audio playback means; (a)(iv) an audio playback start means; and (a)(v) a power supply within the microprocessor. The central processor includes: (b)(i) user input means; (b)(ii) text-to-speech means; (b)(iii) wave file means to create a wave file from the text-to-speech means; and (b)(iv) wireless transmission means to transmit the wave file to the microprocessor wave file receiving chips of a plurality of OTC containers simultaneously.

Type: Grant

Filed: July 16, 2004

Date of Patent: December 4, 2007

Assignee: iVoice, Inc.

Inventors: Kenneth P. Glynn, Jerome R. Mahoney
Automatic assessment of phonological processes

Patent number: 7302389

Abstract: A computer-based system generates alternative phonetic transcriptions for a target word or phrase corresponding to specific phonological processes that replace individual phonemes or clusters of two or more phonemes with replacement phonemes. The system compares a user's speech with a list of possible transcriptions that includes the base (i.e., correct) transcription of the test target as well as the different alternative transcriptions, to identify the transcription that best matches the user's. In a speech therapy application, the system identifies the phonological process(es), if any, associated with the user's speech and generates statistics over multiple test targets that can be used to diagnose the user's specific phonological disorders. The system can also be implemented in other contexts such as foreign language instruction and automated attendant applications to cover a wide variety and range of accents and/or phonological disorders.

Type: Grant

Filed: August 8, 2003

Date of Patent: November 27, 2007

Assignee: Lucent Technologies Inc.

Inventors: Sunil K. Gupta, Prabhu Raghavan, Chetan Vinchhi

prev 1 2 3 4 5 6 … next