Patents Examined by Talivaldis Ivars {haeck over (S)}mits

Method of automatic recognition of a spelled speech utterance

Patent number: 6725197

Abstract: The invention relates to a method of automatic recognition of an at least partly spelled speech utterance, with a speech recognition unit (2) based on statistical models that include a linguistic speech model (6).

Type: Grant

Filed: October 12, 1999

Date of Patent: April 20, 2004

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Friedhelm Wuppermann, Volker Stahl
Secure remote voice activation system using a password

Patent number: 6581036

Abstract: A method of secure remote control by voice wherein the digitization and speech recognition functions are separated, which involves receiving an audible voice password in a remote controller, digitizing the voice password, and transmitting the digitized voice password and an ID from the controller to a base station. The method also includes confirming the ID and the password in the base station, receiving an audible voice command in the controller, and digitizing the command. The method still further includes transmitting the digitized command from the controller to the base station, confirming the command to indicate transmission of a desired control signal by the base station, and transmitting the control signal from the base station in response to the command.

Type: Grant

Filed: October 19, 2000

Date of Patent: June 17, 2003

Assignee: Var LLC

Inventor: Gordon H. Varney, Jr.
Audio financial data system

Patent number: 6574600

Abstract: A financial data system is disclosed that receives real-time data, uses a set of pre-determined rules to prioritize the data and provide a priority value, and then delivers the highest priority data by way of multiple audio channels. A key aspect of the invention is the use of data manipulation according to the priority value to adjust delivery volume, provide selective vocalization compression, add additional audio channels, or to override an existing comment when required. As a result of the invention, a significant amount of information may be aurally delivered to a user including properties of events as they change in response to changing financial conditions.

Type: Grant

Filed: January 14, 2000

Date of Patent: June 3, 2003

Assignee: MarketSound L.L.C.

Inventors: Bradley S. Fishman, Wade J. Vagle
Confidence measure system using a near-miss pattern

Patent number: 6571210

Abstract: A method and system of performing confidence measure in a speech recognition system includes receiving an utterance of input speech and creating a near-miss pattern or a near-miss list of possible word entries for the utterance. Each word entry includes an associated value of probability that the utterance corresponds to the word entry. The near-miss list of possible word entries is compared with corresponding stored near-miss confidence templates. Each word in the vocabulary (or keyword list) of near-miss confidence template, which includes a list of word entries and each word entry in each list includes an associated value. Confidence measure for a particular hypothesis word is performed based on the comparison of the values in the near-miss list of possible word entries with the values of the corresponding near-miss confidence template.

Type: Grant

Filed: November 13, 1998

Date of Patent: May 27, 2003

Assignee: Microsoft Corporation

Inventors: Hsiao-Wuen Hon, Asela J. R. Gunawardana
Model adaptation of neural tree networks and other fused models for speaker verification

Patent number: 6519561

Abstract: The model adaptation system of the present invention is a speaker verification system that embodies the capability to adapt models learned during the enrollment component to track aging of a user's voice. The system has the advantage of only requiring a single enrollment for the user. The model adaptation system and methods can be applied to several types of speaker recognition models including neural tree networks (NTN), Gaussian Mixture Models (GMMs), and dynamic time warping (DTW) or to multiple models (i.e., combinations of NTNs, GMMs and DTW). Moreover, the present invention can be applied to text-dependent or text-independent systems.

Type: Grant

Filed: November 3, 1998

Date of Patent: February 11, 2003

Assignee: T-Netix, Inc.

Inventors: Kevin Farrell, William Mistretta
Network application software services containing a speech recognition capability

Patent number: 6434526

Abstract: Speech recognition software is provided in combination with application specific software on a communications network. Analog voice data is digitized at a user's location, identified as voice data, and transmitted to the application software residing at a central location. The network server receiving data identified as voice data transmits it to a speech server. Speech recognition software resident at the speech server contains a dictionary and modules tailored to the voice of each of the users of the speech recognition software. As the user speaks, a translation of the dictation is transmitted back to the user's location and appears in print on the user's computer screen for examination and if necessary, voice or typed correction of its contents. Multiple users have interleaved access to the speech recognition software so that transmission back to each of the users is contemporaneous.

Type: Grant

Filed: June 29, 1998

Date of Patent: August 13, 2002

Assignee: International Business Machines Corporation

Inventors: Frank Cilurzo, Roger Matthew Miller
Audio compression circuit and method

Patent number: 6405164

Abstract: An audio compressor utilizes a switched charging state rectifier which produces an output proportional to the magnitude of an input signal but with controlled attack/release times. The rectified voltage is input to a logical selector, which provides logical control signals which are a function of the rectified voltage. In a preferred embodiment, the control signals of the logical selector are used to select the switch positions of a switched resistor ladder. The switched resistor ladder is used to provide a resistance path in an op-amp feedback amplifier, thereby enabling the gain of the op-amp amplifier to be adjusted in steps by the selector as the rectified signal level varies.

Type: Grant

Filed: December 30, 1999

Date of Patent: June 11, 2002

Assignee: Engineering Consortium, Inc.

Inventor: Hoang Minh Pinai
N-best search for continuous speech recognition using viterbi pruning for non-output differentiation states

Patent number: 6374220

Abstract: A method for N-best search for continuous speech recognition with limited storage space includes the steps of Viterbi pruning word level (same word, different time alignment, thus non-output differentiation) states and keeping the N-best sub-optimal paths for sentence level (output differentiation) states.

Type: Grant

Filed: July 15, 1999

Date of Patent: April 16, 2002

Assignee: Texas Instruments Incorporated

Inventor: Yu-Hung Kao
Concatenation of speech segments by use of a speech synthesizer

Patent number: 6366883

Abstract: In a speech synthesizer apparatus, a weighting coefficient training controller calculates acoustic distances in second acoustic feature parameters between one target phoneme from the same phoneme and the phoneme candidates other than the target phoneme based on first acoustic feature parameters and prosodic feature parameters, and determines weighting coefficient vectors for respective target phonemes defining degrees of contribution to the second acoustic feature parameters for respective phoneme candidates by executing a predetermined statistical analysis therefor.

Type: Grant

Filed: February 16, 1999

Date of Patent: April 2, 2002

Assignee: ATR Interpreting Telecommunications

Inventors: Nick Campbell, Andrew Hunt
Method and apparatus for suppressing acoustic background noise in a communication system by equaliztion of pre-and post-comb-filtered subband spectral energies

Patent number: 6366880

Abstract: A noise suppression system implemented in communication system provides an improved level of quality during severe signal-to-noise ratio (SNR) conditions. The noise suppression system, inter alia, incorporates a frequency domain comb-filtering (289) technique which supplements a traditional spectral noise suppression method. The invention includes a real cepstrum generator (285) for an input signal (285) G(k) to produce a likely voiced speech pitch lag component and converting a result to frequency domain to obtain a comb-filter function (290) C(k), applying input signal (291) G(k) to comb-filter function (290) C(k), and equalizing the energies of the corresponding pre and post filtered subbands, to produce a signal (293) G″(k) to be used for noise suppression. This prevents high frequency components from being unnecessarily attenuated, thereby reducing muffling effects of prior art comb-filters.

Type: Grant

Filed: November 30, 1999

Date of Patent: April 2, 2002

Assignee: Motorola, Inc.

Inventor: James Patrick Ashley