Patents Examined by David D. Knepper

Method and system for speech frame error concealment in speech decoding

Patent number: 6968309

Abstract: A method and system for concealing errors in one or more bad frames in a speech sequence as part of an encoded bit stream received in a decoder. When the speech sequence is voiced, the LTP-parameters in the bad frames are replaced by the corresponding parameters in the last frame. When the speech sequence is unvoiced, the LTP-parameters in the bad frames are replaced by values calculated based on the LTP history along with an adaptively-limited random term.

Type: Grant

Filed: October 31, 2000

Date of Patent: November 22, 2005

Assignee: Nokia Mobile Phones Ltd.

Inventors: Jari Mäkinen, Hannu Mikkola, Janne Vainio, Jani Rotola-Pukkila
Process for the automatic generation of a textual expression from a semantic representation using a computer system

Patent number: 6965856

Abstract: A procedure for the automatic generation of a textual expression from a semantic representation by a computer-system is described. With the procedure, a statistical model is determined by the computer-system on a plurality of pre-determined pairs of semantic representations and associated expressions and stored. A semantic representation, from which an associated expression is determined by the computer system by means of the statistical model, is presented to the computer system. These steps are repeated by the computer system for further semantic representations if necessary.

Type: Grant

Filed: June 30, 1999

Date of Patent: November 15, 2005

Assignee: International Business Machines Corporation

Inventor: Thomas Stuermer
Speechdriven setting of a language of interaction

Patent number: 6963836

Abstract: A voice controlled electronic device includes a controller (12, 13, 14) for initiating individual functions of the electronic device. The controller also establishes a language attribute associated with a language for interaction with the user. The controller ensures that at least part of the interaction with the user takes place substantially in the associated language. The electronic device includes an input (1) for receiving voice commands. A speech recognizer (4) recognizes at least one voice command in the speech input. The voice command is associated with a predetermined first control function of a device, and a distinct second function of establishing the language attribute. The controller sets the language attribute according to the second function of the recognized command.

Type: Grant

Filed: December 17, 2001

Date of Patent: November 8, 2005

Assignee: Koninklijke Philips Electronics, N.V.

Inventor: Henricus Antonius Wilhelmus Van Gestel
Vector quantization with a non-structured codebook for audio compression

Patent number: 6952671

Abstract: According to one embodiment of the invention, a multistage vector list quantizer comprises a first stage quantizer to select candidate first stage codewords from a plurality of first stage codewords, a reference table memory storing a set of second stage codewords for each first stage codeword, and a second stage codebook constructor to generate a reduced complexity second stage codebook that is the union of sets corresponding to the candidate first stage codewords selected by the first stage quantizer.

Type: Grant

Filed: August 25, 2000

Date of Patent: October 4, 2005

Assignee: XVD Corporation

Inventors: Victor Kolesnik, Boris Kudryashov, Eugeny Ovsjannikov, Sergey Petrov, Boris Trojanovsky
Scalable compression of audio and other signals

Patent number: 6947886

Abstract: Disclosed are scalable quantizers for audio and other signals characterized by a non-uniform, perception-based distortion metric, that operate in a common companded domain which includes both the base-layer and one or more enhancement-layers. The common companded domain is designed to permit use of the same unweighted MSE metric for optimal quantization parameter selection in multiple layers, exploiting the statistical dependence of the enhancement-layer signal on the quantization parameters used in the preceding layer. One embodiment features an asymptotically optimal entropy coded uniform scalar quantizer. Another embodiment is an improved bit rate scalable multi-layer Advanced Audio Coder (AAC) which extends the scalability of the asymptotically optimal entropy coded uniform scalar quantizer to systems with non-uniform base-layer quantization, selecting the enhancement-layer quantization methodology to be used in a particular band based on the preceding layer quantization coefficients.

Type: Grant

Filed: February 21, 2003

Date of Patent: September 20, 2005

Assignee: The Regents of the University of California

Inventors: Kenneth Rose, Ashish Aggarwal, Shankar L. Regunathan
Method and system for predicting problematic dialog situations in a task classification system

Patent number: 6941266

Abstract: The invention concerns a system and method of predicting problematic dialogs in a task classification system based on the user's input communications. The method may include determining whether a task classification decision can be made based on a first automated dialog exchange with the user. As such, if the task classification decision cannot be made, the method may determine whether the probability of conducting a successful automated dialog with the user based on whether the first dialog exchange exceeds a first threshold. The successful dialog may be defined as a dialog exchange between an automated dialog system and the user that results in at least one of processing of the user's input communication and routing the user's input communication. The method may further operate such that if the first threshold is exceeded, further dialog is conducted with the user. Otherwise, the user may be directed to a human for assistance.

Type: Grant

Filed: November 15, 2000

Date of Patent: September 6, 2005

Assignee: AT&T Corp.

Inventors: Allen Louis Gorin, Irene Langkilde Geary, Diane Judith Litman, Marilyn Ann Walker, Jeremy H. Wright
Device and method for coding speech to be recognized (STBR) at a near end

Patent number: 6934678

Abstract: In a mobile wireless communication system automatic speech recognition is performed in a distributed manner using a mobile station based near or front end stage which extracts and vector quantizes recognition feature parameters from frames of an utterance and an infrastructure based back or far end stage which reverses the vector quantization to recover the feature parameters and subjects the feature parameters to a Hidden Markov Model (HMM) evaluation to obtain a recognition decision for the utterance. In order to conserve network capacity, the size (Sz) of the codebook used for the vector quantization, and the corresponding number of bits (B) per codebook index B, are adapted on a dialogue-by dialogue basis in relation to the vocabulary size |V| for the dialogue. The adaptation is performed at the front end, accomplishes a tradeoff between expected recognition rate RR and expected bitrate BR by optimizing a metric which is a function of both.

Type: Grant

Filed: September 25, 2000

Date of Patent: August 23, 2005

Assignee: Koninklijke Philips Electronics N.V.

Inventor: Yin-Pin Yang
Information processing apparatus and method for generating derivative information from vocal-containing musical information

Patent number: 6931377

Abstract: An information processing apparatus for separating input musical number information into a vocal information part containing lyrics in a first language and an accompaniment information part, and for producing second musical number information made of the accompaniment part and a translated vocal information part superimposed thereon. A vocal separation unit separates the first vocal information part and the accompaniment information part from the input first musical information. A processing unit generates first language lyric information by speech recognition of the separated first vocal information part, translates the generated first language lyric information into second language lyric information, and supplies the second language lyric information. A synthesis unit synthesizes the supplied second language lyric information, the accompaniment information part, and the separated first vocal information part to generate second musical information.

Type: Grant

Filed: August 28, 1998

Date of Patent: August 16, 2005

Assignee: Sony Corporation

Inventor: Kenji Seya
Apparatus and method for adding information to a machine translation dictionary

Patent number: 6920419

Abstract: Given a source text, a desired translation of the source text into a target language, and a machine-readable dictionary, a first set of morphemes in the target language is generated from the source text, typically by using the dictionary to perform a machine translation of the source text. The second text is analyzed into a second set of morphemes in the target language. Differences between the first and second sets of morphemes are found, and morphemes corresponding to the differences are taken from the source text. Existing information including these source-text morphemes is extracted from the dictionary, and new information to be added to the dictionary is automatically generated from the extracted information and the differences. This process generates comparatively short dictionary entries, corresponding only to the differences between the two set of morphemes, and therefore creates useful dictionary entries while saving dictionary space.

Type: Grant

Filed: March 26, 2002

Date of Patent: July 19, 2005

Assignee: Oki Electric Industry Co., Ltd.

Inventors: Mihoko Kitamura, Toshiki Murata
Method and apparatus for testing digital channels in a wireless communication system

Patent number: 6917916

Abstract: In a digital channel of a digital wireless communication system including at least one mobile station, at least one base transceiver station in communication with the mobile station, a transcoder configured to provide a signal conversion between vocoder frames and pulse code modulation, and a mobile switching center for interconnecting the digital wireless communication system to a public switched telephone network, a method and apparatus for determining a fault in the digital channel is disclosed. The method includes generating a first set of vocoder input parameters from a speech input signal, and generating a second set of vocoder input parameters from an output signal substantially equivalent to the speech input signal as it is received at a mobile station via the digital channel. The method further includes calculating a metric based on the first and the second set of vocoder input parameters, and subsequently determining a fault in the digital channel using the metric.

Type: Grant

Filed: December 13, 2001

Date of Patent: July 12, 2005

Assignee: Motorola, Inc.

Inventors: Chris B. Curtis, Joseph T. Marino, Jr., Bruce A. Fette
Speech translation device and computer readable medium

Patent number: 6917920

Abstract: A translation device which has both advantages of a table look-up translation device and advantages of a machine translation device by leading the user's utterance through a sentence template suitable for the user's intent of speech is realized. Since the translation device searches for sentence templates suitable for the user's intent of speech with an orally inputted keyword and displays retrieved sentences, the user's utterance can be lead. In addition, the user is free from a troublesome manipulation for replacing a word since an expression uttered by the user is inserted into a replaceable portion (slot) within the sentence template, and the translation device translates a resulting sentence with the replaced expression embedded in the slot.

Type: Grant

Filed: January 6, 2000

Date of Patent: July 12, 2005

Assignee: Hitachi, Ltd.

Inventors: Atsuko Koizumi, Hiroyuki Kaji, Yasunari Obuchi, Yoshinori Kitahara
Multistage inverse quantization having the plurality of frequency bands

Patent number: 6904404

Abstract: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.

Type: Grant

Filed: January 8, 1999

Date of Patent: June 7, 2005

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Takeshi Norimatsu, Shuji Miyasaka, Yoshihisa Nakatoh, Mineo Tsushima, Tomokazu Ishikawa
Using signal to noise ratio of a speech signal to adjust thresholds for extracting speech parameters for coding the speech signal

Patent number: 6898566

Abstract: There are provided speech coding methods and systems for estimating a plurality of speech parameters of a speech signal for coding the speech signal using one of a plurality of speech coding algorithms, the plurality of speech parameters includes pitch information, the plurality of speech parameters is calculated using a plurality of thresholds. An example method includes estimating a background noise level in the speech signal to determine a signal to noise ratio (SNR) for the speech signal, adjusting one or more of the plurality of thresholds based on the SNR to generate one or more SNR adjusted thresholds, analyzing the speech signal to extract the pitch information using the one or more SNR adjusted thresholds, and repeating the estimating, the adjusting and the analyzing to code the speech signal using one the plurality of speech coding algorithms.

Type: Grant

Filed: August 16, 2000

Date of Patent: May 24, 2005

Assignee: Mindspeed Technologies, Inc.

Inventors: Adil Benyassine, Huan-Yu Su
Method of and apparatus for configuring and controlling home entertainment systems through natural language and spoken commands using a natural language server

Patent number: 6895379

Abstract: When a user's request is entered it is then transmitted to a network interface unit which digitizes and stores the request. The digitized request and information about the user's network of devices is then transmitted from the network interface unit to a natural language server, preferably over the internet. The natural language server then processes the request and generates commands necessary to complete the request within the user's network of devices. These commands are then transmitted from the natural language server to the network interface unit. The network interface unit then transmits the commands to the appropriate devices within the network of devices. The devices within the network of devices then execute the received commands to complete the user's request.

Type: Grant

Filed: March 27, 2002

Date of Patent: May 17, 2005

Assignees: Sony Corporation, Sony Electronics Inc.

Inventors: Scott David Smyers, Glen David Stone, Bruce Alan Fairman
Hash function based transcription database

Patent number: 6892176

Abstract: A hash function based data retrieval system for use with a lexicon database of a data processing system is disclosed. The system comprises a RAM, a disk memory, and a data file residing in the disk memory, wherein the data file contains stored data and is organized into nests. The system further comprises a hashing data structure residing in RAM, wherein the data structure is designed to occupy a fixed amount of memory independent of content of the data file. A data retrieval module is operable to identify a nest using a hash function that is based on parameters selected according to characteristics of the data file. The hash function is further designed to be optimized for content of the data file. The hash function is further designed to produce hash values based on the fixed amount of memory.

Type: Grant

Filed: December 18, 2001

Date of Patent: May 10, 2005

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Kirill Stoimenov, Alexander Nikolov
Speech bandwidth extension

Patent number: 6889182

Abstract: A common narrow-band speech signal is expanded into a wide-band speech signal. The expanded speech signal gives the impression of a wide-band speech signal regardless of what type of vocoder is used. Extending the narrow-band speech signal into a lower range involves analyzing the narrow-band speech signal to generate one or more parameters, and synthesizing a lower frequency-band signal based on at least one of the one or more parameters. The synthesized lower frequency-band signal is then combined with a signal that is derived from (e.g., via up-sampling) the narrow-band speech signal. In preferred embodiments, a pitch frequency parameter is generated, and generation of the lower frequency-band signal includes generating continuous sine tones that are frequency shifted with the pitch frequency parameter.

Type: Grant

Filed: December 20, 2001

Date of Patent: May 3, 2005

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventor: Harald Gustafsson
Method and apparatus for improved voice activity detection in a packet voice network

Patent number: 6889187

Abstract: A method and apparatus for detecting and transmitting voice signals in a packet voice network system. The method and apparatus make use of a voice activity detection (VAD) unit at a transmitter, for determining if an input signal contains active audio information or passive audio information, where the input signal includes a plurality of frames. For one or more frames of the input signal containing active audio information, the VAD computes a hangover time period. This computation includes determining whether the hangover time period has a fixed duration or a variable duration on the basis of characteristics of the active audio information contained in the one or more frames. When the VAD detects a frame containing passive audio information subsequent to the one or more frames containing active audio information, the input signal is suppressed after the expiry of the computed hangover time period from the detection of the passive audio information.

Type: Grant

Filed: December 26, 2001

Date of Patent: May 3, 2005

Assignee: Nortel Networks Limited

Inventor: Shude Zhang
Methods and apparatus for controlling an electronic device

Patent number: 6889188

Abstract: Methods and apparatus for controlling an electronic device connected to a network are provided. The methods and apparatus described herein convert a text based device list and/or a text based function list into text based voice prompt scripts. The voice prompt scripts are then read to a user via a text-to-speech engine. The user responds with a voice command for a device. The voice command is converted to text by a voice recognition engine. This text is then used to send a command to the electronic device via the network.

Type: Grant

Filed: November 22, 2002

Date of Patent: May 3, 2005

Assignee: Intel Corporation

Inventors: Benjamin T. Metzler, Wayne D. Trantow
System and method for matching a textual input to a lexical knowledge base and for utilizing results of that match

Patent number: 6871174

Abstract: The present invention can be used in a natural language processing system to determine a relationship (such as similarity in meaning) between two textual segments. The relationship can be identified or determined based on logical graphs generated from the textual segments. A relationship between first and second logical graphs is determined. This is accomplished regardless of whether there is an exact match between the first and second logical graphs. In one embodiment, the first graph represents an input textual discourse unit. The second graph, in one embodiment, represents information in a lexical knowledge base (LKB). The input graph can be matched against the second graph, if they have similar meaning, even if the two differ lexically or structurally.

Type: Grant

Filed: May 17, 2000

Date of Patent: March 22, 2005

Assignee: Microsoft Corporation

Inventors: William B. Dolan, Michael Barnett, Stephen D. Richardson, Arul A. Menezes, Lucretia H. Vanderwende
Method and system for testing algorithm compliancy

Patent number: 6856953

Abstract: The present invention relates to an algorithm for ensuring compliancy of an algorithm module when integrated in a real time software system. The compliancy tests may include a memory test, interrupt test, latency test and other tests, as well as combinations thereof. An inventive aspect of the present invention relates to a unit test harness for verifying that a software algorithm module meets performance and functional requirements when integrated in a complete real-time software system. A software algorithm module eliminates or reduces unwanted behavior by the caller or other software on a real-time software system due to incorrect operations, which may involve interrupts, memory usage, register usage and/or other factors.

Type: Grant

Filed: December 19, 2001

Date of Patent: February 15, 2005

Assignee: GlobespanVirata, Inc.

Inventors: Matthew Randmaa, Murali Anantha, David Lindsay, Keith Dillon

prev … 3 4 5 6 7 8 9 10 11 … next