Patents Examined by Kinari Patel

Decoding method, speech coding processing unit and a network element

Patent number: 6850883

Abstract: This invention is related to tandem free operation (TFO) in mobile cellular systems. The present invention implements a tandem free operation by using a special feedback loop which makes the decoded parameters available, performs the comfort noise insertion and bad frame handling operations, produces the parameter quantisation indices corresponding to the output of these operations, and synchronises the speech encoders and the speech decoders in the transmission path from the uplink mobile station to the downlink mobile station. This functionality is realized by partly decoding and re-encoding the parameters and synchronising and resetting the quantiser prediction memories in a specific manner. A basic idea of the invention is, that during BFH and CNI processes, a re-encoding block produces models of encoded speech parameters from the BFH/CNI processed speech parameters. These models of encoded speech parameters are then transmitted to the receiving end.

Type: Grant

Filed: February 9, 1998

Date of Patent: February 1, 2005

Assignee: Nokia Networks Oy

Inventors: Pekka Kapanen, Janne Vainio
Learning of dialogue states and language model of spoken information system

Patent number: 6839671

Abstract: In this invention dialogue states for a dialogue model are created using a training corpus of example human—human dialogues. Dialogue states are modelled at the turn level rather than at the move level, and the dialogue states are derived from the training corpus. The range of operator dialogue utterances is actually quite small in many services and therefore may be categorized into a set of predetermined meanings. This is an important assumption which is not true of general conversation, but is often true of conversations between telephone operators and people. Phrases are specified which have specific substitution and deletion penalties, for example the two phrases “I would like to” and “can I” may be specified as a possible substitution with low or zero penalty. Thus allows common equivalent phrases are given low substitution penalties. Insignificant phrases such as ‘erm’ are given low or zero deletion penalties.

Type: Grant

Filed: December 19, 2000

Date of Patent: January 4, 2005

Assignee: British Telecommunications public limited company

Inventors: David J. Attwater, Michael D. Edgington, Peter J. Durston
Virtual pet device and control program recording medium therefor

Patent number: 6772121

Abstract: A special relation is formed between an owner and a virtual pet and a sense for feeling a pet more similar to a true pet is produced. Voice inputted from a microphone 22 is recognized by a recognizing part 20. Then, a speaker data base 12 is referred to to identify a speaking individual. In the data base 12, prescribed words and phrases are registered and events corresponding to the words and phrases are stored in an event list 31. If the words and phrases spoken by the individual to a pet satisfy an event generating condition, events (crying, swinging a tail, etc.) corresponding to the recognized words and phrases will be performed.

Type: Grant

Filed: October 23, 2000

Date of Patent: August 3, 2004

Assignee: Namco, Ltd.

Inventor: Yasumichi Kaneko
Automated speech recognition filter

Patent number: 6772118

Abstract: An automated speech recognition filter is disclosed. The automated speech recognition filter device provides a speech signal to an automated speech platform that approximates an original speech signal as spoken into a transceiver by a user. In providing the speech signal, the automated speech recognition filter determines various models representative of a cumulative signal degradation of the original speech signal from various devices along a transmission signal path and a reception signal path between the transceiver and a device housing the filter. The automated speech platform can thereby provide an audio signal corresponding to a context of the original speech signal.

Type: Grant

Filed: January 4, 2002

Date of Patent: August 3, 2004

Assignee: General Motors Corporation

Inventors: Stephen C. Habermas, Ognjen Todic, Kai-Ten Feng, Jane F. MacFarlane
Method and apparatus for transient detection and non-distortion time scaling

Patent number: 6766300

Abstract: A method and apparatus for transient detection and time-scaling an audio signal detects transients and scales only intervals located between transients to avoid artifacts. In one embodiment, the transient detection process compares frequency characteristic energy between succeeding windows of the audio signal and calculates values of an energy curve where the energy increases. Transients are detected at maxima of the energy curve.

Type: Grant

Filed: August 20, 1999

Date of Patent: July 20, 2004

Assignee: Creative Technology Ltd.

Inventor: Jean Laroche
Forward error correction in speech coding

Patent number: 6757654

Abstract: An improved forward error correction (FEC) technique for coding speech data provides an encoder module which primary-encodes an input speech signal using a primary synthesis model to produce primary-encoded data, and redundant-encodes the input speech signal using a redundant synthesis model to produce redundant-encoded data. A packetizer combines the primary-encoded data and the redundant-encoded data into a series of packets and transmits the packets over a packet-based network, such as an Internet Protocol (IP) network. A decoding module primary-decodes the packets using the primary synthesis model, and redundant-decodes the packets using the redundant synthesis model. The technique provides interaction between the primary synthesis model and the redundant synthesis model during and after decoding to improve the quality of a synthesized output speech signal. Such “interaction,” for instance, may take the form of updating states in one model using the other model.

Type: Grant

Filed: May 11, 2000

Date of Patent: June 29, 2004

Assignee: Telefonaktiebolaget LM Ericsson

Inventors: Magnus Westerlund, Anders Nohlgren, Jonas Svedberg, Anders Uvliden, Jim Sundqvist
System and method for automatic voice recognition using mapping

Patent number: 6754629

Abstract: A method and system that combines voice recognition engines and resolves differences between the results of individual voice recognition engines using a mapping function. Speaker independent voice recognition engines and speaker-dependent voice recognition engines are combined. Hidden Markov Model (HMM) engines and Dynamic Time Warping (DTW) engines are combined.

Type: Grant

Filed: September 8, 2000

Date of Patent: June 22, 2004

Assignee: Qualcomm Incorporated

Inventors: Yingyong Qi, Ning Bi, Harinath Garudadri
Speaker recognition using cohort-specific feature transforms

Patent number: 6754628

Abstract: Methods and apparatus for facilitating speaker recognition, wherein, from target data that is provided relating to a target speaker and background data that is provided relating to at least one background speaker, a set of cohort data is selected from the background data that has at least one proximate characteristic with respect to the target data. The target data and the cohort data are then combined in a manner to produce at least one new cohort model for use in subsequent speaker verification. Similar methods and apparatus are contemplated for non-voice-based applications, such as verification through fingerprints.

Type: Grant

Filed: June 13, 2000

Date of Patent: June 22, 2004

Assignee: International Business Machines Corporation

Inventors: Upendra V. Chaudhari, Stephane H. Maes, Jiri Navratil
Fast implementation of MPEG audio coding

Patent number: 6754618

Abstract: A communication system is disclosed in one embodiment of the present invention to include an encoder circuit responsive to an audio signal for performing compression on the audio signal and adaptive to generate an audio output signal based upon the compressed audio signal, the encoder circuit for sampling the audio signal to generated sampled signals, each sampled signals having a real and an imaginary component associated therewith, each sampled signal having an energy and a phase defined within a current block and each sampled signal being transformed to have a real and an imaginary component, a previous block preceding the current block and a block preceding the previous block, the encoder circuit for calculating the phase of the samples of the current block using the real and the imaginary components of the samples of the previous block and the block preceding the previous block, wherein calculations for determining the unpredictability measure is reduced by avoiding trigonometric calculations of the samp

Type: Grant

Filed: June 7, 2000

Date of Patent: June 22, 2004

Assignee: Cirrus Logic, Inc.

Inventors: Konstantinos Konstantinides, Shaomei Chen, Linjun Zhou
TI window compression/expansion method

Patent number: 6748363

Abstract: According to the present invention, there is developed a proprietary technology for compressing the window tables of audio coders to ⅛ their original size (or less) without any loss of quality. This technology can be applied to all transform based audio coders, or any audio coder that uses a windowing stage. The novel technique for reducing storage requirements for the window tables of audio coders is based on multiple differentiation. Since the difference between any two adjacent samples in the first difference signal is small, so it is more efficient to store this difference. This technique can be carried out several more times, until the returns get smaller, and the computational requirements to “undo” the compression go up. The optimum number of times to differentiate is dependent on the particular application and the window shape.

Type: Grant

Filed: June 28, 2000

Date of Patent: June 8, 2004

Assignee: Texas Instruments Incorporated

Inventors: Charles D. Lueck, Alec C. Robinson, Jonathan L. Rowlands, Jeffrey S. Hayes
Method, apparatus, and system for efficient rate control in audio encoding

Patent number: 6732071

Abstract: According to one aspect of the invention, a method is provided in which audio samples representing an input audio signal are received. The input audio samples are transformed into a vector of spectral values in a frequency domain. A value of a quantizing parameter is determined that satisfies one or more criteria based, at least in part, on a modified Newtonian search process, the determined value of the quantizing parameter being used to quantize the respective vector of spectral values to generate a vector of quantized values.

Type: Grant

Filed: September 27, 2001

Date of Patent: May 4, 2004

Assignee: Intel Corporation

Inventors: Alex A. Lopez-Estrada, Mark P. VanDeusen
Sound synthesizing apparatus and method, telephone apparatus, and program service medium

Patent number: 6732075

Abstract: In a sound synthesizer, a noise adder generates a noise signal having a frequency band of 3,400 to 4,600 Hz, adjusts the gain of the noise signal, and adds the gain-adjusted noise signal to an excitation source after being filled with zeros by a zero-filling circuit, thereby providing a wide-band excitation source which is rather flat. The signal gain is adjusted by determining a narrow-band excitation source or a power of the wide-band excitation source after being filled with zeros and fitting the gain to the narrow-band excitation source or the power.

Type: Grant

Filed: April 20, 2000

Date of Patent: May 4, 2004

Assignee: Sony Corporation

Inventors: Shiro Omori, Masayuki Nishiguchi
Automatic speech recognition caller input rate control

Patent number: 6728671

Abstract: An apparatus, method and system provide for caller input rate control for automatic speech recognition (ASR) components within interactive communication systems. The various embodiments provide a concentrator functionality to increase the capacity of ASR systems, and provide a delay functionality to continue to provide service to callers during congestion or overload conditions. The delay functionality provides various associated delay modes, including the insertion of silent periods within messages or prompts played to a caller, and providing increased message duration. The preferred method embodiment determines a usage level of a plurality of ASR input channels, and when the usage level is greater than a first predetermined threshold, provides an associated delay mode for a message output on an output channel of the plurality of output channels.

Type: Grant

Filed: March 29, 2000

Date of Patent: April 27, 2004

Assignee: Lucent Technologies Inc.

Inventor: Mark Alan Johnson
Digital data player, and data processing method and data storage medium for the same

Patent number: 6721709

Abstract: In the digital player, a decoder, having at least first and second decoding functions, identifies a type of digital data output from a data storage medium. Based on the identification, the decoder selects one of the first and second decoding functions, and decodes the output digital data using the selected decoding function. A converter converts the decoded digital data to analog. To store digital data on the data storage medium, an encoder encodes a first type of digital data at a first compression rate, and stores the encoded first type of digital data on the data storage medium, which stores digital data of a second type encoded at a second compression rate. The first compression rate is higher than the second compression rate. A selector, having at least a first and second input for the first type of digital data, selectively outputs the first type of digital data received at one of the first and second inputs to the encoder.

Type: Grant

Filed: April 24, 2000

Date of Patent: April 13, 2004

Assignee: LG Electronics Inc.

Inventors: Jae-Ha Yoo, Myeong-Joon Kang
Method and apparatus for audible fast-forward or reverse of compressed audio content

Patent number: 6721710

Abstract: A method for performing audible fast-forward or reverse of audio content represented in a compressed format, such as, but not limited to, MPEG-1 Layer 3 (MP3) or MPEG-2 Advance Audio Coding (AAC) employs a fast-forward controller which performs fast-forward or reverse by repeatedly skipping forward or reverse in the compressed audio data stream, retrieving a block of data, and then splicing these data blocks back together. A decoder is then used to decode each of these blocks, to detect when a block switch has occurred (a splice in the data stream), and to quickly resynchronize at each transition. Hierarchical or multiplexed data streams may be decoded using a cascade of decoders each employing this technique. The decoder uses a robust sync search for performing resynchronization and error recovery.

Type: Grant

Filed: October 17, 2000

Date of Patent: April 13, 2004

Assignee: Texas Instrument Incorporated

Inventors: Charles D. Lueck, Alec C. Robinson, Jonathan L. Rowlands
Digital cordless phone system for improving distance of speech communication using error concealment and method thereof

Patent number: 6714912

Abstract: A digital cordless phone system and a communication method for improving the distance of speech communication using error concealment are provided. The digital cordless phone system has a base unit and a remote unit, and each of the base and remote units has a first codec, a data processing unit, a controller, a modem and a switch. The first codec encodes voice data in a first encoding mode, and decodes the data encoded in the first encoding mode in a first decoding mode.

Type: Grant

Filed: January 29, 2001

Date of Patent: March 30, 2004

Assignee: Samsung Electronics, Co., Ltd.

Inventor: Yoon-yung Lee
Audio signal coding and decoding methods and apparatus and recording media with programs therefor

Patent number: 6658382

Abstract: An input signal is time-frequency transformed, then the frequency-domain coefficients are divided into coefficient segments of about 100 Hz width to generate a sequence of coefficient segments, and the sequence of coefficient segments is split into subbands each consisting of plural coefficient segments. A threshold value is determined based on the intensity of each coefficient segment in each subband. The intensity of each coefficient segment is compared with the threshold value, and the coefficient segments are classified into low- and high-intensity groups. The coefficient segments are quantized for each group, or they are flattened respectively and then quantized through recombination.

Type: Grant

Filed: March 23, 2000

Date of Patent: December 2, 2003

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Naoki Iwakami, Takehiro Moriya, Akio Jin, Kazuaki Chikira, Takeshi Mori
Vector quantization and decoding apparatus for speech signals and method thereof

Patent number: 6631347

Abstract: A vector quantizing apparatus, a decoding apparatus, a vector quantization method, and a decoding method are provided. Upon encoding of a speech signal by the vector quantization apparatus and method, the advantages of vector quantization are maximized by quantizing the speech signal using KLT-based classified codebooks and the eigenvalues and eigenvectors of the speech signal. The vector quantization apparatus includes a codebook group, a Karhunen-Loéve Transform (KLT) unit, first and second selection units and a transmission unit. The codebook group has a plurality of codebooks that store the code vectors for a speech signal, and the codebooks are classified using KLT domain statistics for the speech signal. The KLT unit transforms an input speech signal to a KLT domain. The first selection unit selects an optimal codebook from the codebooks in the codebook group on the basis of the eigenvalue set of the covariance matrix of the input speech signal obtained by KLT.

Type: Grant

Filed: September 5, 2002

Date of Patent: October 7, 2003

Assignee: Samsung Electronics Co., Ltd.

Inventors: Moo Young Kim, Willem Bastiaan Kleijn
Maximum entropy and maximum likelihood criteria for feature selection from multivariate data

Patent number: 6609094

Abstract: Improvements in speech recognition systems are achieved by considering projections of the high dimensional data on lower dimensional subspaces, subsequently by estimating the univariate probability densities via known univariate techniques, and then by reconstructing the density in the original higher dimensional space from the collection of univariate densities so obtained. The reconstructed density is by no means unique unless further restrictions on the estimated density are imposed. The variety of choices of candidate univariate densities as well as the choices of subspaces on which to project the data including their number further add to this non-uniqueness. Probability density functions are then considered that maximize certain optimality criterion as a solution to this problem. Specifically, those probability density function's that either maximize the entropy functional, or alternatively, the likelihood associated with the data are considered.

Type: Grant

Filed: May 22, 2000

Date of Patent: August 19, 2003

Assignee: International Business Machines Corporation

Inventors: Sankar Basu, Charles A. Micchelli, Peder Olsen