Post-transmission Patents (Class 704/228)

Method and system of correcting spectral deformations in the voice, introduced by a communication network

Patent number: 7359857

Abstract: A technique for correcting the voice spectral deformations introduced by a communication network. Prior to the operation of equalization of the voice signal of a speaker, the constitution of classes of speakers is communicated, with one voice reference per class. Then, for a given speaker, the classification of this speaker is communicated, that is to say his allocation to a class from predefined classification criteria in order to make a voice reference which is closest to his own correspond to him. Then, for that given speaker, communicating the equalization of the digitized signal of the voice of the speaker carried out with, as a reference spectrum, the voice reference of the class to which the speaker has been allocated. This technique applies to the correction of the timbre of the voice in switched telephone networks, in ISDN networks and in mobile networks.

Type: Grant

Filed: November 25, 2003

Date of Patent: April 15, 2008

Assignee: France Telecom

Inventors: Gaël Mahe, André Gilloire
Method and apparatus for active noise cancellation

Publication number: 20080082326

Abstract: In one embodiment, the present invention is a method and apparatus for active noise cancellation. In one embodiment, a method for recognizing user speech in an audio signal received by a media system (where the audio signal includes user speech and ambient audio output produced by the media system and/or other devices) includes canceling portions of the audio signal associated with the ambient audio output and applying speech recognition processing to an uncancelled remainder of the audio signal.

Type: Application

Filed: September 28, 2006

Publication date: April 3, 2008

Inventors: Anand Venkataraman, Venkata Ramana Rao Gadde, Martin Graciarena
Multi-sensory speech enhancement using a clean speech prior

Patent number: 7346504

Abstract: A method and apparatus determine a channel response for an alternative sensor using an alternative sensor signal, an air conduction microphone signal. The channel response and a prior probability distribution for clean speech values are then used to estimate a clean speech value.

Type: Grant

Filed: June 20, 2005

Date of Patent: March 18, 2008

Assignee: Microsoft Corporation

Inventors: Zicheng Liu, Alejandro Acero, Zhengyou Zhang
Digital signal coding and decoding methods and apparatuses and programs therefor

Patent number: 7337112

Abstract: At the coder side, bits of samples of each frame of an input digital signal are concatenated every digit common to the samples across each frame to generate equi-order bit sequences, which are output as packets. At the decoding side, the input equi-order sequences are arranged inversely to their arrangement at the coder side to reconstruct sample sequences. When a packet dropout occurs, a missing information compensating part 430 correct the reconstructed sample sequences in a manner to reduce an error between the spectral envelope of the reconstructed sample sequence concerned and a known spectral envelope.

Type: Grant

Filed: December 14, 2006

Date of Patent: February 26, 2008

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takehiro Moriya, Akio Jin, Takeshi Mori, Kazunaga Ikeda
Packet Loss Concealment Based On Forced Waveform Alignment After Packet Loss

Publication number: 20080046235

Abstract: A packet loss concealment method and system is described that attempts to reduce or eliminate destructive interference that can occur when an extrapolated waveform representing a lost segment of a speech or audio signal is merged with a good segment after a packet loss. This is achieved by guiding a waveform extrapolation that is performed to replace the bad segment using a waveform available in the first good segment or segments after the packet loss. In another aspect of the invention, a selection is made between a packet loss concealment method that performs the aforementioned guided waveform extrapolation and one that does not. The selection may be made responsive to determining whether the first good segment or segments after the packet loss are available and also to whether a segment preceding the lost segment and the first good segment following the lost segment are deemed voiced.

Type: Application

Filed: July 31, 2007

Publication date: February 21, 2008

Applicant: BROADCOM CORPORATION

Inventor: Juin-Hwey Chen
Re-phasing of Decoder States After Packet Loss

Publication number: 20080046237

Abstract: A technique is described herein for updating a state of a decoder configured to decode a series of frames representing an encoded audio signal. In accordance with the technique, an output audio signal associated with a lost frame in the series of frames is synthesized. The decoder state is set to align with the synthesized output audio signal at a frame boundary. An extrapolated signal is generated based on the synthesized output audio signal. A time lag is calculated between the extrapolated signal and a decoded audio signal associated with a first received frame after the lost frame in the series of frames, wherein the time lag represents a phase difference between the extrapolated signal and the decoded audio signal. The decoder state is then reset based on the time lag.

Type: Application

Filed: August 15, 2007

Publication date: February 21, 2008

Applicant: BROADCOM CORPORATION

Inventors: Robert W. Zopf, Jes Thyssen, Juin-Hwey Chen
Constrained and Controlled Decoding After Packet Loss

Publication number: 20080046236

Abstract: A technique is described herein for reducing audible artifacts in an audio output signal generated by decoding a received frame in a series of frames representing an encoded audio signal in a predictive coding system. In accordance with the technique, it is determined if the received frame is one of a predefined number of received frames that follow a lost frame in the series of the frames. Responsive to determining that the received frame is one of the predefined number of received frames, at least one parameter or signal associated with the decoding of the received frame is altered from a state associated with normal decoding. The received frame is then decoded in accordance with the at least one parameter or signal to generate a decoded audio signal. The audio output signal is then generated based on the decoded audio signal.

Type: Application

Filed: August 15, 2007

Publication date: February 21, 2008

Applicant: BROADCOM CORPORATION

Inventors: Jes Thyssen, Juin-Hwey Chen, Robert W. Zopf
Apparatus and method for canceling residual echo in a mobile terminal of a mobile communication system

Patent number: 7330738

Abstract: An apparatus for canceling an echo signal in a mobile terminal of a mobile communication system. A double talk detector (DTD) receives a first signal by canceling an estimated echo signal from a signal received through a microphone, outputs the first signal, and outputs the first and a second signal comprising a background noise signal and a residual echo signal during a non-double talk. An Auto-Regressive (AR) analysis and inverse filtering unit receives the second signal from the DTD, and whitens the second signal. A pitch analysis and inverse filtering unit receives the whitened signal, and cancels a pitch value remaining therein by performing pitch analysis and inverse filtering on the whitened signal. A noise canceller receives the pitch-cancelled whitened signal and the first signal output from the DTD, canceling a residual echo signal and a background noise signal from the first signal using the pitch-cancelled whitened signal.

Type: Grant

Filed: December 10, 2004

Date of Patent: February 12, 2008

Assignee: Samsung Electronics Co., Ltd

Inventors: Sang-Ki Kang, Gang-Youl Kim, Jung-Soung Lee, Hyun-Soo Kim
Audio decoder with dynamic adjustment of signal modification

Patent number: 7328151

Abstract: Methods and devices for dynamically adjusting a multi-band signal-modification profile based on a psychoacoustic model are disclosed. In one arrangement, the encoding parameter side information is used to estimate encoding noise of an encoded signal. The signal spectrum of the signal is estimated. Adjustments to the multi-band signal-modification profile are determined using the estimated noise and signal spectrum and a psychoacoustic profile.

Type: Grant

Filed: March 22, 2002

Date of Patent: February 5, 2008

Assignee: Sound ID

Inventor: Hannes Muesch
Systems and methods for TV navigation with compressed voice-activated commands

Patent number: 7321857

Abstract: A method, apparatus and system that receives speech commands at a remote control device microphone, digitizes those input speech commands, compresses the digitized speech commands, multiplexes control commands with the compressed digitized speech commands, and transmits the compressed digitized speech commands to an electronic device, such as a digital home communication terminal (DCHT). The electronic device decompresses and interprets the speech commands to allow the remote control operator to control the electronic device. Because speech recognition is performed at the electronic device, rather than at the remote control device, the remote control does not have to interpret and transmit infrared signals that represent user commands. This simplifies the processing and voice recognition capabilities required by the remote control.

Type: Grant

Filed: January 10, 2005

Date of Patent: January 22, 2008

Assignee: Scientific-Atlanta, Inc.

Inventors: Arturo A. Rodriguez, David A. Sedacca, Albert Garcia
Method for improving speech quality in speech transmission tasks

Patent number: 7318025

Abstract: A method for calculating the amplication factor, which co-determines the volume, for a speech signal transmitted in encoded form includes dividing the speech signal into short temporal signal segments. The individual signal segments are encoded and transmitted separately from each other, and the amplication factor for each signal segment is calculated, transmitted and used by the decoder to reconstruct the signal. The amplication factor is determined by minimizing the value E(g_opt2)=(1?a)*f1(g_opt2)+a*f2(g_opt2), the weighting factor a being determined taking into account both the periodicity and the stationarity of the encoded speech signal.

Type: Grant

Filed: March 8, 2001

Date of Patent: January 8, 2008

Assignee: Deutsche Telekom AG

Inventors: Alexander Kyrill Fischer, Christoph Erdmann
Method and system for a waveform attenuation technique for predictive speech coding based on extrapolation of speech waveform

Patent number: 7308406

Abstract: A method and system are provided for processing an extrapolated signal including a number of consecutive replacement frames. The method comprises attenuating a portion of the extrapolated signal when the extrapolated signal reaches a predetermined duration. The attenuating produces an output signal having an attenuated portion, wherein the output signal includes the number of consecutive replacement frames. Each of the consecutive frames within the attenuated portion is attenuated by applying an attenuation window with a starting magnitude value of approximately 1 and including a unique ending magnitude. The unique ending magnitudes decrease over time.

Type: Grant

Filed: June 28, 2002

Date of Patent: December 11, 2007

Assignee: Broadcom Corporation

Inventor: Juin-Hwey Chen
Apparatus and method for concealing erased periodic signal data

Patent number: 7305338

Abstract: Circuitry and a method compensate the erasure of speech signal data or similar periodic signal data, by substitution using past periodic signal data input. After a predetermined number of latest periodic signal data have been saved, whether or not an erasure occurs is determined with every periodic signal data sequence, which is a unit of processing. When an erasure occurs, one of periodic signal data sequences saved, which lies in a determined segment to be used, is used to generate synthetic data for substitution. The position of the segment to be used is determined such that when the erasure continues over units of processing, the position sequentially varies gradually for each processing units.

Type: Grant

Filed: May 14, 2004

Date of Patent: December 4, 2007

Assignee: Oki Electric Industry Co., Ltd.

Inventors: Atsushi Tashiro, Hiromi Aoyagi, Masashi Takada
Speech restoration system and method for concealing packet losses

Patent number: 7302385

Abstract: Provided are a speech restoration system and method for concealing packet losses.

Type: Grant

Filed: July 7, 2003

Date of Patent: November 27, 2007

Assignee: Electronics and Telecommunications Research Institute

Inventors: Ho Sang Sung, Dae Hwan Hwang, Moon Keun Lee, Ki Seung Lee, Young Cheol Park, Dae Hee Youn
Voice quality analysis of speech packets by substituting coded reference speech for the coded speech in received packets

Patent number: 7299176

Abstract: A system and method for voice quality analysis include the ability to receive packets in a voice stream and to generate a receipt indicator for the packets. The system and method also include the ability to substitute a reference voice sample for the voice data in the packets and to compare the voice data in the voice-substituted packets to the reference voice sample to determine voice quality.

Type: Grant

Filed: September 19, 2002

Date of Patent: November 20, 2007

Inventors: Yueh-ju Lee, Shang-Pin Chang, Phuong Luong, Hang Shi, Frank C. Lin, Yu-Lun Huang
Robust noise estimation

Publication number: 20070265843

Abstract: An enhancement system improves the estimate of noise from a received signal. The system includes a spectrum monitor that divides a portion of the signal at more than one frequency resolution. Adaptation logic derives a noise adaptation factor of the received signal. A plurality of devices tracks the characteristics of an estimated noise in the received signal and modifies multiple noise adaptation rates. Weighting logic applies the modified noise adaptation rates derived from the signal divided at a first frequency resolution to the signal divided at a second frequency resolution.

Type: Application

Filed: December 22, 2006

Publication date: November 15, 2007

Inventor: Phillip A. Hetherington
Method and apparatus for multi-sensory speech enhancement on a mobile device

Patent number: 7283850

Abstract: A mobile device includes an air conduction microphone and an alternative sensor that provides an alternative sensor signal indicative of speech. A communication interface permits the mobile device to communicate directly with other mobile devices.

Type: Grant

Filed: October 12, 2004

Date of Patent: October 16, 2007

Assignee: Microsoft Corporation

Inventors: Randy Phyllis Granovetter, Michael J. Sinclair, Zhengyou Zhang, Zicheng Liu
Noise suppression

Patent number: 7283956

Abstract: A method and apparatus for noise suppression is described herein. The channel gain is controlled based on a degree of variability of the background noise. The noise variability estimate is used in conjunction with a variable attenuation concept to produce a family of gain curves that are adaptively suited for a variety of combinations of long-term peak SNR and noise variability. More specifically, a measure of the variability of the background noise is used to provide an optimized threshold that reduces the occurrence of non-stationary background noise entering into the transition region of the gain curve.

Type: Grant

Filed: September 18, 2002

Date of Patent: October 16, 2007

Assignee: Motorola, Inc.

Inventors: James Patrick Ashley, Tenkasi Vaideeswaran Ramabadran, Michael Joseph McLaughlin
Method for determining intensity parameters of background noise in speech pauses of voice signals

Patent number: 7277847

Abstract: A method for determining intensity characteristics of background noise during speech pauses of speech signals includes determining a proportion of speech pauses in the undisturbed source speech signal so as to define a frequency threshold. The disturbed speech signal is divided into short successive signal elements, an intensity value is determined for each of the signal elements, and a cumulative relative frequency distribution is formed from the determined intensity values of the signal elements. The cumulative relative frequency distribution is used to determine an intensity threshold value which corresponds to the defined frequency threshold. At least one intensity characteristic of the background noise during the speech pauses is determined using a region of the cumulative relative frequency distribution below the intensity threshold value.

Type: Grant

Filed: April 3, 2002

Date of Patent: October 2, 2007

Assignee: Deutsche Telekom AG

Inventor: Jens Berger
Accelerated handwritten symbol recognition in a pen based tablet computer

Patent number: 7266236

Abstract: The present invention provides a method and apparatus for accelerated handwritten symbol recognition in a pen based tablet computer. In one embodiment, handwritten symbols are translated into machine readable characters using special purpose hardware. In one embodiment, the special purpose hardware is a recognition processing unit (RPU) which performs feature extraction and recognition. A user inputs the handwritten symbols and software recognition engine preprocesses the input to a reduced form. The data from the preprocessor is sent to the RPU which performs feature extraction and recognition. In one embodiment, the RPU has memory and the RPU operates on data in its memory. In one embodiment, the RPU uses a hidden Markov model (HMM) as a finite state machine that assigns probabilities to a symbol state based on the preprocessed data from the handwritten symbol. In another embodiment, the RPU recognizes collections of symbols, termed “wordlets,” in addition to individual symbols.

Type: Grant

Filed: May 3, 2001

Date of Patent: September 4, 2007

Assignee: California Institute of Technology

Inventors: Kevin Hickerson, Uri Eden
Method and apparatus for identifying noise environments from noisy signals

Patent number: 7266494

Abstract: A method and apparatus are provided for identifying a noise environment for a frame of an input signal based on at least one feature for that frame. To identify the noise environment, a probability for a noise environment is determined by applying the noisy input feature vector to a distribution of noisy training feature vectors. In one embodiment, each noisy training feature vector in the distribution is formed by modifying a set of clean training feature vectors. In one embodiment, the probabilities of the noise environments for past frames are included in the identification of an environment for a current frame. In one embodiment, a correction vector is then selected based on the identified noise environment.

Type: Grant

Filed: November 10, 2004

Date of Patent: September 4, 2007

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Alejandro Acero, Li Deng
Enhancing source coding systems by adaptive transposition

Patent number: 7260520

Abstract: The present invention relates to a new method for enhancement of source coding systems using high-frequency reconstruction. The invention teaches that tonal signals can be classified as either pulse-train-like or non-pulse-train-like. Relying on this classification, significant improvements on the perceived audio quality can be obtained by adaptive switching of transposers. The invention shows that the so-switched transposers must have fundamental differences in their characteristics.

Type: Grant

Filed: December 20, 2001

Date of Patent: August 21, 2007

Assignee: Coding Technologies AB

Inventors: Fredrik Henn, Kristofer Kjörling, Per Ekstrand, Lars Villemoes
Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech

Patent number: 7254536

Abstract: A method and apparatus are provided for reducing noise in a training signal and/or test signal. The noise reduction technique uses a stereo signal formed of two channel signals, each channel containing the same pattern signal. One of the channel signals is “clean” and the other includes additive noise. Using feature vectors from these channel signals, a collection of noise correction and scaling vectors is determined. When a feature vector of a noisy pattern signal is later received, it is multiplied by the best scaling vector for that feature vector and the best correction vector is added to the product to produce a noise reduced feature vector. Under one embodiment, the best scaling and correction vectors are identified by choosing an optimal mixture component for the noisy feature vector. The optimal mixture component being selected based on a distribution of noisy channel feature vectors associated with each mixture component.

Type: Grant

Filed: February 16, 2005

Date of Patent: August 7, 2007

Assignee: Microsoft Corporation

Inventors: Li Deng, Xuedong Huang, Alejandro Acero
Method for fast dynamic estimation of background noise

Patent number: 7246059

Abstract: The invention provides a method and system for dynamically estimating background noise. The system includes a portable communication device, a vocoder, and a voice activated detector. Based on information received by the portable communication device, the vocoder determines parameters related to incoming information including a voicing mode indicative of the periodicity of incoming information. The voice activated detector then compares the voicing mode to a threshold to determine whether a background noise estimate should be updated.

Type: Grant

Filed: July 24, 2003

Date of Patent: July 17, 2007

Assignee: Motorola, Inc.

Inventors: Ali Behboodian, Pratik Desai, Chin Pan Wong
Low-complexity comfort noise generator

Patent number: 7243065

Abstract: A comfort noise generator (104) suitable for use in a communication system includes a finite impulse response (FIR) filter (136), a random number generator (140), and a coefficient updater (138). The coefficient updater (138) determines an updated set of filter coefficients (142) based on the signal frame of the input signal (102). The updated set of filter coefficients (142) is output to the FIR filter (136). The FIR filter (136) shapes a white noise signal (146) supplied by the random number generator (140) to provide a simulated background noise signal, or comfort noise signal (122). The comfort noise signal (122) is selectively output from an echo suppression system or corresponding method to overwrite or suppress reflected residual echoes.

Type: Grant

Filed: April 8, 2003

Date of Patent: July 10, 2007

Assignee: FreeScale Semiconductor, Inc

Inventors: James Allen Stephens, David L. Barron, Sean S. You
Method and apparatus for mitigating the effect of transmission errors in a distributed speech recognition process and system

Patent number: 7243066

Abstract: A method and apparatus for distributed speech recognition serve to mitigate the effect of transmission errors. The method comprises the steps of identifying speech recognition parameters which have been subjected to a transmission error, and processing data to be sent to the speech recognition decoder to ensure that any speech recognition parameters which have been subjected to such an error are excluded from back-end processing. The speech recognition parameters which have been subjected to transmission errors are excluded from back-end processing by replacing those parameters with data which is selected so as to be rejected by the speech recognition decoder as abnormal and/or non-speechlike.

Type: Grant

Filed: June 5, 2001

Date of Patent: July 10, 2007

Assignee: Motorola, Inc.

Inventor: David John Benjamin Pearce
Method and apparatus for performing packet loss or frame erasure concealment

Patent number: 7233897

Abstract: The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with a decoder receives encoded frames of compressed speech information transmitted from an encoder. A lost frame detector at the receiver determines if an encoded frame has been lost or corrupted in transmission, or erased. If the encoded frame is not erased, the encoded frame is decoded by a decoder and a temporary memory is updated with the decoder's output. A predetermined delay period is applied and the audio frame is then output. If the lost frame detector determines that the encoded frame is erased, a FEC module applies a frame concealment process to the signal. The FEC processing produces natural sounding synthetic speech for the erased frames.

Type: Grant

Filed: June 29, 2005

Date of Patent: June 19, 2007

Assignee: AT&T Corp.

Inventor: David A. Kapilow
Method for replacing corrupted audio data

Patent number: 7206986

Abstract: A decoding method for coded data representing original data. Corrupted data is detected and replaced with buffered data. The buffered data is stored in the buffer a time interval corresponding to an estimated periodicity or an integer multiple thereof before the corrupted data was received. The estimated periodicity is determined by estimating the periodicity of the original data represented by the corrupted data.

Type: Grant

Filed: November 30, 2001

Date of Patent: April 17, 2007

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Jan Stemerdink, Arjan Meijerink
2-D processing of speech

Patent number: 7203639

Abstract: Acoustic signals are analyzed by two-dimensional (2-D) processing of the one-dimensional (1-D) speech signal in the time-frequency plane. The short-space 2-D Fourier transform of a frequency-related representation (e.g., spectrogram) of the signal is obtained. The 2-D transformation maps harmonically-related signal components to a concentrated entity in the new 2-D plane (compressed frequency-related representation). The series of operations to produce the compressed frequency-related representation is referred to as the “grating compression transform” (GCT), consistent with sine-wave grating patterns in the frequency-related representation reduced to smeared impulses. The GCT provides for speech pitch estimation. The operations may, for example, determine pitch estimates of voiced speech or provide noise filtering or speaker separation in a multiple speaker acoustic signal.

Type: Grant

Filed: September 13, 2002

Date of Patent: April 10, 2007

Assignee: Massachusetts Institute of Technology

Inventor: Thomas F. Quatieri, Jr.
System and method for speech enhancement

Patent number: 7191127

Abstract: A method and apparatus for reducing noise in a speech signal. A handset or remote unit provides to users with a hearing deficiency, a first mode of operation where noise suppressant/speech enhancement algorithms are used during any auditory-related service. There is also provided, in a related mode of operation, speech filtering for reducing noise in a speech signal received through the microphone and outputting the filtered sound to the speaker. The handset includes a microphone for receiving an auditory sound, a receiver for receiving an auditory signal and a speech filter for suppressing noise in the auditory signal and sound. The speech filter also may be configured to shift the frequency and/or alter the intensity of the auditory signal and sound. The speaker is used for amplifying and outputting the enhanced speech component as an audible sound.

Type: Grant

Filed: December 23, 2002

Date of Patent: March 13, 2007

Assignee: Motorola, Inc.

Inventors: Geydi Lorenzo, Charles D. Estes
Speech recognition over lossy networks with rejection threshold

Patent number: 7171359

Abstract: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. Potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

Type: Grant

Filed: July 29, 2004

Date of Patent: January 30, 2007

Assignee: AT&T Corp.

Inventors: Richard Vandervoort Cox, Stephen Michael Marcus, Mazin G. Rahim, Nambirajan Seshadri, Robert Douglas Sharp
Low-power noise characterization over a distributed speech recognition channel

Patent number: 7171356

Abstract: A distributed speech recognition system includes a noise floor estimator to provide a noise floor estimate to a feature extractor which provides a parametric representation of the noise floor estimate. An encoder is included to to generate an encoded parametric representation of the noise floor estimate. A front-end controller is also included to determine when at least one of the noise floor estimator, the feature extractor, and the encoder is to be turned on or off and to determine when the noise floor estimator is to provide the noise floor estimate to the feature extractor. Additionally, a decoder is included to generate a decoded parametric representation of the noise floor estimate. A noise model generator creates a statistical model of noise feature vectors based on the decoded parametric representation of the noise floor estimate.

Type: Grant

Filed: June 28, 2002

Date of Patent: January 30, 2007

Assignee: Intel Corporation

Inventors: Michael E Deisher, Robert W Morris
Method of noise estimation using incremental bayes learning

Patent number: 7165026

Abstract: A method and apparatus estimate additive noise in a noisy signal using incremental Bayes learning, where a time-varying noise prior distribution is assumed and hyperparameters (mean and variance) are updated recursively using an approximation for posterior computed at the preceding time step. The additive noise in time domain is represented in the log-spectrum or cepstrum domain before applying incremental Bayes learning. The results of both the mean and variance estimates for the noise for each of separate frames are used to perform speech feature enhancement in the same log-spectrum or cepstrum domain.

Type: Grant

Filed: March 31, 2003

Date of Patent: January 16, 2007

Assignee: Microsoft Corporation

Inventors: Alejandro Acero, Li Deng, James G. Droppo
Sound processing method and apparatus

Patent number: 7162045

Abstract: A sound processing method and apparatus are provided, which are capable of performing sound processing on input audio signals containing a plurality of signal components being different in desired sound processing conditions, in a manner that allows natural sound to be reproduced. An input audio signal of at least one system is separated into a plurality of separated signal components, and each signal component of at least part of the plurality of separated signal components is subjected to individual sound processing according to the signal component, and the plurality of separated signal components are outputted as at least one audio signal after each signal component of the at least part thereof is subjected to the individual sound processing. The plurality of separated signal components are synthesized into a synthesized audio signal, which is then outputted, or alternatively, the plurality of separated signal components are outputted separately as audio signals.

Type: Grant

Filed: June 16, 2000

Date of Patent: January 9, 2007

Assignee: Yamaha Corporation

Inventor: Shigeki Fujii
Automatic gain control for adjusting gain during non-speech portions

Patent number: 7155385

Abstract: An estimate is made of the power of a speech portion of a speech signal that includes speech portions separated by non-speech portions, the power for the speech portion being estimated based on a power envelope that spans the speech portion. The gain of an automatic gain control is not adjusted during the speech portions.

Type: Grant

Filed: May 16, 2002

Date of Patent: December 26, 2006

Assignee: Comerica Bank, as Administrative Agent

Inventors: Alexander Berestesky, David E. Duehren
Method and system for an overlap-add technique for predictive decoding based on extrapolation of speech and ringinig waveform

Patent number: 7143032

Abstract: A method and system are provided for removing discontinuities associated with synthesizing a corrupted frame output from a decoder including one or more predictive filters. The corrupted frame is representative of one segment of a decoded signal. The method comprises copying a first number of stored samples of the decoded signal in accordance with a time lag and a scaling factor, and calculating a first number of ringing samples output from at least one of the filters.

Type: Grant

Filed: June 28, 2002

Date of Patent: November 28, 2006

Assignee: Broadcom Corporation

Inventor: Juin-Hwey Chen
Method of iterative noise estimation in a recursive framework

Patent number: 7139703

Abstract: A method and apparatus estimate additive noise in a noisy signal using an iterative technique within a recursive framework. In particular, the noisy signal is divided into frames and the noise in each frame is determined based on the noise in another frame and the noise determined in a previous iteration for the current frame. In one particular embodiment, the noise found in a previous iteration for a frame is used to define an expansion point for a Taylor series approximation that is used to estimate the noise in the current frame. In one embodiment, noise estimation employs a recursive-Expectation-Maximization framework with a maximum likelihood (ML) criteria. In a further embodiment, noise estimation employs a recursive-Expectation-Maximization framework based on a MAP (maximum a posterior) criteria.

Type: Grant

Filed: September 6, 2002

Date of Patent: November 21, 2006

Assignee: Microsoft Corporation

Inventors: Alejandro Acero, Li Deng, James G. Droppo
Method and apparatus for reducing random, continuous non-stationary noise in audio signals

Patent number: 7127072

Abstract: There are provided a method and an apparatus for reducing random, continuous, non-stationary noise in audio signals, the noisy audio signal being filtered by means of a predetermined filter function. The filter function is determined dynamically having regard to the current properties of the noisy audio signal and/or its constituent parts, and the filter function is also limited dynamically having regard to the current properties of the noise component contained in the noisy audio signal.

Type: Grant

Filed: December 13, 2001

Date of Patent: October 24, 2006

Inventors: Jan Rademacher, Jörg Bitzer
Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization

Patent number: 7117148

Abstract: A method and apparatus are provided for reducing noise in a signal. Under one aspect of the invention, a correction vector is selected based on a noisy feature vector that represents a noisy signal. The selected correction vector incorporates dynamic aspects of pattern signals. The selected correction vector is then added to the noisy feature vector to produce a cleaned feature vector. In other aspects of the invention, a noise value is produced from an estimate of the noise in a noisy signal. The noise value is subtracted from a value representing a portion of the noisy signal to produce a noise-normalized value. The noise-normalized value is used to select a correction value that is added to the noise-normalized value to produce a cleaned noise-normalized value. The noise value is then added to the cleaned noise-normalized value to produce a cleaned value representing a portion of a cleaned signal.

Type: Grant

Filed: April 5, 2002

Date of Patent: October 3, 2006

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Li Deng, Alejandro Acero
Method and apparatus for performing packet loss or frame erasure concealment

Patent number: 7117156

Abstract: The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with a decoder receives encoded frames of compressed speech information transmitted from an encoder. A lost frame detector at the receiver determines if an encoded frame has been lost or corrupted in transmission, or erased. If the encoded frame is not erased, the encoded frame is decoded by a decoder and a temporary memory is updated with the decoder's output. A predetermined delay period is applied and the audio frame is then output. If the lost frame detector determines that the encoded frame is erased, a FEC module applies a frame concealment process to the signal. The FEC processing produces natural sounding synthetic speech for the erased frames.

Type: Grant

Filed: April 19, 2000

Date of Patent: October 3, 2006

Assignee: AT&T Corp.

Inventor: David A. Kapilow
Enhanced conversion of wideband signals to narrowband signals

Patent number: 7113522

Abstract: Wideband speech signals must be converted to narrowband speech signals if the transmission medium or the destination terminal is constructed with narrowband constraints. A typical wideband-to-narrowband conversion method is the elimination of frequencies above 3400 Hz using a low pass filter and a down sampler. However, this method produces a muffled speech sound since the resulting narrowband signal has a flat frequency response. Methods and apparatus are presented herein to enhance the acoustic quality of a wideband-to-narrowband converted signal. A bandwidth switching filter is used to emphasize a mid-range frequency portion of the wideband signal so that the resulting narrowband signal has a non-flat frequency spectrum.

Type: Grant

Filed: January 24, 2001

Date of Patent: September 26, 2006

Assignee: QUALCOMM, Incorporated

Inventors: Khaled H. El-Maleh, Arasanipalai K. Ananthapadmanabhan, Andrew P. DeJaco
5-2-5 matrix encoder and decoder system

Patent number: 7107211

Abstract: A sound reproduction system has been developed, for converting signals on two input channels into surround signals on five or seven output channels and vice-versa. A decoder is included in the sound reproduction system which enhances the correlated component of the input signals in the desired direction and reduces the strength of such signals in channels not associated with the encoded direction, while preserving the apparent loudness of all output channels, the separation between the respective left and right output channels and the total energy of the uncorrelated component of the input channels in each output channel. The decoder may include a uniquely defined matrix that helps to ensure that the surface of the output signals is smooth and continuous.

Type: Grant

Filed: October 17, 2003

Date of Patent: September 12, 2006

Assignee: Harman International Industries, Incorporated

Inventor: David H. Griesinger
Method for suppressing noise as well as a method for recognizing voice signals

Patent number: 7092877

Abstract: The invention relates to a method for suppressing noise interference with the following steps: Gaining of an analytical signal from an input signal (Sin); Calculation of an instant amplitude signal (IA) from the analytical signal; Calculation of an instant phase signal (IFI) from the analytical signal; Non-linear modification of the instant amplitude signal (IA) into a modified instant amplitude signal (IAmod); Linkage of the modified instant amplitude signal (IAmod) with the instant phase signal (IFI) into an output signal (Sout).

Type: Grant

Filed: July 31, 2002

Date of Patent: August 15, 2006

Assignee: Turk & Turk Electric GmbH

Inventor: Zlatan Ribic
Noise-robust feature extraction using multi-layer principal component analysis

Patent number: 7082394

Abstract: Extracting features from signals for use in classification, retrieval, or identification of data represented by those signals uses a “Distortion Discriminant Analysis” (DDA) of a set of training signals to define parameters of a signal feature extractor. The signal feature extractor takes signals having one or more dimensions with a temporal or spatial structure, applies an oriented principal component analysis (OPCA) to limited regions of the signal, aggregates the output of multiple OPCAs that are spatially or temporally adjacent, and applies OPCA to the aggregate. The steps of aggregating adjacent OPCA outputs and applying OPCA to the aggregated values are performed one or more times for extracting low-dimensional noise-robust features from signals, including audio signals, images, video data, or any other time or frequency domain signal. Such extracted features are useful for many tasks, including automatic authentication or identification of particular signals, or particular elements within such signals.

Type: Grant

Filed: June 25, 2002

Date of Patent: July 25, 2006

Assignee: Microsoft Corporation

Inventors: Chris Burges, John Platt
Audio segmentation and classification using threshold values

Patent number: 7080008

Abstract: A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.

Type: Grant

Filed: May 11, 2004

Date of Patent: July 18, 2006

Assignee: Microsoft Corporation

Inventors: Hao Jiang, Hong-Jiang Zhang
System and method for concealment of data loss in digital audio transmission

Patent number: 7069208

Abstract: A system and method for the concealment of errors resulting from missing or corrupted data in the transmission of audio signals in compressed digital packet formats is disclosed. The system utilizes a circular FIFO buffer to store audio frames from the transmitted audio signal, and a beat detector, to identify the presence of beats in the audio signal. The error concealment method replaces erroneous audio frames with error-free audio frames by a process which takes into account the presence and location of the detected beats.

Type: Grant

Filed: January 24, 2001

Date of Patent: June 27, 2006

Assignee: Nokia, Corp.

Inventor: Ye Wang
Audio decoding apparatus and method for band expansion with aliasing adjustment

Patent number: 7069212

Abstract: An audio decoding apparatus decodes high frequency component signals using a band expander that generates multiple high frequency subband signals from low frequency subband signals divided into multiple subbands and transmitted high frequency encoded information. The apparatus is provided with an aliasing detector and an aliasing remover. The aliasing detector detects the degree of occurrence of aliasing components in the multiple high frequency subband signals generated by the band expander. The aliasing remover suppresses aliasing components in the high frequency subband signals by adjusting the gain used to generate the high frequency subband signals. Thus occurrence of aliasing can be suppressed and the resulting degradation in sound quality can be reduced, even when real-valued subband signals are used in order to reduce the number of operations.

Type: Grant

Filed: September 11, 2003

Date of Patent: June 27, 2006

Assignees: Matsushita Elecric Industrial Co., Ltd., NEC Corporation

Inventors: Naoya Tanaka, Osamu Shimada, Mineo Tsushima, Takeshi Norimatsu, Kok Seng Chong, Kim Hann Kuah, Sua Hong Neo, Toshiyuki Nomura, Yuichiro Takamizawa, Masahiro Serizawa
Method of speech recognition with compensation for both channel distortion and background noise

Patent number: 7062433

Abstract: A method of speech recognition with compensation is provided by modifying HMM models trained on clean speech with cepstral mean normalization. For all speech utterances the MFCC vector is calculated for the clean database. This mean MFCC vector is added to the original models. An estimate of the background noise is determined for a given speech utterance. The model mean vectors adapted to the noise are determined. The mean vector of the noisy data over the noisy speech space is determined and this is removed from model mean vectors adapted to noise to get the target model.

Type: Grant

Filed: January 18, 2002

Date of Patent: June 13, 2006

Assignee: Texas Instruments Incorporated

Inventor: Yifan Gong
Audio decoding apparatus and method for band expansion with aliasing suppression

Patent number: 7058571

Abstract: A wideband, high quality audio signal is decoded with few calculations at a low bitrate. Unwanted spectrum components accompanying sinusoidal signal injection by a synthesis subband filter built with real-value operations are suppressed by inserting a suppression signal to subbands adjacent to the subband to which the sine wave is injected. This makes it possible to inject a desired sinusoid with few calculations.

Type: Grant

Filed: July 30, 2003

Date of Patent: June 6, 2006

Assignees: Matsushita Electric Industrial Co., Ltd., NEC Corporation

Inventors: Mineo Tsushima, Naoya Tanaka, Takeshi Norimatsu, Kok Seng Chong, Kim Hann Kuah, Sua Hong Neo, Toshiyuki Nomura, Osamu Shimada, Yuichiro Takamizawa, Masahiro Serizawa
Speech signal decoding method and apparatus using decoded information smoothed to produce reconstructed speech signal of enhanced quality

Patent number: 7050968

Abstract: In a speech signal decoding method, information containing at least a sound source signal, gain, and filter coefficients is decoded from a received bit stream. Voiced speech and unvoiced speech of a speech signal are identified using the decoded information. Smoothing processing based on the decoded information is performed for at least either one of the decoded gain and decoded filter coefficients in the unvoiced speech. The speech signal is decoded by driving a filter having the decoded filter coefficients by an excitation signal obtained by multiplying the decoded sound source signal by the decoded gain using the result of the smoothing processing. A speech signal decoding apparatus is also disclosed.

Type: Grant

Filed: July 27, 2000

Date of Patent: May 23, 2006

Assignee: NEC Corporation

Inventor: Atsushi Murashima

prev … 4 5 6 7 8 9 10 11 next