Post-transmission Patents (Class 704/228)

Speech post-processing using MDCT coefficients

Patent number: 7590523

Abstract: There is provided a speech post-processor for enhancing a speech signal divided into a plurality of sub-bands in frequency domain. The speech post-processor comprises an envelope modification factor generator configured to use frequency domain coefficients representative of an envelope derived from the plurality of sub-bands to generate an envelope modification factor for the envelope derived from the plurality of sub-bands, where the envelope modification factor is generated using FAC=?ENV/Max+(1??), where FAC is the envelope modification factor, ENV is the envelope, Max is the maximum envelope, and ? is a value between 0 and 1, where ? is a different constant value for each speech coding rate. The speech post-processor further comprises an envelope modifier configured to modify the envelope derived from the plurality of sub-bands by the envelope modification factor corresponding to each of the plurality of sub-bands.

Type: Grant

Filed: March 20, 2006

Date of Patent: September 15, 2009

Assignee: Mindspeed Technologies, Inc.

Inventor: Yang Gao
Robust decoder

Patent number: 7590531

Abstract: Techniques and tools related to delayed or lost coded audio information are described. For example, a concealment technique for one or more missing frames is selected based on one or more factors that include a classification of each of one or more available frames near the one or more missing frames. As another example, information from a concealment signal is used to produce substitute information that is relied on in decoding a subsequent frame. As yet another example, a data structure having nodes corresponding to received packet delays is used to determine a desired decoder packet delay value.

Type: Grant

Filed: August 4, 2005

Date of Patent: September 15, 2009

Assignee: Microsoft Corporation

Inventors: Hosam A. Khalil, Tian Wang, Kazuhito Koishida, Xiaoqin Sun, Wei-Ge Chen
Concealment of frame erasures and method

Patent number: 7587315

Abstract: A decoder for code excited LP encoded frames with both adaptive and fixed codebooks; erased frame concealment uses repetitive excitation plus a smoothing of pitch gain in the next good frame, plus multilevel voicing classification with multiple thresholds of correlations determining linear interpolated adaptive and fixed codebook excitation contributions.

Type: Grant

Filed: February 27, 2002

Date of Patent: September 8, 2009

Assignee: Texas Instruments Incorporated

Inventor: Takahiro Unno
POST FILTER, DECODER, AND POST FILTERING METHOD

Publication number: 20090216527

Abstract: A post filter and a decoder enabling improvement of the sound quality of a decoded signal even when the sound quality of the decoded signal is different from the bands are disclosed. A frequency converting section determines a decoded spectrum. A power spectrum computing section computes the power spectrum from the decoded spectrum. A correction band determining section determines the band in which the power spectrum is corrected according to layer information. A power spectrum correcting section corrects the power spectrum in the corrected band in such a way that the variation along the frequency axis is suppressed. An inverse converting section subjects the corrected power spectrum to inverse conversion to determine an autocorrelation function. An LPC analyzing section determines an LPC coefficient of the determined autocorrelation function.

Type: Application

Filed: June 15, 2006

Publication date: August 27, 2009

Applicant: MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD.

Inventor: Masahiro Oshikiri
Multimode speech coding apparatus and decoding apparatus

Patent number: 7577567

Abstract: Square sum calculator 603 calculates a square sum of evolution in smoothed quantized LSP parameter for each order. A first dynamic parameter is thereby obtained. Square sum calculator 605 calculates a square sum using a square value of each order. The square sum is a second dynamic parameter. Maximum value calculator 606 selects a maximum value from among square values for each order. The maximum value is a third dynamic parameter. The first to third dynamic parameters are output to mode determiner 607, which determines a speech mode by judging the parameters with respective thresholds to output mode information.

Type: Grant

Filed: December 12, 2006

Date of Patent: August 18, 2009

Assignee: Panasonic Corporation

Inventor: Hiroyuki Ehara
Method for recovering target speech based on amplitude distributions of separated signals

Patent number: 7562013

Abstract: The present invention provides a method for recovering target speech based on shapes of amplitude distributions of split spectra obtained by use of blind signal separation.

Type: Grant

Filed: August 31, 2004

Date of Patent: July 14, 2009

Assignee: Kitakyushu Foundation For The Advancement of Industry, Science and Technology

Inventors: Hiromu Gotanda, Keiichi Kaneda, Takeshi Koya
Method and computer program product for generating recognition error correction information

Publication number: 20090144056

Abstract: A method for providing recognition error correction information, the method includes: obtaining metadata associated with a capture of a media item; and generating recognition error correction information in response to the metadata. The recognition error correction information is to be used in a recognition process selected out of a list consisting of an automatic speech recognition process and an optical characters recognition process.

Type: Application

Filed: November 29, 2007

Publication date: June 4, 2009

Inventors: Netta Aizenbud-Reshef, Ella Barkan, Eran Belinsky, Jonathan Joseph Mamou, Yaakov Navon, Boaz Ophir
Noise reduction using correction vectors based on dynamic aspects of speech and noise normalization

Patent number: 7542900

Abstract: A method and apparatus are provided for reducing noise in a signal. Under one aspect of the invention, a correction vector is selected based on a noisy feature vector that represents a noisy signal. The selected correction vector incorporates dynamic aspects of pattern signals. The selected correction vector is then added to the noisy feature vector to produce a cleaned feature vector. In other aspects of the invention, a noise value is produced from an estimate of the noise in a noisy signal. The noise value is subtracted from a value representing a portion of the noisy signal to produce a noise-normalized value. The noise-normalized value is used to select a correction value that is added to the noise-normalized value to produce a cleaned noise-normalized value. The noise value is then added to the cleaned noise-normalized value to produce a cleaned value representing a portion of a cleaned signal.

Type: Grant

Filed: May 5, 2006

Date of Patent: June 2, 2009

Assignee: Microsoft Corporation

Inventors: James G. Droppo, Li Deng, Alejandro Acero
Audio signal quality enhancement in a digital network

Patent number: 7539615

Abstract: The invention relates to a network element (1) and a method for enhancing the quality of digitised analogue signals transmitted in parameterised coded form via a digital network. In order to enable an enhancement of the quality of the digitised analogue signals on network side, the network element comprises means (20, 21) for extracting signals from and insert signals into the network, first processing means (24) for processing the extracted parameters in the parameter domain with functions suitable to enhance the quality of the digitised analogue signals and second processing means (26) for processing the extracted parameters in the linear domain with functions suitable to enhance the quality of the digitised analogue signals. Moreover included analysing and selecting means (23, 27) determine the expected enhancement of quality in the different processing domains and cause a corresponding insertion of processed signals back into the network. The proposed method comprises corresponding steps.

Type: Grant

Filed: December 29, 2000

Date of Patent: May 26, 2009

Assignee: Nokia Siemens Networks Oy

Inventors: Tommi Koistinen, Olli Kirla
System and method for implementing real-time adaptive threshold triggering in acoustic detection systems

Patent number: 7536301

Abstract: A system and method to generate a trigger signal based on a real-time adaptive threshold. The system may include a microphone to receive an audio signal, a device to generate a trigger signal based on a real-time adaptive threshold coupled to the microphone to form an adaptive threshold and generate a trigger signal if a magnitude of the audio signal is greater than a magnitude of the adaptive threshold. The system may also include a waveform capture module coupled to the microphone to receive the audio signal and convert the audio signal into a series of waveform packets and a waveform analysis processor to extract characteristics from the waveform packets.

Type: Grant

Filed: January 3, 2005

Date of Patent: May 19, 2009

Assignee: AAI Corporation

Inventors: James Jaklitsch, Gary Hartman, Jay Markey, Niall B. McNelis
Method for recovering target speech based on speech segment detection under a stationary noise

Patent number: 7533017

Abstract: Method for recovering target speech by extracting signal components falling in a speech segment, which is determined based on separated signals obtained through the Independent Component Analysis, thereby minimizing the residual noise in the recovered target speech.

Type: Grant

Filed: August 31, 2004

Date of Patent: May 12, 2009

Assignee: Kitakyushu Foundation for the Advancement of Industry, Science and Technology

Inventors: Hiromu Gotanda, Keiichi Kaneda, Takeshi Koya
Apparatus and method for producing a data stream and apparatus and method for reading a data stream

Patent number: 7526432

Abstract: An entropy encoder includes an apparatus for producing a data stream which comprises two reference points, of code words of variable lengths, the apparatus comprising a first device for writing at least a part of a code word into the data stream in a first direction of writing, starting from a first reference point, and a second device for writing at least a part of a code word into the data stream in a second direction of writing, which is opposite to the first direction of writing, starting from the other reference point. In particular, when a raster having a plurality of segments is used to write the code words of variable lengths into the data stream, the number of the code words which can be written starting at raster points is doubled, in the best case, such that the data stream of code words of variable lengths is robust toward a propagation of sequence errors.

Type: Grant

Filed: January 22, 2008

Date of Patent: April 28, 2009

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Ralph Sperschneider, Martin Dietz, Daniel Homm, Reinhold Böhm
Method and apparatus using harmonic-model-based front end for robust speech recognition

Patent number: 7516067

Abstract: A system and method are provided that reduce noise in speech signals. The system and method decompose a noisy speech signal into a harmonic component and a residual component. The harmonic component and residual component are then combined as a sum to form a noise-reduced value. In some embodiments, the sum is a weighted sum where the harmonic component is multiplied by a scaling factor. In some embodiments, the noise-reduced value is used in speech recognition.

Type: Grant

Filed: August 25, 2003

Date of Patent: April 7, 2009

Assignee: Microsoft Corporation

Inventors: Michael Seltzer, James Droppo, Alejandro Acero
Adaptive postfiltering methods and systems for decoding speech

Patent number: 7512535

Abstract: A filter controller processes a decoded speech (DS) signal. The DS signal has a spectral envelope including a first plurality of formant peaks having different respective amplitudes. The controller produces, from the DS signal, a spectrally-flattened DS signal that is a time-domain signal. The spectrally-flattened time-domain DS signal has a spectral envelope including a second plurality of formant peaks. Each of the second plurality of formant peaks approximately coincides in frequency with a respective one of the first plurality of formant peaks. Also, the second plurality of formant peaks have approximately equal respective amplitudes. Next, the controller derives, from the spectrally-flattened time-domain DS signal, a set of filter coefficients representative of a filter response that is to be used to filter the DS signal.

Type: Grant

Filed: June 28, 2002

Date of Patent: March 31, 2009

Assignee: Broadcom Corporation

Inventors: Juin-Hwey Chen, Jes Thyssen
Apparatuses for adaptively controlling processing of speech signal and adaptively communicating speech in accordance with conditions of transmitting apparatus side and radio wave and methods thereof

Patent number: 7509255

Abstract: An apparatus for processing a speech signal includes a receiver, a speech signal decoder, a speech rate conversion information detector, and a speech rate converting processor. The receiver receives multiplexed signal of information concerning controls and programs, including speech packets through a transmission line. The decoder decodes the speech signal of packets out of the received signals. The detector detects speech rate conversion execution information in the received signals. The processor subjects the decoded speech signal to a speech rate conversion process if the speech rate conversion execution information indicates that the speech signal has not been subjected to the speech rate conversion process on the transmitting end, and which does not subject the decoded speech signal to the speech rate conversion process if the speech rate conversion execution information indicates that the speech signal has been subjected to the speech rate conversion process on the transmitting end.

Type: Grant

Filed: September 28, 2004

Date of Patent: March 24, 2009

Assignee: Victor Company of Japan, Limited

Inventors: Hiroyuki Takeishi, Yutaka Ichinoi
Speech signal transmission apparatus and method that multiplex and packetize coded information

Patent number: 7502735

Abstract: A speech signal transmission apparatus multiplexes, packetizes, and sends first coded information coded in a normal state and second coded information used for improving the quality of decoded speech when a frame loss occurs. A first error calculating section calculates a first error signal between a target signal and a synthesized signal generated by an adaptive codebook, and a second error calculating section calculates a second error signal between the target signal and a synthesized signal generated by a fixed codebook. An error signal ratio calculating section calculates the ratio of the first error signal to the second error signal. A speech frame classifying section classifies a speech frame according to the magnitude of the ratio, and a decision section decides whether or not to multiplex the second coded information based on the classification result.

Type: Grant

Filed: August 24, 2004

Date of Patent: March 10, 2009

Assignee: Panasonic Corporation

Inventor: Hiroyuki Ehara
Estimation method and apparatus of overall conversational quality taking into account the interaction between quality factors

Patent number: 7499856

Abstract: The delay time and listening quality of a system under test are measured from a signal received therefrom, then the measured delay time and listening quality are transformed to a delay-related degradation and a listening quality degradation on the same quality measure, then the quantity of interaction between the delay-related degradation and the listening quality degradation is calculated, and the delay-related degradation, the listening quality degradation and the quantity of interaction are added together to obtain an overall degradation. The overall degradation is transformed to a subjective evaluation value to estimate the overall speech quality.

Type: Grant

Filed: December 22, 2003

Date of Patent: March 3, 2009

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Akira Takahashi, Jun Okamoto, Ginga Kawaguti
BUZZ REDUCTION FOR LOW-COMPLEXITY FRAME ERASURE CONCEALMENT

Publication number: 20090055171

Abstract: A system is described that performs periodic waveform extrapolation based frame erasure concealment (FEC) to generate frames of an output speech signal corresponding to erased frames of encoded bit-stream in a manner reduces buzzy and tonal artifacts in the output speech signal. An embodiment of the invention uses a multiple of a pitch period associated with previously-decoded speech to perform periodic waveform extrapolation for consecutively-erased frames in a frame erasure beyond the first erased frame. An embodiment of the invention also attenuates the extrapolated signal after a threshold number of erased frames so as to reduce the FEC output signal to zero, wherein the threshold number of erased frames is dependent at least in part on the pitch period associated with the previously-decoded speech.

Type: Application

Filed: July 24, 2008

Publication date: February 26, 2009

Applicant: BROADCOM CORPORATION

Inventor: Robert W. Zopf
Digitized audio data reproduction apparatus, and integrated circuit device to output audio signals

Patent number: 7496507

Abstract: An audio data reproduction apparatus, integrated circuit device, and computer program product configured to implement a method for reproducing audio data that includes receiving an audio stream, decoding the audio stream, processing the audio stream or a signal produced by the decoding, selecting the signal produced by the decoding or a signal produced by the processing, and outputting the audio stream, the signal produced by the decoding, or the signal produced by the processing.

Type: Grant

Filed: March 8, 2005

Date of Patent: February 24, 2009

Assignee: Kabushiki Kaisha Toshiba

Inventor: Setsuo Terasaki
Generating LSF vectors

Patent number: 7493255

Abstract: To alleviate problems of signal aliasing and to reduce complexity, Linear Predictive Coefficients (LPCS) are calculated from samples of audio signals and Line Spectral Frequency (LSF) vectors are extracted from the LPCs with a rate higher than a desired vector rate, the LSF vectors comprising values of different LSF parameters. Next, an LSF track is formed for at least one of the LSF parameters. At least one of the formed LSF tracks is then low pass filtered. Finally, decimated LSF vectors are reconstructed from the low pass filtered LSF tracks, the decimated number corresponding to the desired vector rate. The invention equally relates to a corresponding computer program, to corresponding devices and to a corresponding communication network.

Type: Grant

Filed: April 10, 2003

Date of Patent: February 17, 2009

Assignee: Nokia Corporation

Inventors: Khaldoon Taha Al-Naimi, Stephane Villette, Ahmet Kondoz
Audio signal processing

Patent number: 7490044

Abstract: An audio system for processing two channels of audio input to provide more than two output channels. The input may be conventional stereo material or compressed audio signal data. The audio processing includes separating the input signals into frequency bands and processing the frequency bands according to processes which may differ from band to band. The audio processing includes no processing of L?R signals.

Type: Grant

Filed: June 8, 2004

Date of Patent: February 10, 2009

Assignee: Bose Corporation

Inventor: Abhijit Kulkarni
Apparatus, program storage device and method for testing speech recognition in the mobile environment of a vehicle

Patent number: 7487084

Abstract: A testing arrangement provided for speech recognition systems in vehicles. Preferably included are a “mobile client” secured in the vehicle and driven around at a desired speed, an audio system and speaker which plays back a set of prerecorded utterances stored digitally in a computer arrangement such that the speech of a human being is simulated, transmission of the speech signal to a server, followed by speech recognition and signal-to-noise ratio (SNR) computation. Here, the acceptability of the vehicular speech recognition system is preferably determined via comparison with pre-specified standards of recognition accuracy and SNR values.

Type: Grant

Filed: July 31, 2002

Date of Patent: February 3, 2009

Assignee: International Business Machines Corporation

Inventors: Andrew Aaron, Subrata K. Das, David M. Lubensky
Method for adaptive filtering

Patent number: 7478040

Abstract: A method for adaptive long-term filtering of an audio signal, such as a decoded speech signal. The method includes measuring a smoothed periodicity of an audio signal segment, such as an audio frame, wherein the smoothed periodicity is measured by low-pass filtering an instantaneous periodicity of the audio signal segment. The periodicity of the audio signal segment is then increased in a manner that depends upon whether the smoothed periodicity is less than a predetermined threshold. By utilizing a smoothed periodicity measurement in this fashion, more accurate control of the post-filter is provided as compared to conventional solutions. Additionally, the method includes deriving filters by interpolating between filter responses of adjacent audio signal segments to minimize distortion at segment boundaries.

Type: Grant

Filed: October 20, 2004

Date of Patent: January 13, 2009

Assignee: Broadcom Corporation

Inventors: Jes Thyssen, Juin-Hwey Chen
LOW COMPLEXITY DECODER FOR COMPLEX TRANSFORM CODING OF MULTI-CHANNEL SOUND

Publication number: 20080319739

Abstract: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

Type: Application

Filed: June 22, 2007

Publication date: December 25, 2008

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Robust separation of speech signals in a noisy environment

Patent number: 7464029

Abstract: A method for improving the quality of a speech signal extracted from a noisy acoustic environment is provided. In one approach, a signal separation process is associated with a voice activity detector. The voice activity detector is a two-channel detector, which enables a particularly robust and accurate detection of voice activity. When speech is detected, the voice activity detector generates a control signal. The control signal is used to activate, adjust, or control signal separation processes or post-processing operations to improve the quality of the resulting speech signal. In another approach, a signal separation process is provided as a learning stage and an output stage. The learning stage aggressively adjusts to current acoustic conditions, and passes coefficients to the output stage. The output stage adapts more slowly, and generates a speech-content signal and a noise dominant signal.

Type: Grant

Filed: July 22, 2005

Date of Patent: December 9, 2008

Assignee: QUALCOMM Incorporated

Inventors: Erik Visser, Jeremy Toman, Kwokleung Chan
Noise-robust feature extraction using multi-layer principal component analysis

Patent number: 7457749

Abstract: Extracting features from signals for use in classification, retrieval, or identification of data represented by those signals uses a “Distortion Discriminant Analysis” (DDA) of a set of training signals to define parameters of a signal feature extractor. The signal feature extractor takes signals having one or more dimensions with a temporal or spatial structure, applies an oriented principal component analysis (OPCA) to limited regions of the signal, aggregates the output of multiple OPCAs that are spatially or temporally adjacent, and applies OPCA to the aggregate. The steps of aggregating adjacent OPCA outputs and applying OPCA to the aggregated values are performed one or more times for extracting low-dimensional noise-robust features from signals, including audio signals, images, video data, or any other time or frequency domain signal. Such extracted features are useful for many tasks, including automatic authentication or identification of particular signals, or particular elements within such signals.

Type: Grant

Filed: June 7, 2006

Date of Patent: November 25, 2008

Assignee: Microsoft Corporation

Inventors: Chris Burges, John Platt
Separating multiple audio signals recorded as a single mixed signal

Patent number: 7454333

Abstract: A method according to the invention separates multiple audio signals recorded as a mixed signal via a single channel. The mixed signal is A/D converted and sampled. A sliding window is applied to the samples to obtain frames. The logarithms of the power spectra of the frames are determined. From the spectra, the a posteriori probabilities of pairs of spectra are determined. The probabilities are used to obtain Fourier spectra for each individual signal in each frame. The invention provides a minimum-mean-squared error metho or a soft mask method for making this determination. The Fourier spectra are inverted to obtain corresponding signals, which are concatenated to recover the individual signals.

Type: Grant

Filed: September 13, 2004

Date of Patent: November 18, 2008

Assignee: Mitsubishi Electric Research Lab, Inc.

Inventors: Bhiksha Ramakrishnan, Aarthi M. Reddy
Subtractive cancellation of harmonic noise

Patent number: 7453963

Abstract: A common problem in audio processing is that a useful signal is disturbed by one or more sinusoidal noises that should be suppressed. One embodiment of the invention provides a method of canceling a sinusoidal disturbance of unknown frequency in a disturbed useful signal. The method comprises the steps of estimating parameters of the sinusoidal disturbance including amplitude, phase and frequency; generating a reference signal on the basis of the estimated parameters; and subtracting the reference signal from the disturbed useful signal. According to one embodiment of the present invention, the estimation is performed by an Extended Kalman filter.

Type: Grant

Filed: May 25, 2005

Date of Patent: November 18, 2008

Assignee: Honda Research Institute Europe GmbH

Inventors: Frank Joublin, Martin Heckmann, Björn Schölling
Detection of signal modifications in audio streams with embedded code

Patent number: 7451092

Abstract: An encoder transforms at least a portion of a signal, counts the resulting transform coefficients having a zero value, and encodes the signal with the zero count. A decoder decodes the signal in order to recover the zero count. The decoder may also determine its own zero count of the signal as received and may compare the zero count that it determines to the recovered zero count. The decoder may be arranged to detect compression/decompression based upon results from the comparison, and/or the decoder may be arranged to prevent use of a device based upon results from the comparison.

Type: Grant

Filed: March 5, 2004

Date of Patent: November 11, 2008

Assignee: Nielsen Media Research, Inc. a Delaware corporation

Inventor: Venugopal Srinivasan
Method and apparatus for multi-sensory speech enhancement

Patent number: 7447630

Abstract: A method and system use an alternative sensor signal received from a sensor other than an air conduction microphone to estimate a clean speech value. The estimation uses either the alternative sensor signal alone, or in conjunction with the air conduction microphone signal. The clean speech value is estimated without using a model trained from noisy training data collected from an air conduction microphone. Under one embodiment, correction vectors are added to a vector formed from the alternative sensor signal in order to form a filter, which is applied to the air conductive microphone signal to produce the clean speech estimate. In other embodiments, the pitch of a speech signal is determined from the alternative sensor signal and is used to decompose an air conduction microphone signal. The decomposed signal is then used to determine a clean signal estimate.

Type: Grant

Filed: November 26, 2003

Date of Patent: November 4, 2008

Assignee: Microsoft Corporation

Inventors: Zicheng Liu, Michael J. Sinclair, Alejandro Acero, Xuedong D. Huang, James G. Droppo, Li Deng, Zhengyou Zhang, Yanli Zheng
Method, device and program for extracting and recognizing voice

Patent number: 7440892

Abstract: In a method of extracting voice components free of noise components from voice signals input through a single microphone, a signal-decomposing unit extracts independent signal components from the voice signals input through a single microphone by using a plurality of filters that permit the passage of signal components of different frequency bands. A signal-synthesizing unit synthesizes the signal components according to a first rule to form a first synthesized signal, and synthesizes the signal components according to a second rule to form a second synthesized signal. The first and second rules are so determined that a difference becomes a maximum between the probability density function of the first synthesized signal and the probability density function of the second synthesized signal. An output selection unit selectively produces a synthesized signal having a large difference from the Gaussian distribution between the synthesized signals.

Type: Grant

Filed: March 8, 2005

Date of Patent: October 21, 2008

Assignee: Denso Corporation

Inventor: Shinichi Tamura
Telecommunications system for minimizing the effect of white noise data packets for the generation of required white noise on transmission channel utilization

Patent number: 7436786

Abstract: Minimizing the effects of the requisite AGWN packets on transmission channel utilization without diminishing any of the aesthetic quality of the AGWN white noise on the voice or audio communication.

Type: Grant

Filed: December 9, 2003

Date of Patent: October 14, 2008

Assignee: International Business Machines Corporation

Inventor: Oliver Keren Ban
Method and system for optimizing denoising parameters using compressibility

Patent number: 7436969

Abstract: In various embodiments of the present invention, a noisy signal denoiser is tuned and optimized by selecting denoiser parameters that provide relatively highly compressible denoiser output. When the original signal can be compared to the output of a denoiser, the denoiser can be accurately tuned and adjusted in order to produce a denoised signal that resembles as closely as possible the clear signal originally transmitted through a noise-introducing channel. However, when the clear signal is not available, as in many communications applications, other methods are needed. By adjusting the parameters to provide a denoised signal that is globally or locally maximally compressible, the denoiser can be optimized despite inaccessibility of the original, clear signal.

Type: Grant

Filed: September 2, 2004

Date of Patent: October 14, 2008

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Gadiel Seroussi, Sergio Verdu, Marcelo Weinberger, Itschak Weissman, Erik Ordentlich
Apparatus and method for producing a data stream and apparatus and method for reading a data stream

Patent number: 7433825

Abstract: An entropy encoder includes an apparatus for producing a data stream which comprises two reference points, of code words of variable lengths, the apparatus comprising a first device for writing at least a part of a code word into the data stream in a first direction of writing, starting from a first reference point, and a second device for writing at least a part of a code word into the data stream in a second direction of writing, which is opposite to the first direction of writing, starting from the other reference point. In particular, when a raster having a plurality of segments is used to write the code words of variable lengths into the data stream, the number of the code words which can be written starting at raster points is doubled, in the best case, such that the data stream of code words of variable lengths is robust toward a propagation of sequence errors.

Type: Grant

Filed: January 17, 2000

Date of Patent: October 7, 2008

Assignee: Fraunhofer-Gesellschaft zur Foerderling der Angewandten Forschung E.V.

Inventors: Ralph Sperschneider, Martin Dietz, Daniel Homm, Reinhold Böhm
Method and apparatus for encoding and decoding pause information

Patent number: 7433822

Abstract: At an audio source, pause information is added to audio data, the combination of which is subsequently packetized. The resulting packets are transmitted to an audio destination via a network in which different packets may be subjected to varying levels of delay. At the audio destination, the pause information may be used to insert pauses at appropriate times to accommodate the occurrence of delays in packet delivery. In one embodiment, pauses are inserted based on a hierarchy of pause types. During pauses, audio filler information may be injected. In this manner, the effects of variable network delays upon reconstructed audio may be mitigated.

Type: Grant

Filed: April 25, 2005

Date of Patent: October 7, 2008

Assignee: Research In Motion Limited

Inventors: Dale R. Buchholz, Bashar Jano, Ira Gerson
Impulse noise suppression device

Patent number: 7430255

Abstract: Digital audio data with error detection bits added thereto is inputted to an error detecting and correcting device (4). The correcting device (4) corrects an error when the error is detected in the digital audio data. The digital audio data outputted from the error detecting and correcting device (4) is inputted to an impulse noise suppressing circuit (6). The suppressing circuit (6) operates for a predetermined time period when the correcting device (4) detects an error.

Type: Grant

Filed: October 8, 2002

Date of Patent: September 30, 2008

Assignee: TOA Corporation

Inventors: Takako Shibuya, Tomohisa Tanaka
Speech signal decoding method and apparatus using decoded information smoothed to produce reconstructed speech signal to enhanced quality

Patent number: 7426465

Abstract: In a speech signal decoding method, information containing at least a sound source signal, gain, and filter coefficients is decoded from a received bit stream. Voiced speech and unvoiced speech of a speech signal are identified using the decoded information. Smoothing processing based on the decoded information is performed for at least either one of the decoded gain and decoded filter coefficients in the unvoiced speech. The speech signal is decoded by driving a filter having the decoded filter coefficients by an excitation signal obtained by multiplying the decoded sound source signal by the decoded gain using the result of the smoothing processing. A speech signal decoding apparatus is also disclosed.

Type: Grant

Filed: January 20, 2006

Date of Patent: September 16, 2008

Assignee: NEC Corporation

Inventor: Atsushi Murashima
Speech quality assessment with noise masking

Patent number: 7412375

Abstract: A method and apparatus for assessing the perceptual quality of stereo speech signals transmitted via a telecommunications network and recorded acoustically from an acoustic terminal device in which a mono reference signal comprising a single channel is aligned with a degraded stereo signal comprising a left and a right channel; a delay between each channel of said degraded signal and said reference signal is estimated; a noise masking indicator in dependence upon said estimated delays is generated; the level of the stereo signals is adjusted in dependence upon said noise masking indicator; a set of perceptually relevant parameters for each of said reference and degraded signals are generated; the perceptually relevant parameters of the reference signal with the perceptually relevant parameters of the degraded signal to generate a disturbance profile are compared; and a speech quality prediction is generated in dependence upon said disturbance profile.

Type: Grant

Filed: June 22, 2004

Date of Patent: August 12, 2008

Assignee: Psytechnics Limited

Inventors: Tom Goldstein, Paul Alexander Barrett, Antony William Rix
Bit error concealment methods for speech coding

Patent number: 7406411

Abstract: A system and method of concealing bit errors in a signal are provided. An exemplary method detects bit errors in an input signal having at least a current signal segment and a previous signal segment. The previous signal segment has a log-gain value qlg(m?1) and immediately precedes the current signal segment. The method comprises estimating a level lvl(m?1) of the input signal and determining a log-gain value qlg(m) of the current signal segment within the input signal. The method also comprises determining a difference between the gain value of the current signal segment and the previous signal segment and determining whether the difference exceeds a threshold. wherein the threshold is adaptive to the input signal level.

Type: Grant

Filed: August 19, 2002

Date of Patent: July 29, 2008

Assignee: Broadcom Corporation

Inventor: Juin-Hwey Chen
Control system outputting received speech with display of a predetermined effect or image corresponding to its ambient noise power spectrum

Patent number: 7403895

Abstract: A control system receiving an input signal, comprising speech with ambient noise, determines the ambient noise power spectrum, retrieves control information corresponding to the closest-matching ambient noise power spectrum, and outputs the input signal along with display of a predetermined effect or image corresponding to the ambient noise power spectrum.

Type: Grant

Filed: June 24, 2003

Date of Patent: July 22, 2008

Assignee: Fujitsu Limited

Inventors: Toru Iwamoto, Naoya Takahashi
Processing a speech frame in a radio system

Patent number: 7401022

Abstract: The invention relates to method of processing a speech frame in radio system, a radio system, a mobile station in radio system, and a network of radio system. In the method a speech frame having propagated over a radio path is channel-decoded. If on the basis of the channel-decoding the speech frame is free of defects, it is inferred from the value of at least one speech parameter in the channel-decoded speech frame whether the speech frame contains speech that is decodable by means of a speech decoder; and if, according to the inference, the speech frame does contain speech that is decodable by means of a speech decoder, the speech frame is decoded by means of a speech decoder; and if, according to the inference, the speech frame does not contain speech that would be decodable by means of the speech decoder, the speech frame is not decoded.

Type: Grant

Filed: September 17, 2001

Date of Patent: July 15, 2008

Assignee: Nokia Corporation

Inventor: Petri Ahonen
SPECTRO-TEMPORAL VARYING APPROACH FOR SPEECH ENHANCEMENT

Publication number: 20080167866

Abstract: The present system proposes a technique called the spectro-temporal varying technique, to compute the suppression gain. This method is motivated by the perceptual properties of human auditory system; specifically, that the human ear has higher frequency resolution in the lower frequencies band and less frequency resolution in the higher frequencies, and also that the important speech information in the high frequencies are consonants which usually have random noise spectral shape. A second property of the human auditory system is that the human ear has lower temporal resolution in the lower frequencies and higher temporal resolution in the higher frequencies. Based on that, the system uses a spectro-temporal varying method which introduces the concept of frequency-smoothing by modifying the estimation of the a posteriori SNR. In addition, the system also makes the a priori SNR time-smoothing factor depend on frequency.

Type: Application

Filed: December 20, 2007

Publication date: July 10, 2008

Applicant: HARMAN INTERNATIONAL INDUSTRIES, INC.

Inventors: Phil A. Hetherington, Xueman Li
Method and Apparatus for Robust Speech Activity Detection

Publication number: 20080147389

Abstract: A method and apparatus for robust speech activity detection is disclosed. The method may include calculating autocorrelations by filtering input signals using order statistic filtering, averaging the autocorrelations over a time period, obtaining a voiced speech feature from the averaged autocorrelations, classifying the input signal as one of speech and non-speech based on the obtained voiced speech feature, and outputting only the classified speech signals or the input signals along with the speech/non-speech classification information, to an automated speech recognizer.

Type: Application

Filed: December 15, 2006

Publication date: June 19, 2008

Applicant: Motorola, Inc.

Inventor: Dusan Macho
Method for speech coding, method for speech decoding and their apparatuses

Patent number: 7383177

Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding method according to a code-excited linear prediction (CELP) speech coding, a noise level of a speech in a concerning coding period is evaluated by using a code or coding result of at least one of spectrum information, power information, and pitch information, and various excitation codebooks are used based on an evaluation result.

Type: Grant

Filed: July 26, 2005

Date of Patent: June 3, 2008

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Tadashi Yamaura
Pitch adaptive equalization for improved audio

Patent number: 7383175

Abstract: A pitch adaptive circuit (200) includes an equalizer control circuit (206) that evaluates the pitch of the speech signals that are being processed and depending on the pitch information, the equalizer control circuit (206) selects an equalizer (208, 210) to shape the decoded speech signals. By selecting the best equalizer (208 or 210) to use based on the pitch information, improvements in audio quality are provided automatically without user intervention.

Type: Grant

Filed: March 25, 2003

Date of Patent: June 3, 2008

Assignee: Motorola, Inc.

Inventors: Patrick J. Doran, Stephen S. Shiao
Method of cascading noise reduction algorithms to avoid speech distortion

Patent number: 7383179

Abstract: A method of reducing noise by cascading a plurality of noise reduction algorithms is provided. A sequence of noise reduction algorithms are applied to the noisy signal. The noise reduction algorithms are cascaded together, with the final noise reduction algorithm in the sequence providing the system output signal. The sequence of noise reduction algorithms includes a plurality of noise reduction algorithms that are sufficiently different from each other such that resulting distortions and artifacts are sufficiently different to result in reduced human perception of the artifact and distortion levels in the system output signal.

Type: Grant

Filed: September 28, 2004

Date of Patent: June 3, 2008

Assignee: Clarity Technologies, Inc.

Inventors: Rogerio G. Alves, Kuan-Chich Yen, Jeff Chisholm
System and methods for concealing errors in data transmission

Patent number: 7379865

Abstract: A frame erasure concealment device and method that is based on reestimating gain parameters for a code excited linear prediction (CELP) coder is disclosed. During operation, when a frame in a stream of received data is detected as being erased, the coding parameters, especially an adaptive codebook gain gp and a fixed codebook gain gc, of the erased and subsequent frames can be reestimated by a gain matching procedure. By using this technique with the IS-641 speech coder, it has been found that the present invention improves frame erasure concealment device and method improve the speech quality under various channel conditions, compared with a conventional extrapolation-based concealment algorithm.

Type: Grant

Filed: October 26, 2001

Date of Patent: May 27, 2008

Assignee: AT&T Corp.

Inventors: Hong-Goo Kang, Hong Kook Kim
Simple noise suppression model

Patent number: 7379866

Abstract: An approach for efficiently reducing background noise from speech signal in real-time applications is presented. A noisy input speech signal is processed through an inverse filter when the spectrum tilt of the input signal is not that of a pure background noise model the noisy input signal is also filtered in order to reduce the spectrum valley areas of the noisy input signal when the background noise is present.

Type: Grant

Filed: March 11, 2004

Date of Patent: May 27, 2008

Assignee: Mindspeed Technologies, Inc.

Inventor: Yang Gao
Automated speech recognition filter

Patent number: 7373297

Abstract: An automated speech recognition filter is disclosed. The automated speech recognition filter device provides a speech signal to an automated speech platform that approximates an original speech signal as spoken into a transceiver by a user. In providing the speech signal, the automated speech recognition filter determines various models representative of a cumulative signal degradation of the original speech signal from various devices along a transmission signal path and a reception signal path between the transceiver and a device housing the filter. The automated speech platform can thereby provide an audio signal corresponding to a context of the original speech signal.

Type: Grant

Filed: February 6, 2004

Date of Patent: May 13, 2008

Assignee: General Motors Corporation

Inventors: Stephen C. Habermas, Ognjen Todic, Kai-Ten Feng, Jane F. MacFarlane
Method for speech coding, method for speech decoding and their apparatuses

Patent number: 7363220

Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal.

Type: Grant

Filed: March 28, 2005

Date of Patent: April 22, 2008

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Tadashi Yamaura

prev … 3 4 5 6 7 8 9 10 11 next