With Content Reduction Encoding Patents (Class 704/501)

WARPED SPECTRAL AND FINE ESTIMATE AUDIO ENCODING

Publication number: 20120143599

Abstract: A warped spectral estimate of an original audio signal can be used to encode a representation of a fine estimate of the original signal. The representation of the warped spectral estimate and the representation of the fine estimate can be sent to a speech recognition system. The representation of the warped spectral estimate can be passed to a speech recognition engine, where it may be used for speech recognition. The representation of the warped spectral estimate can also be used along with the representation of the fine estimate to reconstruct a representation of the original audio signal.

Type: Application

Filed: December 3, 2010

Publication date: June 7, 2012

Applicant: Microsoft Corporation

Inventors: Michael L. Seltzer, James G. Droppo, Henrique S. Malvar, Alejandro Acero, Xing Fan
High-quality encoding at low-bit rates

Patent number: 8195452

Abstract: Methods and devices provide improved perceived quality of an audio (or other) coded signal at a low bit-rate. An input signal may be split into an outlier portion and a stationary portion. The outlier portion of the input signal may be encoded. The stationary portion may be divided into subvectors. Each subvector may be classified as trivial or non-trivial. Each trivial subvector may be encoded using a pre-defined pattern. Each non-trivial subvector may be encoded with at least one location of at least one significant sample and a sign of the significant sample.

Type: Grant

Filed: June 12, 2008

Date of Patent: June 5, 2012

Assignee: Nokia Corporation

Inventors: Ioan Tabus, Adriana Vasilache
Scheme for generating a parametric representation for low-bit rate applications

Patent number: 8194861

Abstract: For generating a parametric representation of a multi-channel signal especially suitable for low-bit rate applications, only the location of the maximum of the sound energy within a replay setup is encoded and transmitted using direction parameter information. For multi-channel reconstruction, the energy distribution of the output channels identified by the direction parameter information is controlled by the direction parameter information, while the energy distribution in the remaining ambience channels is not controlled by the direction parameter information.

Type: Grant

Filed: October 16, 2006

Date of Patent: June 5, 2012

Assignee: Dolby International AB

Inventors: Fredrik Henn, Jonas Roeden
Data reproduction apparatus and data reproduction method

Patent number: 8195317

Abstract: A data reproduction apparatus includes: arithmetic means for calculating difference data that indicate a difference between left-channel and right-channel data that have been compressed in a predetermined compression format; higher harmonic component generation means for generating a higher harmonic component, which was lost during compression, by performing, when the difference data's signal level exceeds a predetermined threshold, a digital limiter process that suppresses the signal level to the threshold; and adding means for adding the higher harmonic component to the left-channel and right-channel data to reproduce original data before being compressed.

Type: Grant

Filed: February 27, 2008

Date of Patent: June 5, 2012

Assignee: Sony Corporation

Inventors: Tokihiko Sawashi, Yasuyuki Kino
Device, method, and program for encoding/decoding of speech with function of encoding silent period

Patent number: 8195469

Abstract: A speech decoding device of the invention smoothes, in decoding speech signal in a voice-less period, RMS and filter coefficients which is discontinuously transmitted, and provides them to a synthesis filter. Thereby, it is capable of preventing discontinuous changing of the filter coefficient caused by the intermittent transmission of the filter coefficient. As a result, a quality of decoding can be improved. Also, to remove an effect, caused by the smoothing process, from the filter coefficients or the RMS which are transmitted in the past frames, a smoothing factor is adjusted not to perform smoothing while a certain time period (or a certain number of frames) from when a transition is made from a voice period from a voice-less period, or when a decoded feature parameter satisfies a predetermined condition.

Type: Grant

Filed: May 31, 2000

Date of Patent: June 5, 2012

Assignee: NEC Corporation

Inventors: Masahiro Serizawa, Hironori Ito
Audio data packet format and decoding method thereof and method for correcting mobile communication terminal codec setup error and mobile communication terminal performance same

Patent number: 8195470

Abstract: Disclosed is an audio data packet format for transmitting an MPEG-4 HE-AAC frame via a voice channel of a mobile communication network, a method for decoding the audio data packet format, a method for correcting a codec setup error by identifying a codec used to encode sound source data inserted into a data field of voice slot data, based on the sequence number of the voice slot data, and correcting the codec setup error when a codec set up in a mobile communication terminal is different from the codec used to encode the sound source data, and a mobile communication terminal adapted to correct a codec setup error.

Type: Grant

Filed: October 31, 2006

Date of Patent: June 5, 2012

Assignee: SK Telecom Co., Ltd.

Inventors: Seongsoo Park, Seongkeun Kim, Sehyun Oh
High quality time-scaling and pitch-scaling of audio signals

Patent number: 8195472

Abstract: In one alternative, an audio signal is analyzed using multiple psychoacoustic criteria to identify a region of the signal in which time scaling and/or pitch shifting processing would be inaudible or minimally audible, and the signal is time scaled and/or pitch shifted within that region. In another alternative, the signal is divided into auditory events, and the signal is time scaled and/or pitch shifted within an auditory event. In a further alternative, the signal is divided into auditory events, and the auditory events are analyzed using a psychoacoustic criterion to identify those auditory events in which the time scaling and/or pitch shifting processing of the signal would be inaudible or minimally audible. Further alternatives provide for multiple channels of audio.

Type: Grant

Filed: October 26, 2009

Date of Patent: June 5, 2012

Assignee: Dolby Laboratories Licensing Corporation

Inventor: Brett Graham Crockett
Playback of compressed media files without quantization gaps

Patent number: 8190441

Abstract: Playback by a decoder of a lossy compressed digital media file without quantization gaps is disclosed. The digital media file can be formed of a number of audio samples grouped into a corresponding number of audio frames. As a method, one embodiment can be carried out by identifying an encoder used to compress the digital media file; obtaining an encoder delay value for the identified encoder; obtaining a decoder delay value for the decoder; determining a audio sample count corresponding to a last valid audio sample; setting a re-synchronization after seek option marker N audio frames from the last valid audio sample; and decoding valid audio samples using the encoder delay value, the decoder delay value, and the sample count corresponding to the last valid audio sample.

Type: Grant

Filed: September 11, 2006

Date of Patent: May 29, 2012

Assignee: Apple Inc.

Inventor: William S. Kincaid
Method and system for dual mode subband acoustic echo canceller with integrated noise suppression

Patent number: 8180648

Abstract: Certain aspects of a method and system for a dual mode subband acoustic echo canceller with integrated noise suppression may include splitting an input signal into a lowband component and a highband component. The subbands of each of the lowband component and the highband component may be processed in order to reduce an echo associated with the input signal and to suppress the noise associated with the input signal.

Type: Grant

Filed: July 25, 2011

Date of Patent: May 15, 2012

Assignee: Broadcom Corporation

Inventors: Wilfrid LeBlanc, Jes Thyssen
Method for limiting adaptive excitation gain in an audio decoder

Patent number: 8180632

Abstract: Decoder for an audio signal coded by a coder including a long-term prediction filter wherein the decoder comprises: a block (211) for detecting transmission frame losses; a module (222) for calculating values of an error indication function representative of the cumulative adaptive excitation error during decoding following said transmission frame loss, an arbitrary value being assigned to said adaptive excitation gain for the lost frame; a module (213) for calculating an error indication parameter from said values of the error indication function; a comparator (214) for comparing said error indication parameter to at least one given threshold; and a discriminator (215) adapted to determine as a function of the results supplied by the comparator (214) a value of at least one adaptive excitation gain to be used by the decoder.

Type: Grant

Filed: February 13, 2007

Date of Patent: May 15, 2012

Assignee: France Telecom

Inventors: Balazs Kovesi, David Virette
Wideband audio signal coding/decoding device and method

Patent number: 8170885

Abstract: Disclosed is a wideband audio signal coding/decoding device and method that may code a wideband audio signal while maintaining a low bit rate. The wideband audio signal coding device includes an enhancement layer that extracts a first spectrum parameter from an inputted wideband signal having a first bandwidth, quantizes the extracted first spectrum parameter, and converts the extracted first spectrum parameter into a second spectrum parameter; and a coding unit that extracts a narrowband signal from the inputted wideband signal and codes the narrowband signal based on the second spectrum parameter provided from the enhancement layer, wherein the narrowband signal has a second bandwidth smaller than the first bandwidth. The wideband audio signal coding/decoding device and method may code a wideband audio signal while maintaining a low bit rate.

Type: Grant

Filed: October 15, 2008

Date of Patent: May 1, 2012

Assignee: Gwangju Institute of Science and Technology

Inventors: Hong Kook Kim, Young Han Lee
MULTI-FUNCTIONAL AUDIO DISTRIBUTION SYSTEM AND METHOD FOR MOVIE THEATERS AND OTHER PUBLIC AND PRIVATE VENUES

Publication number: 20120095749

Abstract: Audiovisual presentation methods, systems and apparatus for improving and enhancing the listening experience of attendees of audiovisual presentations. An exemplary audiovisual presentation system includes an audio processing and distribution unit (APDU) configured to generate and broadcast a wireless audio service containing audio of an audiovisual presentation (e.g., soundtrack and dialogue audio of a movie, in the case of a movie presentation) throughout an audiovisual presentation room or space (e.g., a movie theater, in the case of a movie presentation). The wireless audio service is received by mobile receiving devices (MRDs) having or comprising headsets, headphones or earbuds, through which MRD users listen to the audio of the audiovisual presentation provided by the wireless audio service while viewing images of the audiovisual presentation.

Type: Application

Filed: October 13, 2011

Publication date: April 19, 2012

Inventor: Antonio Capretta
Apparatus and method for encoding/decoding signal

Patent number: 8160258

Abstract: An encoding method and apparatus and a decoding method and apparatus are provided. The decoding method includes extracting a down-mix signal and down-mix identification information from an input bitstream, determining, based on the down-mix identification information, whether the down-mix signal is a 3D down-mix signal obtained by performing a three-dimensional (3D) rendering operation, and if the down-mix signal is not 3D down-mix signal, generating a 3D down-mix signal by performing a 3D rendering operation. Accordingly, it is possible to efficiently encode multi-channel signals with 3D effects and to adaptively restore and reproduce audio signals with optimum sound quality according to the characteristics of an audio reproduction environment.

Type: Grant

Filed: February 7, 2007

Date of Patent: April 17, 2012

Assignee: LG Electronics Inc.

Inventors: Yang Won Jung, Hee Suk Pang, Hyen O Oh, Dong Soo Kim, Jae Hyun Lim
Apparatus and method of encoding and decoding audio signal

Patent number: 8155153

Abstract: In one embodiment, the method includes receiving audio frame data having at least first and second channel data. The first and second channel data include a plurality of blocks, where the blocks are classified by a block type. The first and second channel data is provided jointly if the first and second channel data are paired with each other. The method further includes obtaining frame length information indicating a length of the audio frame data, and obtaining block information indicating the block type. The block information corresponds to the first and second channel data being common when the channel data are paired. The first and second channel data are lossless decoded based on the frame length information and the block information.

Type: Grant

Filed: September 23, 2008

Date of Patent: April 10, 2012

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
Apparatus and method of encoding and decoding audio signal

Patent number: 8155144

Abstract: In one embodiment, the method includes receiving audio frame data having at least first and second channel data. The first and second channel data includes a plurality of blocks, where the blocks are classified by a block type. The first and second channel data is provided jointly if the first and second channel data are paired with each other. Block information indicating the block type is obtained. The block information corresponds to the first and second channel data being common when the first and second channel data are paired. The first and second channel data are lossless decoded based on the block information.

Type: Grant

Filed: September 23, 2008

Date of Patent: April 10, 2012

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
Apparatus and method of encoding and decoding audio signal

Patent number: 8155152

Abstract: In one embodiment, the method includes receiving audio frame data having at least first and second channel data. The first and second channel data include a plurality of blocks, where the blocks are classified by a block type. The first and second channel data are provided jointly if the first and second channel data are paired with each other. The embodiment further includes obtaining block information indicating the block type, and lossless decoding the first and second channel data based on the block information.

Type: Grant

Filed: September 23, 2008

Date of Patent: April 10, 2012

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
Audio decoding of multi-audio-object signal using upmixing

Patent number: 8155971

Abstract: A method for decoding a multi-audio-object signal having audio signals of first and second types encoded therein, the multi-audio-object signal having a downmix signal and side information having level information of the audio signals of the first and second types in a first predetermined time/frequency resolution, the method including computing a prediction coefficient matrix C based on the level information; and up-mixing the downmix signal based on the prediction coefficients to obtain a first and/or a second up-mix audio signal approximating the audio signals of the first and second types, respectively, wherein up-mixing yields the first and/or second up-mix signals S1 and S2 from the downmix signal d according to a computation representable by ( S 1 S 2 ) = D - 1 ? { ( 1 C ) ? d + H } , with “1” denoting—depending on the number of channels of d—a scalar, or an identity matrix, and D?1 being a matrix uniquely determined by a downmix prescription according

Type: Grant

Filed: October 17, 2008

Date of Patent: April 10, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung e.V.

Inventors: Oliver Hellmuth, Johannes Hilpert, Leonid Terentiev, Cornelia Falch, Andreas Hoelzer, Juergen Herre
SPATIAL AUDIO ENCODING AND REPRODUCTION OF DIFFUSE SOUND

Publication number: 20120082319

Abstract: A method and apparatus processes multi-channel audio by encoding, transmitting or recording “dry” audio tracks or “stems” in synchronous relationship with time-variable metadata controlled by a content producer and representing a desired degree and quality of diffusion. Audio tracks are compressed and transmitted in connection with synchronized metadata representing diffusion and preferably also mix and delay parameters. The separation of audio stems from diffusion metadata facilitates the customization of playback at the receiver, taking into account the characteristics of local playback environment.

Type: Application

Filed: September 8, 2011

Publication date: April 5, 2012

Inventors: Jean-Marc Jot, Stephen Roger Hastings, James D. Johnston
Apparatus and method of encoding and decoding audio signal

Patent number: 8149876

Abstract: In one embodiment, the method includes receiving audio frame data having at least first and second channel data. The first and second channel data includes a plurality of blocks, where the blocks are classified by a block type. The embodiment further includes obtaining frame length information indicating a length of the audio frame data, and obtaining block information indicating the block type. The block information corresponds to the first and second channel data being common when the first and second channel data are paired. The first and second channel data are lossless decoded based on the frame length information and the block information.

Type: Grant

Filed: September 23, 2008

Date of Patent: April 3, 2012

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
Apparatus and method of encoding and decoding audio signal

Patent number: 8149877

Abstract: In one embodiment, the method includes receiving audio frame data having at least first and second channel data. The first and second channel data includes a plurality of blocks, where the blocks are classified by a block type. Block information indicating the block type is obtained. The block information corresponds to the first and second channel data being common when the first and second channel data are paired. The first and second channel data is lossless decoded based on the block information.

Type: Grant

Filed: September 23, 2008

Date of Patent: April 3, 2012

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
Apparatus and method of encoding and decoding audio signal

Patent number: 8149878

Abstract: In one embodiment, the method includes receiving audio frame data having at least first and second channel data. The first and second channel data includes a plurality of blocks, where the blocks are classified by a block type. The first and second channel data is provided jointly if the first and second channel data are paired with each other. The method further includes obtaining frame length information indicating a length of the audio frame data, obtaining block information indicating a block type, and lossless decoding the first and second channel data based on the frame length information and the block information.

Type: Grant

Filed: September 23, 2008

Date of Patent: April 3, 2012

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
Device and method for generating a coded multi-channel signal and device and method for decoding a coded multi-channel signal

Patent number: 8145498

Abstract: In a multi-channel encoder generating several different parameter sets for reconstructing a multi-channel output signal using at least one transmission channel, the data stream is written such that the two parameter sets are decodable independently of each other. Thus, a multi-channel decoder is enabled to skip a parameter set which is marked as optional and/or has a higher version number when reading the data stream and still to perform a valid multi-channel reconstruction using a data set marked as mandatory or a data set having a sufficiently low version number. This achieves a flexible encoder/decoder concept suitable for future updates characterized by backward compatibility and reliability.

Type: Grant

Filed: March 2, 2007

Date of Patent: March 27, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Juergen Herre, Ralph Sperschneider, Johannes Hilpert, Karsten Linzmeier, Harald Popp
Method and apparatus for implementing speech decoding in speech decoder field of the invention

Patent number: 8145480

Abstract: The present disclosure relates to a decoding method and apparatus. The method includes: receiving data frames from the coder; if any erroneous frame appears, calculating a pitch lag parameter of the erroneous frame; decoding the data frames according to the calculated pitch lag parameter of the erroneous frame, and obtaining decoded data. The process of determining the pitch lag parameter includes: determining the number of continuous erroneous frames and the pitch lag parameter of the previous frame; adjusting the pitch lag parameter of the previous frame according to the number of the continuous erroneous frames and a preset adjustment policy, and calculating and determining the pitch lag parameter of a current erroneous frame, wherein the preset adjustment policy is adjusting the determined pitch lag parameter of the current erroneous frame within a preset value range according to the number of the continuous erroneous frames.

Type: Grant

Filed: April 20, 2009

Date of Patent: March 27, 2012

Assignee: Huawei Technologies Co., Ltd.

Inventors: Jianfeng Xu, Lijing Xu, Qing Zhang, Wei Li, Shenghu Sang, Zhengzhong Du, Chen Hu
Method, device and system for signal encoding and decoding

Patent number: 8140343

Abstract: A method, device and system for signal encoding and decoding, the method comprising: encoding a core layer signal to obtain a core layer signal code; selecting an enhancement sample point that requires enhancement layer signal encoding according to the core layer signal code and the number of bits that can be used by an enhancement layer; obtaining an enhancement layer signal code of the enhancement sample point; and outputting a bit stream, where the bit stream includes the core layer signal code and the enhancement layer signal code. In embodiments of the present invention, according to the number of bits that can be used by the enhancement layer, the enhancement sample point that requires enhancement layer signal encoding is selected; the enhancement layer signal of the selected enhancement sample point is encoded and decoded; when no sufficient bits are available for the enhancement layer, the enhancement quality of the core layer can be improved.

Type: Grant

Filed: August 15, 2011

Date of Patent: March 20, 2012

Assignee: Huawei Technologies Co., Ltd.

Inventors: Chen Hu, Zexin Liu, Lei Miao, Longyin Chen, Qing Zhang, Wei Xiao, Herve Marcel Taddei
Methods, apparatuses and system for encoding and decoding signal

Patent number: 8135593

Abstract: Methods and apparatuses for encoding a signal and decoding a signal and a system for encoding and decoding are provided. The method for encoding a signal includes performing a classification decision process on high frequency signals of input signals, adaptively encoding the high frequency signals according to the result of the classification decision process, and outputting a bitstream including codes of low frequency signals of the input signals, adaptive codes of the high frequency signals, and the result of the classification decision process. The classification decision process is performed on the high frequency signals, and adaptive encoding or adaptive decoding is performed according to the result of the classification decision process, so the quality of voice and audio output signals is improved.

Type: Grant

Filed: May 3, 2011

Date of Patent: March 13, 2012

Assignee: Huawei Technologies Co., Ltd.

Inventors: Lei Miao, Zexin Liu, Longyin Chen, Chen Hu, Wei Xiao, Herve Marcel Taddei, Qing Zhang
Encoding an information signal

Patent number: 8126721

Abstract: The transient problem may be sufficiently addressed, and for this purpose, a further delay on the side of the decoding may be reduced if a new SBR frame class is used wherein the frame boundaries are not shifted, i.e. the grid boundaries are still synchronized with the frame boundaries, but wherein a transient position indication is additionally used as a syntax element so as to be used, on the encoder and/or decoder sides, within the frames of these new frame class for determining the grid boundaries within these frames.

Type: Grant

Filed: October 18, 2007

Date of Patent: February 28, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Markus Schnell, Michael Schuldt, Manfred Lutzky, Manuel Jander
Bases dictionary for low complexity matching pursuits data coding and decoding

Patent number: 8121848

Abstract: Embodiments related to utilizing substantially optimal entries for a relatively low complexity dictionary for matching pursuits coding is disclosed. In various embodiments, methods are invoked for determining a substantially optimal entry from a bases dictionary comprising a plurality of entries; and utilizing the substantially optimal entry in a relatively low complexity matching pursuits data coding. In various embodiments, a system is provided comprising a bases dictionary comprising a plurality of entries each with a width of 15 or less; a signal to be coded; and a selection module configured to receive at least one of the plurality of entries from the bases dictionary, to calculate an inner product between the at least one of the plurality of entries and the signal to be coded, and to select the entry from the at least one of the plurality of entries that produces a maximum inner product for use in at least partially coding the signal to be coded.

Type: Grant

Filed: March 17, 2006

Date of Patent: February 21, 2012

Assignee: Pan Pacific Plasma LLC

Inventor: Donald M. Monro
Encoding apparatus and encoding method

Patent number: 8121850

Abstract: An encoding device and an encoding method are provided for encoding by reducing the number of samples to be processed when encoding higher-band spectrum data according to lower-band spectrum data in a wide-band signal. The device and the method can obtain a high-quality decoded signal even if a large quantization distortion is caused in the lower-band spectrum data. When encoding higher-band spectrum data in a signal to be encoded, according to lower-band spectrum data in the signal, only for a part (a head portion) of the higher-band spectrum data, the lower-band spectrum data after being quantized is subjected to approximate partial search and higher-band spectrum data is generated according to the search result.

Type: Grant

Filed: May 9, 2007

Date of Patent: February 21, 2012

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Kaoru Sato, Toshiyuki Morii
Multi-staging recursive audio frame-based resampling and time mapping

Patent number: 8117039

Abstract: A multi-stage recursive sample rate converter (“SRC”) typically embodied as digital signal processor provides for an efficient structure for converting digital audio samples at one frequency, such as 48 kHz, to another frequency, such as 44.1 kHz. A parameter codebook comprising memory stores parameters used at a plurality of stages by the SRC. For each stage, a controller coordinates the SRC to use the appropriate set of parameters from the codebook, process an input audio sample stream, and store the intermediate results in a buffer. The controller then causes the intermediate results to be processed again as input to the SRC in a subsequent stage of processing using a different set of parameters. The process is repeated until all stages are completed, and the final results are the output digital audio data stream at the desired sampling rate.

Type: Grant

Filed: December 15, 2008

Date of Patent: February 14, 2012

Assignee: Ericsson Television, Inc.

Inventors: Zhicheng Lancelot Wang, Jianguang Jiang
Method and apparatus for introducing information into a data stream and method and apparatus for encoding an audio signal

Patent number: 8117027

Abstract: Techniques for introducing information into a data stream first obtains the spectral values of the short-term spectrum of the audio signal. Separately, information to be introduced are combined with a spread sequence obtaining a spread information signal, whereupon a spectral representation of the spread information is generated, then weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal are then summed and afterwards processed again to obtain a processed data stream including audio information and information to be introduced.

Type: Grant

Filed: September 25, 2008

Date of Patent: February 14, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Christian Neubauer, Juergen Herre, Karlheinz Brandenburg, Eric Allamanche
Efficient and scalable parametric stereo coding for low bitrate audio coding applications

Patent number: 8116460

Abstract: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

Type: Grant

Filed: September 28, 2005

Date of Patent: February 14, 2012

Assignee: Coding Technologies AB

Inventors: Fredrik Henn, Kristofer Kjorling, Lars Liljeryd, Jonas Roden, Jonas Engdegard
Method and device for code conversion between audio encoding/decoding methods and storage medium thereof

Patent number: 8117028

Abstract: When performing audio communication by using different encoding/decoding methods, a code obtained by encoding audio by a certain method is converted into a code decodable by another method with a high audio quality and a small calculation amount. In a code conversion device for converting a first code string into a second code string, an audio decoding circuit acquires a first linear prediction coefficient and excitation signal information from the first code string and drives the filter having the first linear prediction coefficient by the excitation signal obtained from the excitation signal information, thereby creating a first audio signal. A fixed codebook code generation circuit uses the fixed codebook information and minimizes the distance between the second audio signal generated from the information obtained from the second code string and the first audio signal, thereby obtaining the fixed codebook information in the second code string.

Type: Grant

Filed: May 22, 2003

Date of Patent: February 14, 2012

Assignee: NEC Corporation

Inventor: Atsushi Murashima
Enhanced method for signal shaping in multi-channel audio reconstruction

Patent number: 8116459

Abstract: The present invention is based on the finding that a reconstructed output channel, reconstructed with a multi-channel reconstructor using at least one downmix channel derived by downmixing a plurality of original channels and using a parameter representation including additional information on a temporal fine structure of an original channel can be reconstructed efficiently with high quality, when a generator for generating a direct signal component and a diffuse signal component based on the downmix channel is used. The quality can be essentially enhanced, if only the direct signal component is modified such that the temporal fine structure of the reconstructed output channel is fitting a desired temporal fine structure, indicated by the additional information on the temporal fine structure transmitted.

Type: Grant

Filed: May 18, 2006

Date of Patent: February 14, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Karsten Linzmeier, Juergen Herre, Harald Popp
Universal container for audio data

Patent number: 8117038

Abstract: Storing audio data encoded in any of a plurality of different audio encoding formats is enabled by parametrically defining the underlying format in which the audio data is encoded, in audio format and packet table chunks. A flag can be used to manage storage of the size of the audio data portion of the file, such that premature termination of an audio recording session does not result in an unreadable corrupted file. This capability can be enabled by initially setting the flag to a value that does not correspond to a valid audio data size and that indicates that the last chunk in the file contains the audio data. State information for the audio data, to effectively denote a version of the file, and a dependency indicator for dependent metadata, may be maintained, where the dependency indicator indicates the state of the audio data on which the metadata is dependent.

Type: Grant

Filed: April 25, 2008

Date of Patent: February 14, 2012

Assignee: Apple Inc.

Inventors: William G. Stewart, James E. McCartney, Douglas S. Wyatt
Stereo encoding device, and stereo signal predicting method

Patent number: 8112286

Abstract: A prediction performance between individual channels of a stereo signal is improved to improve a sound quality of a decoded signal. A first low pass filter LPF interrupts a high-range component of a first channel signal S1, and outputs a first low-range component S1?. A second low pass filter LPF interrupts a high-range component of a second channel signal S2, and outputs a second low-range component S2?. A predictor predicts the S2? from the S1?, and outputs a prediction parameter composed of a delay time difference t and an amplitude ratio g. first channel encoder encodes the S1. A prediction parameter encoder encodes the prediction parameter. The encoded parameters of the encoded parameter of the S1 and the prediction parameter are then outputted.

Type: Grant

Filed: October 30, 2006

Date of Patent: February 7, 2012

Assignee: Panasonic Corporation

Inventors: Michiyo Goto, Koji Yoshida, Hiroyuki Ehara
Methods and apparatus for improving high frequency reconstruction of audio and speech signals

Patent number: 8112284

Abstract: The present invention proposes a new method and a new apparatus for enhancement of audio source coding systems utilizing high frequency reconstruction (HFR). It utilizes a detection mechanism on the encoder side to assess what parts of the spectrum will not be correctly reproduced by the HFR method in the decoder. Information on this is efficiently coded and sent to the decoder, where it is combined with the output of the HFR unit.

Type: Grant

Filed: November 19, 2008

Date of Patent: February 7, 2012

Assignee: Coding Technologies AB

Inventors: Kristofer Kjörling, Per Ekstrand, Holger Hörich
ENHANCING PERCEPTUAL PERFORMANCE OF SBR AND RELATED HFR CODING METHODS BY ADAPTIVE NOISE-FLOOR ADDITION AND NOISE SUBSTITUTION LIMITING

Publication number: 20120029927

Abstract: Methods and an apparatus for enhancement of source coding systems utilizing high frequency reconstruction (HFR) are introduced. The problem of insufficient noise contents is addressed in a reconstructed highband, by using Adaptive Noise-floor Addition. New methods are also introduced for enhanced performance by means of limiting unwanted noise, interpolation and smoothing of envelope adjustment amplification factors. The methods and apparatus used are applicable to both speech coding and natural audio coding systems.

Type: Application

Filed: September 12, 2011

Publication date: February 2, 2012

Inventors: Lars G. LILJERYD, Kristofer Kjoerling, Per Ekstrand, Frederik Henn
Encoding device and decoding device

Patent number: 8108222

Abstract: An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.

Type: Grant

Filed: July 15, 2010

Date of Patent: January 31, 2012

Assignee: Panasonic Corporation

Inventors: Mineo Tsushima, Takeshi Norimatsu, Kosuke Nishio, Naoya Tanaka
Slot position coding of OTT syntax of spatial audio coding application

Patent number: 8103514

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: October 7, 2010

Date of Patent: January 24, 2012

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O Oh, Yang-Won Jung
Method and system for aligning windows to extract peak feature from a voice signal

Patent number: 8103512

Abstract: Disclosed is a method capable of adaptively aligning windows to extract features according to the types and characteristics of voice signals. To this end, window lengths based on the window update points in a corresponding order are determined by employing the concept of a higher order peak, and windows are aligned according to window lengths. When the windows are aligned according to such a manner, the start and end points of each window is known, so that it becomes possible to easily extract and analyze peak feature information.

Type: Grant

Filed: January 23, 2007

Date of Patent: January 24, 2012

Assignee: Samsung Electronics Co., Ltd

Inventor: Hyun-Soo Kim
Slot position coding of syntax of spatial audio application

Patent number: 8103513

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: August 20, 2010

Date of Patent: January 24, 2012

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O Oh, Yang-Won Jung
Subband coding apparatus and method of coding subband

Patent number: 8103516

Abstract: A subband coding apparatus carries out subband coding which prevents deterioration in coding performance and improves audio quality of decoded signals. The subband coding apparatus includes a low-band coding section (103) to code a low-band spectrum (S13). A low-band decoding section (106) decodes a low-band coded data (S14) and outputs a decoded low-band spectrum (S18) to a high-band coding section (107). A spectrum rearranging section (105) rearranges to make each frequency component of a high-band spectrum (S16) in reverse order on the frequency axis and outputs a modified high-band spectrum (S17) after rearranging to a high-band coding section (107). The high-band coding section (107) uses the decoded low-band spectrum (S18) output from the low-band decoding section (106) to code the modified high-band spectrum (S17) output from the spectrum rearranging section (105).

Type: Grant

Filed: November 29, 2006

Date of Patent: January 24, 2012

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Multi-channel audio encoding and decoding

Patent number: 8099292

Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

Type: Grant

Filed: November 11, 2010

Date of Patent: January 17, 2012

Assignee: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen
Audio signal processing

Patent number: 8099293

Abstract: An audio system for processing two channels of audio input to provide more than two output channels. The input may be conventional stereo material or compressed audio signal data. The audio processing includes separating the input signals into frequency bands and processing the frequency bands according to processes which may differ from band to band. The audio processing includes no processing of L?R signals.

Type: Grant

Filed: August 13, 2008

Date of Patent: January 17, 2012

Assignee: Bose Corporation

Inventor: Abhijit Kulkarni
Universal container for audio data

Patent number: 8095375

Abstract: Storing audio data encoded in any of a plurality of different audio encoding formats is enabled by parametrically defining the underlying format in which the audio data is encoded, in audio format and packet table chunks. A flag can be used to manage storage of the size of the audio data portion of the file, such that premature termination of an audio recording session does not result in an unreadable corrupted file. This capability can be enabled by initially setting the flag to a value that does not correspond to a valid audio data size and that indicates that the last chunk in the file contains the audio data. State information for the audio data, to effectively denote a version of the file, and a dependency indicator for dependent metadata, may be maintained, where the dependency indicator indicates the state of the audio data on which the metadata is dependent.

Type: Grant

Filed: April 25, 2008

Date of Patent: January 10, 2012

Assignee: Apple Inc.

Inventors: William G. Stewart, James E. McCartney, Douglas S. Wyatt
Entropy encoding and decoding using direct level and run-length/level context-adaptive arithmetic coding/decoding modes

Patent number: 8090574

Abstract: An encoder performs context-adaptive arithmetic encoding of transform coefficient data. For example, an encoder switches between coding of direct levels of quantized transform coefficient data and run-level coding of run lengths and levels of quantized transform coefficient data. The encoder can determine when to switch between coding modes based on a pre-determined switch point or by counting consecutive coefficients having a predominant value (e.g., zero). A decoder performs corresponding context-adaptive arithmetic decoding.

Type: Grant

Filed: October 19, 2010

Date of Patent: January 3, 2012

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Scalable coding apparatus and scalable coding method

Patent number: 8086452

Abstract: A scalable coding apparatus is provided to suppress deterioration of a quality of a coded signal in a normal frame next to a frame compensated for the occurrence of a data loss. The scalable coding apparatus is provided with a core-layer coding section (11) to carry out core-layer coding for the n-th frame input audio signal, an ordinary coding section (121) to generate expanding-layer ordinary-coding layer L2(n) by carrying out ordinary-coding of an expanding layer for the input audio signal, a deterioration-compensation coding section (123) to generate an expanding-layer-deterioration coding data L2?(n) by carrying out compensation for quality deterioration of coded audio in a current frame due to a past frame loss, a judging section (125) to determine whether either the expanding-layer ordinary-coding data L2(n) or the expanding-layer deterioration-coding data L2?(n) should be output from the expanding-layer coding section (12) as expanding-layer coding data of the current frame.

Type: Grant

Filed: November 29, 2006

Date of Patent: December 27, 2011

Assignee: Panasonic Corporation

Inventor: Koji Yoshida
Method and apparatus for non-overlapped transforming of an audio signal, method and apparatus for adaptively encoding audio signal with the transforming, method and apparatus for inverse non-overlapped transforming of an audio signal, and method and apparatus for adaptively decoding audio signal with the inverse transforming

Patent number: 8086446

Abstract: A method and apparatus for transforming an audio signal, a method and apparatus for adaptively encoding an audio signal, a method and apparatus for inversely transforming an audio signal, and a method and apparatus for adaptively decoding an audio signal. The method of transforming an audio signal includes determining a transform unit into which the audio signal in a time domain is to be transformed into an audio signal in a frequency domain, and transforming the audio signal into an audio signal in the frequency domain according to the determined transform units using a window coefficient other than 0. Accordingly, it is possible to minimize distortion of the audio signal when encoding the audio signal even at a high bit rate while increasing efficiency of compression.

Type: Grant

Filed: December 7, 2005

Date of Patent: December 27, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Eunmi Oh, Junghoe Kim, Boris Kudryashov, Konstantin Osipov
Time slot position coding of multiple frame types

Patent number: 8082158

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: October 14, 2010

Date of Patent: December 20, 2011

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O Oh, Yang-Won Jung
Enhancing perceptual performance of SBR and related HFR coding methods by adaptive noise-floor addition and noise substitution limiting

Patent number: RE43189

Abstract: Methods and an apparatus for enhancement of source coding systems utilizing high frequency reconstruction (HFR) are introduced. The problem of insufficient noise contents is addressed in a reconstructed highband, by using Adaptive Noise-floor Addition. New methods are also introduced for enhanced performance by means of limiting unwanted noise, interpolation and smoothing of envelope adjustment amplification factors. The methods and apparatus used are applicable to both speech coding and natural audio coding systems.

Type: Grant

Filed: January 26, 2000

Date of Patent: February 14, 2012

Assignee: Dolby International AB

Inventors: Lars G. Liljeryd, Kristofer Kjoerling, Per Ekstrand, Frederik Henn

prev … 3 4 5 6 7 8 9 10 11 … next