With Content Reduction Encoding Patents (Class 704/501)

Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data

Patent number: 7805314

Abstract: A method and apparatus to quantize/dequantize frequency amplitude data and a method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize the frequency amplitude data. The method includes calculating and quantizing power of frequency amplitudes for each of a plurality of bands constituting an audio frame, normalizing frequency amplitude data for each of the bands using the quantized power, and quantizing a first one of even-numbered or odd-numbered data among the normalized frequency amplitude data.

Type: Grant

Filed: June 21, 2006

Date of Patent: September 28, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hosang Sung, Sangwook Kim, Rakesh Taori, Kangeun Lee, Shihwa Lee
Audio encoding device and audio encoding method

Patent number: 7797162

Abstract: There is provided an audio encoding device capable of generating an appropriate monaural signal from a stereo signal while suppressing the lowering of encoding efficiency of the monaural signal. In a monaural signal generation unit (101) of this device, an inter-channel prediction/analysis unit (201) obtains a prediction parameter based on a delay difference and an amplitude ratio between a first channel audio signal and a second channel audio signal; an intermediate prediction parameter generation unit (202) obtains an intermediate parameter of the prediction parameter (called intermediate prediction parameter) so that the monaural signal generated finally is an intermediate signal of the first channel audio signal and the second channel audio signal; and a monaural signal calculation unit (203) calculates a monaural signal by using the intermediate prediction parameter.

Type: Grant

Filed: December 26, 2005

Date of Patent: September 14, 2010

Assignee: Panasonic Corporation

Inventors: Koji Yoshida, Michiyo Goto
Slot position coding for non-guided spatial audio coding

Patent number: 7792668

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type as indicated by the data structure type indicator.

Type: Grant

Filed: August 30, 2006

Date of Patent: September 7, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O. Oh, Yang Won Jung
Optimized multiple coding method

Patent number: 7792679

Abstract: The invention relates to the compression coding of digital signals such as multimedia signals (audio or video), and more particularly a method for multiple coding, wherein several encoders each comprising a series of functional blocks receive an input signal in parallel. Accordingly, a method is provided in which, a) the functional blocks forming each encoder are identified, along with one or several functions carried out of each block, b) functions which are common to various encoders are itemized and c) said common functions are carried out definitively for a part of at least all of the encoders within at least one same calculation module.

Type: Grant

Filed: November 24, 2004

Date of Patent: September 7, 2010

Assignee: France Telecom

Inventors: David Virette, Claude Lamblin, Abdellatif Benjelloun Touimi
Entropy coding with compact codebooks

Patent number: 7788106

Abstract: The present invention is based on the finding that an efficient code for encoding information values can be derived, when two or more information values are grouped in a tuple in a tuple order and when an encoding rule is used, that assigns the same code word to tuples having identical information values in different orders and that does derive an order information, indicating the tuple order, and when the code word is output in association with the order information.

Type: Grant

Filed: October 14, 2005

Date of Patent: August 31, 2010

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.

Inventors: Ralph Sperschneider, Jürgen Herre, Karsten Linzmeier, Johannes Hilpert
Slot position coding of syntax of spatial audio application

Patent number: 7783493

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: August 30, 2006

Date of Patent: August 24, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O. Oh, Yang Won Jung
Time slot position coding

Patent number: 7783494

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: August 30, 2006

Date of Patent: August 24, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O. Oh, Yang Won Jung
Encoding device and decoding device

Patent number: 7783496

Abstract: An encoding device (200) includes an MDCT unit (202) that transforms an input signal in a time domain into a frequency spectrum including a lower frequency spectrum, a BWE encoding unit (204) that generates extension data which specifies a higher frequency spectrum at a higher frequency than the lower frequency spectrum, and an encoded data stream generating unit (205) that encodes to output the lower frequency spectrum obtained by the MDCT unit (202) and the extension data obtained by the BWE encoding unit (204). The BWE encoding unit (204) generates as the extension data (i) a first parameter which specifies a lower subband which is to be copied as the higher frequency spectrum from among a plurality of the lower subbands which form the lower frequency spectrum obtained by the MDCT unit (202) and (ii) a second parameter which specifies a gain of the lower subband after being copied.

Type: Grant

Filed: February 12, 2009

Date of Patent: August 24, 2010

Assignee: Panasonic Corporation

Inventors: Mineo Tsushima, Takeshi Norimatsu, Kosuke Nishio, Naoya Tanaka
Method and apparatus for encoding and decoding multi-channel audio signal using virtual source location information

Patent number: 7783495

Abstract: Provided is a method and apparatus for encoding/decoding a multi-channel audio signal. The apparatus for encoding a multi-channel audio signal includes a frame converter for converting the multi-channel audio signal into a framed audio signal; means for downmixing the framed audio signal; means for encoding the downmixed audio signal; a source location information estimator for estimating source location information from the framed multi-channel audio signal; means for quantizing the estimated source location information; and means for multiplexing the encoded audio signal and the quantized source location information, to generate an encoded multi-channel audio signal.

Type: Grant

Filed: July 8, 2005

Date of Patent: August 24, 2010

Assignees: Electronics and Telecommunications Research Institute, Seoul National University Industry Foundation

Inventors: Jeong II Seo, Han Gil Moon, Seung Kwon Beack, Kyeong Ok Kang, In Seon Jang, Koeng Mo Sung, Min Soo Hahn, Jin Woo Hong
Slot position coding of residual signals of spatial audio coding application

Patent number: 7765104

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: August 30, 2006

Date of Patent: July 27, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O. Oh, Yang Won Jung
Slot position coding of TTT syntax of spatial audio coding application

Patent number: 7761303

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: August 30, 2006

Date of Patent: July 20, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O. Oh, Yang Won Jung
Signal processing using pilot based coding

Patent number: 7756702

Abstract: An apparatus for processing a signal and method thereof are disclosed. Data coding and entropy coding are performed with interconnection, and grouping is used to enhance coding efficiency. The present invention includes the steps of obtaining a pilot reference value corresponding to a plurality of data and a pilot difference value corresponding to the pilot reference value and obtaining the data using the pilot reference value and the pilot difference value.

Type: Grant

Filed: October 4, 2006

Date of Patent: July 13, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Hyen-O Oh, Dong Soo Kim, Jae Hyun Lim, Yang-Won Jung, Hyo Jin Kim
Apparatus, method, and medium for processing audio signal using correlation between bands

Patent number: 7756715

Abstract: Apparatus, method, and medium for processing an audio signal using a correlation between bands are provided. The apparatus includes an encoding unit encoding an input audio signal and a decoding unit decoding the encoded input audio signal.

Type: Grant

Filed: November 17, 2005

Date of Patent: July 13, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Dohyung Kim, Sihwa Lee
Scalable coder and decoder performing amplitude flattening for error spectrum estimation

Patent number: 7752052

Abstract: A down-sampler 101 down-samples the sampling rate of an input signal from sampling rate FH to sampling rate FL. A base layer coder 102 encodes the sampling rate FL acoustic signal. A local decoder 103 decodes coding information output from base layer coder 102. An up-sampler 104 raises the sampling rate of the decoded signal to FH. A subtracter 106 subtracts the decoded signal from the sampling rate FH acoustic signal. An enhancement layer coder 107 encodes the signal output from subtracter 106 using a decoding result parameter output from local decoder 103.

Type: Grant

Filed: April 28, 2003

Date of Patent: July 6, 2010

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Adaptive residual audio coding

Patent number: 7751572

Abstract: An audio signal having at least two channels can be efficiently down-mixed into a downmixe signal and a residual signal, when the down-mixing rule used depends on a spatial parameter that is derived from the audio signal and that is post-processed by a limiter to apply a certain limit to the derived spatial parameter with the aim of avoiding instabilities during the up-mixing or down-mixing process. By having a down-mixing rule that dynamically depends on parameters describing an interrelation between the audio channels, one can assure that the energy within the down-mixed residual signal is as minimal as possible, which is advantageous in the view of coding efficiency. By post processing the spatial parameter with a limiter prior to using it in the down-mixing, one can avoid instabilities in the down- or up-mixing, which otherwise could result in a disturbance of the spatial perception of the encoded or decoded audio signal.

Type: Grant

Filed: October 11, 2005

Date of Patent: July 6, 2010

Assignees: Dolby International AB, Koninklijke Philips Electronics N.V.

Inventors: Lars Villemoes, Francois Philippus Myburg
Compression into a fixed size buffer

Patent number: 7751635

Abstract: The method compresses image data into a fixed sized memory. The image is encoded in scans, where the scans are ordered from a perceptually most significant scan to a perceptually least significant scan. The scans also have an attribute 210 determining whether a scan is active or inactive. The method comprises an encoding, transferring and setting step. The encoding step 326, 348 encodes active scans of image data into scan bit-stream data. The transferring step 330, 352 transfers the encoded scan bit-stream data to the fixed size memory. The setting step 452 sets, if the fixed size memory becomes full, the attribute of a perceptually least significant scan to inactive.

Type: Grant

Filed: January 22, 2004

Date of Patent: July 6, 2010

Assignee: Canon Kabushiki Kaisha

Inventor: James David Clark
Method and apparatus for speech decoding by evaluating a noise level based on gain information

Patent number: 7747432

Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding method according to a code-excited linear prediction (CELP) speech coding, a noise level of a speech in a concerning coding period is evaluated by using a code or coding result of at least one of spectrum information, power information, and pitch information, and various excitation codebooks are used based on an evaluation result.

Type: Grant

Filed: October 29, 2007

Date of Patent: June 29, 2010

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Tadashi Yamaura
Channel signal concealment in multi-channel audio systems

Patent number: 7747448

Abstract: A parametric model is used for error concealment. The model filter allows for recovering signal components of original audio channel signals that now are lost or erroneous from signal components of at least one other audio channel. During error-free reception of valid frames, the parameters of that model will be derived and stored. In case of frame loss or frame error affecting the multi-channel information, a conjecture of the missing information is recovered by applying the model, using the stored parameters. In case of several subsequent lost or erroneous frames, it is possible either to use the parameters derived during the last valid frame or to use parameters derived from the recovered multi-channel information of the respective previous invalid frame. Furthermore, if there are long sequences of lost frames, it can be beneficial to apply some gradual muting of the model parameters, which essentially results in a gradual attenuation of the recovered multi-channel information.

Type: Grant

Filed: December 16, 2004

Date of Patent: June 29, 2010

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventor: Stefan Bruhn
Method and apparatus for speech encoding by evaluating a noise level based on gain information

Patent number: 7747433

Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding method according to a code-excited linear prediction (CELP) speech coding, a noise level of a speech in a concerning coding period is evaluated by using a code or coding result of at least one of spectrum information, power information, and pitch information, and various excitation codebooks are used based on an evaluation result.

Type: Grant

Filed: October 29, 2007

Date of Patent: June 29, 2010

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Tadashi Yamaura
Method and apparatus for speech encoding by evaluating a noise level based on pitch information

Patent number: 7742917

Abstract: A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding method according to a code-excited linear prediction (CELP) speech coding, a noise level of a speech in a concerning coding period is evaluated by using a code or coding result of at least one of spectrum information, power information, and pitch information, and various excitation codebooks are used based on an evaluation result.

Type: Grant

Filed: October 29, 2007

Date of Patent: June 22, 2010

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventor: Tadashi Yamaura
Spectral enhancing method and device

Patent number: 7742927

Abstract: The present invention relates to a spectral enhancement method and to an apparatus carrying out this method. The method of the invention enhanced the spectral content of a signal having an incomplete spectrum including a first spectral frequency band, the method comprising the following stages: at least one spectral content transposition of said first frequency band into a second spectral frequency band not included in said spectrum for the purpose of generating a transposed spectrum signal having a spectrum limited to said second spectral frequency band, shaping the spectrum of the transposed spectrum signal for the purpose of producing an enhanced signal, combining an incomplete spectrum signal and the enhanced signal for the purpose of producing an enhanced spectrum signal, characterized in that said spectral content is subject to a stage of whitening.

Type: Grant

Filed: April 12, 2001

Date of Patent: June 22, 2010

Assignees: France Telecom, Telediffusion de France

Inventors: Pierrick Philippe, Patrice Collen
MULTI-STAGING RECURSIVE AUDIO FRAME-BASED RESAMPLING AND TIME MAPPING

Publication number: 20100153122

Abstract: A multi-stage recursive sample rate converter (“SRC”) typically embodied as digital signal processor provides for an efficient structure for converting digital audio samples at one frequency, such as 48 kHz, to another frequency, such as 44.1 kHz. A parameter codebook comprising memory stores parameters used at a plurality of stages by the SRC. For each stage, a controller coordinates the SRC to use the appropriate set of parameters from the codebook, process an input audio sample stream, and store the intermediate results in a buffer. The controller then causes the intermediate results to be processed again as input to the SRC in a subsequent stage of processing using a different set of parameters. The process is repeated until all stages are completed, and the final results are the output digital audio data stream at the desired sampling rate.

Type: Application

Filed: December 15, 2008

Publication date: June 17, 2010

Inventors: Zhicheng Lancelot Wang, Jianguang Jiang
Selection of coding models for encoding an audio signal

Patent number: 7739120

Abstract: The invention relates to a method of selecting a respective coding model for encoding consecutive sections of an audio signal, wherein at least one coding model optimized for a first type of audio content and at least one coding model optimized for a second type of audio content are available for selection. In general, the coding model is selected for each section based on signal characteristics indicating the type of audio content in the respective section. For some remaining sections, such a selection is not viable, though. For these sections, the selection carried out for respectively neighboring sections is evaluated statistically. The coding model for the remaining sections is then selected based on these statistical evaluations.

Type: Grant

Filed: May 17, 2004

Date of Patent: June 15, 2010

Assignee: Nokia Corporation

Inventor: Jari Mäkinen
Method and apparatus for time scaling of a signal

Patent number: 7734473

Abstract: A decoder receives (501) a bitstream comprising an encoded mono signal and stereo data. A time scale processor (503) generates a time scaled mono signal. A time-to frequency processor generates frequency sample blocks of the time scaled signal, the block length being fixed and independent of the time scaling. A parametric stereo decoder (509) generates a stereo signal for the frequency sample blocks and these are converted to the time domain by a frequency-to-time processor (511). A synchronization processor (515) synchronizes the stereo data with the time scaled signal by determining a time association between a parameter value and a frequency sample block. The parameter value and time association is used to determine synchronized stereo parameter values for that and other frequency sample blocks. The invention is particularly suitable for low complexity generation of time scaled stereo signals from MPEG-4 encoded signals.

Type: Grant

Filed: January 14, 2005

Date of Patent: June 8, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Erik Gosuinus Petrus Schuijers, Andreas Johannes Gerrits, Arnoldus Werner Johannes Oomen
System and method for suppressing silence data in a network environment

Patent number: 7733793

Abstract: A method for suppressing data is provided that includes receiving a first packet communicated by an end user and identifying a difference in a comfort noise level associated with a second packet received as compared to the first packet. The second packet may be communicated without a data payload in cases where the difference in comfort noise level associated with the second packet as compared to the first packet is below a predetermined threshold.

Type: Grant

Filed: December 10, 2003

Date of Patent: June 8, 2010

Assignee: Cisco Technology, Inc.

Inventors: Alain Charles Brainos, II, Mohammad A. Ahmed-Khan, Malcolm M. Smith
Audio coding

Patent number: 7729903

Abstract: The central idea of the present invention is that the prior procedure, namely interpolation relative to the filter coefficients and the amplification value, for obtaining interpolated values for the intermediate audio values starting from the nodes has to be dismissed. Coding containing less audible artifacts can be obtained by not interpolating the amplification value, but rather taking the power limit derived from the masking threshold, for each node, i.e. for each parameterization to be transferred, and then performing the interpolation between these power limits of neighboring nodes, such as, for example, a linear interpolation. On both the coder and the decoder side, an amplification value can then be calculated from the intermediate power limit determined such that the quantizing noise caused by quantization, which has a constant frequency before post-filtering on the decoder side, is below the power limit or corresponds thereto after post-filtering.

Type: Grant

Filed: July 27, 2006

Date of Patent: June 1, 2010

Inventors: Gerald Schuller, Stefan Wabnik, Marc Gayer
Constrained filter encoding of polyphonic signals

Patent number: 7725324

Abstract: Signals of different channels are combined into one mono signal. A set of adaptive filters, preferably one for each channel, is derived in a respective filter adaptation unit. When an adaptive filter is applied to the mono signal it reconstructs the signal of the respective channel under a perceptual constraint. The perceptual constraint is a gain and/or shape constraint. The gain constraint allows the preservation of the relative energy between the channels while the shape constraint allows more stability by avoiding unnecessary filtering of spectral nulls. The transmitted parameters are the mono signal, in encoded form, and the parameters of the adaptive filters, preferably also encoded. The receiver reconstructs the signal of the different channels by applying the adaptive filters and possibly some additional post-processing.

Type: Grant

Filed: December 15, 2004

Date of Patent: May 25, 2010

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Stefan Bruhn, Ingemar Johansson, Anisse Taleb, Patrik Sandgren
Device and process for encoding audio data

Patent number: 7725323

Abstract: An MPEG-1 layer 3 audio encoder, including a scalefactor generator for determining first scalefactors for encoding a block of audio data if a temporal masking transient is not detected in said block of audio data; and for selecting the maximum of said scalefactors for encoding said block of audio data if a temporal masking transient is detected in said block of audio data to enable greater compression of said audio data. Increases in quantization error, due to use of the maximum scalefactor are pre-masked or post-masked by the temporal masking transient. In cases where the last portion of a block includes a temporal masking transient that masks the preceding portions of the block, the maximum scalefactor is only used to encode the block if the resulting increase in quantization error is less than 30% of the quantization error for the block.

Type: Grant

Filed: September 14, 2004

Date of Patent: May 25, 2010

Assignee: STMicroelectronics Asia Pacific Pte. Ltd.

Inventors: Kabi Prakash Padhi, Sudhir Kumar Kasargod, Sapna George
Individual channel shaping for BCC schemes and the like

Patent number: 7720230

Abstract: At an audio encoder, cue codes are generated for one or more audio channels, wherein an envelope cue code is generated by characterizing a temporal envelope in an audio channel. At an audio decoder, E transmitted audio channel(s) are decoded to generate C playback audio channels, where C>E?1. Received cue codes include an envelope cue code corresponding to a characterized temporal envelope of an audio channel corresponding to the transmitted channel(s). One or more transmitted channel(s) are upmixed to generate one or more upmixed channels. One or more playback channels are synthesized by applying the cue codes to the one or more upmixed channels, wherein the envelope cue code is applied to an upmixed channel or a synthesized signal to adjust a temporal envelope of the synthesized signal based on the characterized temporal envelope such that the adjusted temporal envelope substantially matches the characterized temporal envelope.

Type: Grant

Filed: December 7, 2004

Date of Patent: May 18, 2010

Assignees: Agere Systems, Inc., Fraunhofer-Gesellschaft zur Forderung der angewandten Forschung e.V.

Inventors: Eric Allamanche, Sascha Disch, Christof Faller, Juergen Herre
Audio coding

Patent number: 7716042

Abstract: Coding an audio signal of a sequence of audio values into a coded signal includes determining first and second listening thresholds for first and second blocks of audio values of the sequence of audio values; calculating versions of first second parameterizations of the parameterizable filter such that the transfer function thereof roughly corresponds to the inverse of the magnitude of the first and second listening thresholds, respectively; filtering a predetermined block of audio values of the sequence of audio values with the parameterizable filter using a predetermined parameterization which depends on the version of the second parameterization to obtain a block of filtered audio values corresponding to the predetermined block which is quantized; forming a difference between the versions of the first and second parameterizations; integrating information on, inter alias, the difference into the coded signal.

Type: Grant

Filed: July 27, 2006

Date of Patent: May 11, 2010

Inventors: Gerald Schuller, Stefan Wabnik, Jens Hirschfeld, Manfred Lutzky
Method for processing a digital video signal

Patent number: 7705912

Abstract: A method of decoding audio data, encoded in multiple DIF blocks in a Digital Video (DV) data stream, and outputting said audio data as a PCM frame, includes fetching a single Digital Interface Frame (DIF) block from the DV data stream. A first byte in the single DIF block is de-shuffled to determine its index (n) in the PCM frame. Each byte in the in the single DIF block is de-shuffled to determine its respective index (n) in the PCM frame. The de-shuffled data is written into the PCM frame for output if the present DIF block is the last in the present DV frame. Subsequent DIF blocks in the DV frame are processed in the manner described above.

Type: Grant

Filed: March 8, 2004

Date of Patent: April 27, 2010

Assignee: STMicroelectronics Asia Pacific Pte, Ltd.

Inventors: Jianhua Sun, Sapna George
Image processing apparatus and method

Patent number: 7706583

Abstract: There is disclosed an image processing apparatus which implements an image process suitable for an effect for compositing or switching images, and an image process that allows smooth high-speed playback of even an image in motion. The apparatus has band segmentation means for segmenting an image signal into a plurality of frequency band components, and image composition means for, after the band segmentation means segments input first and second image signals, outputting a third image signal by replacing image data for respective segmented band components. When image data, which are recorded while being segmented into a plurality of frequency band components, are composited for respective band, and the composite image data is output, image data obtained by compositing a plurality of image frequency components is decoded and played back in a high-speed playback mode.

Type: Grant

Filed: November 6, 2003

Date of Patent: April 27, 2010

Assignee: Canon Kabushiki Kaisha

Inventor: Hirofumi Takei
Bitrate scalable speech coding and decoding apparatus and method

Patent number: 7702504

Abstract: A coding apparatus including a base layer, a speech quality enhancement layer, and a multiplexer. The base layer filters an input speech signal using linear prediction coding and generates an excitation signal corresponding to the filtered speech signal through a fixed codebook search and an adaptive codebook search. The speech quality enhancement layer searches a fixed codebook using parameters obtained through the fixed codebook search in the base layer, or searches the fixed codebook using a target signal, which is obtained by removing a contribution of a fixed codebook of the base layer and a signal which is obtained by synthesizing and filtering a previous fixed codebook of the speech quality enhancement layer from a target signal for the fixed codebook search of the base layer. The multiplexer multiplexes signals generated by the base layer and the at least one speech quality enhancement layer.

Type: Grant

Filed: July 9, 2004

Date of Patent: April 20, 2010

Assignee: Samsung Electronics Co., Ltd

Inventors: Chang-yong Son, Kang-eun Lee, Sang-won Kang, Sang-hyun Chi
Automated audio sub-band comparison

Patent number: 7698144

Abstract: Automated testing of audio performance of applications across platforms is provided for via capture of audio data. The audio data can include, inter alia, output sounds from a sound card or pre-rendered buffer data. The audio data is processed to produce descriptive data including data describing the audio data at least a first resolution and a second resolution. This descriptive data is used to compare data samples and describe the degree of similarity of two or more data samples. This comparison enables a determination as to whether the audio performance is satisfactory.

Type: Grant

Filed: January 11, 2006

Date of Patent: April 13, 2010

Assignee: Microsoft Corporation

Inventors: Gershon Parent, Karen Elaine Stevens, Shanon Isaac Drone
Hybrid multi-channel/cue coding/decoding of audio signals

Patent number: 7693721

Abstract: Part of the spectrum of two or more input signals is encoded using conventional coding techniques, while encoding the rest of the spectrum using binaural cue coding (BCC). In BCC coding, spectral components of the input signals are downmixed and BCC parameters (e.g., inter-channel level and/or time differences) are generated. In a stereo implementation, after converting the left and right channels to the frequency domain, pairs of left- and right-channel spectral components are downmixed to mono. The mono components are then converted back to the time domain, along with those left- and right-channel spectral components that were not downmixed, to form hybrid stereo signals, which can then be encoded using conventional coding techniques. For playback, the encoded bitstream is decoded using conventional decoding techniques. BCC synthesis techniques may then apply the BCC parameters to synthesize an auditory scene based on the mono components as well as the unmixed stereo components.

Type: Grant

Filed: December 10, 2007

Date of Patent: April 6, 2010

Assignee: Agere Systems Inc.

Inventors: Frank Baumgarte, Peter Kroon
Method for generating encoded audio signal and method for processing audio signal

Patent number: 7693706

Abstract: A method for generating an encoded audio signal, and a method for processing the same during the multi-channel audio coding are disclosed. The present invention provides the method for generating an encoded audio signal comprising: generating basic spatial information including basic configuration information requisite for a multi-channel audio coding process and basic data corresponding to the basic configuration information; and generating extension spatial information including extension configuration information selectively required for the multi-channel audio coding process and extension data corresponding to the extension configuration information.

Type: Grant

Filed: July 28, 2006

Date of Patent: April 6, 2010

Assignee: LG Electronics Inc.

Inventors: Hyen-O Oh, Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyo Jin Kim, Yang-Won Jung
Acoustic signal encoding device, and acoustic signal decoding device

Patent number: 7689428

Abstract: An acoustic signal encoding device for down-mixing at different ratios to encode a multichannel signal with a small number of channels, and an acoustic signal decoding device for decoding the signal encoded by the acoustic signal encoding device. In these devices, weighting means (103) in the acoustic signal encoding device (100) weights input signals of two channels individually according to a down-mixing coefficient thereby to calculate the level difference of the signals of two channels weighted by a level difference calculation unit (104). A separating unit (202) in the acoustic signal decoding device (200) separates the down-mixed signals into signals of two channels with the level difference information weighted.

Type: Grant

Filed: October 13, 2005

Date of Patent: March 30, 2010

Assignee: Panasonic Corporation

Inventors: Yoshiaki Takagi, Naoya Tanaka
STREAM SYNTHESIZING DEVICE, DECODING UNIT AND METHOD

Publication number: 20100063828

Abstract: To provide an enhanced true-to-life atmosphere enjoyed in multipoint connecting, and reduce a calculation load at a multipoint connection unit, as well.

Type: Application

Filed: October 16, 2008

Publication date: March 11, 2010

Inventors: Tomokazu Ishikawa, Takeshi Norimatsu, Takashi Katayama
Method for scale-factor estimation in an audio encoder

Patent number: 7676360

Abstract: A method, system and computer program product for computationally efficient estimation of the scale factors of one or more frequency bands in an encoder. These scale factors are dependant on a plurality of variables. One of the variables is approximated according to embodiments of the invention. This reduces the complexity of the estimation of scale factors, especially in digital signal processors.

Type: Grant

Filed: February 24, 2006

Date of Patent: March 9, 2010

Assignee: Sasken Communication Technologies Ltd.

Inventors: Sachin Ghanekar, Ravindra Chaugule
Scalable lossless audio codec and authoring tool

Patent number: 7668723

Abstract: An audio codec losslessly encodes audio data into a sequence of analysis windows in a scalable bitstream. This is suitably done by separating the audio data into MSB and LSB portions and encoding each with a different lossless algorithm. An authoring tool compares the buffered payload to an allowed payload for each window and selectively scales the losslessly encoded audio data, suitably the LSB portion, in the non-conforming windows to reduce the encoded payload, hence buffered payload. This approach satisfies the media bit rate and buffer capacity constraints without having to filter the original audio data, reencode or otherwise disrupt the lossless bitstream.

Type: Grant

Filed: August 14, 2007

Date of Patent: February 23, 2010

Assignee: DTS, Inc.

Inventor: Zoran Fejzo
Multi parametrisation based multi-channel reconstruction

Patent number: 7668722

Abstract: A multi-channel synthesizer for generating at least three output channels using an input signal having at least one base channel, the base channel being derived from the original multi-channel signal, the input signal further including at least two different up-mixing parameters, and an up-mixer mode indication indicating, in a first state that a first up-mixing rule is to be performed, and, indicating, in a second state, that a different second up-mixing rule is to be performed, uses an up-mixer for up-mixing the at least one base channel using the at least two different up-mixing parameters based on the first or the second up-mixing rule in response to the up-mixer mode indication so that the at least three output channels are obtained.

Type: Grant

Filed: November 29, 2005

Date of Patent: February 23, 2010

Assignees: Coding Technologies AB, Koninklijke Philips Electronics N.V.

Inventors: Lars Villemoes, Kristofer Kjoerling, Heiko Purnhagen, Jonas Roeden, Jeroen Breebaart, Gerard Hotho
Lossless audio coding/decoding method and apparatus

Patent number: 7660720

Abstract: A lossless audio coding and/or decoding method and apparatus are provided. The coding method includes: mapping the audio signal in the frequency domain having an integer value into a bit-plane signal with respect to the frequency; obtaining a most significant bit and a Golomb parameter for each bit-plane; selecting a binary sample on a bit-plane to be coded in the order from the most significant bit to the least significant bit and from a lower frequency component to a higher frequency component; calculating the context of the selected binary sample by using significances of already coded bit-planes for each of a plurality of frequency lines existing in the vicinity of a frequency line to which the selected binary sample belongs; selecting a probability model by using the obtained Golomb parameter and the calculated contexts; and lossless-coding the binary sample by using the selected probability model.

Type: Grant

Filed: March 10, 2005

Date of Patent: February 9, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Ennmi Oh, Junghoe Kim, Miao Lei, Shihwa Lee, Sangwook Kim
Communication device, signal encoding/decoding method

Patent number: 7653539

Abstract: There is provided a communication device for effectively encoding an audio/music signal while maintaining a predetermined quality by controlling the transmission bit rate of the transmission side considering the use environment of the reception side. In this device, a transmission mode decision unit (101) detects an environment noise contained in the background of the audio/music signal in the input signal and decides the transmission mode controlling the transmission bit rate of the signal transmitted from a communication terminal device (150), which is a communication terminal of the partner side, according to the environment noise level. A signal decoding unit (103) decodes encoded information transmitted from the communication terminal device (150) via a transmission path (110) and outputs the obtained signal as an output signal.

Type: Grant

Filed: February 22, 2005

Date of Patent: January 26, 2010

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Kaoru Sato, Toshiyuki Morii
System, method, and apparatus for fast quantization in perceptual audio coders

Patent number: 7650277

Abstract: A technique to encode an audio signal based on a perceptual model. In one example embodiment, this is accomplished by shaping quantization noise in the spectral lines on a band-by-band basis using their local gains. The noise shaped spectral lines are then fitted within a predetermined bit rate to form an encoded bit stream.

Type: Grant

Filed: September 25, 2003

Date of Patent: January 19, 2010

Assignee: Ittiam Systems (P) Ltd.

Inventors: Vinod Prakash, Ashok I Magadum
System and method for transmitting and receiving wideband speech signals

Patent number: 7649856

Abstract: The system for transmitting and receiving a wideband speech signal includes an A/D converter for receiving an analog speech signal to convert it into a digital speech signal, a transmitter analysis filter for receiving the digital speech signal and dividing it into a baseband signal and an enhancement residual band signal, a standard baseband encoder for accepting the baseband signal and coding it using an ITU-T encoder, an additional baseband encoder for reducing standard coding distortion in the baseband signal, an enhancement residual band encoder for coding a signal obtained by removing the coded baseband signal from the original digital speech signal, and an IP network interface for multiplexing the coded standard and additional baseband signals and enhancement residual band signal.

Type: Grant

Filed: December 18, 2003

Date of Patent: January 19, 2010

Assignee: Electronics and Telecommunications Research Institute

Inventors: Ho-Sang Sung, Dae-Hwan Hwang, Dae-Hee Youn, Hong-Goo Kang, Young-Cheol Park, Ki-Seung Lee, Sung-Kyo Jung, Kyung-Tae Kim
Apparatus and methods for encoding digital audio data with a reduced bit rate

Patent number: 7647222

Abstract: A method and an apparatus for encoding digital audio data with reduced bit rates, the apparatus having a provider of psycho-acoustically quantized digital audio data with a bit rate being higher than the reduced bit rate. The apparatus further has an identifier for identifying a frequency band according to a selection criterion, the selection criterion being such that an impact on the quality of the digital audio data when the data in the identified frequency band is replaced by generated noise is smaller than the impact on the quality of the digital audio data, which would arise when the data in a different frequency band is replaced by generated noise. The apparatus further has a replacer for replacing data in the identified frequency band of the digital audio data by a noise synthesis parameter.

Type: Grant

Filed: April 24, 2007

Date of Patent: January 12, 2010

Assignee: Nero AG

Inventors: Ivan Dimkovic, Gian Carlo Pascutto
Cue-based audio coding/decoding

Patent number: 7644003

Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.

Type: Grant

Filed: September 8, 2004

Date of Patent: January 5, 2010

Assignee: Agere Systems Inc.

Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
CUE-BASED AUDIO CODING/DECODING

Publication number: 20090319281

Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.

Type: Application

Filed: August 27, 2009

Publication date: December 24, 2009

Applicant: AGERE SYSTEMS INC.

Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
DIFFUSE SOUND SHAPING FOR BCC SCHEMES AND THE LIKE

Publication number: 20090319282

Abstract: In one embodiment, C input audio channels are encoded to generate E transmitted audio channel(s), where one or more cue codes are generated for two or more of the C input channels, and the C input channels are downmixed to generate the E transmitted channel(s), where C>E?1. One or more of the C input channels and the E transmitted channel(s) are analyzed to generate a flag indicating whether or not a decoder of the E transmitted channel(s) should perform envelope shaping during decoding of the E transmitted channel(s). In one implementation, envelope shaping adjusts a temporal envelope of a decoded channel generated by the decoder to substantially match a temporal envelope of a corresponding transmitted channel.

Type: Application

Filed: August 31, 2009

Publication date: December 24, 2009

Applicant: AGERE SYSTEMS INC.

Inventors: Eric Allamanche, Sascha Disch, Christof Faller, Juergen Herre
Subband synthesis filtering process and apparatus

Patent number: 7636660

Abstract: A subband synthesis filtering apparatus for M sets of signals is provided. Each set of signals includes N subband sample signals. The apparatus includes a processor for processing the ith set of signals among the M sets of signals, wherein i is an integer index ranging from 0 to (M?1). The processor includes a DCT converting module and a generating module. The DCT converting module converts the N subband sample signals of the ith set of signals into N converted vectors. If i is an odd number, the (2j?1)th subband sample signal among the N subband sample signals is multiplied by negative one in the converting module, wherein j is an integer index ranging from 1 to (N/2). The generating module generates N pulse code modulation signals based on the N converted vectors.

Type: Grant

Filed: June 15, 2006

Date of Patent: December 22, 2009

Assignee: Quanta Computer Inc.

Inventors: Chih-Wei Hung, Chih-Hsien Chang, Hsien-Ming Tsai

prev … 6 7 8 9 10 11 12 13 14 … next