Using Spectral Analysis, E.g., Transform Vocoders, Subband Vocoders, Perceptual Audio Coders, Psychoacoustically Based Lossy Encoding, Etc., E.g., Mpeg Audio, Dolby Ac-3, Etc. (epo) Patents (Class 704/E19.01)

E Subclasses

Blocking, i.e., grouping of samples in time, choice of analysis window, overlap factor (epo) (Class 704/E19.011)

Detection of transients and attacks for time/frequency resolution switching (EPO) (Class 704/E19.012)

Noise substitution, i.e., substituting nontonal spectral components by noisy source (epo) (Class 704/E19.013)

Spectral prediction for pre-echo prevention; temporal noise shaping (tns), e.g., in mpeg2 or mpeg4, etc. (epo) (Class 704/E19.014)

Quantization or dequantization of spectral components (epo) (Class 704/E19.015)

Using subband decomposition (epo) (Class 704/E19.018)

Subband vocoders (EPO) (Class 704/E19.019)

Using orthogonal transformation (epo) (Class 704/E19.02)

Using wavelet decomposition (EPO) (Class 704/E19.021)

Automated plan synthesis and action dispatch

Patent number: 11960926

Abstract: An artificial intelligence, AI, planning controller control the timing of when a plan (16) to accomplish a task (14) is synthesized. The AI planning controller in this regard determines a quiescent phase (20) during which values of at least some predicates describing a state of the system (12) will remain stable. The AI planning controller then controls artificial intelligence planning to synthesize the plan (16) during at least some of the quiescent phase (20).

Type: Grant

Filed: September 13, 2018

Date of Patent: April 16, 2024

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Swarup Kumar Mohalik, Senthamiz Selvi Arumugam, Chakri Padala
Audio signal compression method and apparatus using deep neural network-based multilayer structure and training method thereof

Patent number: 11881227

Abstract: A method, executed by a processor for compressing an audio signal in multiple layers, may comprise: (a) restoring, in a highest layer, an input audio signal as a first signal; (b) restoring, in at least one intermediate layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in the highest layer or an immediately previous intermediate layer, from the input audio signal as a second signal; and (c) restoring, in a lowest layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in an intermediate layer immediately before the lowest layer, from the input audio signal as a third signal, wherein the first signal, the second signal, and the third signal are combined to output a final restoration audio signal.

Type: Grant

Filed: January 13, 2023

Date of Patent: January 23, 2024

Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESERCH INSTITUTE, INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY

Inventors: In Seon Jang, Seung Kwon Beack, Jong Mo Sung, Tae Jin Lee, Woo Taek Lim, Byeong Ho Cho, Hong Goo Kang, Ji Hyun Lee, Chan Woo Lee, Hyung Seob Lim
Digital filterbank for spectral envelope adjustment

Patent number: 11735198

Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.

Type: Grant

Filed: August 30, 2021

Date of Patent: August 22, 2023

Assignee: Dolby International AB

Inventor: Per Ekstrand
Encoding parameter adjustment method and apparatus, device, and storage medium

Patent number: 11715481

Abstract: An encoding parameter adjustment method is performed at a computer device. The method includes: obtaining a first audio signal, and determining a psychoacoustic masking threshold within a service frequency band in the first audio signal; obtaining a second audio signal, and determining a background environmental noise estimation value of the frequency within the service frequency band in the second audio signal; determining a masking tag corresponding to the service frequency band according to the psychoacoustic masking threshold of the first audio signal and the background environmental noise estimation value of the second audio signal; determining a masking rate of the service frequency band according to the masking tag corresponding to the frequency within the service frequency band; determining a first reference bit rate according to the masking rate of the service frequency band; and configuring an encoding bit rate of an audio encoder based on the first reference bit rate.

Type: Grant

Filed: July 6, 2021

Date of Patent: August 1, 2023

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor: Junbin Liang
Method of processing residual signal for audio coding, and audio processing apparatus

Patent number: 11508385

Abstract: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.

Type: Grant

Filed: November 18, 2019

Date of Patent: November 22, 2022

Assignee: Electronics and Telecommunications Research Institute

Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
Decoding apparatus, encoding apparatus, and methods and programs therefor

Patent number: 11430464

Abstract: A decoding apparatus includes: a bandwidth extending part 25 obtaining a decoded extended frequency spectrum sequence by arranging samples based on K samples included in a frequency-domain sample sequence obtained by decoding, on a higher side than the frequency-domain sample sequence; and a fricative sound adjustment releasing part 23 obtaining, if inputted information indicating whether a hissing sound or not indicates being a hissing sound, what is obtained by exchanging all or a part of a low-side frequency sample sequence existing on a lower side than a predetermined frequency in the decoded extended frequency spectrum sequence for all or a part of a high-side frequency sample sequence existing on a higher side than the predetermined frequency in the decoded extended frequency spectrum sequence as an adjusted frequency spectrum sequence, the number of all or the part of the high-side frequency spectrum sequence being the same as the number of all or the part of the low-side frequency spectrum sequence.

Type: Grant

Filed: December 3, 2018

Date of Patent: August 30, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Ryosuke Sugiura, Yutaka Kamamoto, Takehiro Moriya
Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages

Patent number: 8958566

Abstract: An audio signal decoder for providing an upmix signal representation in dependence on a downmix signal representation and an object-related parametric information includes an object separator configured to decompose the downmix signal representation, to provide a first audio information describing a first set of one or more audio objects of a first audio object type and a second audio information describing a second set of one or more audio objects of a second audio object type, in dependence on the downmix signal representation and using at least a part of the object-related parametric information.

Type: Grant

Filed: December 22, 2011

Date of Patent: February 17, 2015

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Oliver Hellmuth, Cornelia Falch, Juergen Herre, Johannes Hilpert, Leon Terentiv, Falko Ridderbusch
Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks

Patent number: 8606587

Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.

Type: Grant

Filed: July 18, 2012

Date of Patent: December 10, 2013

Assignee: Dolby International AB

Inventors: Kristofer Kjorling, Lars Villemoes
Parametric multi-channel audio representation

Patent number: 8498422

Abstract: Multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.

Type: Grant

Filed: April 22, 2003

Date of Patent: July 30, 2013

Assignee: Koninklijke Philips N.V.

Inventors: Arnoldus Werner Johannes Oomen, Erik Gosuinus Petrus Schuijers, Dirk Jeroen Breebaart, Steven Leonardus Josephus Dimphina Elisabeth Van De Par
Method and apparatus for spectral cross coherence

Patent number: 8428897

Abstract: The present invention relates to a machine implemented method for spectral analysis that determines a measure of cross coherence between application of two spectral estimation filters to data; and identifies a spectral feature of the measure of cross coherence. One example embodiment of the present invention provides a complete statistical summary of the joint dependence of the Bartlett and Capon power spectral statistics, showing that the coupling is expressible via a 2×2 complex Wishart matrix, where the degree coupling is determined by a single measure of cross coherence defined herein. This measure of coherence leads to a new two-dimensional algorithm capable of yielding significantly better resolution than the Capon algorithm, often commensurate with but at times exceeding finite sample based MUSIC.

Type: Grant

Filed: April 3, 2009

Date of Patent: April 23, 2013

Assignee: Massachusetts Institute of Technology

Inventor: Christ D. Richmond
Apparatus and method for generating a synthesis audio signal using a patching control signal

Patent number: 8386268

Abstract: An apparatus for generating a synthesis audio signal using a patching control signal has a first converter, a spectral domain patch generator, a high frequency reconstruction manipulator and a combiner. The first converter is configured for converting a time portion of an audio signal into a spectral representation. The spectral domain patch generator is configured for performing a plurality of different spectral domain patching algorithms, wherein each patching algorithm generates a modified spectral representation having spectral components in an upper frequency band derived from corresponding spectral components in a core frequency band of the audio signal.

Type: Grant

Filed: May 13, 2011

Date of Patent: February 26, 2013

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Frederik Nagel, Markus Multrus, Jeremie Lecomte, Stefan Bayer, Guillaume Fuchs, Johannes Hilpert, Julien Robilliard
ENCODING METHOD AND APPARATUS, AND DECODING METHOD AND APPARATUS

Publication number: 20130030795

Abstract: An encoding method of an encoder is provided. The encoder generates first MDCT coefficients by transforming an input signal, and generates MDCT indices by quantizing the first MDCT coefficients. The encoder generates second MDCT coefficients by dequantizing the MDCT indices, and calculates MDCT residual coefficients using differences between the first MDCT coefficients and the second MDCT coefficients. The encoder generates a residual index by encoding the MDCT residual coefficients, and generates gain indices corresponding to gains from the first MDCT coefficients and the second MDCT coefficients.

Type: Application

Filed: March 31, 2011

Publication date: January 31, 2013

Inventors: Jongmo Sung, Hyun Woo Kim, Hyun Joo Bae
AUDIO COMMUNICATION DEVICE, METHOD FOR OUTPUTTING AN AUDIO SIGNAL, AND COMMUNICATION SYSTEM

Publication number: 20130024191

Abstract: An audio communication device comprises an input connectable to a narrowband audio signal source. The input can receive a narrowband audio signal having a first bandwidth. An extraction unit is connected to the input and arranged to extract a plurality of narrowband parameters from the narrowband audio signal. An extrapolation unit is connected to receive the plurality of narrowband parameters and arranged to generate a plurality of wideband parameters from the plurality of narrowband parameters. The extrapolation unit comprises one or more adaptive neuro-fuzzy inference system modules. The device further comprises a synthesis unit connected to receive the plurality of wideband parameters and arranged to generate, using the wideband parameters, a synthesized wideband audio signal having a second bandwidth wider than the first bandwidth.

Type: Application

Filed: April 12, 2010

Publication date: January 24, 2013

Applicant: FREESCALE Semiconductor, Inc.

Inventors: Robert Krutsch, Radu D. Pralea
AUDIO SIGNAL CODING AND DECODING METHOD AND DEVICE

Publication number: 20130018660

Abstract: Embodiments of the present invention provide an audio signal coding and decoding method and device. The coding method includes: dividing a frequency band of an audio signal into a plurality of sub-bands, and quantifying a sub-band normalization factor of each sub-band; determining signal bandwidth of bit allocation according to the quantified sub-band normalization factor, or according to the quantified sub-band normalization factor and bit rate information; allocating bits for a sub-band within the determined signal bandwidth; and coding a spectrum coefficient of the audio signal according to the bits allocated for each sub-band. According to embodiments of the present invention, during coding and decoding, signal bandwidth of bit allocation is determined according to the quantified sub-band normalization factor and bit rate information. In this manner, the determined signal bandwidth is effectively coded and decoded by centralizing the bits, and audio quality is improved.

Type: Application

Filed: June 25, 2012

Publication date: January 17, 2013

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Fengyan QI, Zexin LIU, Lei MIAO
SPEECH PROCESSING APPARATUS, SPEECH PROCESSING METHOD AND PROGRAM

Publication number: 20130006618

Abstract: The present invention relates to a speech processing apparatus, a speech processing method and a program which, when multichannel audio signals are downmixed and coded, prevent delay and an increase in the computation amount upon decoding of the audio signals. An inverse multiplexing unit (101) acquires coded data on which a BC parameter is multiplexed. An uncorrelated frequency-time transform unit (102) performs IMDCT transform and IMDST transform of frequency spectrum coefficients of a monaural signal (XM) obtained from this coded data to generate the monaural signal XM) which is a time domain signal and a signal (XD?) which is substantially uncorrelated with this monaural signal (XM). The stereo synthesis unit (103) generates a stereo signal by synthesizing the monaural signal (XM) and the signal (XD?) using the BC parameter. The present invention is applicable to, for example, a speech processing apparatus which decodes a downmixed and coded stereo signal.

Type: Application

Filed: March 8, 2011

Publication date: January 3, 2013

Inventors: Yasuhiro Toguri, Shiro Suzuki, Jun Matsumoto, Yuuji Maeda, Yuuki Matsumura
Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks

Patent number: 8346566

Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.

Type: Grant

Filed: August 31, 2010

Date of Patent: January 1, 2013

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes
VECTOR QUANTIZATION DEVICE, VOICE CODING DEVICE, VECTOR QUANTIZATION METHOD, AND VOICE CODING METHOD

Publication number: 20120278067

Abstract: Provided are a vector quantization device, a voice coding device, a vector quantization method, and a voice coding method which enable a reduction in the calculation amount of voice codec without deterioration of voice quality. In the vector quantization device, a first reference vector calculation unit (201) calculates a first reference vector by multiplying a target vector (x) by an auditory weighting LPC synthesis filter (H), and a second reference vector calculation unit (202) calculates a second reference vector by multiplying an element of the first reference vector by a filter having a high pass characteristic. A polarity preliminary selection unit (205) generates a polar vector by disposing a unit pulse having a positive or negative polarity, which is selected on the basis of the polarity of an element of the second reference vector, in the position of said element.

Type: Application

Filed: December 13, 2010

Publication date: November 1, 2012

Applicant: PANASONIC CORPORATION

Inventor: Toshiyuki Morii
METHOD AND A DECODER FOR ATTENUATION OF SIGNAL REGIONS RECONSTRUCTED WITH LOW ACCURACY

Publication number: 20120278085

Abstract: The embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.

Type: Application

Filed: December 15, 2011

Publication date: November 1, 2012

Applicant: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)

Inventors: Sebastian Näslund, Volodya Grancharov, Erik Norvell
VOICE TRANSFORMATION WITH ENCODED INFORMATION

Publication number: 20120239387

Abstract: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.

Type: Application

Filed: March 17, 2011

Publication date: September 20, 2012

Applicant: International Business Corporation

Inventors: Shay Ben-David, Ron Hoory, Zvi Kons, David Nahamoo
Method, medium, and apparatus encoding and/or decoding extension data for surround

Patent number: 8270617

Abstract: A method, medium, and apparatus encoding and/or decoding an audio signal to surround data. While encoding spatial information, which can up-mix an audio signal to a surround signal, to extension data, a length of a payload corresponding to the spatial information is encoded and a payload of the spatial information is decoded using the length of the payload. Accordingly, compatibility of the spatial information can be provided, and the spatial information can be transmitted by effectively embedding the spatial information.

Type: Grant

Filed: July 12, 2007

Date of Patent: September 18, 2012

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jung-hoe Kim, Eun-mi Oh
TRANSFORM CODER AND TRANSFORM CODING METHOD

Publication number: 20120136653

Abstract: A transform coding apparatus includes an input scale factor calculating section that calculates an input scale factor having a predetermined number of scale factors associated with an input spectrum as an element, and a codebook that stores a plurality of scale factor candidates having a predetermined number of elements and outputs one scale factor candidate. The transform coding apparatus also includes an error calculating section that calculates an error on a per element basis, a weighted error calculating section that determines a weight on a per element basis and calculates a sum of products of the error and the weight to calculate a weighted error, and a searching section that searches for a scale factor candidate that minimizes the weighted error in the codebook.

Type: Application

Filed: February 7, 2012

Publication date: May 31, 2012

Applicant: PANASONIC CORPORATION

Inventors: Masahiro OSHIKIRI, Tomofumi YAMANASHI
METHOD AND APPARATUS FOR ENCODING AND DECODING AUDIO SIGNAL USING LAYERED SINUSOIDAL PULSE CODING

Publication number: 20120095754

Abstract: Provided are a method and an apparatus for encoding and decoding an audio signal. A method for encoding an audio signal includes receiving a transformed audio signal, dividing the transformed audio signal into a plurality of subbands, performing a first sinusoidal pulse coding operation on the subbands, determining a performance region of a second sinusoidal pulse coding operation among the subbands on the basis of coding information of the first sinusoidal pulse coding operation, and performing the second sinusoidal pulse coding operation on the determined performance region, wherein the first sinusoidal pulse coding operation is performed variably according to the coding information. Accordingly, it is possible to further improve the quality of a synthesized signal by considering the sinusoidal pulse coding of a lower layer when encoding or decoding an audio signal in an upper layer by a layered sinusoidal pulse coding scheme.

Type: Application

Filed: May 19, 2010

Publication date: April 19, 2012

Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventors: Mi-Suk Lee, Heesik Yang, Hyun-Woo Kim, Jongmo Sung, Hyun-Joo Bae, Byung-Sun Lee
Apparatus and method for encoding and decoding signal for high frequency bandwidth extension

Publication number: 20120065965

Abstract: An apparatus and method for encoding and decoding a signal for high frequency bandwidth extension are provided. An encoding apparatus may down-sample a time domain input signal, may core-encode the down-sampled time domain input signal, may transform the core-encoded time domain input signal to a frequency domain input signal, and may perform bandwidth extension encoding using a basic signal of the frequency domain input signal.

Type: Application

Filed: September 12, 2011

Publication date: March 15, 2012

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Ki Hyun Choo, Eun Mi Oh, Ho Sang Sung
Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks

Patent number: 8108209

Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.

Type: Grant

Filed: May 26, 2009

Date of Patent: January 31, 2012

Assignee: Coding Technologies Sweden AB

Inventors: Kristofer Kjoerling, Lars Villemoes
Energy Envelope Perceptual Correction for High Band Coding

Publication number: 20120016668

Abstract: In accordance with an embodiment, A method of encoding an audio bitstream at an encoder includes encoding an original low band signal at the encoder by using a closed loop analysis-by-synthesis approach to obtain a coded low band signal, encoding an original high band signal at the encoder by using an open loop energy matching approach to obtain coded high band energy envelopes, comparing an energy of the coded low band signal with an energy of a corresponding original low band signal for a subframe, and generating an indication flag that indicates whether an energy envelope perceptual correction is needed for the subframe based on comparing the energy.

Type: Application

Filed: July 19, 2011

Publication date: January 19, 2012

Applicant: FutureWei Technologies, Inc.

Inventor: Yang Gao
APPARATUS, METHOD AND COMPUTER PROGRAM FOR GENERATING A REPRESENTATION OF A BANDWIDTH-EXTENDED SIGNAL ON THE BASIS OF AN INPUT SIGNAL REPRESENTATION USING A COMBINATION OF A HARMONIC BANDWIDTH-EXTENSION AND A NON-HARMONIC BANDWIDTH-EXTENSION

Publication number: 20120010880

Abstract: An apparatus for generating a representation of a bandwidth-extended signal on the basis of an input signal representation includes a phase vocoder configured to obtain values of a spectral domain representation of a first patch of the bandwidth-extended signal on the basis of the input signal representation. The apparatus also includes a value copier configured to copy a set of values of the spectral domain representation of the first patch, which values are provided by the phase vocoder, to obtain a set of values of a spectral domain representation of a second patch, wherein the second patch is associated with higher frequencies than the first patch. The apparatus is configured to obtain the representation of the bandwidth-extended signal using the values of the spectral domain representation of the first patch and the values of the spectral domain representation of the second patch.

Type: Application

Filed: April 1, 2010

Publication date: January 12, 2012

Inventors: Frederik Nagel, Max Neuendorf, Nikolaus Rettelbach, Jeremie Lecomte, Markus Multrus, Bernhard Grill, Sascha Disch
Method and apparatus for non-overlapped transforming of an audio signal, method and apparatus for adaptively encoding audio signal with the transforming, method and apparatus for inverse non-overlapped transforming of an audio signal, and method and apparatus for adaptively decoding audio signal with the inverse transforming

Patent number: 8086446

Abstract: A method and apparatus for transforming an audio signal, a method and apparatus for adaptively encoding an audio signal, a method and apparatus for inversely transforming an audio signal, and a method and apparatus for adaptively decoding an audio signal. The method of transforming an audio signal includes determining a transform unit into which the audio signal in a time domain is to be transformed into an audio signal in a frequency domain, and transforming the audio signal into an audio signal in the frequency domain according to the determined transform units using a window coefficient other than 0. Accordingly, it is possible to minimize distortion of the audio signal when encoding the audio signal even at a high bit rate while increasing efficiency of compression.

Type: Grant

Filed: December 7, 2005

Date of Patent: December 27, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Eunmi Oh, Junghoe Kim, Boris Kudryashov, Konstantin Osipov
Method And Arrangement For Processing Of Audio Signals

Publication number: 20110282656

Abstract: Method and decoder for processing of audio signals. The method and decoder relate to deriving a processed vector {circumflex over (d)} by applying a post-filter directly on a vector d comprising quantized MDCT domain coefficients of a time segment of an audio signal. The post-filter is configured to have a transfer function H which is a compressed version of the envelope of the vector d. A signal waveform is reconstructed by performing an inverse MDCT transform on the processed vector {circumflex over (d)}.

Type: Application

Filed: May 10, 2011

Publication date: November 17, 2011

Inventors: Volodya GRANCHAROV, Sigurdur Sverrisson
DECODER FOR AUDIO SIGNAL INCLUDING GENERIC AUDIO AND SPEECH FRAMES

Publication number: 20110218799

Abstract: A method for decoding audio frames includes producing a first frame of coded audio samples, producing at least a portion of a second frame of coded audio samples, generating audio gap filler samples based on parameters representative of a weighted segment of the first frame of coded audio samples or a weighted segment of the portion of the second frame of coded audio samples, and forming a sequence including the audio gap filler samples and the portion of the second frame of coded audio samples.

Type: Application

Filed: September 9, 2010

Publication date: September 8, 2011

Applicant: MOTOROLA, INC.

Inventors: Udar Mittal, Joanthan A. Gibbs, James P. Ashley
Band based audio coding and decoding apparatuses, methods, and recording media for scalability

Patent number: 8015017

Abstract: Audio coding and decoding apparatuses and methods which support fine granularity scalability (FGS) using harmonic information of a high-band audio signal or wideband error audio signal when performing wideband audio coding and decoding, and recording mediums on which the methods are stored. The audio coding method includes detecting harmonics of a high-band audio signal or wideband error audio signal of an input audio signal; determining an order of the detected harmonics; and coding the detected harmonics based on the determined order.

Type: Grant

Filed: January 24, 2006

Date of Patent: September 6, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hosang Sung, Rakesh Taori, Kangeun Lee
SOUND PICKUP APPARATUS, PORTABLE COMMUNICATION APPARATUS, AND IMAGE PICKUP APPARATUS

Publication number: 20110200205

Abstract: A sound pickup apparatus includes: a microphone array including at least three microphones, wherein a first pair of microphones in which two of the at least three microphones are aligned on a first axis, and a second pair of microphones in which two of the at least three microphones are aligned on a second axis; a first null signal generator which outputs a first null signal based on a differential output of the first pair of microphones; a second null signal generator which outputs a second null signal based on a differential output of the second pair of microphones; and a combiner which generates a target signal based on the first null signal and the second null signal, the target signal having a directional characteristic in which the lowest sensitivity is formed in a direction to a line along which the first null surface meets the second null surface.

Type: Application

Filed: February 17, 2010

Publication date: August 18, 2011

Applicant: PANASONIC CORPORATION

Inventor: Toshimichi Tokuda
Apparatus and a Method for Generating Bandwidth Extension Output Data

Publication number: 20110202352

Abstract: An apparatus for generating bandwidth extension output data for an audio signal has a noise floor measurer, a signal energy characterizer and a processor. The audio signal has components in a first frequency band and components in a second frequency band, the bandwidth extension output data are adapted to control a synthesis of the components in the second frequency band. The noise floor measurer measures noise floor data of the second frequency band for a time portion of the audio signal. The signal energy characterizer derives energy distribution data, the energy distribution data characterizing an energy distribution in a spectrum of the time portion of the audio signal. The processor combines the noise floor data and the energy distribution data to obtain the bandwidth extension output data.

Type: Application

Filed: January 11, 2011

Publication date: August 18, 2011

Inventors: Max Neuendorf, Bernhard Grill, Ulrich Kraemer, Markus Multrus, Harald Popp, Nikolaus Rettelbach, Frederik Nagel, Markus Lohwasser, Marc Gayer, Manuel Jander, Virgilio Bacigalupo
SYSTEM AND METHOD FOR ENCODING AND DECODING PULSE INDICES

Publication number: 20110184733

Abstract: Methods, and corresponding codec-containing devices are provided that have source coding schemes for encoding a component of an excitation. In some cases, the source coding scheme is an enumerative source coding scheme, while in other cases the source coding scheme is an arithmetic source coding scheme. In some cases, the source coding schemes are applied to encode a fixed codebook component of the excitation for a codec employing codebook excited linear prediction, for example an AMR-WB (Adaptive Multi-Rate-Wideband) speech codec.

Type: Application

Filed: January 22, 2010

Publication date: July 28, 2011

Applicant: RESEARCH IN MOTION LIMITED

Inventors: Xiang YU, Dake HE, En-hui YANG
TIME WARP ACTIVATION SIGNAL PROVIDER, AUDIO SIGNAL ENCODER, METHOD FOR PROVIDING A TIME WARP ACTIVATION SIGNAL, METHOD FOR ENCODING AN AUDIO SIGNAL AND COMPUTER PROGRAMS

Publication number: 20110178795

Abstract: An audio encoder has a window function controller, a windower, a time warper with a final quality check functionality, a time/frequency converter, a TNS stage or a quantizer encoder, the window function controller, the time warper, the TNS stage or an additional noise filling analyzer are controlled by signal analysis results obtained by a time warp analyzer or a signal classifier. Furthermore, a decoder applies a noise filling operation using a manipulated noise filling estimate depending on a harmonic or speech characteristic of the audio signal.

Type: Application

Filed: January 11, 2011

Publication date: July 21, 2011

Inventors: Stefan Bayer, Sascha Disch, Ralf Geiger, Guillaume Fuchs, Max Neuendorf, Gerald Schuller, Bernd Edler
Advanced methods for interpolation and parameter signalling

Patent number: 7974847

Abstract: A parameter calculator calculates lower resolution parametric information and interpolation information. On a decoder-side, an upmixer is used for generating the output channels. The upmixer uses high resolution parametric information generated by a parameter interpolator using the low resolution parametric information and decoder-side derived interpolation information or encoder-generated interpolation information for selecting one of a plurality of different interpolation characteristics.

Type: Grant

Filed: November 22, 2005

Date of Patent: July 5, 2011

Assignee: Coding Technologies AB

Inventors: Kristofer Kjoerling, Heiko Purnhagen, Jonas Engdegard, Jonas Roeden
SPECTRAL SMOOTHING DEVICE, ENCODING DEVICE, DECODING DEVICE, COMMUNICATION TERMINAL DEVICE, BASE STATION DEVICE, AND SPECTRAL SMOOTHING METHOD

Publication number: 20110137643

Abstract: Disclosed is a spectral smoothing device with a structure whereby smoothing is performed after a nonlinear conversion has been performed for a spectrum calculated from an audio signal, and with which the amount of processing calculation is significantly reduced while maintaining excellent audio quality. With this spectral smoothing device, a sub band division unit (102) divides an input spectrum into multiple sub bands; a representative value calculation unit (103) calculates a representative value for each sub band using an arithmetic mean and a geometric mean; with respect to each representative value, a nonlinear conversion unit (104) performs a nonlinear conversion the characteristic of which is further emphasized as the value increases; and a smoothing unit (105) that smoothes the representative value which has undergone the nonlinear conversion for each sub band, at the frequency domain.

Type: Application

Filed: August 7, 2009

Publication date: June 9, 2011

Inventors: Tomofumi Yamanashi, Masahiro Oshikiri, Toshiyuki Morii, Hiroyuki Ehara
APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO

Publication number: 20110119054

Abstract: Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.

Type: Application

Filed: July 14, 2009

Publication date: May 19, 2011

Inventors: Tae Jin Lee, Seung Kwon Beack, Minje Kim, Dae Young Jang, Kyeongok Kang, Jin Woo Hong, Hochong Park, Young-Cheol Park
APPARATUS AND METHOD FOR ENCODING AND DECODING ENHANCEMENT LAYER

Publication number: 20110106532

Abstract: Provided is a method and apparatus for encoding and decoding an enhancement layer to reduce quantization error in a G.711 codec. Exponent indices of additional mantissa information of each sample are calculated based upon exponent information of each sample in a frame. A process of allocating 1 bit to each sample with a current exponent index is repeated, the exponent index starting from the maximum value while decreasing by 1 at every repetition until the total number of bits allocated to the samples is equal to the total number of available bits in the frame. And the most significant bits, as many as the number of bits allocated to each sample, are extracted from the additional mantissa information of each sample in the frame.

Type: Application

Filed: August 18, 2008

Publication date: May 5, 2011

Inventors: Jongmo Sung, Do-Young Kim
TRANSFORM CODING OF SPEECH AND AUDIO SIGNALS

Publication number: 20110035212

Abstract: In a method of perceptual transform coding of audio signals in a telecommunication system, performing the steps of determining transform coefficients representative of a time to frequency transformation of a time segmented input audio signal; determining a spectrum of perceptual sub-bands for said input audio signal based on said determined transform coefficients; determining masking thresholds for each said sub-band based on said determined spectrum; computing scale factors for each said sub-band based on said determined masking thresholds, and finally adapting said computed scale factors for each said sub-band to prevent energy loss for perceptually relevant sub-bands.

Type: Application

Filed: August 26, 2008

Publication date: February 10, 2011

Applicant: Telefonaktiebolaget L M Ericsson (publ)

Inventors: Manuel Briand, Anisse Taleb
Audio encoding with different coding frame lengths

Patent number: 7860709

Abstract: The invention relates to a method for supporting an encoding of an audio signal, wherein at least one section of the audio signal is to be encoded with a coding model that allows the use of different coding frame lengths. In order to enable a simple selection of the respectively best suited coding frame length, it is proposed that at least one control parameter is determined based on signal characteristics of the audio signal. The control parameter is then used for limiting the options of possible coding frame lengths for the at least one section. The invention relates equally to a module 10,11 in which this method is implemented, to a device 1 and a system comprising such a module 10,11, and to a software program product including a software code for realizing the proposed method.

Type: Grant

Filed: May 13, 2005

Date of Patent: December 28, 2010

Assignee: Nokia Corporation

Inventor: Jari Mäkinen
ENCODING DEVICE, DECODING DEVICE, AND METHOD THEREOF

Publication number: 20100262421

Abstract: Provided is an encoding device which improves the sound quality of a stereo signal while maintaining a low bit rate. The encoding device includes: an LP inverse filter (121) which LP-inverse-filterS a left signal L(n) by using an inverse quantization linear prediction coefficient AdM(z) of a monaural signal; a T/F conversion unit (122) which converts the left sound source signal Le(n) from a temporal region to a frequency region; an inverse quantizer (123) which inverse-quantizes encoded information Mqe; spectrum division units (124, 125) which divide a high-frequency component of the sound source signal Mde(f) and the left signal Le(f) into a plurality of bands; and scale factor calculation units (126, 127) which calculate scale factors ai and ssi by using a monaural sound source signal Mdeh,i(f), a left sound source signal Leh,i(f), Mdeh,i(f), and right sound source signal Reh,i(f) of each divided band.

Type: Application

Filed: November 4, 2008

Publication date: October 14, 2010

Applicant: PANASONIC CORPORATION

Inventors: Kok Seng Chong, Koji Yoshida, Masahiro Oshikiri
ENCODER AND DECODER

Publication number: 20100250244

Abstract: There is provided an encoder capable of improving inter-channel prediction (ICP) performance in scalable stereo sound encoding using an ICP. In the encoder, ICP analysis units (113, 114, 115) use, as reference signal candidates, a frequency coefficient (sL?(f)) in the low-band portion of a side residual signal, a frequency coefficient (mM,i(f)) in each sub-band portion of a monaural residual signal, and a frequency coefficient (mL(f)) in the low-band portion of the monaural residual signal, respectively, and perform an ICP analysis between the respective these candidates and a frequency coefficient (sM,i(f)) in each sub-band portion of the side residual signal to generate first, second, and third ICP coefficients.

Type: Application

Filed: October 31, 2008

Publication date: September 30, 2010

Applicant: PANASONIC CORPORATION

Inventors: Haishan Zhong, Zongxian Liu, Kok Seng Chong, Koji Yoshida
TRANSCODING METHOD, TRANSCODING DEVICE AND COMMUNICATION APPARATUS

Publication number: 20100185440

Abstract: The embodiments of a transcoding method, a transcoding device, and a communication apparatus are provided. The embodiment of a method includes: receiving a bit stream input from a sending end; determining an attribute of discontinuous transmission (DTX) used by a receiving end and a frame type of the input bit stream; and transcoding the input bit stream in a corresponding processing manner according to a determination result. Thereby, a corresponding transcoding operation is performed on the input bit stream according to the attribute of DTX used by the receiving end and the frame type of the input bit stream. In such a manner, input bit streams of various types can be processed, and the input bit streams can be correspondingly transcoded according to the requirements of the receiving end. Therefore, the average computational complexity and peak computational complexity can be effectively decreased without decreasing the quality of the synthesized speech.

Type: Application

Filed: January 21, 2010

Publication date: July 22, 2010

Inventors: Changchun Bao, Hao Xu, Fanrong Tang, Xiangyu Hu
METHOD AND APPARATUS FOR ADAPTIVE SUB-BAND ALLOCATION OF SPECTRAL COEFFICIENTS

Publication number: 20100161320

Abstract: An apparatus and method for adaptive sub-band allocation of spectral coefficients are disclosed. The sizes of sub-bands are determined according to the distribution of spectral coefficients transformed from an input speech/audio signal to perform more elaborate quantization in units of sub-bands. Thus, quantization noise of the spectral coefficients is reduced, and sound quality in a frequency region is enhanced, thereby improving the quality of the signal.

Type: Application

Filed: September 9, 2009

Publication date: June 24, 2010

Inventors: Hyun Woo KIM, Hyun Joo BAE, Byung Sun LEE
SAMPLING RATE CONVERSION APPARATUS, CODING APPARATUS, DECODING APPARATUS AND METHODS THEREOF

Publication number: 20100161321

Abstract: A coding apparatus reduces a circuit scale and the amount of coding processing calculation. A frequency domain conversion section performs a frequency analysis of the signal sampled at a sampling rate Fx with an analysis length of 2·Na and calculates first spectrum S1(k)(0?k<Na). A band extension section extends the effective frequency band of first spectrum S1(k) to 0?k<Nb so that a new spectrum can be assigned to the extended area following to the frequency k=Na of first spectrum S1(k). An extended spectrum assignment section assigns extended spectrum S1?(k)(Na?k<Nb) input to the extended frequency band from the outside. A spectral information specification section outputs information necessary to specify extended spectrum S1?(k) out of the spectrum given from the extended spectrum assignment section as a code.

Type: Application

Filed: February 18, 2010

Publication date: June 24, 2010

Applicant: PANASONIC CORPORATION

Inventor: Masahiro OSHIKIRI
Method and Related Device for Simplifying Psychoacoustic Analysis with Spectral Flatness Characteristic Values

Publication number: 20100145682

Abstract: The present invention applies spectral flatness characteristic values to simplify psychoacoustic analysis of a sound signal. If the sound signal comprises a plurality of frames, the present invention calculates the energy of the sound signal in a frequency domain, calculates a plurality of spectral flatness, and decides to use a short-block or a long-block Modified Discrete Cosine Transform accordingly. If the sound signal comprises left and right channel signals, the present invention performs psychoacoustic analysis on the sound signal to count energy of the left and right channel signals in a frequency domain, counts spectral flatness of the left and right channel signals, and decides to use middle/side transform or left and right channel encoding to transform the left and right channel signals accordingly.

Type: Application

Filed: March 27, 2009

Publication date: June 10, 2010

Inventor: Yi-Lun Ho
CODING/DECODING OF DIGITAL AUDIO SIGNALS

Publication number: 20100121646

Abstract: The invention relates to the coding/decoding of a signal into several sub-bands, in which at least a first and a second sub-bands which are adjacent are transform coded (601, 602). In particular, in order to apply a perceptual weighting, in the transformed domain, to at least the second sub-band, the method comprises:—determining at least one frequency masking threshold (606) to be applied on the second sub-band; and normalizing said masking threshold in order to provide a spectral continuity between the above-mentioned first and second sub-bands. An advantageous application of the invention involves a perceptual weighting of the high-frequency band in the TDAC transform coding of a hierarchical encoder according to standard G.729.1.

Type: Application

Filed: January 30, 2008

Publication date: May 13, 2010

Applicant: France Telecom

Inventors: Stéphane Ragot, Cyril Gukllaume
ADAPTIVE SOUND SOURCE VECTOR QUANTIZATION DEVICE AND ADAPTIVE SOUND SOURCE VECTOR QUANTIZATION METHOD

Publication number: 20100063804

Abstract: Provided is an adaptive sound source vector quantization device which can always perform a pitch cycle search with a resolution appropriate for any section of the pitch cycle search range of a second sub-frame when a pitch cycle search range of the second sub-frame changes in accordance with a pitch cycle of a first sub-frame. The device includes a first pitch cycle instruction unit (111), a search range calculation unit (112), and a second pitch cycle instruction unit (113). The first pitch cycle instruction unit (111) successively instructs pitch cycle search candidates in a predetermined search range having a search resolution which transits over a predetermined pitch cycle candidate for the first sub-frame. The search range calculation unit (112) calculates a predetermined range before and after the pitch cycle of the first sub-frame as the pitch cycle search range for the second sub-frame, if the predetermined range includes the predetermined pitch cycle search candidate.

Type: Application

Filed: February 29, 2008

Publication date: March 11, 2010

Applicant: PANASONIC CORPORATION

Inventors: Kaoru Sato, Toshiyuki Morii
Voice processing apparatus and method

Publication number: 20100023321

Abstract: Character extraction section extracts character amounts, pertaining to a prosody of voice, from a voice signal sequentially in a time-serial manner. Difference value calculation calculates a difference value between each of the extracted character amounts and a reference value. Processing values, corresponding to the individual character amounts, are generated in accordance with the respective difference values, and a voice processing section controls the individual character amounts of the voice signal in accordance with the processing values corresponding to the character amounts and thereby generates an output signal having a prosody changed from the prosody of the voice signal.

Type: Application

Filed: July 22, 2009

Publication date: January 28, 2010

Applicant: Yamaha Corporation

Inventor: Yasuo Yoshioka
CODING OF TRANSITIONAL SPEECH FRAMES FOR LOW-BIT-RATE APPLICATIONS

Publication number: 20090319263

Abstract: Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.

Type: Application

Filed: October 30, 2008

Publication date: December 24, 2009

Applicant: QUALCOMM Incorporated

Inventors: Alok Kumar Gupta, Sharath Manjunath

1 2 next