Using Spectral Analysis, E.g., Transform Vocoders, Subband Vocoders, Perceptual Audio Coders, Psychoacoustically Based Lossy Encoding, Etc., E.g., Mpeg Audio, Dolby Ac-3, Etc. (epo) Patents (Class 704/E19.01)
  • Patent number: 11960926
    Abstract: An artificial intelligence, AI, planning controller control the timing of when a plan (16) to accomplish a task (14) is synthesized. The AI planning controller in this regard determines a quiescent phase (20) during which values of at least some predicates describing a state of the system (12) will remain stable. The AI planning controller then controls artificial intelligence planning to synthesize the plan (16) during at least some of the quiescent phase (20).
    Type: Grant
    Filed: September 13, 2018
    Date of Patent: April 16, 2024
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Swarup Kumar Mohalik, Senthamiz Selvi Arumugam, Chakri Padala
  • Patent number: 11881227
    Abstract: A method, executed by a processor for compressing an audio signal in multiple layers, may comprise: (a) restoring, in a highest layer, an input audio signal as a first signal; (b) restoring, in at least one intermediate layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in the highest layer or an immediately previous intermediate layer, from the input audio signal as a second signal; and (c) restoring, in a lowest layer, a signal obtained by subtracting an upsampled signal, which is obtained by upsampling the audio signal restored in an intermediate layer immediately before the lowest layer, from the input audio signal as a third signal, wherein the first signal, the second signal, and the third signal are combined to output a final restoration audio signal.
    Type: Grant
    Filed: January 13, 2023
    Date of Patent: January 23, 2024
    Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESERCH INSTITUTE, INDUSTRY-ACADEMIC COOPERATION FOUNDATION, YONSEI UNIVERSITY
    Inventors: In Seon Jang, Seung Kwon Beack, Jong Mo Sung, Tae Jin Lee, Woo Taek Lim, Byeong Ho Cho, Hong Goo Kang, Ji Hyun Lee, Chan Woo Lee, Hyung Seob Lim
  • Patent number: 11735198
    Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.
    Type: Grant
    Filed: August 30, 2021
    Date of Patent: August 22, 2023
    Assignee: Dolby International AB
    Inventor: Per Ekstrand
  • Patent number: 11715481
    Abstract: An encoding parameter adjustment method is performed at a computer device. The method includes: obtaining a first audio signal, and determining a psychoacoustic masking threshold within a service frequency band in the first audio signal; obtaining a second audio signal, and determining a background environmental noise estimation value of the frequency within the service frequency band in the second audio signal; determining a masking tag corresponding to the service frequency band according to the psychoacoustic masking threshold of the first audio signal and the background environmental noise estimation value of the second audio signal; determining a masking rate of the service frequency band according to the masking tag corresponding to the frequency within the service frequency band; determining a first reference bit rate according to the masking rate of the service frequency band; and configuring an encoding bit rate of an audio encoder based on the first reference bit rate.
    Type: Grant
    Filed: July 6, 2021
    Date of Patent: August 1, 2023
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Junbin Liang
  • Patent number: 11508385
    Abstract: Disclosed is a method of processing a residual signal for audio coding and an audio coding apparatus. The method learns a feature map of a reference signal through a residual signal learning engine including a convolutional layer and a neural network and performs learning based on a result obtained by mapping a node of an output layer of the neural network and a quantization level of index of the residual signal.
    Type: Grant
    Filed: November 18, 2019
    Date of Patent: November 22, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
  • Patent number: 11430464
    Abstract: A decoding apparatus includes: a bandwidth extending part 25 obtaining a decoded extended frequency spectrum sequence by arranging samples based on K samples included in a frequency-domain sample sequence obtained by decoding, on a higher side than the frequency-domain sample sequence; and a fricative sound adjustment releasing part 23 obtaining, if inputted information indicating whether a hissing sound or not indicates being a hissing sound, what is obtained by exchanging all or a part of a low-side frequency sample sequence existing on a lower side than a predetermined frequency in the decoded extended frequency spectrum sequence for all or a part of a high-side frequency sample sequence existing on a higher side than the predetermined frequency in the decoded extended frequency spectrum sequence as an adjusted frequency spectrum sequence, the number of all or the part of the high-side frequency spectrum sequence being the same as the number of all or the part of the low-side frequency spectrum sequence.
    Type: Grant
    Filed: December 3, 2018
    Date of Patent: August 30, 2022
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Ryosuke Sugiura, Yutaka Kamamoto, Takehiro Moriya
  • Patent number: 8958566
    Abstract: An audio signal decoder for providing an upmix signal representation in dependence on a downmix signal representation and an object-related parametric information includes an object separator configured to decompose the downmix signal representation, to provide a first audio information describing a first set of one or more audio objects of a first audio object type and a second audio information describing a second set of one or more audio objects of a second audio object type, in dependence on the downmix signal representation and using at least a part of the object-related parametric information.
    Type: Grant
    Filed: December 22, 2011
    Date of Patent: February 17, 2015
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Oliver Hellmuth, Cornelia Falch, Juergen Herre, Johannes Hilpert, Leon Terentiv, Falko Ridderbusch
  • Patent number: 8606587
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Grant
    Filed: July 18, 2012
    Date of Patent: December 10, 2013
    Assignee: Dolby International AB
    Inventors: Kristofer Kjorling, Lars Villemoes
  • Patent number: 8498422
    Abstract: Multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.
    Type: Grant
    Filed: April 22, 2003
    Date of Patent: July 30, 2013
    Assignee: Koninklijke Philips N.V.
    Inventors: Arnoldus Werner Johannes Oomen, Erik Gosuinus Petrus Schuijers, Dirk Jeroen Breebaart, Steven Leonardus Josephus Dimphina Elisabeth Van De Par
  • Patent number: 8428897
    Abstract: The present invention relates to a machine implemented method for spectral analysis that determines a measure of cross coherence between application of two spectral estimation filters to data; and identifies a spectral feature of the measure of cross coherence. One example embodiment of the present invention provides a complete statistical summary of the joint dependence of the Bartlett and Capon power spectral statistics, showing that the coupling is expressible via a 2×2 complex Wishart matrix, where the degree coupling is determined by a single measure of cross coherence defined herein. This measure of coherence leads to a new two-dimensional algorithm capable of yielding significantly better resolution than the Capon algorithm, often commensurate with but at times exceeding finite sample based MUSIC.
    Type: Grant
    Filed: April 3, 2009
    Date of Patent: April 23, 2013
    Assignee: Massachusetts Institute of Technology
    Inventor: Christ D. Richmond
  • Patent number: 8386268
    Abstract: An apparatus for generating a synthesis audio signal using a patching control signal has a first converter, a spectral domain patch generator, a high frequency reconstruction manipulator and a combiner. The first converter is configured for converting a time portion of an audio signal into a spectral representation. The spectral domain patch generator is configured for performing a plurality of different spectral domain patching algorithms, wherein each patching algorithm generates a modified spectral representation having spectral components in an upper frequency band derived from corresponding spectral components in a core frequency band of the audio signal.
    Type: Grant
    Filed: May 13, 2011
    Date of Patent: February 26, 2013
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Frederik Nagel, Markus Multrus, Jeremie Lecomte, Stefan Bayer, Guillaume Fuchs, Johannes Hilpert, Julien Robilliard
  • Publication number: 20130030795
    Abstract: An encoding method of an encoder is provided. The encoder generates first MDCT coefficients by transforming an input signal, and generates MDCT indices by quantizing the first MDCT coefficients. The encoder generates second MDCT coefficients by dequantizing the MDCT indices, and calculates MDCT residual coefficients using differences between the first MDCT coefficients and the second MDCT coefficients. The encoder generates a residual index by encoding the MDCT residual coefficients, and generates gain indices corresponding to gains from the first MDCT coefficients and the second MDCT coefficients.
    Type: Application
    Filed: March 31, 2011
    Publication date: January 31, 2013
    Inventors: Jongmo Sung, Hyun Woo Kim, Hyun Joo Bae
  • Publication number: 20130024191
    Abstract: An audio communication device comprises an input connectable to a narrowband audio signal source. The input can receive a narrowband audio signal having a first bandwidth. An extraction unit is connected to the input and arranged to extract a plurality of narrowband parameters from the narrowband audio signal. An extrapolation unit is connected to receive the plurality of narrowband parameters and arranged to generate a plurality of wideband parameters from the plurality of narrowband parameters. The extrapolation unit comprises one or more adaptive neuro-fuzzy inference system modules. The device further comprises a synthesis unit connected to receive the plurality of wideband parameters and arranged to generate, using the wideband parameters, a synthesized wideband audio signal having a second bandwidth wider than the first bandwidth.
    Type: Application
    Filed: April 12, 2010
    Publication date: January 24, 2013
    Applicant: FREESCALE Semiconductor, Inc.
    Inventors: Robert Krutsch, Radu D. Pralea
  • Publication number: 20130018660
    Abstract: Embodiments of the present invention provide an audio signal coding and decoding method and device. The coding method includes: dividing a frequency band of an audio signal into a plurality of sub-bands, and quantifying a sub-band normalization factor of each sub-band; determining signal bandwidth of bit allocation according to the quantified sub-band normalization factor, or according to the quantified sub-band normalization factor and bit rate information; allocating bits for a sub-band within the determined signal bandwidth; and coding a spectrum coefficient of the audio signal according to the bits allocated for each sub-band. According to embodiments of the present invention, during coding and decoding, signal bandwidth of bit allocation is determined according to the quantified sub-band normalization factor and bit rate information. In this manner, the determined signal bandwidth is effectively coded and decoded by centralizing the bits, and audio quality is improved.
    Type: Application
    Filed: June 25, 2012
    Publication date: January 17, 2013
    Applicant: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Fengyan QI, Zexin LIU, Lei MIAO
  • Publication number: 20130006618
    Abstract: The present invention relates to a speech processing apparatus, a speech processing method and a program which, when multichannel audio signals are downmixed and coded, prevent delay and an increase in the computation amount upon decoding of the audio signals. An inverse multiplexing unit (101) acquires coded data on which a BC parameter is multiplexed. An uncorrelated frequency-time transform unit (102) performs IMDCT transform and IMDST transform of frequency spectrum coefficients of a monaural signal (XM) obtained from this coded data to generate the monaural signal XM) which is a time domain signal and a signal (XD?) which is substantially uncorrelated with this monaural signal (XM). The stereo synthesis unit (103) generates a stereo signal by synthesizing the monaural signal (XM) and the signal (XD?) using the BC parameter. The present invention is applicable to, for example, a speech processing apparatus which decodes a downmixed and coded stereo signal.
    Type: Application
    Filed: March 8, 2011
    Publication date: January 3, 2013
    Inventors: Yasuhiro Toguri, Shiro Suzuki, Jun Matsumoto, Yuuji Maeda, Yuuki Matsumura
  • Patent number: 8346566
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Grant
    Filed: August 31, 2010
    Date of Patent: January 1, 2013
    Assignee: Dolby International AB
    Inventors: Kristofer Kjoerling, Lars Villemoes
  • Publication number: 20120278067
    Abstract: Provided are a vector quantization device, a voice coding device, a vector quantization method, and a voice coding method which enable a reduction in the calculation amount of voice codec without deterioration of voice quality. In the vector quantization device, a first reference vector calculation unit (201) calculates a first reference vector by multiplying a target vector (x) by an auditory weighting LPC synthesis filter (H), and a second reference vector calculation unit (202) calculates a second reference vector by multiplying an element of the first reference vector by a filter having a high pass characteristic. A polarity preliminary selection unit (205) generates a polar vector by disposing a unit pulse having a positive or negative polarity, which is selected on the basis of the polarity of an element of the second reference vector, in the position of said element.
    Type: Application
    Filed: December 13, 2010
    Publication date: November 1, 2012
    Applicant: PANASONIC CORPORATION
    Inventor: Toshiyuki Morii
  • Publication number: 20120278085
    Abstract: The embodiments of the present invention improves conventional attenuation schemes by replacing constant attenuation with an adaptive attenuation scheme that allows more aggressive attenuation, without introducing audible change of signal frequency characteristics.
    Type: Application
    Filed: December 15, 2011
    Publication date: November 1, 2012
    Applicant: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)
    Inventors: Sebastian Näslund, Volodya Grancharov, Erik Norvell
  • Publication number: 20120239387
    Abstract: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.
    Type: Application
    Filed: March 17, 2011
    Publication date: September 20, 2012
    Applicant: International Business Corporation
    Inventors: Shay Ben-David, Ron Hoory, Zvi Kons, David Nahamoo
  • Patent number: 8270617
    Abstract: A method, medium, and apparatus encoding and/or decoding an audio signal to surround data. While encoding spatial information, which can up-mix an audio signal to a surround signal, to extension data, a length of a payload corresponding to the spatial information is encoded and a payload of the spatial information is decoded using the length of the payload. Accordingly, compatibility of the spatial information can be provided, and the spatial information can be transmitted by effectively embedding the spatial information.
    Type: Grant
    Filed: July 12, 2007
    Date of Patent: September 18, 2012
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jung-hoe Kim, Eun-mi Oh
  • Publication number: 20120136653
    Abstract: A transform coding apparatus includes an input scale factor calculating section that calculates an input scale factor having a predetermined number of scale factors associated with an input spectrum as an element, and a codebook that stores a plurality of scale factor candidates having a predetermined number of elements and outputs one scale factor candidate. The transform coding apparatus also includes an error calculating section that calculates an error on a per element basis, a weighted error calculating section that determines a weight on a per element basis and calculates a sum of products of the error and the weight to calculate a weighted error, and a searching section that searches for a scale factor candidate that minimizes the weighted error in the codebook.
    Type: Application
    Filed: February 7, 2012
    Publication date: May 31, 2012
    Applicant: PANASONIC CORPORATION
    Inventors: Masahiro OSHIKIRI, Tomofumi YAMANASHI
  • Publication number: 20120095754
    Abstract: Provided are a method and an apparatus for encoding and decoding an audio signal. A method for encoding an audio signal includes receiving a transformed audio signal, dividing the transformed audio signal into a plurality of subbands, performing a first sinusoidal pulse coding operation on the subbands, determining a performance region of a second sinusoidal pulse coding operation among the subbands on the basis of coding information of the first sinusoidal pulse coding operation, and performing the second sinusoidal pulse coding operation on the determined performance region, wherein the first sinusoidal pulse coding operation is performed variably according to the coding information. Accordingly, it is possible to further improve the quality of a synthesized signal by considering the sinusoidal pulse coding of a lower layer when encoding or decoding an audio signal in an upper layer by a layered sinusoidal pulse coding scheme.
    Type: Application
    Filed: May 19, 2010
    Publication date: April 19, 2012
    Applicant: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Mi-Suk Lee, Heesik Yang, Hyun-Woo Kim, Jongmo Sung, Hyun-Joo Bae, Byung-Sun Lee
  • Publication number: 20120065965
    Abstract: An apparatus and method for encoding and decoding a signal for high frequency bandwidth extension are provided. An encoding apparatus may down-sample a time domain input signal, may core-encode the down-sampled time domain input signal, may transform the core-encoded time domain input signal to a frequency domain input signal, and may perform bandwidth extension encoding using a basic signal of the frequency domain input signal.
    Type: Application
    Filed: September 12, 2011
    Publication date: March 15, 2012
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ki Hyun Choo, Eun Mi Oh, Ho Sang Sung
  • Patent number: 8108209
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Grant
    Filed: May 26, 2009
    Date of Patent: January 31, 2012
    Assignee: Coding Technologies Sweden AB
    Inventors: Kristofer Kjoerling, Lars Villemoes
  • Publication number: 20120016668
    Abstract: In accordance with an embodiment, A method of encoding an audio bitstream at an encoder includes encoding an original low band signal at the encoder by using a closed loop analysis-by-synthesis approach to obtain a coded low band signal, encoding an original high band signal at the encoder by using an open loop energy matching approach to obtain coded high band energy envelopes, comparing an energy of the coded low band signal with an energy of a corresponding original low band signal for a subframe, and generating an indication flag that indicates whether an energy envelope perceptual correction is needed for the subframe based on comparing the energy.
    Type: Application
    Filed: July 19, 2011
    Publication date: January 19, 2012
    Applicant: FutureWei Technologies, Inc.
    Inventor: Yang Gao
  • Publication number: 20120010880
    Abstract: An apparatus for generating a representation of a bandwidth-extended signal on the basis of an input signal representation includes a phase vocoder configured to obtain values of a spectral domain representation of a first patch of the bandwidth-extended signal on the basis of the input signal representation. The apparatus also includes a value copier configured to copy a set of values of the spectral domain representation of the first patch, which values are provided by the phase vocoder, to obtain a set of values of a spectral domain representation of a second patch, wherein the second patch is associated with higher frequencies than the first patch. The apparatus is configured to obtain the representation of the bandwidth-extended signal using the values of the spectral domain representation of the first patch and the values of the spectral domain representation of the second patch.
    Type: Application
    Filed: April 1, 2010
    Publication date: January 12, 2012
    Inventors: Frederik Nagel, Max Neuendorf, Nikolaus Rettelbach, Jeremie Lecomte, Markus Multrus, Bernhard Grill, Sascha Disch
  • Patent number: 8086446
    Abstract: A method and apparatus for transforming an audio signal, a method and apparatus for adaptively encoding an audio signal, a method and apparatus for inversely transforming an audio signal, and a method and apparatus for adaptively decoding an audio signal. The method of transforming an audio signal includes determining a transform unit into which the audio signal in a time domain is to be transformed into an audio signal in a frequency domain, and transforming the audio signal into an audio signal in the frequency domain according to the determined transform units using a window coefficient other than 0. Accordingly, it is possible to minimize distortion of the audio signal when encoding the audio signal even at a high bit rate while increasing efficiency of compression.
    Type: Grant
    Filed: December 7, 2005
    Date of Patent: December 27, 2011
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Eunmi Oh, Junghoe Kim, Boris Kudryashov, Konstantin Osipov
  • Publication number: 20110282656
    Abstract: Method and decoder for processing of audio signals. The method and decoder relate to deriving a processed vector {circumflex over (d)} by applying a post-filter directly on a vector d comprising quantized MDCT domain coefficients of a time segment of an audio signal. The post-filter is configured to have a transfer function H which is a compressed version of the envelope of the vector d. A signal waveform is reconstructed by performing an inverse MDCT transform on the processed vector {circumflex over (d)}.
    Type: Application
    Filed: May 10, 2011
    Publication date: November 17, 2011
    Inventors: Volodya GRANCHAROV, Sigurdur Sverrisson
  • Publication number: 20110218799
    Abstract: A method for decoding audio frames includes producing a first frame of coded audio samples, producing at least a portion of a second frame of coded audio samples, generating audio gap filler samples based on parameters representative of a weighted segment of the first frame of coded audio samples or a weighted segment of the portion of the second frame of coded audio samples, and forming a sequence including the audio gap filler samples and the portion of the second frame of coded audio samples.
    Type: Application
    Filed: September 9, 2010
    Publication date: September 8, 2011
    Applicant: MOTOROLA, INC.
    Inventors: Udar Mittal, Joanthan A. Gibbs, James P. Ashley
  • Patent number: 8015017
    Abstract: Audio coding and decoding apparatuses and methods which support fine granularity scalability (FGS) using harmonic information of a high-band audio signal or wideband error audio signal when performing wideband audio coding and decoding, and recording mediums on which the methods are stored. The audio coding method includes detecting harmonics of a high-band audio signal or wideband error audio signal of an input audio signal; determining an order of the detected harmonics; and coding the detected harmonics based on the determined order.
    Type: Grant
    Filed: January 24, 2006
    Date of Patent: September 6, 2011
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Hosang Sung, Rakesh Taori, Kangeun Lee
  • Publication number: 20110200205
    Abstract: A sound pickup apparatus includes: a microphone array including at least three microphones, wherein a first pair of microphones in which two of the at least three microphones are aligned on a first axis, and a second pair of microphones in which two of the at least three microphones are aligned on a second axis; a first null signal generator which outputs a first null signal based on a differential output of the first pair of microphones; a second null signal generator which outputs a second null signal based on a differential output of the second pair of microphones; and a combiner which generates a target signal based on the first null signal and the second null signal, the target signal having a directional characteristic in which the lowest sensitivity is formed in a direction to a line along which the first null surface meets the second null surface.
    Type: Application
    Filed: February 17, 2010
    Publication date: August 18, 2011
    Applicant: PANASONIC CORPORATION
    Inventor: Toshimichi Tokuda
  • Publication number: 20110202352
    Abstract: An apparatus for generating bandwidth extension output data for an audio signal has a noise floor measurer, a signal energy characterizer and a processor. The audio signal has components in a first frequency band and components in a second frequency band, the bandwidth extension output data are adapted to control a synthesis of the components in the second frequency band. The noise floor measurer measures noise floor data of the second frequency band for a time portion of the audio signal. The signal energy characterizer derives energy distribution data, the energy distribution data characterizing an energy distribution in a spectrum of the time portion of the audio signal. The processor combines the noise floor data and the energy distribution data to obtain the bandwidth extension output data.
    Type: Application
    Filed: January 11, 2011
    Publication date: August 18, 2011
    Inventors: Max Neuendorf, Bernhard Grill, Ulrich Kraemer, Markus Multrus, Harald Popp, Nikolaus Rettelbach, Frederik Nagel, Markus Lohwasser, Marc Gayer, Manuel Jander, Virgilio Bacigalupo
  • Publication number: 20110184733
    Abstract: Methods, and corresponding codec-containing devices are provided that have source coding schemes for encoding a component of an excitation. In some cases, the source coding scheme is an enumerative source coding scheme, while in other cases the source coding scheme is an arithmetic source coding scheme. In some cases, the source coding schemes are applied to encode a fixed codebook component of the excitation for a codec employing codebook excited linear prediction, for example an AMR-WB (Adaptive Multi-Rate-Wideband) speech codec.
    Type: Application
    Filed: January 22, 2010
    Publication date: July 28, 2011
    Applicant: RESEARCH IN MOTION LIMITED
    Inventors: Xiang YU, Dake HE, En-hui YANG
  • Publication number: 20110178795
    Abstract: An audio encoder has a window function controller, a windower, a time warper with a final quality check functionality, a time/frequency converter, a TNS stage or a quantizer encoder, the window function controller, the time warper, the TNS stage or an additional noise filling analyzer are controlled by signal analysis results obtained by a time warp analyzer or a signal classifier. Furthermore, a decoder applies a noise filling operation using a manipulated noise filling estimate depending on a harmonic or speech characteristic of the audio signal.
    Type: Application
    Filed: January 11, 2011
    Publication date: July 21, 2011
    Inventors: Stefan Bayer, Sascha Disch, Ralf Geiger, Guillaume Fuchs, Max Neuendorf, Gerald Schuller, Bernd Edler
  • Patent number: 7974847
    Abstract: A parameter calculator calculates lower resolution parametric information and interpolation information. On a decoder-side, an upmixer is used for generating the output channels. The upmixer uses high resolution parametric information generated by a parameter interpolator using the low resolution parametric information and decoder-side derived interpolation information or encoder-generated interpolation information for selecting one of a plurality of different interpolation characteristics.
    Type: Grant
    Filed: November 22, 2005
    Date of Patent: July 5, 2011
    Assignee: Coding Technologies AB
    Inventors: Kristofer Kjoerling, Heiko Purnhagen, Jonas Engdegard, Jonas Roeden
  • Publication number: 20110137643
    Abstract: Disclosed is a spectral smoothing device with a structure whereby smoothing is performed after a nonlinear conversion has been performed for a spectrum calculated from an audio signal, and with which the amount of processing calculation is significantly reduced while maintaining excellent audio quality. With this spectral smoothing device, a sub band division unit (102) divides an input spectrum into multiple sub bands; a representative value calculation unit (103) calculates a representative value for each sub band using an arithmetic mean and a geometric mean; with respect to each representative value, a nonlinear conversion unit (104) performs a nonlinear conversion the characteristic of which is further emphasized as the value increases; and a smoothing unit (105) that smoothes the representative value which has undergone the nonlinear conversion for each sub band, at the frequency domain.
    Type: Application
    Filed: August 7, 2009
    Publication date: June 9, 2011
    Inventors: Tomofumi Yamanashi, Masahiro Oshikiri, Toshiyuki Morii, Hiroyuki Ehara
  • Publication number: 20110119054
    Abstract: Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.
    Type: Application
    Filed: July 14, 2009
    Publication date: May 19, 2011
    Inventors: Tae Jin Lee, Seung Kwon Beack, Minje Kim, Dae Young Jang, Kyeongok Kang, Jin Woo Hong, Hochong Park, Young-Cheol Park
  • Publication number: 20110106532
    Abstract: Provided is a method and apparatus for encoding and decoding an enhancement layer to reduce quantization error in a G.711 codec. Exponent indices of additional mantissa information of each sample are calculated based upon exponent information of each sample in a frame. A process of allocating 1 bit to each sample with a current exponent index is repeated, the exponent index starting from the maximum value while decreasing by 1 at every repetition until the total number of bits allocated to the samples is equal to the total number of available bits in the frame. And the most significant bits, as many as the number of bits allocated to each sample, are extracted from the additional mantissa information of each sample in the frame.
    Type: Application
    Filed: August 18, 2008
    Publication date: May 5, 2011
    Inventors: Jongmo Sung, Do-Young Kim
  • Publication number: 20110035212
    Abstract: In a method of perceptual transform coding of audio signals in a telecommunication system, performing the steps of determining transform coefficients representative of a time to frequency transformation of a time segmented input audio signal; determining a spectrum of perceptual sub-bands for said input audio signal based on said determined transform coefficients; determining masking thresholds for each said sub-band based on said determined spectrum; computing scale factors for each said sub-band based on said determined masking thresholds, and finally adapting said computed scale factors for each said sub-band to prevent energy loss for perceptually relevant sub-bands.
    Type: Application
    Filed: August 26, 2008
    Publication date: February 10, 2011
    Applicant: Telefonaktiebolaget L M Ericsson (publ)
    Inventors: Manuel Briand, Anisse Taleb
  • Patent number: 7860709
    Abstract: The invention relates to a method for supporting an encoding of an audio signal, wherein at least one section of the audio signal is to be encoded with a coding model that allows the use of different coding frame lengths. In order to enable a simple selection of the respectively best suited coding frame length, it is proposed that at least one control parameter is determined based on signal characteristics of the audio signal. The control parameter is then used for limiting the options of possible coding frame lengths for the at least one section. The invention relates equally to a module 10,11 in which this method is implemented, to a device 1 and a system comprising such a module 10,11, and to a software program product including a software code for realizing the proposed method.
    Type: Grant
    Filed: May 13, 2005
    Date of Patent: December 28, 2010
    Assignee: Nokia Corporation
    Inventor: Jari Mäkinen
  • Publication number: 20100262421
    Abstract: Provided is an encoding device which improves the sound quality of a stereo signal while maintaining a low bit rate. The encoding device includes: an LP inverse filter (121) which LP-inverse-filterS a left signal L(n) by using an inverse quantization linear prediction coefficient AdM(z) of a monaural signal; a T/F conversion unit (122) which converts the left sound source signal Le(n) from a temporal region to a frequency region; an inverse quantizer (123) which inverse-quantizes encoded information Mqe; spectrum division units (124, 125) which divide a high-frequency component of the sound source signal Mde(f) and the left signal Le(f) into a plurality of bands; and scale factor calculation units (126, 127) which calculate scale factors ai and ssi by using a monaural sound source signal Mdeh,i(f), a left sound source signal Leh,i(f), Mdeh,i(f), and right sound source signal Reh,i(f) of each divided band.
    Type: Application
    Filed: November 4, 2008
    Publication date: October 14, 2010
    Applicant: PANASONIC CORPORATION
    Inventors: Kok Seng Chong, Koji Yoshida, Masahiro Oshikiri
  • Publication number: 20100250244
    Abstract: There is provided an encoder capable of improving inter-channel prediction (ICP) performance in scalable stereo sound encoding using an ICP. In the encoder, ICP analysis units (113, 114, 115) use, as reference signal candidates, a frequency coefficient (sL?(f)) in the low-band portion of a side residual signal, a frequency coefficient (mM,i(f)) in each sub-band portion of a monaural residual signal, and a frequency coefficient (mL(f)) in the low-band portion of the monaural residual signal, respectively, and perform an ICP analysis between the respective these candidates and a frequency coefficient (sM,i(f)) in each sub-band portion of the side residual signal to generate first, second, and third ICP coefficients.
    Type: Application
    Filed: October 31, 2008
    Publication date: September 30, 2010
    Applicant: PANASONIC CORPORATION
    Inventors: Haishan Zhong, Zongxian Liu, Kok Seng Chong, Koji Yoshida
  • Publication number: 20100185440
    Abstract: The embodiments of a transcoding method, a transcoding device, and a communication apparatus are provided. The embodiment of a method includes: receiving a bit stream input from a sending end; determining an attribute of discontinuous transmission (DTX) used by a receiving end and a frame type of the input bit stream; and transcoding the input bit stream in a corresponding processing manner according to a determination result. Thereby, a corresponding transcoding operation is performed on the input bit stream according to the attribute of DTX used by the receiving end and the frame type of the input bit stream. In such a manner, input bit streams of various types can be processed, and the input bit streams can be correspondingly transcoded according to the requirements of the receiving end. Therefore, the average computational complexity and peak computational complexity can be effectively decreased without decreasing the quality of the synthesized speech.
    Type: Application
    Filed: January 21, 2010
    Publication date: July 22, 2010
    Inventors: Changchun Bao, Hao Xu, Fanrong Tang, Xiangyu Hu
  • Publication number: 20100161320
    Abstract: An apparatus and method for adaptive sub-band allocation of spectral coefficients are disclosed. The sizes of sub-bands are determined according to the distribution of spectral coefficients transformed from an input speech/audio signal to perform more elaborate quantization in units of sub-bands. Thus, quantization noise of the spectral coefficients is reduced, and sound quality in a frequency region is enhanced, thereby improving the quality of the signal.
    Type: Application
    Filed: September 9, 2009
    Publication date: June 24, 2010
    Inventors: Hyun Woo KIM, Hyun Joo BAE, Byung Sun LEE
  • Publication number: 20100161321
    Abstract: A coding apparatus reduces a circuit scale and the amount of coding processing calculation. A frequency domain conversion section performs a frequency analysis of the signal sampled at a sampling rate Fx with an analysis length of 2·Na and calculates first spectrum S1(k)(0?k<Na). A band extension section extends the effective frequency band of first spectrum S1(k) to 0?k<Nb so that a new spectrum can be assigned to the extended area following to the frequency k=Na of first spectrum S1(k). An extended spectrum assignment section assigns extended spectrum S1?(k)(Na?k<Nb) input to the extended frequency band from the outside. A spectral information specification section outputs information necessary to specify extended spectrum S1?(k) out of the spectrum given from the extended spectrum assignment section as a code.
    Type: Application
    Filed: February 18, 2010
    Publication date: June 24, 2010
    Applicant: PANASONIC CORPORATION
    Inventor: Masahiro OSHIKIRI
  • Publication number: 20100145682
    Abstract: The present invention applies spectral flatness characteristic values to simplify psychoacoustic analysis of a sound signal. If the sound signal comprises a plurality of frames, the present invention calculates the energy of the sound signal in a frequency domain, calculates a plurality of spectral flatness, and decides to use a short-block or a long-block Modified Discrete Cosine Transform accordingly. If the sound signal comprises left and right channel signals, the present invention performs psychoacoustic analysis on the sound signal to count energy of the left and right channel signals in a frequency domain, counts spectral flatness of the left and right channel signals, and decides to use middle/side transform or left and right channel encoding to transform the left and right channel signals accordingly.
    Type: Application
    Filed: March 27, 2009
    Publication date: June 10, 2010
    Inventor: Yi-Lun Ho
  • Publication number: 20100121646
    Abstract: The invention relates to the coding/decoding of a signal into several sub-bands, in which at least a first and a second sub-bands which are adjacent are transform coded (601, 602). In particular, in order to apply a perceptual weighting, in the transformed domain, to at least the second sub-band, the method comprises:—determining at least one frequency masking threshold (606) to be applied on the second sub-band; and normalizing said masking threshold in order to provide a spectral continuity between the above-mentioned first and second sub-bands. An advantageous application of the invention involves a perceptual weighting of the high-frequency band in the TDAC transform coding of a hierarchical encoder according to standard G.729.1.
    Type: Application
    Filed: January 30, 2008
    Publication date: May 13, 2010
    Applicant: France Telecom
    Inventors: Stéphane Ragot, Cyril Gukllaume
  • Publication number: 20100063804
    Abstract: Provided is an adaptive sound source vector quantization device which can always perform a pitch cycle search with a resolution appropriate for any section of the pitch cycle search range of a second sub-frame when a pitch cycle search range of the second sub-frame changes in accordance with a pitch cycle of a first sub-frame. The device includes a first pitch cycle instruction unit (111), a search range calculation unit (112), and a second pitch cycle instruction unit (113). The first pitch cycle instruction unit (111) successively instructs pitch cycle search candidates in a predetermined search range having a search resolution which transits over a predetermined pitch cycle candidate for the first sub-frame. The search range calculation unit (112) calculates a predetermined range before and after the pitch cycle of the first sub-frame as the pitch cycle search range for the second sub-frame, if the predetermined range includes the predetermined pitch cycle search candidate.
    Type: Application
    Filed: February 29, 2008
    Publication date: March 11, 2010
    Applicant: PANASONIC CORPORATION
    Inventors: Kaoru Sato, Toshiyuki Morii
  • Publication number: 20100023321
    Abstract: Character extraction section extracts character amounts, pertaining to a prosody of voice, from a voice signal sequentially in a time-serial manner. Difference value calculation calculates a difference value between each of the extracted character amounts and a reference value. Processing values, corresponding to the individual character amounts, are generated in accordance with the respective difference values, and a voice processing section controls the individual character amounts of the voice signal in accordance with the processing values corresponding to the character amounts and thereby generates an output signal having a prosody changed from the prosody of the voice signal.
    Type: Application
    Filed: July 22, 2009
    Publication date: January 28, 2010
    Applicant: Yamaha Corporation
    Inventor: Yasuo Yoshioka
  • Publication number: 20090319263
    Abstract: Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.
    Type: Application
    Filed: October 30, 2008
    Publication date: December 24, 2009
    Applicant: QUALCOMM Incorporated
    Inventors: Alok Kumar Gupta, Sharath Manjunath