Transformation Patents (Class 704/203)

Orthogonal functions (Class 704/204)

Audio equipment including means for de-noising a speech signal by fractional delay filtering, in particular for a “hands-free” telephony system

Patent number: 8682658

Abstract: The equipment comprises two microphones, sampling means, and de-noising means. The de-noising means are non-frequency noise reduction means comprising a combiner having an adaptive filter performing an iterative search seeking to cancel the noise picked up by one of the microphones on the basis of a noise reference given by the other microphone sensor. The adaptive filter is a fractional delay filter modeling a delay that is shorter than the sampling period. The equipment also has voice activity detector means delivering a signal representative of the presence or the absence of speech from the user of the equipment. The adaptive filter receives this signal as input so as to enable it to act selectively: i) either to perform an adaptive search for the parameters of the filter in the absence of speech; ii) or else to “freeze” those parameters of the filter in the presence of speech.

Type: Grant

Filed: May 18, 2012

Date of Patent: March 25, 2014

Assignee: Parrot

Inventors: Guillaume Vitte, Michael Herve
Method for time scaling of a sequence of input signal values

Patent number: 8676584

Abstract: The invention relates to a digital signal processing technique that changes the length of an audio signal and, thus, effectively its play-out speed. This is used for frame rate conversion or sound effects in music production. Time scaling may further be used for fast forward or slow-motion audio play-out. According said method the waveform similarity overlap add approach is modified such that a maximized similarity is determined among similarity measures of sub-sequence pairs each comprising a sub-sequence to-be-matched from a input window and a matching sub-sequence from a search window wherein said sub-sequence pairs comprise at least two sub-sequence pairs of which a first pair comprises a first sub-sequence to-be-matched and a second pair comprises a different second sub-sequence to-be-matched. The input window allows for finding sub-sequence pairs with higher similarity than with a WSOLA approach based on a single sub-sequence to-be-matched. This results in less perceivable artefacts.

Type: Grant

Filed: June 22, 2009

Date of Patent: March 18, 2014

Assignee: Thomson Licensing

Inventor: Markus Schlosser
Pre-echo attenuation in a digital audio signal

Patent number: 8676365

Abstract: A method is provided for attenuating pre-echoes in a digital audio signal generated from a transform encoding, comprising, upon decoding and for a current frame of said digital audio signal: defining a concatenated signal from at least the reconstructed signal of the current frame, dividing said concatenated signal into subunits of samples having a predetermined length, calculating the time envelope of the concatenated signal, detecting the transition of the time envelope towards a high-energy area, determining the low-energy sub-units preceding a subunit in which a transition has been detected, and an attenuation step in said determined subunits. The attenuation is carried out according to an attenuation factor calculated for each of the determined subunits, based on the time envelope of the concatenated signal. The invention also relates to a device for implementing said method, and to a decoder including such a device.

Type: Grant

Filed: September 15, 2009

Date of Patent: March 18, 2014

Assignee: Orange

Inventors: Balazs Kovesi, Stéphane Ragot
AUTOMATIC CONVERSION OF SPEECH INTO SONG, RAP OR OTHER AUDIBLE EXPRESSION HAVING TARGET METER OR RHYTHM

Publication number: 20140074459

Abstract: Captured vocals may be automatically transformed using advanced digital signal processing techniques that provide captivating applications, and even purpose-built devices, in which mere novice user-musicians may generate, audibly render and share musical performances. In some cases, the automated transformations allow spoken vocals to be segmented, arranged, temporally aligned with a target rhythm, meter or accompanying backing tracks and pitch corrected in accord with a score or note sequence. Speech-to-song music applications are one such example. In some cases, spoken vocals may be transformed in accord with musical genres such as rap using automated segmentation and temporal alignment techniques, often without pitch correction. Such applications, which may employ different signal processing and different automated transformations, may nonetheless be understood as speech-to-rap variations on the theme.

Type: Application

Filed: March 29, 2013

Publication date: March 13, 2014

Applicant: Smule, Inc.

Inventor: Smule, Inc.
Tone determination device and method

Patent number: 8670980

Abstract: A tone determination device, which determines the tonality of an input signal, is capable of reducing calculation complexity. Therein a frequency conversion unit (101) converts the frequency of an input signal; a downsampling unit (102) carries out shortening processing which shortens the vector series length of the frequency-converted signal; a constancy determination unit (107) determines the constancy of the input signal; depending on the constancy of the input signal, a vector selection unit (104) selects either the vector series of the post-frequency conversion signal or the vector series after the shortening of the vector series length; a correlation analysis unit (105) uses the vector series selected by the vector selection unit (104) to obtain correlations; and a tone determination unit (106) uses the correlations to determine the tonality of the input signal.

Type: Grant

Filed: October 26, 2010

Date of Patent: March 11, 2014

Assignee: Panasonic Corporation

Inventor: Kaoru Satoh
System and method for ranking a posting

Patent number: 8670968

Abstract: A method for training a ranking application. The method includes ranking the help postings to create an initial ranking using initial parameter values, and storing user interactions with the help postings to obtain stored interactions. Simulations are performed using the stored interactions to generate revised parameter values for the ranking application. Performing the simulations includes calculating relevance values from the stored interactions, creating a test posting, assigning, to the test posting, an initial score and a relevance value randomly selected from the relevance values to generate a test ranking, and simulating user interactions with the test ranking to generate simulated rankings. The simulated rankings are analyzed to obtain revised parameter values. The method further includes ranking, using the revised parameter values, the help postings to generate a revised ranking, and displaying the help postings in the forum according to the revised ranking.

Type: Grant

Filed: August 31, 2012

Date of Patent: March 11, 2014

Assignee: Intuit Inc.

Inventors: Igor A. Podgorny, Floyd J. Morgan, Derek Szydlowski
Audio signal encoding method, audio signal decoding method, encoding device, decoding device, audio signal processing system, audio signal encoding program, and audio signal decoding program

Patent number: 8666754

Abstract: When a frame immediately preceding an encoding target frame to be encoded by a first encoding unit operating under a linear predictive coding scheme is encoded by a second encoding unit operating under a coding scheme different from the linear predictive coding scheme, the encoding target frame can be encoded under the linear predictive coding scheme by initializing the internal state of the first encoding unit. Therefore, encoding processing performed under a plurality of coding schemes including the linear predictive coding scheme and a coding scheme different from the linear predictive coding scheme can be realized.

Type: Grant

Filed: March 5, 2013

Date of Patent: March 4, 2014

Assignee: NTT DoCoMo, Inc.

Inventors: Kosuke Tsujino, Kei Kikuiri, Nobuhiko Naka
Apparatus and method of encoding and decoding signals

Patent number: 8645126

Abstract: A method of encoding an audio signal, where signals including two or more channel signals are downmixed to a mono signal, the mono signal is divided into a low-frequency signal and a high-frequency signal, the low-frequency signal is encoded through algebraic code excited linear prediction (ACELP) or transform coded excitation (TCX), and the high-frequency signal is encoded using the low-frequency signal. A method of decoding of an audio signal, a low-frequency signal encoded through ACELP or TCX is decoded, a high-frequency signal is decoded using the low-frequency signal, the low-frequency signal and the high-frequency signal are combined to generate a mono signal, and the mono signal is upmixed by decoding spatial parameters regarding signals including two or more channel signals.

Type: Grant

Filed: March 26, 2013

Date of Patent: February 4, 2014

Assignee: Samsung Electronics Co., Ltd

Inventors: Ho-sang Sung, Eun-mi Oh, Jung-hoe Kim, Ki-hyun Choo, Mi-young Kim
SPEECH ENHANCEMENT TO IMPROVE SPEECH INTELLIGIBILITY AND AUTOMATIC SPEECH RECOGNITION

Publication number: 20140025374

Abstract: The present invention provides a system and method to enhance speech intelligibility and improve the detection rate of automatic speech recognizer in noisy environments. The present invention reduces an acoustically coupled loudspeaker signal from a plurality of microphone signals to enhance a near end user speech signal. A decision unit checks a system configuration parameter to determine if the cleaned speech is intended for human communication and/or Automatic Speech Recognition (ASR). A formant emphasis filer and a spectrum band reconstruction unit are used to further enhance the speech quality and improve the ASR recognition rate. The present invention can also apply to devices which has a foreground microphone(s) and a background microphone(s).

Type: Application

Filed: July 21, 2013

Publication date: January 23, 2014

Inventor: Xia Lou
Method and apparatus for encoding and decoding audio/speech signal

Patent number: 8630863

Abstract: Provided is a method of encoding an audio/speech signal, the method including determining a variable length of a frame, that is, a processing unit of an input signal in accordance with a position of an attack in the input signal; transforming each frame of the input signal to a frequency domain and dividing the frame into a plurality of sub frequency bands; and, if a signal of a sub frequency band is determined to be encoded in the frequency domain, encoding the signal of the sub frequency band in the frequency domain, and if the signal of the sub frequency band is determined to be encoded in a time domain, inverse transforming the signal of the sub frequency band to the time domain and encoding the inverse transformed signal in the time domain. According to the present invention, the audio/speech signal may be efficiently encoded by controlling time resolution and frequency resolution.

Type: Grant

Filed: October 15, 2007

Date of Patent: January 14, 2014

Assignee: SAMSUNG Electronics Co., Ltd.

Inventors: Chang-yong Son, Eun-mi Oh, Jung-hoe Kim, Ho-sang Sung, Kang-eun Lee, Ki-hyun Choo
Method and apparatus for transforming between different filter bank domains

Patent number: 8620671

Abstract: Filter banks may have different structures and different individual output signal domains. Often a translation between different filter bank domains is desirable. Usually, mapping matrices are used that, however, vary over frequency. This requires a significant amount of lookup tables. A method for transforming first data frames of a first filter bank domain to second data frames of a different second filter bank domain, comprises steps of transcoding sub-bands of the first filter bank domain into sub-bands of an intermediate domain that corresponds to said second filter bank domain but has warped phase, and transcoding the sub-bands of the intermediate domain to sub-bands of the second filter bank domain, wherein a phase correction is performed on the sub-bands of the intermediate domain.

Type: Grant

Filed: February 19, 2009

Date of Patent: December 31, 2013

Assignee: Thomson Licensing

Inventors: Peter Jax, Sven Kordon
Method and apparatus to extract important spectral component from audio signal and low bit-rate audio signal coding and/or decoding method and apparatus using the same

Patent number: 8615391

Abstract: An method and apparatus to extract an audio signal having an important spectral component (ISC) and a low bit-rate audio signal coding/decoding method using the method and apparatus to extract the ISC. The method of extracting the ISC includes calculating perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals by using a psychoacoustic model, selecting spectral signals having a masking threshold value smaller than that of the spectral audio signals using the SMR value as first ISCs, and extracting a spectral peak from the audio signals selected as the ISCs according to a predetermined weighting factor to select second ISCs. Accordingly, the perceptual important spectral components can be efficiently coded so as to obtain high sound quality at a low bit-rate.

Type: Grant

Filed: July 6, 2006

Date of Patent: December 24, 2013

Assignee: SAMSUNG Electronics Co., Ltd.

Inventors: Junghoe Kim, Eunmi Oh, Konstantin Osipov, Boris Kudryashov
SPEECH DECODER, SPEECH ENCODER, SPEECH DECODING METHOD, SPEECH ENCODING METHOD, STORAGE MEDIUM FOR STORING SPEECH DECODING PROGRAM, AND STORAGE MEDIUM FOR STORING SPEECH ENCODING PROGRAM

Publication number: 20130339010

Abstract: A speech decoder includes a demultiplexing unit, a low frequency band decoding unit, a band splitting filter bank unit, a coded sequence analysis unit, a coded sequence decoding/dequantization unit, a high frequency band generation unit, low frequency band time envelope calculation units that acquire a plurality of low frequency band time envelopes, a time envelope calculation unit that calculates high frequency band time envelopes using time envelope information and the plurality of low frequency band time envelopes, a time envelope adjustment unit that adjusts the time envelope of high frequency band components using the time envelopes obtained by the time envelope calculation unit, and a band synthesis filter bank unit.

Type: Application

Filed: August 16, 2013

Publication date: December 19, 2013

Applicant: NTT DOCOMO, Inc

Inventors: Kei KIKUIRI, Atsushi YAMAGUCHI
Quantization after linear transformation combining the audio signals of a sound scene, and related coder

Patent number: 8612220

Abstract: The invention relates to a method for quantifying components, wherein certain components are each determined based on a plurality of audio signals and can be calculated by the application of a linear conversion on the audio signals, said method comprising: determining a quantification function to be applied to the components by testing a condition relative to an audio signal and depending on a comparison made between a psycho-acoustic masking threshold relative to the audio signal and a value determined based on the reverse linear conversion and quantification errors of the components by the function.

Type: Grant

Filed: July 1, 2008

Date of Patent: December 17, 2013

Assignee: France Telecom

Inventors: Adil Mouhssine, Abdellatif Benjelloun Touimi, Pierre Duhamel
APPARATUS AND METHOD FOR ENCODING AND DECODING AN AUDIO SIGNAL USING AN ALIGNED LOOK-AHEAD PORTION

Publication number: 20130332148

Abstract: An apparatus for encoding an audio signal having a stream of audio samples has: a windower for applying a prediction coding analysis window to the stream of audio samples to obtain windowed data for a prediction analysis and for applying a transform coding analysis window to the stream of audio samples to obtain windowed data for a transform analysis, wherein the transform coding analysis window is associated with audio samples within a current frame of audio samples and with audio samples of a predefined portion of a future frame of audio samples being a transform-coding look-ahead portion, wherein the prediction coding analysis window is associated with at least the portion of the audio samples of the current frame and with audio samples of a predefined portion of the future frame being a prediction coding look-ahead portion, wherein the transform coding look-ahead portion and the prediction coding look-ahead portion are identically to each other or are different from each other by less than 20%; and an enc

Type: Application

Filed: August 14, 2013

Publication date: December 12, 2013

Inventors: Emmanuel RAVELLI, Ralf GEIGER, Markus SCHNELL, Guillaume FUCHS, Vesa RUOPPILA, Tom BAECKSTROEM, Bernhard GRILL, Christian HELMRICH
Method for reduction of aliasing introduced by spectral envelope adjustment in real-valued filterbanks

Patent number: 8606587

Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.

Type: Grant

Filed: July 18, 2012

Date of Patent: December 10, 2013

Assignee: Dolby International AB

Inventors: Kristofer Kjorling, Lars Villemoes
Signal classification method and device, and encoding and decoding methods and devices

Patent number: 8600765

Abstract: Embodiments of the present invention provide a signal classification method and device, and encoding and decoding methods and devices. The encoding method includes: dividing a current frame into a low-frequency band signal and a high-frequency band signal; attenuating the high-frequency band signal or a to-be-encoded characteristic parameter of the high-frequency band signal according to an energy attenuation value of the low-frequency band signal, where the energy attenuation value indicates energy attenuation of the low-frequency band signal caused by encoding of the low-frequency band signal; and encoding the attenuated high-frequency band signal or the attenuated to-be-encoded characteristic parameter of the high-frequency band signal. The technical solutions according to the embodiments of the present invention can improve the effect of combining the low-frequency band signal and the high-frequency band signal at the decoder.

Type: Grant

Filed: December 27, 2012

Date of Patent: December 3, 2013

Assignee: Huawei Technologies Co., Ltd.

Inventors: Zexin Liu, Lei Miao, Anisse Taleb
Efficient Encoding/Decoding of Audio Signals

Publication number: 20130317811

Abstract: A method for encoding of an audio signal comprises performing (214) of a transform of the audio signal. An energy offset is selected (216) for each of the first subbands. An energy measure of a first reference band within a low band of an encoding of a synthesis signal is obtained (212). The first high band is encoded (220) by providing quantization indices representing a respective scalar quantization of a spectrum envelope in the first subbands of the first high band relative to the energy measure of the first reference band by use of the selected energy offset. An encoder apparatus comprises means for carrying out the steps of the method. Corresponding decoder methods and apparatuses are also described.

Type: Application

Filed: February 9, 2011

Publication date: November 28, 2013

Applicant: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)

Inventors: Volodya Grancharov, Erik Norvell, Sigurdur Sverrisson
METHOD AND DEVICE FOR BANDWIDTH EXTENSION

Publication number: 20130317812

Abstract: Method and device of extending a signal band of a voice or audio signal are provided. The bandwidth extension method includes the steps of: performing a modified discrete cosine transform (MDCT) process on an input signal to generate a first transform signal; generating a second transform signal and a third transform signal on the basis of the first transform signal; generating normalized components and energy components of the first transform signal, the second transform signal, and the third transform signal therefrom; generating an extended normalized component from the normalized components and generating an extended energy component from the energy components; generating an extended transform signal on the basis of the extended normalized component and the extended energy component; and performing an inverse MDCT (IMDCT) process on the extended transform signal.

Type: Application

Filed: February 8, 2012

Publication date: November 28, 2013

Applicant: LG Electronics Inc.

Inventors: Gyu Hyeok Jeong, Young Han Lee, Hye Jeong Jeon, Hong Kook Kim, In Gyu Kang, Lag Young Kim
Speech content based packet loss concealment

Patent number: 8589166

Abstract: Systems and methods are described for performing packet loss concealment (PLC) to mitigate the effect of one or more lost frames within a series of frames that represent a speech signal. In accordance with the exemplary systems and methods, PLC is performed by searching a codebook of speech-related parameter profiles to identify content that is being spoken and by selecting a profile associated with the identified content for use in predicting or estimating speech-related parameter information associated with one or more lost frames of a speech signal. The predicted/estimated speech-related parameter information is then used to synthesize one or more frames to replace the lost frame(s) of the speech signal.

Type: Grant

Filed: September 21, 2010

Date of Patent: November 19, 2013

Assignee: Broadcom Corporation

Inventor: Robert W. Zopf
Spatial synthesis of multichannel audio signals

Patent number: 8583424

Abstract: A method and associated device are provided for spatial synthesis of a sum signal to obtain at least two output signals, the sum signal as well as the spatialization parameters being output from a parametric coding by matrixing of an original multi-channel signal. The method comprises: decorrelation of the sum signal to obtain a decorrelated signal; applying a synthesis matrix, whose coefficients depend on the spatialization parameters, to the decorrelated signal and to the sum signal to obtain said output signals, wherein for at least one range of value of at least one spatialization parameter, the coefficients of the synthesis matrix are determined according to a criterion of minimizing a quantitative function, relating to the quantity of decorrelated signal in each of the output signals obtained by applying the synthesis matrix.

Type: Grant

Filed: June 16, 2009

Date of Patent: November 12, 2013

Assignee: France Telecom

Inventors: Florent Jaillet, David Virette
SOURCE SEPARATION BY INDEPENDENT COMPONENT ANALYSIS IN CONJUNCTION WITH SOURCE DIRECTION INFORMATION

Publication number: 20130297296

Abstract: Methods and apparatus for signal processing are disclosed. Source separation can be performed to extract source signals from mixtures of source signals by way of independent component analysis. Source direction information is utilized in the separation process, and independent component analysis techniques described herein use multivariate probability density functions to preserve the alignment of frequency bins in the source separation process. It is emphasized that this abstract is provided to comply with the rules requiring an abstract that will allow a searcher or other reader to quickly ascertain the subject matter of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims.

Type: Application

Filed: May 4, 2012

Publication date: November 7, 2013

Applicant: Sony Computer Entertainment Inc.

Inventors: Jaekwon Yoo, Ruxin Chen
Apparatus and method for encoding a multi-channel audio signal

Patent number: 8577045

Abstract: An encoding apparatus comprises a frame processor (105) which receives a multi channel audio signal comprising at least a first audio signal from a first microphone (101) and a second audio signal from a second microphone (103). An ITD processor 107 then determines an inter time difference between the first audio signal and the second audio signal and a set of delays (109, 111) generates a compensated multi channel audio signal from the multi channel audio signal by delaying at least one of the first and second audio signals in response to the inter time difference signal. A combiner (113) then generates a mono signal by combining channels of the compensated multi channel audio signal and a mono signal encoder (115) encodes the mono signal. The inter time difference may specifically be determined by an algorithm based on determining cross correlations between the first and second audio signals.

Type: Grant

Filed: September 9, 2008

Date of Patent: November 5, 2013

Assignee: Motorola Mobility LLC

Inventor: Jonathan A. Gibbs
Postfilter for layered codecs

Patent number: 8571852

Abstract: A scalable decoder device (50) for signals representing audio comprises a primary decoder (21) connected to an input (40). The primary decoder (21) is arranged to provide a primary decoded signal (23) based on received parameters (4). A primary postfilter (31) is connected to the primary decoder (23) to provide a primary postfiltered signal (32). A secondary enhancement decoder (45) is connected to the input (40) and arranged to provide a secondary decoded enhancement signal (44). The device further comprises a combiner arrangement (55), arranged for combining the primary postfiltered signal (32) and a signal (53) based on the secondary decoded enhancement signal (44) into an output signal (6) to be provided at an output (6). The combining is made with an adaptable strength relation between contributions from the two signals. A method for decoding coded signals representing audio operates in analogy with the scalable decoder device (50).

Type: Grant

Filed: December 14, 2007

Date of Patent: October 29, 2013

Assignee: Telefonaktiebolaget L M Ericsson (publ)

Inventor: Stefan Bruhn
Multi-mode method and an apparatus for processing a signal

Patent number: 8566107

Abstract: Disclosed is a method of processing a signal, which includes receiving at least one of a first signal and a second signal, receiving mode information, and decoding the at least one of the first signal and the second signal using at least one of a first coding scheme and a second coding scheme according to the mode information. The mode information is information for indicating that a prescribed mode corresponds to one of at least three modes. The method includes detecting when a restricted mode change occurs and changing at least one mode when detecting a restricted mode change.

Type: Grant

Filed: October 15, 2008

Date of Patent: October 22, 2013

Assignees: LG Electronics Inc., Intellectual Discovery Co., Ltd.

Inventors: Hyen-O Oh, Hong Goo Kang, Chang Heon Lee, Sang Wook Shin, Yang Won Jung
SIGNAL ANALYZER, SIGNAL ANALYZING METHOD, SIGNAL SYNTHESIZER, SIGNAL SYNTHESIZING, WINDOWER, TRANSFORMER AND INVERSE TRANSFORMER

Publication number: 20130268264

Abstract: The present disclosure relates to a signal analyzer for processing an overlapped input signal frame comprising 2N subsequent input signal values. The signal analyzer comprises: a windower adapted to window the overlapped input signal frame to obtain a windowed signal, wherein the windower is adapted to zero M+N/2 subsequent input signal values of the overlapped input signal frame, wherein M is equal or greater than 1 and smaller than N/2; and a transformer adapted to transform the remaining 3N/2?M subsequent windowed signal values of the windowed signal using N?M sets of transform parameters to obtain a transformed-domain signal comprising N?M transformed-domain signal values.

Type: Application

Filed: April 15, 2013

Publication date: October 10, 2013

Applicant: Huawei Technologies Co., Ltd.

Inventors: Anisse Taleb, Fengyan Qi, Chen Hu
SYSTEMS AND METHODS FOR AUTOMATED SPEECH AND SPEAKER CHARACTERIZATION

Publication number: 20130262097

Abstract: Systems and methods utilize individually selected modulation spectral features for speech and speaker characterization. The method involves construction of a sparse feature space and a method of finding the approximately best feature subset for attributing a specific characteristic of speech or speaker. The current selection method is based on the Kolmogorov-Smirnov statistical test applied to individual features. The characterization task can be defined empirically and no a-priori theory is necessary to explain characteristic attribution processes. Experimental results indicate that employment of selected modulation spectral features works better than the current state-of-the-art at least in some instances of speech characterization task, e.g. prediction of speaker personality traits, as it is evident from the official results of Interspeech'2012 Speaker Personality Recognition Challenge.

Type: Application

Filed: March 29, 2013

Publication date: October 3, 2013

Inventor: Aliaksei Ivanou
Efficient design of MDCT / IMDCT filterbanks for speech and audio coding applications

Patent number: 8548815

Abstract: A more efficient encoder/decoder is provided in which an N-point MDCT transform is mapped into smaller sized N/2-point DCT-IV and/or DCT-II transforms with isolated pre-multiplications which can be moved to a prior or subsequent windowing stage. That is, the windowing operations may be merged with first/last stage multiplications in the core MDCT/IMDCT functions, respectively, thus reducing the total number of multiplications. Additionally, the MDCT may be systematically decimated by factor of 2 by utilizing a uniformly scaled 5-point DCT-II core function as opposed to the DCT-IV or FFT cores used in many existing MDCT designs in audio codecs. The modified windowing stage merges factors from a transform stage and windowing stage to obtain piece-wise symmetric windowing factors, which can be represented by a sub-set of the piece-wise symmetric windowing factors to save storage space. Such features offer appreciable reduction in complexity and less memory usage than the prior art.

Type: Grant

Filed: September 18, 2008

Date of Patent: October 1, 2013

Assignee: QUALCOMM Incorporated

Inventors: Ravi Kiran Chivukula, Yuriy Reznik
Encoding device, decoding device, and method thereof for specifying a band of a great error

Patent number: 8543392

Abstract: Disclosed is an encoding device which can accurately specify a band having a large error among all the bands by using a small calculation amount. A first position identifier uses a first layer error conversion coefficient indicating an error of a decoding signal for an input signal so as to search for a band having a large error in a relatively wide bandwidth in all the bands of the input signal and generates first position information indicating the identified band. A second position identifier searches for a target frequency band having a large error in a relatively narrow bandwidth in the band identified by the first position identifier and generates second position information indicating the identified target frequency band. An encoder encodes a first layer decoding error conversion coefficient contained in the target frequency band.

Type: Grant

Filed: February 29, 2008

Date of Patent: September 24, 2013

Assignee: Panasonic Corporation

Inventors: Masahiro Oshikiri, Tomofumi Yamanashi, Toshiyuki Morii
SPEECH SIGNAL ENCODING METHOD AND SPEECH SIGNAL DECODING METHOD

Publication number: 20130246054

Abstract: A speech signal encoding method and a speech signal decoding method are provided. The speech signal encoding method includes the steps of specifying an analysis frame in an input signal; generating a modified input based on the analysis frame; applying a window to the modified input; generating a transform coefficient by performing an MDCT (Modified Discrete Cosine Transform) on the modified input to which the window has been applied; and encoding the transform coefficient. The modified input includes the analysis frame and a self replication of all or a part of the analysis frame.

Type: Application

Filed: November 23, 2011

Publication date: September 19, 2013

Applicant: LG Electronics Inc.

Inventors: Gyu Hyeok Jeong, Jong Ha Lim, Hye Jeong Jeon, In Gyu Kang, Lag Young Kim
Adaptive frequency prediction for encoding or decoding an audio signal

Patent number: 8532983

Abstract: In one embodiment, a method of transceiving an audio signal is disclosed. The method includes providing low band spectral information having a plurality of spectrum coefficients and predicting a high band extended spectral fine structure from the low band spectral information for at least one subband, where the high band extended spectral fine structure are made of a plurality of spectrum coefficients. The predicting includes preparing the spectrum coefficients of the low band spectral information, defining prediction parameters for the high band extended spectral fine structure and index ranges of the prediction parameters, and determining possible best indices of the prediction parameters, where determining includes minimizing a prediction error between a reference subband in high band and a predicted subband that is selected and composed from an available low band. The possible best indices of the prediction parameters are transmitted.

Type: Grant

Filed: September 4, 2009

Date of Patent: September 10, 2013

Assignee: Huawei Technologies Co., Ltd.

Inventor: Yang Gao
Selective bandwidth extension for encoding/decoding audio/speech signal

Patent number: 8532998

Abstract: A method of receiving an audio signal includes measuring a periodicity of the audio signal to determine a checked periodicity. At least one best available subband is determined. At least one extended subband is composed, wherein composing includes reducing a ratio of composed harmonic components to composed noise components if the checked periodicity is lower than a threshold, and scaling a magnitude of the at least one extended subband based on a spectral envelope on the audio signal.

Type: Grant

Filed: September 4, 2009

Date of Patent: September 10, 2013

Assignee: Huawei Technologies Co., Ltd.

Inventor: Yang Gao
METHOD AND APPARATUS FOR ENCODING AND DECODING HIGH FREQUENCY SIGNAL

Publication number: 20130226566

Abstract: Provided are a method and apparatus for encoding and decoding a high frequency signal by using a low frequency signal. The high frequency signal can be encoded by extracting a coefficient by linear predicting a high frequency signal, and encoding the coefficient, generating a signal by using the extracted coefficient and a low frequency signal, and encoding the high frequency signal by calculating a ratio between the high frequency signal and an energy value of the generated signal. Also, the high frequency signal can be decoded by decoding a coefficient, which is extracted by linear predicting a high frequency signal, and a low frequency signal, and generating a signal by using the decoded coefficient and the decoded low frequency signal, and adjusting the generated signal by decoding a ratio between the generated signal and an energy value of the high frequency signal.

Type: Application

Filed: April 8, 2013

Publication date: August 29, 2013

Applicant: Samsung Electronics Co., Ltd

Inventor: Samsung Electronics Co., Ltd
APPARATUS AND METHOD OF ENCODING AND DECODING SIGNALS

Publication number: 20130226565

Abstract: A method of encoding an audio signal, where signals including two or more channel signals are downmixed to a mono signal, the mono signal is divided into a low-frequency signal and a high-frequency signal, the low-frequency signal is encoded through algebraic code excited linear prediction (ACELP) or transform coded excitation (TCX), and the high-frequency signal is encoded using the low-frequency signal. A method of decoding of an audio signal, a low-frequency signal encoded through ACELP or TCX is decoded, a high-frequency signal is decoded using the low-frequency signal, the low-frequency signal and the high-frequency signal are combined to generate a mono signal, and the mono signal is upmixed by decoding spatial parameters regarding signals including two or more channel signals.

Type: Application

Filed: March 26, 2013

Publication date: August 29, 2013

Applicant: SAMSUNG Electronics Co., Ltd.

Inventor: SAMSUNG Electronics Co., Ltd.
Audio coding/decoding method and apparatus using excess quantization information

Patent number: 8521522

Abstract: There is provided an audio coding device which appropriately sets the quantization bit number by a small calculation amount in each stage when coding an input audio signal by performing multi-stage normalization/quantization. A quantization information calculation section determines total quantization information idwl0, based on normalization information idsf, and allocates the total quantization information idwl0 for quantization information idwl1 and quantization information idwl2. At this time, the quantization information calculation section limits the quantization information idwl1 by a limiter lim1, and allocates the total quantization information idwl0 for quantization information idwl1. If the quantization information idwl1 exceeds the limiter lim1, the excess is allocated for the quantization information idwl2. A first normalization section and a first quantization section normalizes and quantizes a frequency spectrum mdspec1 in the first stage.

Type: Grant

Filed: May 5, 2006

Date of Patent: August 27, 2013

Assignee: Sony Corporation

Inventors: Yuuki Matsumura, Shiro Suzuki, Keisuke Toyama, Mitsuyuki Hatanaka, Yuhki Mitsufuji
Enhanced audio decoder

Patent number: 8515768

Abstract: Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such as the frequency range below a cutoff frequency, and the bandwidth extension data can be associated with an extended portion of the audio signal, such as a frequency range above the cutoff frequency. The core audio data can be decoded to generate a decoded core audio signal in a time domain representation. Further, an extended portion of the audio signal can be reconstructed in accordance with extension data and decoded core audio signal. Additionally, the decoded core audio signal can be lowpass filtered and the extended portion can be highpass filtered before being combined to generate a decoded output signal.

Type: Grant

Filed: August 31, 2009

Date of Patent: August 20, 2013

Assignee: Apple Inc.

Inventors: Frank Baumgarte, William Stewart, Shyh-Shiaw Kuo
Rounding noise shaping for integer transform based encoding and decoding

Patent number: 8503535

Abstract: An integer-reversible MDCT transformation is split into consecutive lifting steps, each introducing considerable rounding errors to the signal. Without noise shaping the rounding error noise will impact all frequency bins of the transformed signal equally. This is a particular problem for low signal level frequency bins. The invention limits the impact of rounding error noise coming with each lifting step in the integer-reversible transformation on the data rate of a lossless audio codec. The filter coefficients of an adaptive noise shaping filter for transform coefficients are adapted in individual lifting steps according to the current time domain signal characteristics. As an alternative, an auto-regressive pre-filter can be added in front of the lossless transformation, for raising the level of frequency regions with low power to decrease the dominance of rounding errors in these areas. Both processes can be combined to further improve lossless codec compression ratio.

Type: Grant

Filed: November 10, 2008

Date of Patent: August 6, 2013

Assignee: Thomson Licensing

Inventor: Peter Jax
Signal distortion elimination apparatus, method, program, and recording medium having the program recorded thereon

Patent number: 8494845

Abstract: Provided is a signal distortion elimination apparatus comprising: an inverse filter application means that outputs the signal obtained by applying an inverse filter to an observed signal as a restored signal when a predetermined iteration termination condition is met and outputs the signal obtained by applying the inverse filter to the observed signal as an ad-hoc signal when the predetermined iteration termination condition is not met; a prediction error filter calculation means that segments the ad-hoc signal into frames and outputs a prediction error filter of each frame obtained by performing linear prediction analysis of the ad-hoc signal of each frame; an inverse filter calculation means that calculates an inverse filter such that a concatenation of innovation estimates of the respective frames becomes mutually independent among their samples, where the innovation estimate of a single frame (an innovation estimate) is the signal obtained by applying the prediction error filter of the corresponding frame

Type: Grant

Filed: February 16, 2007

Date of Patent: July 23, 2013

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Takuya Yoshioka, Takafumi Hikichi, Masato Miyoshi
Method for detecting audio signal transient and time-scale modification based on same

Patent number: 8489404

Abstract: A method for detecting a transient in an audio signal that has been broken up into frames includes obtaining a time domain feature of the frames and comparing the domain feature with a predetermined value. If the time domain feature is greater than the predetermined value, the frames are taken as transient and if the time domain feature is less than the predetermined value, the frames are taken as non-transient. The method has a low computational intensity and is thus very suitable for devices with limited processing resources.

Type: Grant

Filed: March 15, 2011

Date of Patent: July 16, 2013

Assignee: Freescale Semiconductor, Inc.

Inventors: Zhongsong Lin, Shidong Shang, Shengjiu Wang
Apparatus and method for encoding using signal fragments for redundant transmission of data

Patent number: 8477050

Abstract: A system and method for redundant transmission is provided. In one embodiment, an input signal S is encoded as a list of fragments. Each fragment includes an index value and a projection value. The index points to an entry in a dictionary of signal elements. A repetition factor is assigned to each fragment based on its importance. After a fragment is added, a reconstructed signal is generated by decoding the list of fragments. Encoding terminates once the reconstructed signal is sufficiently close to the original signal S.

Type: Grant

Filed: September 15, 2011

Date of Patent: July 2, 2013

Assignee: Google Inc.

Inventor: Pascal Massimino
Quantizer, encoder, and the methods thereof

Patent number: 8473288

Abstract: Disclosed are a quantizer, encoder, and the methods thereof, wherein the computational load is reduced when the values related to the transform coefficients of the principal component analysis transform are quantized when a principal component analysis transform is applied to code stereo.

Type: Grant

Filed: June 18, 2009

Date of Patent: June 25, 2013

Assignee: Panasonic Corporation

Inventors: Toshiyuki Morii, Hiroyuki Ehara, Koji Yoshida
Method and apparatus to facilitate determining signal bounding frequencies

Patent number: 8463412

Abstract: A signal processing platform (300) presents (101) a signal to be processed and identifies (102) signal portions with specific characteristics that are used (103) to automatically determine at least one bounding frequency that can be used to facilitate bandwidth extension for the signal. Identifying these signal portions can comprise identifying signal portions that exhibit at least a predetermined level of energy. The step of determining the bounding frequency can comprise computing a magnitude spectrum for each of the identified signal portions that can be used to determine a corresponding measure of flatness within a pass band as pertains to a corresponding normalized signal portion to thereby provide corresponding vetted signal portions. Determining the bounding frequency can then comprise accumulating the magnitude spectrum for these vetted signal portions and using the resultant accumulation to estimate a corresponding signal envelope.

Type: Grant

Filed: August 21, 2008

Date of Patent: June 11, 2013

Assignee: Motorola Mobility LLC

Inventors: Tenkasi V. Ramabadran, Mark A. Jasiuk
Method and apparatus for concealing lost frame

Patent number: 8457115

Abstract: A method for concealing lost frame includes: using history signals before the lost frame that corresponds to a lost MDCT coefficient to generate a first synthesized signal when it is detected that the MDCT coefficient is lost; performing fast IMDCT for the first synthesized signal to obtain an IMDCT coefficient corresponding to a lost MDCT coefficient; and using the IMDCT coefficient corresponding to the lost MDCT coefficient and an IMDCT coefficient adjacent to the IMDCT coefficient corresponding to the lost MDCT coefficient to perform TDAC and obtain signals corresponding to the lost frame. An apparatus for concealing lost frame is also disclosed herein. The method and the apparatus for concealing lost frames in the embodiments of the present invention make full use of the received partial signals to recover high-quality voice signals and improve the QoS.

Type: Grant

Filed: October 27, 2010

Date of Patent: June 4, 2013

Assignee: Huawei Technologies Co., Ltd.

Inventors: Wuzhou Zhan, Dongqi Wang
Encoding device, decoding device, and method thereof

Patent number: 8452588

Abstract: It is possible to improve quality of a decoding signal in a band spread for estimating a high band from a low band of a decoding signal. A first layer encoder encodes a lower band portion below a predetermined frequency of an input signal so as to generate first layer encoded information. A first layer decoder decodes the first layer encoded information so as to generate a first layer demodulated signal. A second layer encoder divides a high band portion higher, than a predetermined frequency, of an input signal into a plurality of sub-bands and estimates each of the sub-bands from the input signal or the first layer decoded signal by using the estimation result of the sub-band adjacent to the lower band side so as to generate second encoded information including the estimation results of the sub-bands.

Type: Grant

Filed: March 13, 2009

Date of Patent: May 28, 2013

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
Encoder, decoder, and the methods therefor

Patent number: 8452587

Abstract: Provided is an encoder which can decode a high-quality stereo signal while keeping the amount of information in the bit allocation information to a minimum when a scalable coding technique is used for a stereo signal. In the encoder, a principal component analysis (PCA) converter converts the left signal and the right signal of the stereo signal and generates the main signal of the first layer and the sub-signal of the first layer. In the first layer to the M-th layer (where M is a natural number, 2 or greater), an adaptive residual encoder compares the importance of the main signal of the m-th layer, where m is a natural number from 1 to M, and the importance of the sub-signal of the m-th layer, selects the signal having the higher importance, encodes the selected signal, and generates the encoded data of the m-th layer.

Type: Grant

Filed: May 29, 2009

Date of Patent: May 28, 2013

Assignee: Panasonic Corporation

Inventors: Zongxian Liu, Kok Seng Chong
Audio encoding device, audio decoding device, audio encoding method, and audio decoding method

Patent number: 8447597

Abstract: In an encoding process, a CPU transforms an audio signal from the real-time domain to the frequency domain, and transforms the signal into spectra consisting of MDCT coefficients. The CPU separates the audio signal into several frequency bands, and performs bit shifting in each band such that the MDCT coefficients can be expressed with pre-configured numbers of bits. The CPU re-quantizes the MDCT coefficients at a precision differing for each band, and transmits the values acquired thereby and shift bit numbers as encoded data. Meanwhile, in a decoding process, a CPU receives encoded data and inverse re-quantizes and inverse bit shifts the data, thereby restoring the MDCT coefficients. Furthermore, the CPU transforms the data from frequency domain to the real-time domain by using the inverse MDCT, and restores and outputs the audio signal.

Type: Grant

Filed: October 1, 2007

Date of Patent: May 21, 2013

Assignee: Casio Computer Co., Ltd.

Inventor: Hiroyasu Ide
Factorization of overlapping tranforms into two block transforms

Patent number: 8447591

Abstract: An audio encoder/decoder uses a combination of an overlap windowing transform and block transform that have reversible implementations to provide a reversible, integer-integer form of a lapped transform. The reversible lapped transform permits both lossy and lossless transform domain coding of an audio signal having variable subframe sizes.

Type: Grant

Filed: May 30, 2008

Date of Patent: May 21, 2013

Assignee: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Method and apparatus for adaptive sub-band allocation of spectral coefficients

Patent number: 8438012

Abstract: An apparatus and method for adaptive sub-band allocation of spectral coefficients are disclosed. The sizes of sub-bands are determined according to the distribution of spectral coefficients transformed from an input speech/audio signal to perform more elaborate quantization in units of sub-bands. Thus, quantization noise of the spectral coefficients is reduced, and sound quality in a frequency region is enhanced, thereby improving the quality of the signal.

Type: Grant

Filed: September 9, 2009

Date of Patent: May 7, 2013

Assignee: Electronics and Telecommunications Research Institute

Inventors: Hyun Woo Kim, Hyun Joo Bae, Byung Sun Lee
Method and apparatus for estimating high-band energy in a bandwidth extension system

Patent number: 8433582

Abstract: A method (100) includes receiving (101) an input digital audio signal comprising a narrow-band signal. The input digital audio signal is processed (102) to generate a processed digital audio signal. A high-band energy level corresponding to the input digital audio signal is estimated (103) based on a transition-band of the processed digital audio signal within a predetermined upper frequency range of a narrow-band bandwidth. A high-band digital audio signal is generated (104) based on the high-band energy level and an estimated high-band spectrum corresponding to the high-band energy level.

Type: Grant

Filed: February 1, 2008

Date of Patent: April 30, 2013

Assignee: Motorola Mobility LLC

Inventors: Tenkasi V. Ramabadran, Mark A. Jasiuk
Decoder for audio signal including generic audio and speech frames

Patent number: 8428936

Abstract: A method for decoding audio frames includes producing a first frame of coded audio samples, producing at least a portion of a second frame of coded audio samples, generating audio gap filler samples based on parameters representative of a weighted segment of the first frame of coded audio samples or a weighted segment of the portion of the second frame of coded audio samples, and forming a sequence including the audio gap filler samples and the portion of the second frame of coded audio samples.

Type: Grant

Filed: September 9, 2010

Date of Patent: April 23, 2013

Assignee: Motorola Mobility LLC

Inventors: Udar Mittal, Jonathan A. Gibbs, James P. Ashley

prev 1 2 3 4 5 6 7 8 … next