Psychoacoustic Patents (Class 704/200.1)
  • Patent number: 8620645
    Abstract: A decoder arrangement comprising a receiver input for parameters of frame-based coded signals and a decoder arranged to provide frames of decoded audio signals based on the parameters. The receiver input and/or the decoder is arranged to establish a time difference between the occasion when parameters of a first frame is available at the receiver input and the occasion when a decoded audio signal of the first frame is available at an output of the decoder, which time difference corresponds to at least one frame. A postfilter is connected to the output of the decoder and to the receiver input. The postfilter is arranged to provide a filtering of the frames of decoded audio signals into an output signal in response to parameters of a respective subsequent frame.
    Type: Grant
    Filed: December 14, 2007
    Date of Patent: December 31, 2013
    Assignee: Telefonaktiebolaget L M Ericsson (publ)
    Inventor: Stefan Bruhn
  • Patent number: 8620674
    Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.
    Type: Grant
    Filed: January 31, 2013
    Date of Patent: December 31, 2013
    Assignee: Microsoft Corporation
    Inventors: Naveen Thumpudi, Wei-Ge Chen
  • Publication number: 20130346070
    Abstract: Disclosed is an acoustic communication method that includes filtering an audio signal to attenuate a high frequency section of the audio signal; generating a residual signal which corresponds to a difference between the audio signal and the filtered signal; generating a psychoacoustic mask for the audio signal based on a predetermined psychoacoustic model; generating a psychoacoustic spectrum mask by combining the residual signal with the psychoacoustic mask; generating an acoustic communication signal by modulating digital data according to the acoustic signal spectrum mask; and combining the acoustic communication signal with the filtered signal.
    Type: Application
    Filed: August 27, 2013
    Publication date: December 26, 2013
    Applicant: Samsung Electronics Co., Ltd.
    Inventors: Hee-Won Jung, Jun-Ho Koh, Sang-Mook Lee, Gi-Sang Lee, Sergey Zhidkov
  • Patent number: 8615391
    Abstract: An method and apparatus to extract an audio signal having an important spectral component (ISC) and a low bit-rate audio signal coding/decoding method using the method and apparatus to extract the ISC. The method of extracting the ISC includes calculating perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals by using a psychoacoustic model, selecting spectral signals having a masking threshold value smaller than that of the spectral audio signals using the SMR value as first ISCs, and extracting a spectral peak from the audio signals selected as the ISCs according to a predetermined weighting factor to select second ISCs. Accordingly, the perceptual important spectral components can be efficiently coded so as to obtain high sound quality at a low bit-rate.
    Type: Grant
    Filed: July 6, 2006
    Date of Patent: December 24, 2013
    Assignee: SAMSUNG Electronics Co., Ltd.
    Inventors: Junghoe Kim, Eunmi Oh, Konstantin Osipov, Boris Kudryashov
  • Patent number: 8615390
    Abstract: The invention relates to transform coding/decoding of a digital audio signal represented by a succession of frames, using windows of different lengths. For the coding within the meaning of the invention, it is sought to detect (51) a particular event, such as an attack, in a current frame (Ti); and, at least if said particular event is detected at the start of the current frame (53), a short window (54) is directly applied in order to code (56) the current frame (Ti) without applying a transition window. Thus, the coding has a reduced delay in relation to the prior art. In addition, an ad hoc processing is applied during decoding in order to compensate for the direct passage from a long window to a short window during coding.
    Type: Grant
    Filed: December 18, 2007
    Date of Patent: December 24, 2013
    Assignee: France Telecom
    Inventors: Balazs Kovesi, David Virette, Pierrick Philippe
  • Patent number: 8612219
    Abstract: An SBR encoder includes a filter bank that receives an input signal, a time/frequency grid generator that controls a number of bits of various parameters, a parameter calculator that calculates various parameters, a parameter coding unit that encodes the parameters, an upper-limit number-of-bit storage unit that stores an upper limit of the number of bit of encoded data of high-frequency component finally generated in a high-pass encoding process, and a number-of-bit controller. The number-of-bit controller controls the high-pass encoding process by preferentially encoding a parameter having a large influence to sound quality and not encoding a parameter having a small influence to the sound quality relative to a plurality of parameters, so that the number of bits of the encoded data of high-frequency component finally generated in the high-pass encoding process becomes equal to or less than the upper limit to be stored in the upper-limit number-of-bit storage unit.
    Type: Grant
    Filed: October 11, 2007
    Date of Patent: December 17, 2013
    Assignee: Fujitsu Limited
    Inventors: Yoshiteru Tsuchinaga, Masanao Suzuki, Miyuki Shirakawa, Takashi Makiuchi
  • Patent number: 8612225
    Abstract: A voice recognition device that recognizes a voice of an input voice signal, comprises a voice model storage unit that stores in advance a predetermined voice model having a plurality of detail levels, the plurality of detail levels being information indicating a feature property of a voice for the voice model; a detail level selection unit that selects a detail level, closest to a feature property of an input voice signal, from the detail levels of the voice model stored in the voice model storage unit; and a parameter setting unit that sets parameters for recognizing the voice of an input voice according to the detail level selected by the detail level selection unit.
    Type: Grant
    Filed: February 26, 2008
    Date of Patent: December 17, 2013
    Assignee: NEC Corporation
    Inventors: Takayuki Arakawa, Ken Hanazawa, Masanori Tsujikawa
  • Patent number: 8612220
    Abstract: The invention relates to a method for quantifying components, wherein certain components are each determined based on a plurality of audio signals and can be calculated by the application of a linear conversion on the audio signals, said method comprising: determining a quantification function to be applied to the components by testing a condition relative to an audio signal and depending on a comparison made between a psycho-acoustic masking threshold relative to the audio signal and a value determined based on the reverse linear conversion and quantification errors of the components by the function.
    Type: Grant
    Filed: July 1, 2008
    Date of Patent: December 17, 2013
    Assignee: France Telecom
    Inventors: Adil Mouhssine, Abdellatif Benjelloun Touimi, Pierre Duhamel
  • Patent number: 8606571
    Abstract: The present technology provides noise reduction of an acoustic signal using a configurable classification threshold which provides a sophisticated level of control to balance the tradeoff between positional robustness and noise reduction robustness. The configurable classification threshold corresponds to a configurable spatial region, such that signals arising from sources within the configurable spatial region are preserved, and signals arising from sources outside it are rejected. In embodiments, the configurable classification threshold can be automatically and dynamically adjusted in real-time based on evaluated environmental conditions surrounding an audio device implementing the noise reduction techniques described herein.
    Type: Grant
    Filed: July 15, 2010
    Date of Patent: December 10, 2013
    Assignee: Audience, Inc.
    Inventors: Mark Every, Carlo Murgia
  • Patent number: 8589155
    Abstract: Methods of encoding a signal using a perceptual model are described in which a signal to mask ratio parameter within the perceptual model is tuned. The signal to mask ratio parameter is tuned based on a function of the bitrate of the part of the signal which has already been encoded and the target bitrate for the encoding process. The tuned signal to mask ratio parameter is used to compute a masking threshold for the signal which is then used to quantize the signal.
    Type: Grant
    Filed: July 31, 2012
    Date of Patent: November 19, 2013
    Assignee: Cambridge Silicon Radio Ltd.
    Inventors: Esfandiar Zavarehei, David Hargreaves
  • Patent number: 8583423
    Abstract: Method and arrangement for processing of a speech quality estimate, which involve adaption of a speech quality estimate based on information related to the bandwidth of a reference signal used when determining said speech quality estimate, such that the adapted speech quality estimate is independent of the bandwidth of the reference signal. The method and arrangement enable objective speech quality measurements or assessments to be performed on a unified bandwidth scale, independent of the bandwidth of a reference signal, which allows e.g. a more relevant comparison of communication systems and/or equipment, such as e.g. codecs.
    Type: Grant
    Filed: May 16, 2011
    Date of Patent: November 12, 2013
    Assignee: Telefonaktiebolaget L M Ericsson (Publ)
    Inventor: Volodya Grancharov
  • Patent number: 8583443
    Abstract: Disclosed is a recording and reproducing apparatus comprising: an apparatus main body; and a remote controller to perform remote control of the apparatus main body, wherein the remote controller comprises: a key operating section to receive a key operation by a user; a sound information inputting section to input sound information; and a transmitting section to transmit sound data based on the sound information to the apparatus main body, and the apparatus main body comprises: a recording section to record input content data on a recording medium; a reproducing section to reproduce the content data; a receiving section to receive the sound data; a sound information recording section to record the sound data so as to be associated with a piece of the content data; and a sound information outputting section to reproduce the sound data to output the reproduced sound data.
    Type: Grant
    Filed: April 10, 2008
    Date of Patent: November 12, 2013
    Assignee: Funai Electric Co., Ltd.
    Inventor: Masayuki Misawa
  • Patent number: 8571875
    Abstract: A method, medium, and apparatus encoding and/or decoding a multichannel audio signal. The method includes detecting the type of spatial extension data included in an encoding result of an audio signal, if the spatial extension data is data indicating a core audio object type related to a technique of encoding core audio data, detecting the core audio object type; decoding core audio data by using a decoding technique according to the detected core audio object type, if the spatial extension data is residual coding data, decoding the residual coding data by using the decoding technique according to the core audio object type, and up-mixing the decoded core audio data by using the decoded residual coding data. According to the method, the core audio data and residual coding data may be decoded by using an identical decoding technique, thereby reducing complexity at the decoding end.
    Type: Grant
    Filed: October 11, 2007
    Date of Patent: October 29, 2013
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Jung-hoe Kim, Eun-mi Oh
  • Patent number: 8571877
    Abstract: An apparatus for providing an upmix signal representation on the basis of a downmix signal representation and an object-related parametric information, which are included in a bitstream representation of an audio content, in independence on a user-specified rendering matrix, the apparatus has a distortion limiter configured to obtain a modified rendering matrix using a linear combination of a user-specified rendering matrix in a target rendering matrix in dependence on a linear combination parameter. The apparatus also has a signal processor configured to obtain the upmix signal representation on the basis of the downmix signal representation and the object-related parametric information using the modified rendering matrix. The apparatus is also configured to evaluate a bitstream element representing the linear combination parameter in order to obtain the linear combination parameter.
    Type: Grant
    Filed: May 18, 2012
    Date of Patent: October 29, 2013
    Assignees: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung e.V., Dolby International AB
    Inventors: Jonas Engdegard, Heiko Purnhagen, Juergen Herre, Cornelia Falch, Oliver Hellmuth, Leon Terentiv
  • Patent number: 8566082
    Abstract: A method for determining a speech quality measure of an output speech signal with respect to an input speech signal, wherein the input signal passes through a signal path of a data transmission system resulting in the output signal, includes the steps of pre-processing the output signal; determining at least one of an interruption rate of the pre-processed output signal and a measure for an intensity of musical tones present in the pre-processed output signal; and determining the speech quality measure from at least one of the interruption rate and the measure for the intensity of the musical tones.
    Type: Grant
    Filed: September 11, 2008
    Date of Patent: October 22, 2013
    Assignees: Deutsche Telekom AG, France Telecom
    Inventors: Vincent Barriac, Nicolas Cote, Valerie Gautier-Turbin, Sebastian Moeller, Alexander Raake, Marcel Waeltermann, Ulrich Heute, Kirstin Scholz
  • Patent number: 8566107
    Abstract: Disclosed is a method of processing a signal, which includes receiving at least one of a first signal and a second signal, receiving mode information, and decoding the at least one of the first signal and the second signal using at least one of a first coding scheme and a second coding scheme according to the mode information. The mode information is information for indicating that a prescribed mode corresponds to one of at least three modes. The method includes detecting when a restricted mode change occurs and changing at least one mode when detecting a restricted mode change.
    Type: Grant
    Filed: October 15, 2008
    Date of Patent: October 22, 2013
    Assignees: LG Electronics Inc., Intellectual Discovery Co., Ltd.
    Inventors: Hyen-O Oh, Hong Goo Kang, Chang Heon Lee, Sang Wook Shin, Yang Won Jung
  • Patent number: 8560328
    Abstract: A decoding device is capable of flexibly calculating high-band spectrum data with a high accuracy in accordance with an encoding band selected by an upper-node layer of the encoding side. In this device: a first layer decoder decodes first layer encoded information to generate a first layer decoded signal; a second layer decoder decodes second layer encoded information to generate a second layer decoded signal; a spectrum decoder performs a band extension process by using the second layer decoded signal and the first layer decoded signal up-sampled in an up-sampler so as to generate an all-band decoded signal; and a switch outputs the first layer decoded signal or the all-band decoded signal according to the control information generated in a controller.
    Type: Grant
    Filed: December 14, 2007
    Date of Patent: October 15, 2013
    Assignee: Panasonic Corporation
    Inventors: Tomofumi Yamanashi, Masahiro Oshikiri
  • Patent number: 8560303
    Abstract: Provided are an apparatus and method for visualizing multichannel audio signals. The apparatus includes a spatial audio decoding unit for receiving a downmix signal of a time domain, converting the downmix signal into a signal of a frequency domain to output a frequency domain downmix signal, and synthesizing a multichannel audio signal based on the spatial parameter and the downmix signal; and a multichannel visualizing unit for creating visualization information of the multichannel audio signal based on the frequency domain downmix signal and the spatial parameter.
    Type: Grant
    Filed: February 5, 2007
    Date of Patent: October 15, 2013
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung-Kwon Beack, Dae-Young Jang, Jeong-II Seo, Kyeong-Ok Kang, Jin-Woo Hong, Jin-Woong Kim
  • Patent number: 8554549
    Abstract: A voice encoding device accurately encodes a spectrum shape of a signal having a strong tonality such as a vowel. The device includes: a sub-band divider which divides a first layer error conversion coefficient to be encoded into M sub-bands so as to generate M sub-band conversion coefficients; a shape vector encoder which performs encoding on each of the M sub-band conversion coefficients so as to obtain M shape encoded information and calculates a target gain of each of the M sub-band conversion coefficients; a gain vector former which forms one gain vector by using M target gains; a gain vector encoder which encodes the gain vector so as to obtain gain encoded information; and a multiplexer which multiplexes the shape encoded information with the gain encoded information.
    Type: Grant
    Filed: February 29, 2008
    Date of Patent: October 8, 2013
    Assignee: Panasonic Corporation
    Inventors: Masahiro Oshikiri, Toshiyuki Morii, Tomofumi Yamanashi
  • Publication number: 20130253917
    Abstract: The present document relates to the design of anti-aliasing and/or anti-imaging filters for resamplers using rational resampling factors. In particular, the present document relates to a method for designing such filters having a reduced number of filter coefficients or an increased perceptual performance, as well as to the filters designed using such method. A method for designing a filter (102) configured to reduce imaging and/or aliasing of an output audio signal (113) at an output sampling rate (fsout) is described. The output audio signal (113) is a resampled version of an input audio signal (110) at an input sampling rate (fsin). The ratio of the output sampling rate (fsout) and the input sampling rate (fsin) is a rational number N/M. The filter (102) operates at an upsampled sampling rate which equals N times the input sampling rate (fsin).
    Type: Application
    Filed: December 9, 2011
    Publication date: September 26, 2013
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Wolfgang Schildbach
  • Patent number: 8543385
    Abstract: The present proposes new methods and an apparatus for enhancement of source coding systems utilizing high frequency reconstruction (HFR). It addresses the problem of insufficient noise contents in a reconstructed highband, by Adaptive Noise-floor Addition. It also introduces new methods for enhanced performance by means of limiting unwanted noise, interpolation and smoothing of envelope adjustment amplification factors. The present invention is applicable to both speech coding and natural audio coding systems.
    Type: Grant
    Filed: April 30, 2012
    Date of Patent: September 24, 2013
    Assignee: Dolby International AB
    Inventors: Lars G. Liljeryd, Kristofer Kjoerlink, Per Ekstrand, Frederik Henn
  • Patent number: 8543389
    Abstract: The invention relates to the coding/decoding of a signal into several sub-bands, in which at least a first and a second sub-bands which are adjacent are transform coded (601, 602). In particular, in order to apply a perceptual weighting, in the transformed domain, to at least the second sub-band, the method comprises:—determining at least one frequency masking threshold (606) to be applied on the second sub-band; and normalizing said masking threshold in order to provide a spectral continuity between the above-mentioned first and second sub-bands. An advantageous application of the invention involves a perceptual weighting of the high-frequency band in the TDAC transform coding of a hierarchical encoder according to standard G.729.1.
    Type: Grant
    Filed: January 30, 2008
    Date of Patent: September 24, 2013
    Assignee: France Telecom
    Inventors: Stéphane Ragot, Cyril Guillaume
  • Patent number: 8543390
    Abstract: A multi-channel signal enhancement system reinforces signal content and improves the signal-to-noise ratio of a multi-channel signal. The system detects, tracks, and reinforces non-stationary periodic signal components of a multi-channel signal. The periodic signal components of the signal may represent vowel sounds or other voiced sounds. The system may detect, track, or attenuate quasi-stationary signal components in the multi-channel signal.
    Type: Grant
    Filed: August 31, 2007
    Date of Patent: September 24, 2013
    Assignee: QNX Software Systems Limited
    Inventors: Rajeev Nongpiur, Phillip Hetherington
  • Patent number: 8543387
    Abstract: Disclosed herein is a pitch estimation apparatus and associated methods for estimating a fundamental frequency of an audio signal from a fundamental frequency probability density function by modeling the audio signal as a weighted mixture of a plurality of tone models corresponding respectively to harmonic structures of individual fundamental frequencies, so that the fundamental frequency probability density function of the audio signal is given as a distribution of respective weights of the plurality of the tone models.
    Type: Grant
    Filed: August 31, 2007
    Date of Patent: September 24, 2013
    Assignee: Yamaha Corporation
    Inventors: Masataka Goto, Takuya Fujishima, Keita Arimoto
  • Patent number: 8538766
    Abstract: An audio decoder for decoding a multi-audio-object signal having an audio signal of a first type and an audio signal of a second type encoded therein is described, the multi-audio-object signal having a downmix signal and side information, the side information having level information of the audio signals of the first and second types in a first predetermined time/frequency resolution, and a residual signal specifying residual level values in a second predetermined time/frequency resolution, the audio decoder having a processor for computing prediction coefficients based on the level information; and an up-mixer for up-mixing the downmix signal based on the prediction coefficients and the residual signal to obtain a first up-mix audio signal approximating the audio signal of the first type and/or a second up-mix audio signal approximating the audio signal of the second type.
    Type: Grant
    Filed: January 23, 2013
    Date of Patent: September 17, 2013
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung e.V.
    Inventors: Oliver Hellmuth, Johannes Hilpert, Leon Terentiv, Cornelia Falch, Andreas Hoelzer, Juergen Herre
  • Patent number: 8535236
    Abstract: An apparatus for analyzing a sound signal is based on an ear model for deriving, for a number of inner hair cells, an estimate for a time-varying concentration of transmitter substance inside a cleft between an inner hair cell and an associated auditory nerve from the sound signal so that an estimated inner hair cell cleft contents map over time is obtained. This map is analyzed by means of a pitch analyzer to obtain a pitch line over time, the pitch line indicating a pitch of the sound signal for respective time instants. A rhythm analyzer is operative for analyzing envelopes of estimates for selected inner hair cells, the inner hair cells being selected in accordance with the pitch line, so that segmentation instants are obtained, wherein a segmentation instant indicates an end of the preceding note or a start of a succeeding note. Thus, a human-related and reliable sound signal analysis can be obtained.
    Type: Grant
    Filed: March 19, 2004
    Date of Patent: September 17, 2013
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.
    Inventors: Thorsten Heinz, Andreas Brueckmann, Juergen Herre
  • Patent number: 8538746
    Abstract: A method of providing a quality measure for an output voice signal generated to reproduce an input voice signal, the method comprising: partitioning the input and output signals into frames; for each frame of the input signal, determining a disturbance relative to each of a plurality of frames of the output signal; determining a subset of the determined disturbances comprising one disturbance for each input frame such that a sum of the disturbances in the subset set is a minimum; and using the set of disturbances to provide the measure of quality.
    Type: Grant
    Filed: September 27, 2012
    Date of Patent: September 17, 2013
    Assignee: AudioCodes Ltd.
    Inventors: Ilan D. Shallom, Nitay Shiran, Felix Flomen
  • Patent number: 8532982
    Abstract: A method and apparatus to encode and decode an audio/speech signal is provided. An inputted audio signal or speech signal may be transformed into at least one of a high frequency resolution signal and a high temporal resolution signal. The signal may be encoded by determining an appropriate resolution, the encoded signal may be decoded, and thus the audio signal, the speech signal, and a mixed signal of the audio signal and the speech signal may be processed.
    Type: Grant
    Filed: July 14, 2009
    Date of Patent: September 10, 2013
    Assignee: SAMSUNG Electronics Co., Ltd.
    Inventors: Eun Mi Oh, Jung Hoe Kim, Ki Hyun Choo, Ho Sang Sung, Mi Young Kim
  • Patent number: 8527264
    Abstract: A method for determining mantissa bit allocation of frequency domain audio data to be encoded, including by performing adaptive low frequency compensation on each frequency band of a set of low frequency bands of the data.
    Type: Grant
    Filed: August 17, 2012
    Date of Patent: September 3, 2013
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Arijit Biswas, Vinay Melkote, Michael Schug, Grant Allen Davidson, Mark Stuart Vinton
  • Patent number: 8521518
    Abstract: An acoustic communication method and device are provided that filter an audio signal to attenuate a high frequency section of the audio signal. A residual signal is generated that corresponds to a difference between the audio signal and the filtered signal. A psychoacoustic mask is generated for the audio signal based on a predetermined psychoacoustic model. A psychoacoustic spectrum mask is generated by combining the residual signal with the psychoacoustic mask, an acoustic communication signal is generating by modulating digital data according to the acoustic signal spectrum mask, the acoustic communication signal is combined with the filtered signal, and radiating, by a speaker, the combined acoustic communication signal and the filtered signal in a form of sound waves.
    Type: Grant
    Filed: December 10, 2010
    Date of Patent: August 27, 2013
    Assignee: Samsung Electronics Co., Ltd
    Inventors: Hee-Won Jung, Jun-Ho Koh, Sang-Mook Lee, Gi-Sang Lee, Sergey Zhidkov
  • Patent number: 8515741
    Abstract: Presented herein are system(s), method(s), and apparatus for reducing on-chip memory requirements for audio decoding. In one embodiment, there is presented a method for decoding encoded audio signals. The method comprises fetching a first one or more tables from an off-chip memory; loading the first one or more tables to an on-chip memory; applying a first function to the encoded audio signals using the first one or more tables; fetching a second one or more tables from an off-chip memory after applying the first function; loading the second one or more tables to an on-chip memory; and applying a second function to the encoded audio signals, using the second one or more tables.
    Type: Grant
    Filed: June 18, 2004
    Date of Patent: August 20, 2013
    Assignee: Broadcom Corporation
    Inventor: Srinivasa Mpr
  • Patent number: 8515767
    Abstract: Codebook indices for a scalable speech and audio codec may be efficiently encoded based on anticipated probability distributions for such codebook indices. A residual signal from a Code Excited Linear Prediction (CELP)-based encoding layer may be obtained, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal. The residual signal may be transformed at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum. The transform spectrum is divided into a plurality of spectral bands, where each spectral band having a plurality of spectral lines. A plurality of different codebooks are then selected for encoding the spectral bands, where each codebook is associated with a codebook index. A plurality of codebook indices associated with the selected codebooks are then encoded together to obtain a descriptor code that more compactly represents the codebook indices.
    Type: Grant
    Filed: November 3, 2008
    Date of Patent: August 20, 2013
    Assignee: QUALCOMM Incorporated
    Inventor: Yuriy Reznik
  • Patent number: 8498860
    Abstract: A modulation device including: a modulation unit for modulating a carrier in an audible sound range by an encoded transmission signal to generate a modulated signal; a masker sound generation unit for generating a masker signal outputted as a masker sound for making the modulated signal harder to hear when transmitted with the modulated signal; and an acoustic signal generation unit for inserting the masker signal in the modulated signal to generate an acoustic signal.
    Type: Grant
    Filed: October 2, 2006
    Date of Patent: July 30, 2013
    Assignee: NTT DoCoMo, Inc.
    Inventor: Hosei Matsuoka
  • Patent number: 8498876
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Grant
    Filed: July 18, 2012
    Date of Patent: July 30, 2013
    Assignee: Dolby International AB
    Inventors: Kristofer Kjorling, Lars Villemoes
  • Patent number: 8494840
    Abstract: The invention relates to audio signal processing and speech enhancement. In accordance with one aspect, the invention combines a high-quality audio program that is a mix of speech and non-speech audio with a lower-quality copy of the speech components contained in the audio program for the purpose of generating a high-quality audio program with an increased ratio of speech to non-speech audio such as may benefit the elderly, hearing impaired or other listeners. Aspects of the invention are particularly useful for television and home theater sound, although they may be applicable to other audio and sound applications. The invention relates to methods, apparatus for performing such methods, and to software stored on a computer-readable medium for causing a computer to perform such methods.
    Type: Grant
    Filed: February 12, 2008
    Date of Patent: July 23, 2013
    Assignee: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Patent number: 8494846
    Abstract: A method for generating background noise and a noise processing apparatus are provided in order to improve user experience. The method includes: if an obtained signal frame is a noise frame, a high band noise encoding parameter is obtained from the noise frame; weighting and/or smoothing is performed on the high band noise encoding parameter to obtain a second high band noise encoding parameter; and a high band background noise signal is generated according to the second high band noise encoding parameter. A noise processing apparatus is also provided.
    Type: Grant
    Filed: September 20, 2010
    Date of Patent: July 23, 2013
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Jinliang Dai, Libin Zhang
  • Patent number: 8489390
    Abstract: A method is provided in one example and includes receiving data propagating in a network environment and separating the data into one or more fields. At least some of the fields are evaluated in order to identify nouns and noun phrases within the fields. The method also includes identifying selected words within the nouns and noun phrases based on a whitelist and a blacklist. The whitelist includes a plurality of designated words to be tagged and the blacklist includes a plurality of rejected words that are not to be tagged. A resultant composite is generated for the selected nouns and noun phrases that are tagged. The resultant composite is incorporated into the whitelist if the resultant composite is approved.
    Type: Grant
    Filed: September 30, 2009
    Date of Patent: July 16, 2013
    Assignee: Cisco Technology, Inc.
    Inventors: Thangavelu Arumugam, Satish K. Gannu, Virgil N. Mihailovici, Ashutosh A. Malegaonkar, Christian Posse, Sonali M. Sambhus, Nitasha Walia, Kui Zhang
  • Patent number: 8489406
    Abstract: A stereo encoding method and apparatus are provided, so as to reduce distortion caused by delay adjustment. The stereo encoding method includes: extracting a current interchannel delay of a stereo signal and a previous delay adjacent to the current interchannel delay; performing adjustment frame judgment according to characteristics of the current stereo signal when the current delay and the previous delay are different; and performing delay adjustment on the stereo signal by using the current interchannel delay if it is judged that a frame where the current delay occurs is an adjustment frame.
    Type: Grant
    Filed: August 12, 2011
    Date of Patent: July 16, 2013
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Wenhai Wu, Yue Lang, Lei Miao, Zexin Liu, Chen Hu, Qing Zhang
  • Patent number: 8484019
    Abstract: The present invention teaches a new audio coding system that can code both general audio and speech signals well at low bit rates. A proposed audio coding system comprises linear prediction unit for filtering an input signal based on an adaptive filter; a transformation unit for transforming a frame of the filtered input signal into a transform domain; and a quantization unit for quantizing the transform domain signal. The quantization unit decides, based on input signal characteristics, to encode the transform domain signal with a model-based quantizer or a non-model-based quantizer. Preferably, the decision is based on the frame size applied by the transformation unit.
    Type: Grant
    Filed: December 30, 2008
    Date of Patent: July 9, 2013
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Per Hedelin, Pontus Carlsson, Jonas Samuelsson, Michael Schug
  • Patent number: 8473302
    Abstract: Provided are parametric audio encoding and decoding apparatuses and methods thereof. In the parametric audio encoding method, an audio signal is segmented into a plurality of segments. At least one sine wave is extracted from each of the segments, and the extracted sine waves are connected. It is determined whether an extracted sine wave is a birth sine wave. If the extracted sine wave is a birth sine wave, a bit stream is generated by encoding the phase of the birth sine wave on the basis of the frequency of the birth sine wave, wherein the number of bits allocated to encode the phase of the birth sine wave is adjusted according to the frequency of the birth sine wave.
    Type: Grant
    Filed: July 10, 2008
    Date of Patent: June 25, 2013
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Geon-hyoung Lee, Jong-hoon Jeong, Nam-suk Lee
  • Publication number: 20130151241
    Abstract: A method for embedding digital information into an audio signal, is provided. The method includes dividing the digital information into low-priority data and high-priority data; dividing the audio signal into first and second signal parts; embedding at least one echo signal into the first signal part; embedding a communication signal modulated with low-priority data, which has a spectrum according to psychoacoustic analysis of the second signal part, into the second signal part; and combining the embedded first and second signal parts.
    Type: Application
    Filed: December 6, 2012
    Publication date: June 13, 2013
    Applicant: Samsung Electronics Co., Ltd.
    Inventor: Samsung Electronics Co., Ltd.
  • Patent number: 8463602
    Abstract: There is disclosed an encoding device capable of improving similarity between the high frequency band spectrum of the original signal and a new spectrum to be generated while realizing a low bit rate when encoding a wide-band signal spectrum. The encoding device has sub-band amplitude calculation units (122, 128) for calculating the amplitude of the respective sub-bands for the high frequency band spectrum obtained from the wide-band signal. A search unit (124) and a gain codebook (125) select some sub-bands from a plurality of sub-bands and only the gain of the selected sub-bands is subjected to encoding. An interpolation unit (126) expresses the gain of the sub-band not selected, by mutually interpolating the selected gains.
    Type: Grant
    Filed: May 17, 2005
    Date of Patent: June 11, 2013
    Assignee: Panasonic Corporation
    Inventor: Masahiro Oshikiri
  • Patent number: 8463605
    Abstract: A method of processing an audio signal is disclosed. The present invention includes receiving downmix information, object information and mix information, generating and transferring multi-channel information using at least one of the downmix information, the object information and the mix information, and selectively generating and transferring either first gain information or extra multi-channel information including second gain information in accordance with a decoding mode using at least one of the object information and the mix information.
    Type: Grant
    Filed: January 7, 2008
    Date of Patent: June 11, 2013
    Assignee: LG Electronics Inc.
    Inventors: Hyen-O Oh, Yang Won Jung
  • Patent number: 8457953
    Abstract: In a method of smoothing background noise in a telecommunication speech session; receiving and decoding S1O a signal representative of a speech session, the signal comprising both a speech component and a background noise component. Subsequently, determining LPC parameters S20 and an excitation signal S30 for the received signal. Thereafter, synthesizing and outputting (S40) an output signal based on the determined LPC parameters and excitation signal. In addition, modifying S35 the determined excitation signal by reducing power and spectral fluctuations of the excitation signal to provide a smoothed output signal.
    Type: Grant
    Filed: February 13, 2008
    Date of Patent: June 4, 2013
    Assignee: Telefonaktiebolaget LM Ericsson (Publ)
    Inventor: Stefan Bruhn
  • Patent number: 8457956
    Abstract: An audio signal is conveyed more efficiently by transmitting or recording a baseband of the signal with an estimated spectral envelope and a noise-blending parameter derived from a measure of the signal's noise-like quality. The signal is reconstructed by translating spectral components of the baseband signal to frequencies outside the baseband, adjusting phase of the regenerated components to maintain phase coherency, adjusting spectral shape according to the estimated spectral envelope, and adding noise according to the noise-blending parameter. Preferably, the transmitted or recorded signal also includes an estimated temporal envelope that is used to adjust the temporal shape of the reconstructed signal.
    Type: Grant
    Filed: August 31, 2012
    Date of Patent: June 4, 2013
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Michael Mead Truman, Mark Stuart Vinton
  • Patent number: 8452586
    Abstract: Components of a method and system that allow identification of music from the song or sound using only the sound of the audio being played. A system built using the method and device components disclosed processes inputs sent from a mobile phone over a telephone or data connection, though inputs might be sent through any variety of computers, communications equipment, or consumer audio devices over any of their associated audio or data networks.
    Type: Grant
    Filed: December 2, 2009
    Date of Patent: May 28, 2013
    Assignee: Soundhound, Inc.
    Inventors: Aaron Master, Timothy P. Stonehocker
  • Patent number: 8447617
    Abstract: There is provided a method or a device for extending a bandwidth of a first band speech signal to generate a second band speech signal wider than the first band speech signal and including the first band speech signal. The method comprises receiving a segment of the first band speech signal having a low cut off frequency and a high cut off frequency; determining the high cut off frequency of the segment; determining whether the segment is voiced or unvoiced; if the segment is voiced, applying a first bandwidth extension function to the segment to generate a first bandwidth extension in high frequencies; if the segment is unvoiced, applying a second bandwidth extension function to the segment to generate a second bandwidth extension in the high frequencies; using the first bandwidth extension and the second bandwidth extension to extend the first band speech signal beyond the high cut off frequency.
    Type: Grant
    Filed: March 15, 2010
    Date of Patent: May 21, 2013
    Assignee: Mindspeed Technologies, Inc.
    Inventors: Norbert Rossello, Fabien Klein
  • Patent number: 8447620
    Abstract: An audio encoder for encoding an audio signal has a first coding branch, the first coding branch comprising a first converter for converting a signal from a time domain into a frequency domain. Furthermore, the audio encoder has a second coding branch comprising a second time/frequency converter. Additionally, a signal analyzer for analyzing the audio signal is provided. The signal analyzer, on the hand, determines whether an audio portion is effective in the encoder output signal as a first encoded signal from the first encoding branch or as a second encoded signal from a second encoding branch. On the other hand, the signal analyzer determines a time/frequency resolution to be applied by the converters when generating the encoded signals. An output interface includes, in addition to the first encoded signal and the second encoded signal, a resolution information identifying the resolution used by the first time/frequency converter and used by the second time/frequency converter.
    Type: Grant
    Filed: April 6, 2011
    Date of Patent: May 21, 2013
    Assignees: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V., Voiceage Corporation
    Inventors: Max Neuendorf, Stefan Bayer, Jérémie Lecomte, Guillaume Fuchs, Julien Robilliard, Nikolaus Rettelbach, Frederik Nagel, Ralf Geiger, Markus Multrus, Bernhard Grill, Philippe Gournay, Redwan Salami
  • Patent number: 8446947
    Abstract: A method for encoding a digital signal into a scalable bitstream comprising quantizing the digital signal, and encoding the quantized signal to form a core-layer bitstream, performing an error mapping based on the digital signal and the core-layer bitstream to remove information that has been encoded into the core-layer bitstream, resulting in an error signal, bit-plane coding the error signal based on perceptual information of the digital signal, resulting in an enhancement-layer bitstream, wherein the perceptual information of the digital signal is determined using a perceptual model, and multiplexing the core-layer bitstream and the enhancement-layer bitstream, thereby generating the scalable bitstream.
    Type: Grant
    Filed: October 6, 2004
    Date of Patent: May 21, 2013
    Assignee: Agency for Science, Technology and Research
    Inventors: Rongshan Yu, Xiao Lin, Susanto Rahardja
  • Patent number: 8442836
    Abstract: Embodiments of the invention provides a method and device for assigning bitrates to a plurality of channels in a scalable audio encoding/truncation process. Different bitrates are assigned to different channels in the scalable audio encoding/truncation process.
    Type: Grant
    Filed: January 31, 2008
    Date of Patent: May 14, 2013
    Assignee: Agency for Science, Technology and Research
    Inventors: Te Li, Susanto Rahardja, Haibin Huang