Psychoacoustic Patents (Class 704/200.1)

Audio stream notification and processing

Patent number: 7945440

Abstract: Various embodiments provide techniques for allowing an application to opt out of system default audio stream behavior, as well as techniques for notifying applications on a computing device that a communication audio stream has been initiated. The techniques may differentiate between communication-related audio streams and audio streams that are not communication-related. In some embodiments, an application may register to receive notification that a communication stream has been initiated. The application may be configured to comply with system default audio stream handling policies, or it can perform custom behavior in response to the audio stream notification. In some embodiments, an application may register for filtered or unfiltered notification. In a filtered notification scenario, an application is notified that a communication stream has been initiated when an audio stream associated with the application has not already been modified in response to the initiation of a different communication stream.

Type: Grant

Filed: June 26, 2008

Date of Patent: May 17, 2011

Assignee: Microsoft Corporation

Inventors: Elliot H. Omiya, Noel R. Cross, Adeel A. Aslam, Lawrence W. Osterman
Computer communications using acoustic signals

Patent number: 7941480

Abstract: A method of communicating with an electronic device. The method includes providing an electronic device having an audible sound receiving and generating sub-system including a microphone, transmitting from a source at least one acoustic signal encoded with information, receiving said at least one acoustic signal by said microphone and determining a spatial position, distance or movement of the microphone relative to the source, responsive to the received at least one signal.

Type: Grant

Filed: November 18, 2008

Date of Patent: May 10, 2011

Assignee: BeepCard Inc.

Inventors: Alon Atsmon, Amit Antebi, Nathan Altman, Zvi Lev, Moshe Cohen
Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX

Patent number: 7933769

Abstract: In a method and device for low-frequency emphasis, where the spectrum of a sound signal is transformed in a frequency domain and comprises transform coefficients grouped in a number of blocks, a maximum energy for one block having a position index is calculated. Also, a factor having a position index smaller than the position index of the block with maximum energy is calculated for each block. For each block, an energy of the block is calculated, the factor is computed from the calculated maximum energy and the computed energy of the block, and a gain is determined from the factor and applied to the transform coefficients of the block.

Type: Grant

Filed: February 15, 2007

Date of Patent: April 26, 2011

Assignee: Voiceage Corporation

Inventor: Bruno Bessette
Computationally efficient audio coder

Patent number: 7930170

Abstract: The present invention provides a computationally efficient technique for compression encoding of an audio signal, and further provides a technique to enhance the sound quality of the encoded audio signal. This is accomplished by including more accurate attack detection and a computationally efficient quantization technique. The improved audio coder converts the input audio signal to a digital audio signal. The audio coder then divides the digital audio signal into larger frames having a long-block frame length and partitions each of the frames into multiple short-blocks. The audio coder then computes short-block audio signal characteristics for each of the partitioned short-blocks based on changes in the input audio signal.

Type: Grant

Filed: July 31, 2001

Date of Patent: April 19, 2011

Assignee: Sasken Communication Technologies Limited

Inventors: K. P. P. Kalyan Chakravarthy, Navaneetha K Ruthramoorthy, Pushkar P Patwardhan, Bishwarup Molndal
Apparatus and method for controlling audio-frame division

Patent number: 7930185

Abstract: To alleviate degradation of sound quality which may be caused by pre-echoes and bit starvation. An acoustic analyzer analyzes an audio signal to calculate perceptual entropy indicating how many bits are required for quantization. A coded bit count monitor monitors the number of coded bits produced from the audio signal and calculates the number of available bits for the current frame. Based on the combination of the perceptual entropy and the number of available bits, a frame division number determiner determines a division number N for dividing a frame of the audio signal into N blocks. An orthogonal transform processor divides a frame by the determined division number and subjects each divided block of the audio signal to an orthogonal transform process, thereby obtaining orthogonal transform coefficients. A quantizer quantizes the orthogonal transform coefficients on a divided block basis.

Type: Grant

Filed: March 3, 2008

Date of Patent: April 19, 2011

Assignee: Fujitsu Limited

Inventors: Yoshiteru Tsuchinaga, Masanao Suzuki, Miyuki Shirakawa, Takashi Makiuchi
Multi-channel audio encoding/decoding with parametric compression/decompression and weight factors

Patent number: 7930171

Abstract: The invention includes several techniques and tools, which can be used in combination or separately. For example, an audio encoder can encode information directly using coding processes that include a windowed overlapped transform, a selective multi-channel transform, scalar quantization and entropy encoding. The audio encoder can also encode information parametrically according to a parametric compression mode that accounts for audibility of distortion according to an auditory model. A corresponding audio decoder can decode first information directly and second information according to the parametric mode.

Type: Grant

Filed: July 23, 2007

Date of Patent: April 19, 2011

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Ming-Chieh Lee, Naveen Thumpudi
Scalable audio coding

Patent number: 7921007

Abstract: The invention relates to an audio encoder and decoder and methods for audio encoding and decoding. In a preferred encoder embodiment an audio signal is encoded by deterministic encoder means to form a first encoded signal part. A spectrum of the audio signal is determined and represented by an excitation pattern, i.e. spectral values corresponding to human auditory filters, as a second encoded signal part. A masking curve is also extracted based on the excitation pattern, thus improving encoding efficiency in terms of bit rate. In a preferred decoder the first encoded signal part is decoded by deterministic decoder means. A noise generator uses the decoded first signal part together with the second signal part, i.e. the excitation pattern for the original audio signal, to generate a noise signal. The noise signal is then added to the first decoded signal part to form an output audio signal. At the decoder side the masking curve is also extracted based on the second encoded signal part, i.e.

Type: Grant

Filed: July 25, 2005

Date of Patent: April 5, 2011

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Steven Leonardus Josephus Dimphina Elisabeth Van de Par, Valery Stephanovich Kot, Nicolle Hanneke Van Schijndel
Quality improvement techniques in an audio encoder

Patent number: 7917369

Abstract: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source.

Type: Grant

Filed: April 18, 2007

Date of Patent: March 29, 2011

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Automatic measurement of audio presence and level by direct processing of an MPEG data stream

Patent number: 7912226

Abstract: Automatic measurements are made of audio presence and level in an audio signal by direct processing of an MPEG data stream representing the audio signal, without reconstructing the audio signal. Sub-band data is extracted from the data stream, and the extracted sub-band data is dequantized and denormalized. An audio level for the dequantized and denormalized sub-band data is measured without reconstructing the audio signal. Channel characteristics are used in measuring the audio level of the sub-band data, wherein the channel characteristics are used to weight the measured levels. The measured levels are compared against at least one threshold to determine whether an alarm should be triggered.

Type: Grant

Filed: September 12, 2003

Date of Patent: March 22, 2011

Assignee: The DIRECTV Group, Inc.

Inventors: Thomas H. James, Jeffrey D. Carpenter
Adapting masking thresholds for encoding a low frequency transient signal in audio data

Patent number: 7899677

Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.

Type: Grant

Filed: November 24, 2009

Date of Patent: March 1, 2011

Assignee: Apple Inc.

Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
Method for dynamically adjusting the spectral content of an audio signal

Patent number: 7899192

Abstract: A method for dynamically adjusting the spectral content of an audio signal, which increases the harmonic content of said audio signal, said method comprising translating an encoded digital signal into data bands, creating a psychoacoustic model to identify sections of said data bands that are deficient in harmonic quality, analyzing the fundamental frequency and amplitude of said harmonically deficient data bands, creating additional higher order harmonics for said harmonically deficient data bands, adding said higher order harmonics back to said encoded digital signal to form a newly enhanced signal, inverse filtering said newly enhanced signal, and converting said inverse filtered signal to an analog waveform for consumption by the listener.

Type: Grant

Filed: February 20, 2007

Date of Patent: March 1, 2011

Inventors: J. Craig Oxford, Patrick Taylor, D. Michael Shields
Audio encoding system

Patent number: 7895034

Abstract: Provided are, among other things, systems, methods and techniques for encoding an audio signal, in which is obtained a sampled audio signal which has been divided into frames. The location of a transient within one of the frames is identified, and transform data samples are generated by performing multi-resolution filter bank analysis on the frame data, including filtering at different resolutions for different portions of the frame that includes the transient. Quantization data are generated by quantizing the transform data samples using variable numbers of bits based on a psychoacoustical model, and the quantization data are grouped into variable-length segments based on magnitudes of the quantization data. A code book is assigned to each of the variable-length segments, and the quantization data in each of the variable-length segments are encoded using the code book assigned to such variable-length segment.

Type: Grant

Filed: January 31, 2007

Date of Patent: February 22, 2011

Assignee: Digital Rise Technology Co., Ltd.

Inventor: Yuli You
Method for quantizing speech and audio through an efficient perceptually relevant search of multiple quantization patterns

Patent number: 7873514

Abstract: A method and apparatus is disclosed herein for quantizing data using a perceptually relevant search of multiple quantization patterns. In one embodiment, the method comprises performing a perceptually relevant search of multiple quantization patterns in which one of a plurality of prototype patterns and its associated permutation are selected to quantize the target vector, each prototype pattern in the plurality of prototype patterns being capable of directing quantization across the vector; converting the one prototype pattern, the associated permutation and quantization information resulting from both to a plurality of bits by an encoder; and transferring the bits as part of a bit stream.

Type: Grant

Filed: August 7, 2007

Date of Patent: January 18, 2011

Assignee: NTT DoCoMo, Inc.

Inventor: Sean A. Ramprashad
Adaptive rate control algorithm for low complexity AAC encoding

Patent number: 7873510

Abstract: A system and method for adaptive rate control in audio processing is provided. The process could include receiving uncompressed audio data from an input and generating MDCT spectrum for each frame of the uncompressed audio data using a filterbank. The process could also include estimating masking thresholds for current frame to be encoded based on the MDCT spectrum. The masking thresholds reflect a bit budget for the current frame. The process could also include performing quantization of the current frame based on the masking thresholds. After the quantization of the current frame, the bit budget for next frame is updated for estimating the masking thresholds of the next frame. The process could also include encoding the quantized audio data.

Type: Grant

Filed: April 26, 2007

Date of Patent: January 18, 2011

Assignee: STMicroelectronics Asia Pacific Pte. Ltd.

Inventors: Evelyn Kurniawati, Sapna George
Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic

Patent number: 7873511

Abstract: An audio encoder, an audio decoder or an audio processor includes a filter for generating a filtered audio signal, the filter having a variable warping characteristic, the characteristic being controllable in response to a time-varying control signal, the control signal indicating a small or no warping characteristic or a comparatively high warping characteristic. Furthermore, a controller is connected for providing the time-varying control signal, which depends on the audio signal. The filtered audio signal can be introduced to an encoding processor having different encoding algorithms, one of which is a coding algorithm adapted to a specific signal pattern. Alternatively, the filter is a post-filter receiving a decoded audio signal.

Type: Grant

Filed: June 30, 2006

Date of Patent: January 18, 2011

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Juergen Herre, Bernhard Grill, Markus Multrus, Stefan Bayer, Ulrich Kraemer, Jens Hirschfeld, Stefan Wabnik, Gerald Schuller
System and Method for Frequency Domain Audio Post-processing Based on Perceptual Masking

Publication number: 20110002266

Abstract: In an embodiment, a method of frequency domain post-processing is disclosed. The method includes applying adaptive modification gain factor to each frequency coefficient, and determining gain factors based on Local Masking Magnitude and Local Masked Magnitude.

Type: Application

Filed: May 4, 2010

Publication date: January 6, 2011

Applicant: GH Innovation, Inc.

Inventor: Yang Gao
Detection system for segment including specific sound signal, method and program for the same

Patent number: 7860714

Abstract: The present invention is a detection system of a segment including specific sound signal which detects a segment in a stored sound signal similar to a reference sound signal, including: a reference signal spectrogram division portion which divides a reference signal spectrogram into spectrograms of small-regions; a small-region reference signal spectrogram coding portion which encodes the small-region reference signal spectrogram to a reference signal small-region code; a small-region stored signal spectrogram coding portion which encodes a small-region stored signal spectrogram to a stored signal small-region code; a similar small-region spectrogram detection portion which detects a small-region spectrogram similar to the small-region reference signal spectrograms based on a degree of similarity of a code; and a degree of segment similarity calculation portion which uses a degree of small-region similarity and calculates a degree of similarity between the segment of the stored signal and the reference signal

Type: Grant

Filed: July 1, 2005

Date of Patent: December 28, 2010

Assignee: Nippon Telegraph and Telephone Corporation

Inventors: Hidehisa Nagano, Takayuki Kurozumi, Kunio Kashino
Multi-channel audio encoding and decoding with different window configurations

Patent number: 7860720

Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

Type: Grant

Filed: May 15, 2008

Date of Patent: December 28, 2010

Assignee: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen
Method and apparatus for a thin audio codec

Patent number: 7848922

Abstract: An apparatus and method for encoding and decoding a voice signal. The apparatus includes an encoder configured to generate an output bitstream signal from an input voice signal. The output bitstream signal is associated with at least a first standard of a first plurality of CELP voice compression standards. Additionally, the apparatus includes a decoder configured to generate an output voice signal from an input bitstream signal. The input bitstream signal is associated with at least a first standard of a second plurality of CELP voice compression standards. The CELP encoder includes a plurality of codec-specific encoder modules. Additionally, the CELP encoder includes a plurality of generic encoder modules. The CELP decoder includes a plurality of codec-specific decoder modules. Additionally, the CELP decoder includes a plurality of generic decoder modules.

Type: Grant

Filed: August 2, 2007

Date of Patent: December 7, 2010

Inventors: Marwan A. Jabri, Nicola Chong-White, Jianwei Wang
Perceptual, scalable audio compression

Patent number: 7835904

Abstract: The perceptual scalable audio coding/decoding technique lies in the use of a psychoacoustic mask to guide residue coding in enhancement layer coders. At the encoder, a psychoacoustic mask is calculated for the enhancement layer coders or is simply extracted from the coded base layer bitstream. One can also decode the coded base layer bitstream into the audio waveform, and calculate the psychoacoustic mask from the decoded base layer waveform. Furthermore, a predictive technology can be used to refine the psychoacoustic mask derived from the base layer bitstream to form a more accurate psychoacoustic mask of the enhancement layer. In addition, one can calculate the enhancement layer psychoacoustic mask from the original audio, and send the difference between the enhancement layer psychoacoustic mask and the base layer psychoacoustic mask as side information to the decoder. This psychoacoustic mask may then be used for the perceptual coding and decoding of the residue.

Type: Grant

Filed: March 3, 2006

Date of Patent: November 16, 2010

Assignee: Microsoft Corp.

Inventors: Jin Li, James Johnston, Wai Yip Chan
Method and apparatus for low bit rate encoding and decoding

Patent number: 7835907

Abstract: An apparatus and method of low bit rate encoding and reproducing. The method includes transforming input audio signals in a time domain into spectral signals in a frequency domain, extracting important-spectrum components from the spectral signals in the frequency domain, and quantizing the important-spectrum components, extracting residual-spectrum components other than the important-spectrum components from the spectral signals in the frequency domain, and calculating and quantizing a noise level of the residual-spectrum components, and encoding the quantized important-spectrum components and the quantized noise level losslessly, and outputting encoded bitstreams.

Type: Grant

Filed: December 21, 2005

Date of Patent: November 16, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Eunmi Oh, Boris Kudryashov, Konstantin Osipov
Encoding and decoding a set of signals

Patent number: 7835918

Abstract: An encoding device (1) and method convert a set of signals (l, r) into a dominant signal (m) containing most signal energy, a residual signal (s) containing a remainder of the signal energy, and signal parameters (IID, ICC) associated with the conversion. The dominant signal (m) and selected parts of the residual signal (s) are encoded. Selecting parts of the residual signal involves a residual signal (s?) passing perceptually relevant parts of the residual signal (s), attenuating perceptually less relevant parts of the residual signal and suppressing least relevant parts of the residual signal. An associated decoding device (2) and method decode the encoded dominant signal and the encoded residual signal so as to produce a decoded dominant signal (m?u) and a decoded residual signal (s?mod) respectively. A synthetic residual signal (s?Syn) is derived from the decoded dominant signal (m?u) and is attenuated so as to produce an attenuated synthetic residual signal (S?Syn,mod).

Type: Grant

Filed: October 31, 2005

Date of Patent: November 16, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Francois Philippus Myburg, Dirk Jeroen Breebaart, Erik Gosuinus Petrus Schuijers
Optimized fidelity and reduced signaling in multi-channel audio encoding

Patent number: 7822617

Abstract: The invention provides an efficient technique for encoding a multi-channel audio signal. The invention relies on the principle of encoding (S1) a signal representation of one or more of the multiple channels in a first encoding process, and encoding another signal representation of one or more channels in a second, filter-based encoding process. A basic idea according to the invention is to select (S2), for the second encoding process, a combination of i) frame division configuration of an overall encoding frame into a set of sub-frames, and ii) filter length for each sub-frame, according to a predetermined criterion. The second signal representation is then encoded (S3) in each sub-frame of the overall encoding frame according to the selected combination. The possibility to select frame division configuration and at the same time adjust the filter length for each sub-frame provides added degrees of freedom, and generally results in improved performance.

Type: Grant

Filed: February 22, 2006

Date of Patent: October 26, 2010

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Anisse Taleb, Stefan Andersson
Method, medium, and system for music retrieval using modulation spectrum

Patent number: 7818167

Abstract: An audio information retrieval method, medium, and system that can rapidly retrieve audio information, even in noisy environments, by extracting a modulation spectrum that is robust against noise, converting features of the extracted modulation spectrum into hash bits, and using a hash table. The audio information retrieval method may include extracting a modulation spectrum from audio data of a compressed domain, converting the extracted modulation spectrum into fingerprint bits, arranging the fingerprint bits in a form of a hash table, converting a received query into an address by a hash function corresponding to the query, and retrieving the audio information by referring to the hash table.

Type: Grant

Filed: August 29, 2006

Date of Patent: October 19, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hyoung Gook Kim, Ki Wan Eom, Ji Yeun Kim, Yuan Yuan She, Xuan Zhu
Multi-channel encoder

Patent number: 7813513

Abstract: There is described a method of encoding input signals (CHI to CH3; 400 to 450) in a multi-channel encoder (5; 15) to generate corresponding output data comprising down-mix output signals (610, 620) together with complementary parametric data (600). The method includes a first step of down-mixing input signals (CHI to CH3; 400 to 450) to generate the corresponding down-mix output signals (610, 620), and a second step of processing the input signals (CHI to CH3; 400 to 450) during down-mixing to generate said parametric data (600) complementary to the down-mix output signals (610, 620). Processing of the input signals (CHI to CH3; 400 to 450) involves including information in the down-mix signals (610, 620) which is useable during subsequent decoding of the down-mix output signals (610, 620) and the parametric data (600) to determine at least some parameter data and thereby enabling representations of the input signals (CHI to CH3; 400 to 450) to be subsequently regenerated.

Type: Grant

Filed: March 25, 2005

Date of Patent: October 12, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Gerard H. Hotho, Dirk J. Breebaart, Evgeny A. Verbitskiy, Albertus C. Den Brinker
Method and system for processing messages within the framework of an integrated message system

Patent number: 7809117

Abstract: A method and system for processing messages within the framework of an integrated message system. Recipients of messages in an integrated messaging system are provided with an authentic impression of the received message. In a first step, a message received within the framework of an integrated messaging system is automatically translated. Language detection and dictation system is provided. The message contents of the incoming message as well as its segments and parameters are simultaneously utilized to generate additional information regarding the sender and the information, which is suitable to give the recipient an impression of the received message in the most authentic form possible.

Type: Grant

Filed: October 14, 2005

Date of Patent: October 5, 2010

Assignee: Deutsche Telekom AG

Inventors: Fred Runge, Christel Mueller, Heiko-Armin Schōnebeck, Frank Niedermueller, Jin Liu, Marian Trinkel
METHOD AND APPARATUS FOR PROCESSING AUDIO AND SPEECH SIGNALS

Publication number: 20100250242

Abstract: A method and device for processing signals representing speech or audio via a plurality of filters that approximate behaviors of the basilar membrane of human cochlea. Each of the plurality of filters is formed from a mother filter via the dilation and a shift in time and has the similar impulse response of the basilar membrane to the frequency band for which the filter represents. Any process can be conducted and any feature can be extracted in the domain of the filters' outputs for applications, such as noise reduction, speech synthesis, coding, and speech and speaker recognition.

Type: Application

Filed: March 26, 2009

Publication date: September 30, 2010

Inventor: Qi Li
System and method for measurement of perceivable quantization noise in perceptual audio coders

Patent number: 7797155

Abstract: A technique for computing perceptual noise in an audio signal that is computationally efficient. In one example embodiment, the technique includes computing perceptual noise in an input audio signal. The steps involve pre-computing NER (noise-to-excitation ratio) values associated with critical bands within a frame by zeroing out associated spectral coefficient values before the quantization loop, and also assuming bands with lower spectral energy than the band under consideration are zeroed out during quantization. When a critical band is zeroed out during quantization, the associated NER values which have been pre-computed are used in computing an overall perceptual distortion of the frame.

Type: Grant

Filed: November 9, 2006

Date of Patent: September 14, 2010

Assignee: Ittiam Systems (P) Ltd.

Inventors: Preethi Konda, Ameet Kalagi
Time-scale modification of data-compressed audio information

Patent number: 7792681

Abstract: A data-compressed audio waveform is temporally modified without requiring complete decompression of the audio signal. Packets of compressed audio data are first unpacked, to remove scaling that was applied in the formation of the packets. The unpacked data is then temporally modified, using one of a number of different approaches. This modification takes place while the audio information remains in a data-compressed format. New packets are then assembled from the modified data, to produce a data-compressed output stream that can be subsequently processed in a conventional manner to reproduce the desired sound. The assembly of the new packets employs a technique for inferring an auditory model from the original packets, to requantize the data in the output packets.

Type: Grant

Filed: October 12, 2006

Date of Patent: September 7, 2010

Assignee: Interval Licensing LLC

Inventors: Michele M. Covell, Malcolm Slaney, Arthur Rothstein
Slot position coding for non-guided spatial audio coding

Patent number: 7792668

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type as indicated by the data structure type indicator.

Type: Grant

Filed: August 30, 2006

Date of Patent: September 7, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O. Oh, Yang Won Jung
Combined audio coding minimizing perceptual distortion

Patent number: 7788090

Abstract: An audio encoder in which two or more preferably different encoders cooperate to generate a joint encoded audio signal. Encoding parameters of the two or more encoders are optimized in response to a measure of distortion of the joint encoded audio signal in accordance with a predetermined criterion. The distortion. measure is preferably a perceptual distortion measure. In one encoder embodiment comprising a sinusoidal and a waveform encoder, a constant total bit rate for each audio frame is distributed between the two encoders so as to minimize perceptual distortion for both the first and the second encoder. Other embodiments consider a set of encoding parameters that is larger than only those that minimize the perceptual distortion of the first encoder. In some embodiments, perceptual distortion may be minimized by optimizing encoding via optimizing entire encoding templates, i.e. a complex set of encoding parameters, for the separate encoders.

Type: Grant

Filed: September 2, 2005

Date of Patent: August 31, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Steven Leonardus Josephus Dimphina Elisabeth Van De Par, Nicolle Hanneke Van Schijndel, Valery Stephanovich Kot, Richard Heusdens
Support of a multichannel audio extension

Patent number: 7787632

Abstract: The invention relates to methods and units supporting a multichannel audio extension. In order to allow an efficient extension requiring a low computational complexity, it is proposed that at an encoding end, at least state information is provided as side information for a provided mono audio signal (M) generated out of a multichannel audio signal. The state information indicates for each of a plurality of frequency bands how a predetermined or equally provided gain value is to be applied in the frequency domain to the mono audio signal (M) for obtaining first and a second channel signals (L,R) of a reconstructed multichannel audio signal.

Type: Grant

Filed: March 21, 2003

Date of Patent: August 31, 2010

Assignee: Nokia Corporation

Inventor: Juha Ojanpera
QUANTIZATION AFTER LINEAR TRANSFORMATION COMBINING THE AUDIO SIGNALS OF A SOUND SCENE, AND RELATED CODER

Publication number: 20100198585

Abstract: The invention relates to a method for quantifying components, wherein certain components are each determined based on a plurality of audio signals and can be calculated by the application of a linear conversion on the audio signals, said method comprising: determining a quantification function to be applied to the components by testing a condition relative to an audio signal and depending on a comparison made between a psycho-acoustic masking threshold relative to the audio signal and a value determined based on the reverse linear conversion and quantification errors of the components by the function.

Type: Application

Filed: July 1, 2008

Publication date: August 5, 2010

Applicant: France Telecom

Inventors: Adil Mouhssine, Abdellatif Benjelloun Touimi, Pierre Duhamel
SEGMENTING AUDIO SIGNALS INTO AUDITORY EVENTS

Publication number: 20100185439

Abstract: In one aspect, the invention divides an audio signal into auditory events, each of which tends to be perceived as separate and distinct, by calculating the spectral content of successive time blocks of the audio signal, calculating the difference in spectral content between successive time blocks of the audio signal, and identifying an auditory event boundary as the boundary between successive time blocks when the difference in the spectral content between such successive time blocks exceeds a threshold. In another aspect, the invention generates a reduced-information representation of an audio signal by dividing an audio signal into auditory events, each of which tends to be perceived as separate and distinct, and formatting and storing information relating to the auditory events. Optionally, the invention may also assign a characteristic to one or more of the auditory events. Auditory events may be determined according to the first aspect of the invention or by another method.

Type: Application

Filed: March 16, 2010

Publication date: July 22, 2010

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Brett G. Crockett
Slot position coding of TTT syntax of spatial audio coding application

Patent number: 7761303

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: August 30, 2006

Date of Patent: July 20, 2010

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O. Oh, Yang Won Jung
Method and apparatus for disturbing the radiated voice signal by attenuation and masking

Patent number: 7761292

Abstract: A method and apparatus to disturb a voice signal by attenuating and masking the voice signal are provided. The method includes; receiving a voice signal from a wired or wireless network; obtaining a masked voice signal by dividing the received voice signal into a plurality of segments of the same size; outputting the received voice signal and receiving a feedback signal of the output voice signal; obtaining an attenuated voice signal by performing a first sound attenuation operation on the feedback signal; and combining the attenuated voice signal and the masked voice signal and outputting the result of the combination as disturbing sound.

Type: Grant

Filed: September 28, 2006

Date of Patent: July 20, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Attiia Ferencz, Jun-il Sohn, Kwon-ju Yi, Yong-beom Lee, Sang-ryong Kim
Method and apparatus for encoding/decoding digital signal

Patent number: 7752041

Abstract: A method and an apparatus for encoding/decoding a digital signal are provided. First, a digital input signal is transformed into samples to remove redundant information among signals. Then, a lookup table corresponding to a characteristic of the input signal is selected among a plurality of lookup tables that indicate different numbers of bits allocated for each quantization unit depending on different characteristics of input signals, and the number of bits allocated for each quantization unit is acquired from the selected lookup table. Next, a distribution of samples within each quantization unit is divided into a predetermined number of sections, and the samples are linearly quantized using the allocated number of bits on a section-by-section basis. Thereafter, a bitstream comprised of frames is produced from the quantized samples and predetermined side information so that information about a frame length is stored in the end of frame.

Type: Grant

Filed: May 26, 2005

Date of Patent: July 6, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Dohyung Kim, Junghoe Kim, Shihwa Lee, Sangwook Kim, Yangseock Seo
AUDIO ENCODING APPARATUS

Publication number: 20100169080

Abstract: An audio encoding apparatus that encodes audio signals of a plurality of channels, includes an adaptive bit allocation control unit that adaptively controls a number of encoding bits assigned to the audio signal of each channel in accordance with perceptual entropy of the audio signal of each of the channels, a fixed bit allocation control unit that fixedly controls the number of encoding bits assigned to the audio signal of each of the channels in predetermined allocations, and a channel encoding unit that encodes the audio signal of each of the channels based on the number of adaptive allocation bits assigned by the adaptive bit allocation control unit and the number of fixed allocation bits assigned by the fixed bit allocation control unit.

Type: Application

Filed: December 10, 2009

Publication date: July 1, 2010

Applicant: FUJITSU LIMITED

Inventors: Yoshiteru Tsuchinaga, Miyuki Shirakawa, Masanao Suzuki
PSYCHOACOUSTIC TIME ALIGNMENT

Publication number: 20100169079

Abstract: A method of providing a quality measure for an output voice signal generated to reproduce an input voice signal, the method comprising: partitioning the input and output signals into frames; for each frame of the input signal, determining a disturbance relative to each of a plurality of frames of the output signal; determining a subset of the determined disturbances comprising one disturbance for each input frame such that a sum of the disturbances in the subset set is a minimum; and using the set of disturbances to provide the measure of quality.

Type: Application

Filed: December 30, 2008

Publication date: July 1, 2010

Applicant: AUDIOCODES LTD.

Inventors: Ilan Shallom, Nitay Shiran
Broadcast router having a serial digital audio data stream decoder

Patent number: 7747447

Abstract: A bi-phase decoder suitable for use in a broadcast router and an associated method for extracting subframes of digital audio data from a stream of digital audio data. Logical circuitry within the bi-phase decoder extracts subframes of the digital audio data by constructing a transition window from an estimated bit time, sampling the stream of digital audio data using a fast clock and applying the sampled stream of digital audio data to the transition window to identify transitions indicative of preambles of the subframes of digital audio data.

Type: Grant

Filed: June 20, 2003

Date of Patent: June 29, 2010

Assignee: Thomson Licensing

Inventors: Carl Christensen, Lynn Howard Arbuckle
DEVICE AND METHOD FOR GENERATING A COMPLEX SPECTRAL REPRESENTATION OF A DISCRETE-TIME SIGNAL

Publication number: 20100161319

Abstract: A filter bank device for generating a complex spectral representation of a discrete-time signal includes a generator for generating a block-wise real spectral representation, which, for example, implements an MDCT, to obtain temporally successive blocks of real spectral coefficients. The output values of this spectral conversion device are fed to a post-processor for post-processing the block-wise real spectral representation to obtain an approximated complex spectral representation having successive blocks, each block having a set of complex approximated spectral coefficients, wherein a complex approximated spectral coefficient can be represented by a first partial spectral coefficient and by a second partial spectral coefficient, wherein at least one of the first and second partial spectral coefficients is determined by combining at least two real spectral coefficients.

Type: Application

Filed: March 4, 2010

Publication date: June 24, 2010

Inventors: Bernd EDLER, Stefan GEYERSBERGER
Spectral enhancing method and device

Patent number: 7742927

Abstract: The present invention relates to a spectral enhancement method and to an apparatus carrying out this method. The method of the invention enhanced the spectral content of a signal having an incomplete spectrum including a first spectral frequency band, the method comprising the following stages: at least one spectral content transposition of said first frequency band into a second spectral frequency band not included in said spectrum for the purpose of generating a transposed spectrum signal having a spectrum limited to said second spectral frequency band, shaping the spectrum of the transposed spectrum signal for the purpose of producing an enhanced signal, combining an incomplete spectrum signal and the enhanced signal for the purpose of producing an enhanced spectrum signal, characterized in that said spectral content is subject to a stage of whitening.

Type: Grant

Filed: April 12, 2001

Date of Patent: June 22, 2010

Assignees: France Telecom, Telediffusion de France

Inventors: Pierrick Philippe, Patrice Collen
Method and apparatus to encode and decode multi-channel audio signals

Patent number: 7742912

Abstract: An encoder (100) for encoding a multi-channel audio signal comprises a prediction processor (101) for generating two residual signals for two signal components of the multi-channel signal by linear prediction which is associated with psycho-acoustic characteristics and which specifically uses psycho-acoustic prediction filters; a rotation processor (105) for rotating the combined signal of the two residual signals to generate a main signal and a side signal, in which the energy of the main signal is maximized and the energy of the side signal is minimized; an encoding processor (109) for encoding the main and preferably the side signal; and an output processor (111) for generating an output signal data, prediction parameters and rotation parameters.

Type: Grant

Filed: June 14, 2005

Date of Patent: June 22, 2010

Assignee: Koninklijke Philips Electronics N.V.

Inventor: Albertus Cornelis Den Brinker
System and method for processing audio frames

Patent number: 7739105

Abstract: In accordance with a specific implementation of the disclosure, a stream of audio frames is received and compressed using psycho-acoustical processing. The signal-to-mask ratio table generated by the psycho-acoustical algorithm is updated using only a portion of the received audio frames.

Type: Grant

Filed: June 13, 2003

Date of Patent: June 15, 2010

Assignee: VIXS Systems, Inc.

Inventor: Hong Zeng
Method and Related Device for Simplifying Psychoacoustic Analysis with Spectral Flatness Characteristic Values

Publication number: 20100145682

Abstract: The present invention applies spectral flatness characteristic values to simplify psychoacoustic analysis of a sound signal. If the sound signal comprises a plurality of frames, the present invention calculates the energy of the sound signal in a frequency domain, calculates a plurality of spectral flatness, and decides to use a short-block or a long-block Modified Discrete Cosine Transform accordingly. If the sound signal comprises left and right channel signals, the present invention performs psychoacoustic analysis on the sound signal to count energy of the left and right channel signals in a frequency domain, counts spectral flatness of the left and right channel signals, and decides to use middle/side transform or left and right channel encoding to transform the left and right channel signals accordingly.

Type: Application

Filed: March 27, 2009

Publication date: June 10, 2010

Inventor: Yi-Lun Ho
Method and system to identify, quantify, and display acoustic transformational structures in speech

Publication number: 20100145681

Abstract: The invention for processing speech that is described herein measures the periodic changes of multiple acoustic features in a digitized utterance without regard for lexical, sublexical, or prosodic features. These measurements of periodic, simultaneous changes of multiple acoustic features are assembled into transformational structures. Various types of transformational structures are identified, quantified, and displayed by the invention. The invention is useful for the study of such speaker characteristics as cognitive, emotional, linguistic, and behavioral functioning, and may be employed in the study of other phenomena of interest to the user.

Type: Application

Filed: December 8, 2008

Publication date: June 10, 2010

Inventor: Daniel M. Begel
Reduced complexity recursive least square lattice structure adaptive filter by means of limited recursion of the backward and forward error prediction squares

Patent number: 7734466

Abstract: A method for reducing a computational complexity of an m-stage adaptive filter is provided by updating recursively forward and backward error prediction square terms for a first portion of a length of the adaptive filter, and keeping the updated forward and backward error prediction square terms constant for a second portion of the length of the adaptive filter.

Type: Grant

Filed: April 7, 2006

Date of Patent: June 8, 2010

Assignee: Motorola, Inc.

Inventors: David L. Barron, Kyle K. Iwai, James B. Piket
Audio coding

Patent number: 7729903

Abstract: The central idea of the present invention is that the prior procedure, namely interpolation relative to the filter coefficients and the amplification value, for obtaining interpolated values for the intermediate audio values starting from the nodes has to be dismissed. Coding containing less audible artifacts can be obtained by not interpolating the amplification value, but rather taking the power limit derived from the masking threshold, for each node, i.e. for each parameterization to be transferred, and then performing the interpolation between these power limits of neighboring nodes, such as, for example, a linear interpolation. On both the coder and the decoder side, an amplification value can then be calculated from the intermediate power limit determined such that the quantizing noise caused by quantization, which has a constant frequency before post-filtering on the decoder side, is below the power limit or corresponds thereto after post-filtering.

Type: Grant

Filed: July 27, 2006

Date of Patent: June 1, 2010

Inventors: Gerald Schuller, Stefan Wabnik, Marc Gayer
ENHANCING COPYRIGHT REVENUE GENERATION

Publication number: 20100131417

Abstract: A method and system for enhancing copyright revenue generation is disclosed. The method includes receiving a copyrighted media recording, creating an independent work of authorship by generating a simulation of the copyrighted media recording, such that the independent work of authorship is entitled to a copyright and utilizing the simulation in place of the copyrighted media recording, such that use of the independent work of authorship may be entitled to copyright royalties thereon as opposed to requiring copyright royalties for use of the copyrighted media recording.

Type: Application

Filed: November 25, 2008

Publication date: May 27, 2010

Inventor: Hank RISAN
Device and process for encoding audio data

Patent number: 7725323

Abstract: An MPEG-1 layer 3 audio encoder, including a scalefactor generator for determining first scalefactors for encoding a block of audio data if a temporal masking transient is not detected in said block of audio data; and for selecting the maximum of said scalefactors for encoding said block of audio data if a temporal masking transient is detected in said block of audio data to enable greater compression of said audio data. Increases in quantization error, due to use of the maximum scalefactor are pre-masked or post-masked by the temporal masking transient. In cases where the last portion of a block includes a temporal masking transient that masks the preceding portions of the block, the maximum scalefactor is only used to encode the block if the resulting increase in quantization error is less than 30% of the quantization error for the block.

Type: Grant

Filed: September 14, 2004

Date of Patent: May 25, 2010

Assignee: STMicroelectronics Asia Pacific Pte. Ltd.

Inventors: Kabi Prakash Padhi, Sudhir Kumar Kasargod, Sapna George

prev … 6 7 8 9 10 11 12 13 14 … next