Psychoacoustic Patents (Class 704/200.1)

Speech Masking and Cancelling and Voice Obscuration

Publication number: 20120053931

Abstract: A non-acoustic sensor is used to measure a user's speech and then broadcasts an obscuring acoustic signal diminishing the user's vocal acoustic output intensity and/or distorting the voice sounds making them unintelligible to persons nearby.

Type: Application

Filed: February 1, 2011

Publication date: March 1, 2012

Applicant: Lawrence Livermore National Security, LLC

Inventor: John F. Holzrichter
Broadband frequency translation for high frequency regeneration

Patent number: 8126709

Abstract: An audio signal is conveyed more efficiently by transmitting or recording a baseband of the signal with an estimated spectral envelope and a noise-blending parameter derived from a measure of the signal's noise-like quality. The signal is reconstructed by translating spectral components of the baseband signal to frequencies outside the baseband, adjusting phase of the regenerated components to maintain phase coherency, adjusting spectral shape according to the estimated spectral envelope, and adding noise according to the noise-blending parameter. Preferably, the transmitted or recorded signal also includes an estimated temporal envelope that is used to adjust the temporal shape of the reconstructed signal.

Type: Grant

Filed: February 24, 2009

Date of Patent: February 28, 2012

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Michael Mead Truman, Mark Stuart Vinton
Method and apparatus for introducing information into a data stream and method and apparatus for encoding an audio signal

Patent number: 8117027

Abstract: Techniques for introducing information into a data stream first obtains the spectral values of the short-term spectrum of the audio signal. Separately, information to be introduced are combined with a spread sequence obtaining a spread information signal, whereupon a spectral representation of the spread information is generated, then weighted with an established psychoacoustic maskable noise energy to generate a weighted information signal, wherein energy of the introduced information is substantially equal to or below the psychoacoustic masking threshold. The weighted information signal and the spectral values of the short-term spectrum of the audio signal are then summed and afterwards processed again to obtain a processed data stream including audio information and information to be introduced.

Type: Grant

Filed: September 25, 2008

Date of Patent: February 14, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Christian Neubauer, Juergen Herre, Karlheinz Brandenburg, Eric Allamanche
SYSTEM AND METHOD FOR AUTOMATIC DETECTION OF ABNORMAL STRESS PATTERNS IN UNIT SELECTION SYNTHESIS

Publication number: 20120035917

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for detecting and correcting abnormal stress patterns in unit-selection speech synthesis. A system practicing the method detects incorrect stress patterns in selected acoustic units representing speech to be synthesized, and corrects the incorrect stress patterns in the selected acoustic units to yield corrected stress patterns. The system can further synthesize speech based on the corrected stress patterns. In one aspect, the system also classifies the incorrect stress patterns using a machine learning algorithm such as a classification and regression tree, adaptive boosting, support vector machine, and maximum entropy. In this way a text-to-speech unit selection speech synthesizer can produce more natural sounding speech with suitable stress patterns regardless of the stress of units in a unit selection database.

Type: Application

Filed: August 6, 2010

Publication date: February 9, 2012

Applicant: AT&T Intellectual Property I, L.P.

Inventors: Yeon-Jun KIM, Mark Charles BEUTNAGEL, Alistair D. CONKIE, Ann K. SYRDAL
Slot position coding of syntax of spatial audio application

Patent number: 8103513

Abstract: Spatial information associated with an audio signal is encoded into a bitstream, which can be transmitted to a decoder or recorded to a storage media. The bitstream can include different syntax related to time, frequency and spatial domains. In some embodiments, the bitstream includes one or more data structures (e.g., frames) that contain ordered sets of slots for which parameters can be applied. The data structures can be fixed or variable. The data structure can include position information that can be used by a decoder to identify the correct slot for which a given parameter set is applied. The slot position information can be encoded with either a fixed number of bits or a variable number of bits based on the data structure type.

Type: Grant

Filed: August 20, 2010

Date of Patent: January 24, 2012

Assignee: LG Electronics Inc.

Inventors: Hee Suk Pang, Dong Soo Kim, Jae Hyun Lim, Hyen O Oh, Yang-Won Jung
Method and system for aligning windows to extract peak feature from a voice signal

Patent number: 8103512

Abstract: Disclosed is a method capable of adaptively aligning windows to extract features according to the types and characteristics of voice signals. To this end, window lengths based on the window update points in a corresponding order are determined by employing the concept of a higher order peak, and windows are aligned according to window lengths. When the windows are aligned according to such a manner, the start and end points of each window is known, so that it becomes possible to easily extract and analyze peak feature information.

Type: Grant

Filed: January 23, 2007

Date of Patent: January 24, 2012

Assignee: Samsung Electronics Co., Ltd

Inventor: Hyun-Soo Kim
SOUND MASKING SYSTEM AND MASKING SOUND GENERATION METHOD

Publication number: 20120016665

Abstract: In a masking sound generation apparatus, a CPU analyzes a speech utterance speed of a received sound signal. Then, the CPU copies the received sound signal into a plurality of sound signals and performs the following processing on each of the sound signals. Namely, the CPU divides each of the sound signals into frames on the basis of a frame length determined on the basis of the speech utterance speed. Reverse process is performed on each of the frames to replace a waveform of the frame with a reverse waveform, and a windowing process is performed to achieve a smooth connection between the frames. Then, the CPU randomly rearranges the order of the frames and mixes the plurality of sound signals to generate a masking sound signal.

Type: Application

Filed: September 22, 2011

Publication date: January 19, 2012

Applicant: YAMAHA CORPORATION

Inventors: Atsuko Ito, Yasushi Shimizu, Akira Miki, Masato Hata
Signal decoding apparatus

Patent number: 8099291

Abstract: A signal decoding apparatus that can suppress any large unusual sounds to provide decoded signals of improved audibility even when the number of hierarchical layers to be used in the decoding process varies due to a packet loss or the like in communication utilizing a scalable encoding/decoding technique. In the signal decoding apparatus, a gain adjusting part (2308) adjusts, based on a control of a decoding control part (2301), the gain of a basic layer decoded signal outputted from a basic layer decoding part (2302). A gain adjusting part (2309) adjusts, based on a control of the decoding control part (2301), the gain of a first expansion layer decoded signal outputted from a first expansion layer decoding part (2303). A gain adjusting part (2310) adjusts, based on a control of the decoding control part (2301), the gain of a second expansion layer decoded signal outputted from a second expansion layer decoding part (2304).

Type: Grant

Filed: July 25, 2005

Date of Patent: January 17, 2012

Assignee: Panasonic Corporation

Inventors: Tomofumi Yamanashi, Kaoru Sato, Toshiyuki Morii
Temporally accurate watermarking system and method of operation

Patent number: 8099285

Abstract: A system for identifying audio data is provided. The system includes a transform system receiving left channel audio data and right channel audio data and generating a plurality of frequency bins of left channel magnitude data, left channel phase data, right channel magnitude data and right channel phase data. A watermarking system receives watermarking data and modifies predetermined frequency bins of the left channel phase data and the right channel phase data to encode the watermarking data. A magnitude system receives the left channel magnitude data and the right channel magnitude data and increases the left channel magnitude data and the right channel magnitude data for one or more of the predetermined frequency bins to a threshold level if the left channel magnitude data and the right channel magnitude data for the corresponding frequency bin is less than the threshold level.

Type: Grant

Filed: December 13, 2007

Date of Patent: January 17, 2012

Assignee: DTS, Inc.

Inventors: Brandon Smith, Jeffrey Thompson, Aaron Warner
Sound encoder and sound encoding method for generating a second layer decoded signal based on a degree of variation in a first layer decoded signal

Patent number: 8099275

Abstract: A sound encoder having an improved quantization performance while suppressing an increase of the bit rate to a lowest level. In a second layer encoder, a standard deviation calculator calculates a standard deviation ?c of a first layer decoding spectrum after decoding a scale factor ratio multiplication and outputs the standard deviation ?c to a selector. The selector selects a linear transform function as a function for a nonlinear transform of a residual spectrum according to the standard deviation ?c A nonlinear transform function selects one of prepared nonlinear transform functions #1 to #N according to a result of the selection by the selector, and outputs the selected one to an inverse transformer. The inverse transformer subjects an inverse transform (expansion) to a residual spectrum candidate that is stored in a residual spectrum code book using the nonlinear transform function outputted from the nonlinear transform function and outputs the result to an adder.

Type: Grant

Filed: October 25, 2005

Date of Patent: January 17, 2012

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Multi-channel audio encoding and decoding

Patent number: 8099292

Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

Type: Grant

Filed: November 11, 2010

Date of Patent: January 17, 2012

Assignee: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen
Method and apparatus for creating a unique audio signature

Patent number: 8086445

Abstract: A method and apparatus for creating a signature of a sampled work in real-time is disclosed herein. Unique signatures of an unknown audio work are created by segmenting a file into segments having predetermined segment and hop sizes. The signature then may be compared against reference signatures. One aspect may be characterized in that the hop size of the sampled work signature is less than the hop size of reference signatures. A method for identifying an unknown audio work is also disclosed.

Type: Grant

Filed: June 10, 2009

Date of Patent: December 27, 2011

Assignee: Audible Magic Corporation

Inventors: Erling H. Wold, Thomas L. Blum, Douglas F. Keislar, James A. Wheaton
Packet loss concealment for sub-band predictive coding based on extrapolation of sub-band audio waveforms

Patent number: 8078458

Abstract: A technique is described for concealing the effect of a lost frame in a series of frames representing an encoded audio signal in a sub-band predictive coding system. In accordance with the technique, a first synthesized sub-band audio signal is synthesized, wherein synthesizing the first synthesized sub-band audio signal comprises performing waveform extrapolation based on a stored first sub-band decoded audio signal. A second synthesized sub-band audio signal is also synthesized, wherein synthesizing the second synthesized sub-band audio signal comprises performing waveform extrapolation based on the stored second sub-band decoded audio signal. The first synthesized sub-band audio signal and the second synthesized sub-band audio signal are combined to generate a synthesized full-band output audio signal corresponding to a lost frame.

Type: Grant

Filed: May 29, 2009

Date of Patent: December 13, 2011

Assignee: Broadcom Corporation

Inventors: Robert W. Zopf, Jes Thyssen, Juin-Hwey Chen
Apparatus and method for automatic classification/identification of similar compressed audio files

Patent number: 8073684

Abstract: An audio file is divided into frames in the time domain and each frame is compressed, according to a psycho-acoustic algorithm, into file in the frequency domain. Each frame is divided into sub-bands and each sub-band is further divided into split sub-bands. The spectral energy over each split sub-band is averaged for all frames. The resulting quantity for each split sub-band provides a parameter. The set of parameters can be compared to a corresponding set of parameters generated from a different audio file to determine whether the audio files are similar. In order to provide for the higher sensitivity of the auditory response, the comparison of individual split sub-bands of the lower order sub-bands can be performed. Selected constants can be used in the comparison process to improve further the sensitivity of the comparison. In the side-information generated by the psycho-acoustic compression, data related to the rhythm, i.e., related percussive effects, is present.

Type: Grant

Filed: April 25, 2003

Date of Patent: December 6, 2011

Assignee: Texas Instruments Incorporated

Inventor: Prabindh Sundareson
QUALITY EVALUATION METHOD AND QUALITY EVALUATION APPARATUS

Publication number: 20110282654

Abstract: Quality of industrial products is evaluated by evaluating non-stationary operation sound, which is a kind of operation sound, from an aspect of tone, using closely simulated evaluation levels of evaluation of non-stationary sound by used of a human sense of hearing. An operation sound of a conforming product sample is converted into sound waveform data by a sound collecting unit, and the sound waveform data is input into a computer via an A-D converter, and then converted into psychoacoustic parameters. Data of pseudo conforming products is additionally obtained from the psychoacoustic parameters of a plurality of conforming product samples by making use of deviation in the data of the conforming product samples. Threshold data is obtained using thresholds and masking data for evaluation calculated from psychoacoustic parameters of data of the conforming product samples and the pseudo conforming products by a statistical technique.

Type: Application

Filed: May 16, 2011

Publication date: November 17, 2011

Inventors: Yohei TAKECHI, Yutaka Omori
Adapting masking thresholds for encoding a low frequency transient signal in audio data

Patent number: 8060375

Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.

Type: Grant

Filed: January 12, 2011

Date of Patent: November 15, 2011

Assignee: Apple Inc.

Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
Method, medium, and apparatus encoding/decoding audio data with extension data

Patent number: 8055500

Abstract: A method, medium, and apparatus encoding/decoding audio data in which audio data is hierarchically encoded, and at least one extension data of the audio data is encoded using at least one encoding method, and decoding is performed in the same manner, thereby ensuring fine grain scalability (FGS) and unlimited extendibility of the audio data.

Type: Grant

Filed: October 12, 2006

Date of Patent: November 8, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Eunmi Oh
System and method for detecting emotions at different steps in a communication

Patent number: 8054964

Abstract: The system and method establish a communication with a first party. A first emotion is detected during a first step during the communication. Steps in the communication can include: establishing a call, waiting in a queue, talking with an agent, communicating with an IVR system, and the like. A second emotion is detected at a different step in the communication. Alternatively, the first and second emotions are detected in two different but related communications. The detection of the first and/or second emotion determines how to process the communication. Examples of processing a communication can include routing the communication differently, selecting a different script for an agent, selecting a different agent, and the like. In addition, the communication can also be processed differently based on other additional parameters.

Type: Grant

Filed: April 30, 2009

Date of Patent: November 8, 2011

Assignee: Avaya Inc.

Inventors: Andrew D. Flockhart, Eugene P. Mathews, John Z. Taylor
Audio experience for a communications device user

Patent number: 8054948

Abstract: A system and associated methods provide an audio experience such that a user spatially perceives one or more audio events. One particular method set forth involves obtaining audio events and presenting the audio events so that an audio experience is provided. According to one embodiment of the method, upon obtaining audio events, the audio events are associated with one or more corresponding audio components. Thereafter, the audio experience is determined based on the audio events and associated audio components. The audio experience is then presented such that the user may spatially perceive the audio events.

Type: Grant

Filed: June 28, 2007

Date of Patent: November 8, 2011

Assignee: Sprint Communications Company L.P.

Inventors: Michael A. Gailloux, Michael W. Kanemoto
Speech and method for identifying perceptual features

Patent number: 8046218

Abstract: A system and method for phone detection. The system includes a microphone configured to receive a speech signal in an acoustic domain and convert the speech signal from the acoustic domain to an electrical domain, and a filter bank coupled to the microphone and configured to receive the converted speech signal and generate a plurality of channel speech signals corresponding to a plurality of channels respectively. Additionally, the system includes a plurality of onset enhancement devices configured to receive the plurality of channel speech signals and generate a plurality of onset enhanced signals. Each of the plurality of onset enhancement devices is configured to receive one of the plurality of channel speech signals, enhance one or more onsets of one or more signal pulses for the received one of the plurality of channel speech signals, and generate one of the plurality of onset enhanced signals.

Type: Grant

Filed: September 18, 2007

Date of Patent: October 25, 2011

Assignee: The Board of Trustees of the University of Illinois

Inventors: Jont B. Allen, Marion Regnier
Method and apparatus for encoding/decoding audio data with scalability

Patent number: 8046234

Abstract: Method and apparatus for encoding/decoding audio data with scalability are provided. The method includes slicing audio data so that sliced audio data corresponds to a plurality of layers, obtaining scale band information and coding band information corresponding to each of the plurality of layers, coding additional information containing scale factor information and coding model information based on scale band information and coding band information corresponding to a first layer, obtaining quantized samples by quantizing audio data corresponding to the first layer with reference to the scale factor information, coding the obtained plurality of quantized samples in units of symbols in order from a symbol formed with most significant bits (MSB) down to a symbol formed with least significant bits (LSB) by referring to the coding model information, and repeatedly performing the steps with increasing the ordinal number of the layer one by one every time, until coding for the plurality of layers is finished.

Type: Grant

Filed: December 16, 2003

Date of Patent: October 25, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jung-hoe Kim, Sang-wook Kim, Eun-mi Oh
Low complexity decoder for complex transform coding of multi-channel sound

Patent number: 8046214

Abstract: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

Type: Grant

Filed: June 22, 2007

Date of Patent: October 25, 2011

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Enhancing perceptual performance of SBR and related HFR coding methods by adaptive noise-floor addition and noise substitution limiting

Patent number: 8036880

Abstract: Methods and an apparatus for enhancement of source coding systems utilizing high frequency reconstruction (HFR) are introduced. The problem of insufficient noise contents is addressed in a reconstructed highband, by using Adaptive Noise-floor Addition. New methods are also introduced for enhanced performance by means of limiting unwanted noise, interpolation and smoothing of envelope adjustment amplification factors. The methods and apparatus used are applicable to both speech coding and natural audio coding systems.

Type: Grant

Filed: June 24, 2009

Date of Patent: October 11, 2011

Assignee: Coding Technologies Sweden AB

Inventors: Lars G. Liljeryd, Kristofer Kjoerling, Per Ekstrand, Fredrik Henn
Enhancing perceptual performance of SBR and related HFR coding methods by adaptive noise-floor addition and noise substitution limiting

Patent number: 8036882

Abstract: Methods and an apparatus for enhancement of source coding systems utilizing high frequency reconstruction (HFR) are introduced. The problem of insufficient noise contents is addressed in a reconstructed highband, by using Adaptive Noise-floor Addition. New methods are also introduced for enhanced performance by means of limiting unwanted noise, interpolation and smoothing of envelope adjustment amplification factors. The methods and apparatus used are applicable to both speech coding and natural audio coding systems.

Type: Grant

Filed: June 24, 2009

Date of Patent: October 11, 2011

Assignee: Coding Technologies Sweden AB

Inventors: Lars G. Liljeryd, Kristofer Kjoerling, Per Ekstrand, Fredrik Henn
Enhancing perceptual performance of SBR and related HFR coding methods by adaptive noise-floor addition and noise substitution limiting

Patent number: 8036881

Abstract: Methods and an apparatus for enhancement of source coding systems utilizing high frequency reconstruction (HFR) are introduced. The problem of insufficient noise contents is addressed in a reconstructed highband, by using Adaptive Noise-floor Addition. New methods are also introduced for enhanced performance by means of limiting unwanted noise, interpolation and smoothing of envelope adjustment amplification factors. The methods and apparatus used are applicable to both speech coding and natural audio coding systems.

Type: Grant

Filed: June 24, 2009

Date of Patent: October 11, 2011

Assignee: Coding Technologies Sweden AB

Inventors: Lars G. Liljeryd, Kristofer Kjoerling, Per Ekstrand, Fredrik Henn
Binaural multi-channel decoder in the context of non-energy conserving upmix rules

Patent number: 8027479

Abstract: A multi-channel decoder for generating a binaural signal from a downmix signal using upmix rule information on an energy-error introducing upmix rule for calculating a gain factor based on the upmix rule information and characteristics of head related transfer function based filters corresponding to upmix channels. The one or more gain factors are used by a filter processor for filtering the downmix signal so that an energy corrected binaural signal having a left binaural channel and a right binaural channel is obtained.

Type: Grant

Filed: September 1, 2006

Date of Patent: September 27, 2011

Assignee: Coding Technologies AB

Inventor: Lars Villemoes
Scalable encoding apparatus, scalable decoding apparatus, and methods thereof

Patent number: 8019597

Abstract: A scalable encoding apparatus capable of reducing the bit rates of encoded parameters and also capable of efficiently encoding audio signals in which a plurality of harmonic structures are coexistent. In the apparatus, an MDCT analyzer MDCT analyzes an audio signal for converting/encoding processes. A pitch frequency converter determines an inverse of a pitch period to calculate a pitch frequency. A selector selects spectra located at frequencies that are integral multiples of the pitch frequency, and a second layer encoder encodes the selected spectra.

Type: Grant

Filed: October 26, 2005

Date of Patent: September 13, 2011

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Energy shaping apparatus and energy shaping method

Patent number: 8019614

Abstract: A temporal processing apparatus includes: a splitter splitting an audio signal, included in the sub-band domain, into diffuse signals indicating reverberating components and direct signals indicating non-reverberating components; a downmix unit generating a downmix signal by downmixing the direct signals; BPFs respectively generating a bandpass downmix signal and bandpass diffuse signals; normalization processing units respectively generating a normalized downmix signal and normalized diffuse signals; a scale computation processing unit computing, on a predetermined time slot basis, a scale factor indicating the magnitude of energy of the normalized downmix signal with respect to energy of the normalized diffuse signals; a calculating unit generating scale diffuse signals; a HPF generating high-pass diffuse signals; an adding unit generating addition signals; and a synthesis filter bank performing synthesis filter processing on the addition signals and transforming the addition signals into the time domains.

Type: Grant

Filed: August 31, 2006

Date of Patent: September 13, 2011

Assignee: Panasonic Corporation

Inventors: Yoshiaki Takagi, Kok Seng Chong, Takeshi Norimatsu, Shuji Miyasaka, Akihisa Kawamura, Kojiro Ono, Tomokazu Ishikawa
Frequency compensation for perceptual speech analysis

Patent number: 8014999

Abstract: The invention provides a softscaled frequency compensation function that allows the evaluation of a first quality measure indicating a global impact of all distortions in an audio transmission system, including linear frequency response distortions and second quality measure that only lakes into account the impact of linear frequency response distortions. The softscaled frequency compensation function is derived from a softscaled ratio between a time integrated output and a time integrated input power density functions. The first quality measure is derived from the difference loudness density function as function of time and frequency, using the frequency compensated input loudness density function and the gain compensated output loudness density function both as a function of time and frequency, in the same manner as carried out in ITU standard P.862.

Type: Grant

Filed: September 20, 2005

Date of Patent: September 6, 2011

Assignee: Nederlandse Organisatie voor toegepast - natuurwetenschappelijk Onderzoek TNO

Inventor: John Gerard Beerends
Scalable encoder, scalable decoder, and scalable encoding method

Patent number: 8010349

Abstract: A scalable encoder enabling improvement of the encoding efficiency in the second layer and improvement of the quality of the original signal decoded using the encoding signal in the second layer. A predictive coefficient encoder of the scalable encoder has a predictive coefficient codebook where candidates of the predictive coefficient are recorded. After searching the predictive coefficient codebook, the scale factor of the first layer decoded signal inputted from a scale factor calculator is multiplied, and a predictive coefficient which most approximates the multiplication result to the scale factor of the original signal inputted from the scale factor calculator is determined and encoded, and the coded code is inputted to a multiplexer.

Type: Grant

Filed: October 11, 2005

Date of Patent: August 30, 2011

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Method and apparatus for switching speech or audio signals

Patent number: 8000968

Abstract: A method and an apparatus for switching speech or audio signals, wherein the method for switching speech or audio signals includes when switching of a speech or audio, weighting a first high frequency band signal of a current frame of speech or audio signal and a second high frequency band signal of the previous M frame of speech or audio signals to obtain a processed first high frequency band signal, where M is greater than or equal to 1, and synthesizing the processed first high frequency band signal and a first low frequency band signal of the current frame of speech or audio signal into a wide frequency band signal. In this way, speech or audio signals with different bandwidths can be smoothly switched, thus improving the quality of audio signals received by a user.

Type: Grant

Filed: April 26, 2011

Date of Patent: August 16, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventors: Zexin Liu, Lei Miao, Chan Hu, Wenhai Wu, Yue Lang, Qing Zhang
Packet loss concealment for sub-band predictive coding based on extrapolation of sub-band audio waveforms

Patent number: 8000960

Abstract: A technique is described for concealing the effect of a lost frame in a series of frames representing an encoded audio signal in a sub-band predictive coding system. In accordance with the technique, a first synthesized sub-band audio signal is synthesized, wherein synthesizing the first synthesized sub-band audio signal comprises performing waveform extrapolation based on a stored first sub-band decoded audio signal. A second synthesized sub-band audio signal is also synthesized, wherein synthesizing the second synthesized sub-band audio signal comprises performing waveform extrapolation based on the stored second sub-band decoded audio signal. The first synthesized sub-band audio signal and the second synthesized sub-band audio signal are combined to generate a synthesized full-band output audio signal corresponding to a lost frame.

Type: Grant

Filed: August 15, 2007

Date of Patent: August 16, 2011

Assignee: Broadcom Corporation

Inventors: Juin-Hwey Chen, Robert W. Zopf, Jes Thyssen
Acoustic coding of an enhancement frame having a shorter time length than a base frame

Patent number: 7996233

Abstract: A downsampler 101 converts input data having a sampling rate 2*FH to a sampling rate 2*FL which is lower than the sampling rate 2*FH. A base layer coder 102 encodes the input data having the sampling rate 2*FL in predetermined base frame units. A local decoder 103 decodes a first coded code. An upsampler 104 increases the sampling rate of the decoded signal to 2*FH. A subtractor 106 subtracts the decoded signal from the input signal and regards the subtraction result as a residual signal. A frame divider 107 divides the residual signal into enhancement frames having a shorter time length than that of the base frame. An enhancement layer coder 108 encodes the residual signal divided into the enhancement frames and outputs a second coded code obtained by this coding to a multiplexer 109.

Type: Grant

Filed: August 12, 2003

Date of Patent: August 9, 2011

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Prosody conversion

Patent number: 7996222

Abstract: A contour for a syllable (or other speech segment) in a voice undergoing conversion is transformed. The transform of that contour is then used to identify one or more source syllable transforms in a codebook. Information regarding the context and/or linguistic features of the contour being converted can also be compared to similar information in the codebook when identifying an appropriate source transform. Once a codebook source transform is selected, an inverse transformation is performed on a corresponding codebook target transform to yield an output contour. The corresponding codebook target transform represents a target voice version of the same syllable represented by the selected codebook source transform. The output contour may be further processed to improve conversion quality.

Type: Grant

Filed: September 29, 2006

Date of Patent: August 9, 2011

Assignee: Nokia Corporation

Inventors: Jani K. Nurminen, Elina Helander
Method and an apparatus for processing a signal

Patent number: 7991621

Abstract: An apparatus for processing an encoded signal and method thereof are disclosed, by which an audio signal can be compressed and reconstructed in higher efficiency. An audio signal processing method includes the steps of identifying whether a type of an audio signal is a music using first type information, if the type of the audio signal is not the music signal, identifying whether the type of the audio signal is a speech signal or a mixed signal using second type information, and if the type of the audio signal is determined as either the speech signal or the mixed signal, reconstructing the audio signal according to a coding scheme applied per frame using coding identification information. If the type of the audio signal is the music signal, the first type information is received only. If the type of the audio signal is the speech signal or the mixed signal, both of the first type information and the second type information are received.

Type: Grant

Filed: July 2, 2009

Date of Patent: August 2, 2011

Assignees: LG Electronics Inc., Industry-Academic Cooperation Foundation, Yonsei University

Inventors: Hyen O Oh, Jeong Ook Song, Chang Heon Lee, Yang Won Jung, Hong Goo Kang
Methods for watermarking media data

Patent number: 7983441

Abstract: Methods are provided for encoding watermark information into media data containing a series of digital samples in a sample domain. The method involves: dividing the series of digital samples into a plurality of sections in the sample domain, each section comprising a corresponding plurality of samples; processing the corresponding plurality of samples in each section to obtain a single energy value associated with each section; grouping the sections into groups, each group containing three or more sections; assigning a nominal bit value to each group according to a bit assignment rule, the bit assignment rule based on the energy values of the sections in the group; and assigning a watermark bit value to each group.

Type: Grant

Filed: October 18, 2007

Date of Patent: July 19, 2011

Assignee: Destiny Software Productions Inc.

Inventors: Steven Erik Vestergaard, Che-Wai Tsui
Methods and devices for switching between sound signal coding modes at a coder and for producing target signals at a decoder

Patent number: 7979271

Abstract: Methods and devices are used for switching between sound signal coding modes and for producing from a decoded target signal, an overlap-add target signal in a current frame coded according to a first mode. On a coder side, switching is at the junction between a previous frame coded according to a first coding mode and a current frame coded according to a second coding mode, a sound signal is filtered through a weighting filter to produce a weighted signal in the current frame, and a windowed zero-input response of the weighting filter is removed from the weighted signal. On a decoder side, a current frame of the target signal is first windowed, a left portion of a resulting window is skipped, and then a windowed zero-input response of the weighting filter is added to the decoded target signal to reconstruct the overlap-add target signal.

Type: Grant

Filed: February 18, 2005

Date of Patent: July 12, 2011

Assignee: Voiceage Corporation

Inventor: Bruno Bessette
Method and apparatus for encoding/decoding MPEG-4 BSAC audio bitstream having ancillary information

Patent number: 7974840

Abstract: A method of and an apparatus for encoding/decoding an MPEG-4 bit sliced arithmetic coding (BSAC) audio bitstream having ancillary information. A time domain audio signal is converted to a frequency domain audio signal and quantized. A number of data bits is counted and a number of available bits per layer is obtained. The number of available bits per layer is modified considering the size of ancillary information. Actual audio data is encoded in units of layers and ancillary information is embedded in the encoded bitstream. A header is decoded and a layer structure of an audio bitstream is calculated to determine the size of the ancillary information as a difference between a size of data up to a top layer and a size of a frame. The ancillary information is extracted to improve meta data and sound quality of audio contents.

Type: Grant

Filed: November 24, 2004

Date of Patent: July 5, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Shihwa Lee, Sangwook Kim, Eunmi Oh, Dohyung Kim
Data reproduction device

Patent number: 7970602

Abstract: A data reproduction device is provided for achieving seamless reproduction of a stream where a validity of a bandwidth extension function is switched in the stream. The data reproduction device includes an input frequency obtainment unit analyzing header information Hdr and obtaining an input frequency FSin, which is the frequency of basic data, an output frequency determination unit performing predetermined processing based on the input frequency FSin and determining an output frequency FSout, which is the sampling frequency of a decoded frame Fdata, and a decoding unit (2003) which, if the SBR function is valid in a frame to be decoded, decodes sample data at the input frequency FSin and extends the bandwidth of the sampling frequency up to the output frequency FSout, while if the SBR function is not valid in the frame, upsamples the decoding result obtained at the input frequency FSin to the output frequency FSout.

Type: Grant

Filed: February 24, 2006

Date of Patent: June 28, 2011

Assignee: Panasonic Corporation

Inventors: Tadamasa Toma, Yoshinori Matsui, Shinya Kadono
Method And Apparatus For The Detection Of Impulsive Noise In Transmitted Speech Signals For Use In Speech Quality Assessment

Publication number: 20110153313

Abstract: A method and apparatus for performing speech quality assessment in a speech communication system (such as, for example, a VoIP communication system) which detects and measures the presence of impulsive noise is provided. Specifically, in one illustrative embodiment, an autoregressive (AR) model of speech (and, in particular, of the excitation of the vocal tract) is advantageously employed to estimate a short-term variance of the speech excitation, and the standard deviation of the speech excitation (i.e., the square root of the variance) is then advantageously compared to a predetermined threshold to identify whether impulsive noise is present. Then, based on a statistic analysis of any such identified impulsive noise, a speech quality assessment is generated.

Type: Application

Filed: December 17, 2009

Publication date: June 23, 2011

Applicant: Alcatel-Lucent USA Inc.

Inventor: Walter Etter
METHOD FOR DYNAMICALLY ADJUSTING THE SPECTRAL CONTENT OF AN AUDIO SIGNAL

Publication number: 20110153314

Abstract: A method for dynamically adjusting the spectral content of an audio signal, which increases the harmonic content of said audio signal, said method comprising translating an encoded digital signal into data bands, creating a psychoacoustic model to identify sections of said data bands that are deficient in harmonic quality, analyzing the fundamental frequency and amplitude of said harmonically deficient data bands, creating additional higher order harmonics for said harmonically deficient data bands, adding said higher order harmonics back to said encoded digital signal to form a newly enhanced signal, inverse filtering said newly enhanced signal, and converting said inverse filtered signal to an analog waveform for consumption by the listener.

Type: Application

Filed: February 28, 2011

Publication date: June 23, 2011

Inventors: J. Craig Oxford, Patrick Taylor, D. Michael Shields
Reduced number of channels decoding

Patent number: 7965848

Abstract: An intermediate channel representation of a multi-channel signal can be reconstructed highly efficient and with high fidelity, when upmix parameters for upmixing a transmitted downmix signal to the intermediate channel representation are derived that allow for an upmix using the same upmixing algorithms as within the multi-channel reconstruction. This can be achieved when a parameter re-calculator is used to derive the upmix parameters that takes into account also parameters having information on channels that are not included in the intermediate channel representation.

Type: Grant

Filed: August 11, 2006

Date of Patent: June 21, 2011

Assignees: Dolby International AB, Koninklijke Philips Electronics N.V.

Inventors: Lars Villemoes, Kristofer Kjoerling, Jeroen Breebaart
Method and apparatus for generating a number of output audio channels

Patent number: 7966191

Abstract: An audio encoder (109) has a hierarchical encoding structure and generates a data stream comprising one or more audio channels as well as parametric audio encoding data. The encoder (109) comprises an encoding structure processor (305) which inserts decoder tree structure data into the data stream. The decoder tree structure data comprises at least one data value indicative of a channel split characteristic for an audio channel at a hierarchical layer of the hierarchical decoder structure and may specifically specify the decoder tree structures to be applied by a decoder. A decoder (115) comprises a receiver (401) which receives the data stream and a decoder structure processor (405) for generating the hierarchical decoder structure in response to the decoder tree structure data. A decode processor (403) then generates output audio channels from the data stream using the hierarchical decoder structure.

Type: Grant

Filed: July 7, 2006

Date of Patent: June 21, 2011

Assignees: Koninklijke Philips Electronics N.V., Coding Technologies AB

Inventors: Erik Gosuinus Petrus Schuijers, Gerard Herman Hotho, Heiko Purnhagen, Wolfgang Alexander Schildbach, Holger Horich, Hans Magnus Kristofer Kjorling, Karl Jonas Roden
DEVICE AND METHOD FOR ACOUSTIC COMMUNICATION

Publication number: 20110144979

Abstract: Disclosed is an acoustic communication method that includes filtering an audio signal to attenuate a high frequency section of the audio signal; generating a residual signal which corresponds to a difference between the audio signal and the filtered signal; generating a psychoacoustic mask for the audio signal based on a predetermined psychoacoustic model; generating a psychoacoustic spectrum mask by combining the residual signal with the psychoacoustic mask; generating an acoustic communication signal by modulating digital data according to the acoustic signal spectrum mask; and combining the acoustic communication signal with the filtered signal.

Type: Application

Filed: December 10, 2010

Publication date: June 16, 2011

Applicant: Samsung Electronics Co., Ltd.

Inventors: Hee-Won Jung, Jun-Ho Koh, Sang-Mook Lee, Gi-Sang Lee, Sergey Zhidkov
Apparatus and method for processing multi-channel audio signal using space information

Patent number: 7961889

Abstract: An apparatus for and a method of processing a multi-channel audio signal using space information. The apparatus includes: a main coding unit down mixing a multi-channel audio signal by applying space information to surround components included in the multi-channel audio signal, generating side information using the multi-channel audio signal or a stereo signal of a down-mixed result, coding the stereo signal and the side information, and transmitting the coded result as a coding signal; and a main decoding unit receiving the coding signal, decoding the stereo signal and the side information using the received coding signal, up mixing the decoded stereo signal using the decoded side information, and restoring the multi-channel audio signal.

Type: Grant

Filed: August 25, 2005

Date of Patent: June 14, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Sangchul Ko, Shihwa Lee, Eunmi Oh, Miao Lei
Method for high quality audio transcoding

Patent number: 7962333

Abstract: A method and apparatus for a voice transcoder that converts a bitstream representing frames of data encoded according to a first voice compression standard to a bitstream representing frames of data according to a second voice compression standard using perceptual weighting that uses tuned weighting factors, such that the bitstream of a second voice compression standard to produce a higher quality decoded voice signal than a comparable tandem transcoding solution.

Type: Grant

Filed: August 2, 2007

Date of Patent: June 14, 2011

Assignee: Onmobile Global Limited

Inventors: Marwan A. Jabri, Jianwei Wang, Nicola Chong-White, Michael Ibrahim
Dual-transform coding of audio signals

Patent number: 7953595

Abstract: Methods, devices, and systems for coding and decoding audio are disclosed. At least two transforms are applied on an audio signal, each with different transform periods for better resolutions at both low and high frequencies. The transform coefficients are selected and combined such that the data rate remains similar as a single transform. The transform coefficients may be coded with a fast lattice vector quantizer. The quantizer has a high rate quantizer and a low rate quantizer. The high rate quantizer includes a scheme to truncate the lattice. The low rate quantizer includes a table based searching method. The low rate quantizer may also include a table based indexing scheme. The high rate quantizer may further include Huffman coding for the quantization indices of transform coefficients to improve the quantizing/coding efficiency.

Type: Grant

Filed: October 18, 2006

Date of Patent: May 31, 2011

Assignee: Polycom, Inc.

Inventors: Minjie Xie, Peter Chu
Enhancing perceptual performance of SBR and related HFR coding methods by adaptive noise-floor addition and noise substitution limiting

Patent number: RE43189

Abstract: Methods and an apparatus for enhancement of source coding systems utilizing high frequency reconstruction (HFR) are introduced. The problem of insufficient noise contents is addressed in a reconstructed highband, by using Adaptive Noise-floor Addition. New methods are also introduced for enhanced performance by means of limiting unwanted noise, interpolation and smoothing of envelope adjustment amplification factors. The methods and apparatus used are applicable to both speech coding and natural audio coding systems.

Type: Grant

Filed: January 26, 2000

Date of Patent: February 14, 2012

Assignee: Dolby International AB

Inventors: Lars G. Liljeryd, Kristofer Kjoerling, Per Ekstrand, Frederik Henn
Speech coding apparatus and speech decoding apparatus

Patent number: RE43190

Abstract: A speech coding apparatus comprises a repetition period pre-selecting unit for generating a plurality of candidates for the repetition period of a driving excitation source by multiplying the repetition period of an adaptive excitation source by a plurality of constant numbers, respectively, and for pre-selecting a predetermined number of candidates from all the candidates generated. A driving excitation source coding unit provides both excitation source location information and excitation source polarity information that minimize a coding distortion, for each of the predetermined number of candidates, and provides an evaluation value associated with the minimum coding distortion for each of the predetermined number of candidates.

Type: Grant

Filed: January 28, 2010

Date of Patent: February 14, 2012

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventors: Hirohisa Tasaki, Tadashi Yamaura
Speech coding apparatus and speech decoding apparatus

Patent number: RE43209

Abstract: A speech coding apparatus comprises a repetition period pre-selecting unit for generating a plurality of candidates for the repetition period of a driving excitation source by multiplying the repetition period of an adaptive excitation source by a plurality of constant numbers, respectively, and for pre-selecting a predetermined number of candidates from all the candidates generated. A driving excitation source coding unit provides both excitation source location information and excitation source polarity information that minimize a coding distortion, for each of the predetermined number of candidates, and provides an evaluation value associated with the minimum coding distortion for each of the predetermined number of candidates.

Type: Grant

Filed: January 28, 2010

Date of Patent: February 21, 2012

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventors: Hirohisa Tasaki, Tadashi Yamaura

prev … 5 6 7 8 9 10 11 12 13 … next