Cross-correlation Patents (Class 704/218)

Time reversed audio subframe error concealment

Patent number: 12293766

Abstract: A method and a decoder device of generating a concealment audio subframe of an audio signal are provided. The method comprises generating frequency spectra on a subframe basis where consecutive subframes of the audio signal have a property that an applied window shape of first subframe of the consecutive subframes is a mirrored version or a time reversed version of a second subframe of the consecutive subframes. Peaks of a signal spectrum of a previously received audio signal are detected for a concealment subframe, and a phase of each of the peaks is estimated. A time reversed phase adjustment is derived based on the estimated phase and applied to the peaks of the signal spectrum to form time reversed phase adjusted peaks.

Type: Grant

Filed: March 18, 2024

Date of Patent: May 6, 2025

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Erik Norvell, Chamran Moradi Ashour
Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band

Patent number: 12014747

Abstract: An audio encoder for encoding an audio signal having a lower frequency band and an upper frequency band includes: a detector for detecting a peak spectral region in the upper frequency band of the audio signal; a shaper for shaping the lower frequency band using shaping information for the lower band and for shaping the upper frequency band using at least a portion of the shaping information for the lower band, wherein the shaper is configured to additionally attenuate spectral values in the detected peak spectral region in the upper frequency band; and a quantizer and coder stage for quantizing a shaped lower frequency band and a shaped upper frequency band and for entropy coding quantized spectral values from the shaped lower frequency band and the shaped upper frequency band.

Type: Grant

Filed: April 27, 2023

Date of Patent: June 18, 2024

Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventors: Markus Multrus, Christian Neukam, Markus Schnell, Benjamin Schubert
Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band

Patent number: 11682409

Abstract: An audio encoder for encoding an audio signal having a lower frequency band and an upper frequency band includes: a detector for detecting a peak spectral region in the upper frequency band of the audio signal; a shaper for shaping the lower frequency band using shaping information for the lower band and for shaping the upper frequency band using at least a portion of the shaping information for the lower band, wherein the shaper is configured to additionally attenuate spectral values in the detected peak spectral region in the upper frequency band; and a quantizer and coder stage for quantizing a shaped lower frequency band and a shaped upper frequency band and for entropy coding quantized spectral values from the shaped lower frequency band and the shaped upper frequency band.

Type: Grant

Filed: September 17, 2020

Date of Patent: June 20, 2023

Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventors: Markus Multrus, Christian Neukam, Markus Schnell, Benjamin Schubert
Frequency domain parameter sequence generating method, encoding method, decoding method, frequency domain parameter sequence generating apparatus, encoding apparatus, decoding apparatus, program, and recording medium

Patent number: 10504533

Abstract: The present invention reduces encoding distortion in frequency domain encoding compared to conventional techniques, and obtains LSP parameters that correspond to quantized LSP parameters for the preceding frame and are to be used in time domain encoding from coefficients equivalent to linear prediction coefficients resulting from frequency domain encoding. When p is an integer equal to or greater than 1, a linear prediction coefficient sequence which is obtained by linear prediction analysis of audio signals in a predetermined time segment is represented as a[1], a[2], . . . , a[p], and ?[1], ?[2], . . . , ?[p] are a frequency domain parameter sequence derived from the linear prediction coefficient sequence a[1], a[2], . . . , a[p], an LSP linear transformation unit (300) determines the value of each converted frequency domain parameter {tilde over (?)}?[i] (i=1, 2, . . . , p) in a converted frequency domain parameter sequence {tilde over (?)}w[1], {tilde over (?)}w[2], . . .

Type: Grant

Filed: April 30, 2019

Date of Patent: December 10, 2019

Assignees: NIPPON TELEGRAPH AND TELEPHONE CORPORATION, The University of Tokyo

Inventors: Takehiro Moriya, Yutaka Kamamoto, Noboru Harada, Hirokazu Kameoka, Ryosuke Sugiura
Recovering independent waveforms from input bitstream data

Patent number: 9917706

Abstract: An example device in accordance with an aspect of the present disclosure includes a non-linear compensator, an interpolator, a demultiplexer, and a demodulator. The non-linear compensator is to correct a non-linearity of the input bitstream data to obtain linearized bitstream data. The interpolator is to convert a non-constant sample rate of the linearized bitstream data to obtain constant sample rate bitstream data. The demultiplexer is to demultiplex the constant sample rate bitstream data into a first waveform and a second waveform. The demodulator is to demodulate the first waveform and the second waveform.

Type: Grant

Filed: May 27, 2015

Date of Patent: March 13, 2018

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: David E Smith
Real-time network attack detection and mitigation infrastructure

Patent number: 9692774

Abstract: The invention features systems and methods for detecting and mitigating network attacks in a Voice-Over-IP (VoIP) network. A server is configured to receive information related to a mitigation action for a call. The information can include a complexity level for administering an audio challenge-response test to the call and an identification of the call. The server also generates i) a routing label based on the identification of the call, and ii) a script defining a plurality of variables that store identifications of a plurality of altered sound files for the audio challenge-response test. Each altered sound file is randomly selected by the server subject to one or more constraints associated with the complexity level. The server is further configured to transmit the script to a guardian module and the routing label to a gateway.

Type: Grant

Filed: March 3, 2016

Date of Patent: June 27, 2017

Assignee: SONUS NETWORKS, INC.

Inventors: David Lapsley, Miri Mansur, Jonathan Klotzbach, Ti-yuan Dean Shu, Sri Chary, Joby Joseph, Mark Topham, Wassim Matragi, Kenneth Dumble
System and method for processor wake-up based on sensor data

Patent number: 9349386

Abstract: A system for processor wake-up based on sensor data includes an audio buffer, an envelope buffer, and a processor. The audio buffer is configured to store a first data from a sensor. The first data is generated according to a first sampling rate. The envelope buffer is configured to store a second data, which is derived from the first data according to a second sampling rate, which is less than the first sampling rate. The processor is configured to wake up periodically from an idle state and read the second data from the envelope buffer. If the second data indicates an activity, the processor is configured to read the first data from the audio buffer. If the second data does not indicate an activity, the processor is configured to return to the idle state.

Type: Grant

Filed: March 7, 2013

Date of Patent: May 24, 2016

Assignee: Analog Device Global

Inventors: Robert Adams, Maikael Mortensen
Location determination system and mobile terminal

Patent number: 9026437

Abstract: A location determination system includes a first mobile terminal and a second mobile terminal. The first mobile terminal includes a first processor to acquire a first sound signal, analyze the first sound signal to obtain a first analysis result, and transmit the first analysis result. The second mobile terminal includes a second processor to acquire a second sound signal, analyze the second sound signal to obtain a second analysis result, receive the first analysis result from the first mobile terminal, compare the second analysis result with the first analysis result to obtain a comparison result, and determine whether the first mobile terminal locates in an area in which the second mobile terminal locates, based on the comparison result.

Type: Grant

Filed: March 26, 2012

Date of Patent: May 5, 2015

Assignee: Fujitsu Limited

Inventor: Eiji Hasegawa
Method for estimating a fundamental frequency of a speech signal

Patent number: 9026435

Abstract: The invention provides a method for estimating a fundamental frequency of a speech signal comprising the steps of receiving a signal spectrum of the speech signal, filtering the signal spectrum to obtain a refined signal spectrum, determining a cross-power spectral density using the refined signal spectrum and the signal spectrum, transforming the cross-power spectral density into the time domain to obtain a cross-correlation function, and estimating the fundamental frequency of the speech signal based on the cross-correlation function.

Type: Grant

Filed: May 3, 2010

Date of Patent: May 5, 2015

Assignee: Nuance Communications, Inc.

Inventors: Mohamed Krini, Gerhard Schmidt
Multiple microphone voice activity detector

Patent number: 8954324

Abstract: Voice activity detection using multiple microphones can be based on a relationship between an energy at each of a speech reference microphone and a noise reference microphone. The energy output from each of the speech reference microphone and the noise reference microphone can be determined. A speech to noise energy ratio can be determined and compared to a predetermined voice activity threshold. In another embodiment, the absolute value of the autocorrelation of the speech and noise reference signals are determined and a ratio based on autocorrelation values is determined. Ratios that exceed the predetermined threshold can indicate the presence of a voice signal. The speech and noise energies or autocorrelations can be determined using a weighted average or over a discrete frame size.

Type: Grant

Filed: September 28, 2007

Date of Patent: February 10, 2015

Assignee: QUALCOMM Incorporated

Inventors: Song Wang, Samir Kumar Gupta, Eddie L. T. Choy
Non-spatial speech detection system and method of using same

Patent number: 8935164

Abstract: A non-spatial speech detection system includes a plurality of microphones whose output is supplied to a fixed beamformer. An adaptive beamformer is used for receiving the output of the plurality of microphones and one or more processors are used for processing an output from the fixed beamformer and identifying speech from noise though the use of an algorithm utilizing a covariance matrix.

Type: Grant

Filed: May 2, 2012

Date of Patent: January 13, 2015

Assignee: Gentex Corporation

Inventors: Robert R. Turnbull, Michael A. Bryson
Audio signal bandwidth extension in CELP-based speech coder

Patent number: 8924200

Abstract: A method for decoding an audio signal in a decoder having a CELP-based decoder element including a fixed codebook component, at least one pitch period value, and a first decoder output, wherein a bandwidth of the audio signal extends beyond a bandwidth of the CELP-based decoder element. The method includes obtaining an up-sampled fixed codebook signal by up-sampling the fixed codebook component to a higher sample rate, obtaining an up-sampled excitation signal based on the up-sampled fixed codebook signal and an up-sampled pitch period value, and obtaining a composite output signal based on the up-sampled excitation signal and an output signal of the CELP-based decoder element, wherein the composite output signal includes a bandwidth portion that extends beyond a bandwidth of the CELP-based decoder element.

Type: Grant

Filed: September 28, 2011

Date of Patent: December 30, 2014

Assignee: Motorola Mobility LLC

Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
Audio signal bandwidth extension in CELP-based speech coder

Patent number: 8868432

Abstract: A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes obtaining a second excitation signal having an audio bandwidth extending beyond the audio bandwidth of the CELP excitation signal, obtaining a set of signals by filtering the second excitation signal with a set of bandpass filters, scaling the set of signals using a set of energy-based parameters, and obtaining a composite output signal by combining the scaled set of signals with a signal based on the audio signal decoded by the CELP-based decoder element.

Type: Grant

Filed: September 28, 2011

Date of Patent: October 21, 2014

Assignee: Motorola Mobility LLC

Inventors: Jonathan A. Gibbs, James P. Ashley, Udar Mittal
Wind noise mitigation

Patent number: 8861745

Abstract: A method of compensating for noise in a receiver having a first receiver unit and a second receiver unit, the method includes receiving a first transmission at the first receiver unit, the first transmission having a first signal component and a first noise component; receiving a second transmission at the second receive unit, the second transmission having a second signal component and a second noise component; determining whether the first noise component and the second noise component are incoherent and; only if it is determined that the first and second noise components are incoherent, processing the first and second transmissions in a first processing path, wherein the first processing path is configured to compensate for incoherent noise.

Type: Grant

Filed: December 1, 2010

Date of Patent: October 14, 2014

Assignee: Cambridge Silicon Radio Limited

Inventors: Kuan-Chieh Yen, Xuejing Sun, Jeffrey S. Chisholm
Decoding method and decoding apparatus therefor

Patent number: 8762158

Abstract: A method and apparatus for generating synthesis audio signals are provided. The method includes decoding a bitstream; splitting the decoded bitstream into n sub-band signals; generating n transformed sub-band signals by transforming the n sub-band signals in a frequency domain; and generating synthesis audio signals by respectively multiplying the n transformed sub-band signals by values corresponding to synthesis filter bank coefficients.

Type: Grant

Filed: August 5, 2011

Date of Patent: June 24, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hyun-wook Kim, Han-gil Moon, Sang-hoon Lee
Scaled window overlap add for mixed signals

Patent number: 8731913

Abstract: A method for overlap-adding signals useful for performing frame loss concealment (FLC) in an audio decoder as well as in other applications. The method uses a dynamic mix of windows to overlap two signals whose normalized cross-correlation may vary from zero to one. If the overlapping signals are decomposed into a correlated component and an uncorrelated component, they are overlap-added separately using the appropriate window, and then added together. If the overlapping signals are not decomposed, a weighted mix of windows is used. The mix is determined by a measure estimating the amount of cross-correlation between overlapping signals, or the relative amount of correlated to uncorrelated signals.

Type: Grant

Filed: April 13, 2007

Date of Patent: May 20, 2014

Assignee: Broadcom Corporation

Inventors: Robert W. Zopf, Juin-Hwey Chen
Apparatus and method for encoding and decoding multi-channel signal

Patent number: 8666752

Abstract: Provided are an encoding apparatus and a decoding apparatus of a multi-channel signal. The encoding apparatus of the multi-channel signal may process a phase parameter associated with phase information between a plurality of channels constituting the multi-channel signal, based on a characteristic of the multi-channel signal. The encoding apparatus may generate an encoded bitstream with respect to the multi-channel signal using the processed phase parameter and a mono signal extracted from the multi-channel signal.

Type: Grant

Filed: March 17, 2010

Date of Patent: March 4, 2014

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jung-Hoe Kim, Eun Mi Oh
Enhancing speech recognition using visual information

Patent number: 8660842

Abstract: Speech recognition device uses visual information to narrow down the range of likely adaptation parameters even before a speaker makes an utterance. Images of the speaker and/or the environment are collected using an image capturing device, and then processed to extract biometric features and environmental features. The extracted features and environmental features are then used to estimate adaptation parameters. A voice sample may also be collected to refine the adaptation parameters for more accurate speech recognition.

Type: Grant

Filed: March 9, 2010

Date of Patent: February 25, 2014

Assignee: Honda Motor Co., Ltd.

Inventor: Antoine R. Raux
Determining pitch dynamics of an audio signal

Patent number: 8645128

Abstract: A first-pitch metric function based on a first audio sample and a second pitch-metric function based on a second audio sample may be determined. The first and second pitch-metric functions may have either local minima or local maxima that correspond to candidate pitch values of the first and the second audio samples, respectively. The first and the second pitch-metric functions may be transformed to generate a first and a second transformed pitch-metric function, respectively. A correlation function based on a correlation between the first and the second transformed pitch-metric function may also be determined. A lower-dimensionality representation of the correlation function may further be determined. The lower-dimensionality representation may convey information indicative of pitch dynamics between the first and second audio sample. A computing device having a processor and a memory may perform an action based on the information indicative of the pitch dynamics.

Type: Grant

Filed: October 2, 2012

Date of Patent: February 4, 2014

Assignee: Google Inc.

Inventor: Ioannis Agiomyrgiannakis
Automatic marking method for karaoke vocal accompaniment

Patent number: 8626497

Abstract: An automatic marking method for Karaoke vocal accompaniment is provided. In the method, pitch, beat position and volume of a singer are compared with the original pitch, beat position and volume of the theme of a song to generate a score of pitch, a score of beat and a score of emotion respectively, so as to obtain a weighted total score in a weighted marking method. By using the method, the pitch, beat position and volume error of each section of the song sung by the singer can be exactly worked out, and a pitch curve and a volume curve can be displayed, so that the singer can learn which part is sung incorrectly and which part needs to be enhanced. The present invention also has the advantages of dual effects of teaching and entertainment, high practicability and technical advancement.

Type: Grant

Filed: April 7, 2009

Date of Patent: January 7, 2014

Inventor: Wen-Hsin Lin
System and method for tracking sound pitch across an audio signal using harmonic envelope

Patent number: 8620646

Abstract: A system and method may be configured to analyze audio information derived from an audio signal. The system and method may track sound pitch across the audio signal. The tracking of pitch across the audio signal may take into account change in pitch by determining at individual time sample windows in the signal duration an estimated pitch and a representation of harmonic envelope at the estimated pitch. The estimated pitch and the representation of harmonic envelope may then be implemented to determine an estimated pitch for another time sample window in the signal duration with an enhanced accuracy and/or precision.

Type: Grant

Filed: August 8, 2011

Date of Patent: December 31, 2013

Assignee: The Intellisis Corporation

Inventors: David C. Bradley, Rodney Gateau, Daniel S. Goldin, Robert N. Hilton, Nicholas K. Fisher
System and method for adaptive overlap and add length estimation

Patent number: 8576961

Abstract: A method for determining an overlap and add length estimate comprises determining a plurality of correlation values of a plurality of ordered frequency domain samples obtained from a data frame; comparing the correlation values of a first subset of the samples to a first predetermined threshold to determine a first edge sample; comparing the correlation values of a second subset of the samples to a second predetermined threshold to determine a second edge sample; using the first and second edge samples to determine an overlap and add length estimate; and providing the overlap and add length estimate to an overlap and add circuit.

Type: Grant

Filed: June 15, 2009

Date of Patent: November 5, 2013

Assignee: Olympus Corporation

Inventors: Haidong Zhu, Dumitru Mihai Ionescu, Abu Amanullah
Apparatus and method for encoding a multi-channel audio signal

Patent number: 8577045

Abstract: An encoding apparatus comprises a frame processor (105) which receives a multi channel audio signal comprising at least a first audio signal from a first microphone (101) and a second audio signal from a second microphone (103). An ITD processor 107 then determines an inter time difference between the first audio signal and the second audio signal and a set of delays (109, 111) generates a compensated multi channel audio signal from the multi channel audio signal by delaying at least one of the first and second audio signals in response to the inter time difference signal. A combiner (113) then generates a mono signal by combining channels of the compensated multi channel audio signal and a mono signal encoder (115) encodes the mono signal. The inter time difference may specifically be determined by an algorithm based on determining cross correlations between the first and second audio signals.

Type: Grant

Filed: September 9, 2008

Date of Patent: November 5, 2013

Assignee: Motorola Mobility LLC

Inventor: Jonathan A. Gibbs
Speech processing based on time series of maximum values of cross-power spectrum phase between two consecutive speech frames

Patent number: 8566084

Abstract: A speech signal processing system which outputs a speech feature, divides an input speech signal into frames so that each pair of consecutive frames have a frame shift length equal to at least one period of the speech signal and have an overlap equal to at least a predetermined length, applies discrete Fourier transform to each of the frames, calculates a CSP coefficient for the pair, searches a predetermined search range in which a speech wave lags a period equal to at least one period to obtain the maximum value of the CSP coefficient for the pair, and generates time-series data of the maximum CSP coefficient values arranged in the order in which the frames appear. A method and a computer readable article of manufacture for the implementing the same are also provided.

Type: Grant

Filed: June 1, 2011

Date of Patent: October 22, 2013

Assignee: Nuance Communications, Inc.

Inventors: Osamu Ichikawa, Masafumi Nishimura
Preprocessing method, preprocessing apparatus and coding device

Patent number: 8566085

Abstract: The present disclosure relates to coding and decoding technologies, and discloses a preprocessing method, a preprocessing apparatus, and a coding device. The preprocessing method includes: obtaining characteristic information of a current frame signal; identifying whether the current frame signal requires no coding operation of removing LTC according to the characteristic information of the current frame signal and preset information; and if identifying that the current frame signal requires no coding operation of removing LTC, performing the coding operation of removing STC for the current frame signal; and if identifying that the current frame signal requires the coding operation of removing LTC, performing the coding operations of removing both LTC and STC for the current frame signal. Through the technical solution provided herein, the coding operation of removing LTC is performed for only part of the input frame signals.

Type: Grant

Filed: March 15, 2010

Date of Patent: October 22, 2013

Assignee: Huawei Technologies Co., Ltd.

Inventors: Lei Miao, Fengyan Qi, Jianfeng Xu, Dejun Zhang, Qing Zhang
System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain

Patent number: 8548803

Abstract: A system and method may be configured to process an audio signal. The system and method may track pitch, chirp rate, and/or harmonic envelope across the audio signal, may reconstruct sound represented in the audio signal, and/or may segment or classify the audio signal. A transform may be performed on the audio signal to place the audio signal in a frequency chirp domain that enhances the sound parameter tracking, reconstruction, and/or classification.

Type: Grant

Filed: August 8, 2011

Date of Patent: October 1, 2013

Assignee: The Intellisis Corporation

Inventors: David C. Bradley, Daniel S. Goldin, Robert N. Hilton, Nicholas K. Fisher, Rodney Gateau, Derrick R. Roos, Eric Wiewiora
Apparatus and method of providing a quality measure for an output voice signal generated to reproduce an input voice signal

Patent number: 8538746

Abstract: A method of providing a quality measure for an output voice signal generated to reproduce an input voice signal, the method comprising: partitioning the input and output signals into frames; for each frame of the input signal, determining a disturbance relative to each of a plurality of frames of the output signal; determining a subset of the determined disturbances comprising one disturbance for each input frame such that a sum of the disturbances in the subset set is a minimum; and using the set of disturbances to provide the measure of quality.

Type: Grant

Filed: September 27, 2012

Date of Patent: September 17, 2013

Assignee: AudioCodes Ltd.

Inventors: Ilan D. Shallom, Nitay Shiran, Felix Flomen
Voice analysis device, voice analysis method, voice analysis program, and system integration circuit

Patent number: 8478587

Abstract: A sound analysis device comprises: a sound parameter calculation unit operable to acquire an audio signal and calculate a sound parameter for each of partial audio signals, the partial audio signals each being the acquired audio signal in a unit of time; a category determination unit operable to determine, from among a plurality of environmental sound categories, which environmental sound category each of the partial audio signals belongs to, based on a corresponding one of the calculated sound parameters; a section setting unit operable to sequentially set judgement target sections on a time axis as time elapses, each of the judgment target sections including two or more of the units of time, the two or more of the units of time being consecutive; and an environment judgment unit operable to judge, based on a number of partial audio signals in each environmental sound category determined in at least a most recent judgment target section, an environment that surrounds the sound analysis device in at least the

Type: Grant

Filed: March 13, 2008

Date of Patent: July 2, 2013

Assignee: Panasonic Corporation

Inventors: Takashi Kawamura, Ryouichi Kawanishi
Method of deriving a set of features for an audio input signal

Patent number: 8423356

Abstract: The invention describes a method of deriving a set of features (S) of an audio input signal (M), which method comprises identifying a number of first-order features (f1, f2, . . . , ff) of the audio input signal (M), generating a number of correlation values (?1, ?2, . . . , ?I) from at least part of the first-order features (f1, f2, . . . , ff), and compiling the set of features (S) for the audio input signal (M) using the correlation values (?1, ?2, . . . , ?I). The invention further describes a method of classifying an audio input signal (M) into a group, and a method of comparing audio input signals (M, M?) to determine a degree of similarity between the audio input signals (M, M?).

Type: Grant

Filed: October 16, 2006

Date of Patent: April 16, 2013

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Dirk Jeroen Breebaart, Martin Franciscus McKinney
Concealment of transmission error in a digital audio signal in a hierarchical decoding structure

Patent number: 8391373

Abstract: A method is provided for concealing a transmission error in a digital signal chopped into a plurality of successive frames associated with different time intervals in which, on reception, the signal may comprise erased frames and valid frames, the valid frames comprising information relating to the concealment of frame loss. The method is implemented during a hierarchical decoding using a core decoding and a transform-based decoding using windows introducing a time delay of less than a frame with respect to the core decoding. The method includes concealing a first set of missing samples for the erased frame, implemented in a first time interval; a step of concealing a second set of missing samples utilizing information of said valid frame and implemented in a second time interval; and a step of transition between the first and the second set of missing samples to obtain at least part of the missing frame.

Type: Grant

Filed: March 20, 2009

Date of Patent: March 5, 2013

Assignee: France Telecom

Inventors: David Virette, Pierrick Philippe, Balazs Kovesi
Encoder and decoder using inter channel prediction based on optimally determined signals

Patent number: 8374883

Abstract: An encoder improves inter-channel prediction (ICP) performance in scalable stereo sound encoding using an ICP. In the encoder, ICP analysis units use, as reference signal candidates, a frequency coefficient in the low-band portion of a side residual signal, a frequency coefficient in each sub-band portion of a monaural residual signal, and a frequency coefficient in the low-band portion of the monaural residual signal, respectively, and perform an ICP analysis between the these respective candidates and a frequency coefficient in each sub-band portion of the side residual signal to generate first, second, and third ICP coefficients.

Type: Grant

Filed: October 31, 2008

Date of Patent: February 12, 2013

Assignee: Panasonic Corporation

Inventors: Haishan Zhong, Zongxian Liu, Kok Seng Chong, Koji Yoshida
Packet loss concealment based on forced waveform alignment after packet loss

Patent number: 8346546

Abstract: A packet loss concealment method and system is described that attempts to reduce or eliminate destructive interference that can occur when an extrapolated waveform representing a lost segment of a speech or audio signal is merged with a good segment after a packet loss. This is achieved by guiding a waveform extrapolation that is performed to replace the bad segment using a waveform available in the first good segment or segments after the packet loss. In another aspect of the invention, a selection is made between a packet loss concealment method that performs the aforementioned guided waveform extrapolation and one that does not. The selection may be made responsive to determining whether the first good segment or segments after the packet loss are available and also to whether a segment preceding the lost segment and the first good segment following the lost segment are deemed voiced.

Type: Grant

Filed: July 31, 2007

Date of Patent: January 1, 2013

Assignee: Broadcom Corporation

Inventor: Juin-Hwey Chen
Method and apparatus of providing a quality measure for an output voice signal generated to reproduce an input voice signal

Patent number: 8296131

Abstract: A method of providing a quality measure for an output voice signal generated to reproduce an input voice signal, the method comprising: partitioning the input and output signals into frames; for each frame of the input signal, determining a disturbance relative to each of a plurality of frames of the output signal; determining a subset of the determined disturbances comprising one disturbance for each input frame such that a sum of the disturbances in the subset set is a minimum; and using the set of disturbances to provide the measure of quality.

Type: Grant

Filed: December 30, 2008

Date of Patent: October 23, 2012

Assignee: AudioCodes Ltd.

Inventors: Ilan D. Shallom, Nitay Shiran
Apparatus and a method for calculating a number of spectral envelopes

Patent number: 8296159

Abstract: An apparatus calculates a number of spectral envelopes to be derived by a spectral band replication (SBR) encoder, wherein the SBR encoder is adapted to encode an audio signal using a plurality of sample values within a predetermined number of subsequent time portions in an SBR frame extending from an initial time to a final time, the predetermined number of subsequent time portions being arranged in a time sequence given by the audio signal. The apparatus has a decision value calculator for determining a decision value, the decision value measuring a deviation in spectral energy distributions of a pair of neighboring time portions. The apparatus further has a detector for detecting a violation of a threshold by the decision value and a processor for determining a first envelope border between the pair of neighboring time portions when the violation of the threshold is detected.

Type: Grant

Filed: January 11, 2011

Date of Patent: October 23, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.

Inventors: Max Neuendorf, Bernhard Grill, Ulrich Kraemer, Markus Multrus, Harald Popp, Nikolaus Rettelbach, Frederik Nagel, Markus Lohwasser, Marc Gayer, Manuel Jander, Virgilio Bacigalupo
Efficient use of phase information in audio encoding and decoding

Patent number: 8255228

Abstract: An efficient encoded representation of a first and a second input audio signal can be derived using correlation information indicating a correlation between the first and the second input audio signals, when a signal characterization information, indicating at least a first or a second, different characteristic of the input audio signal is additionally considered. Phase information indicating a phase relation between the first and the second input audio signals is derived, when the input audio signals have the first characteristic. The phase information and a correlation measure are included into the encoded representation when the input audio signals have the first characteristic, and only the correlation information is included into the encoded representation when the input audio signals have the second characteristic.

Type: Grant

Filed: January 11, 2011

Date of Patent: August 28, 2012

Assignee: Fraunhofer-Gesellschaft zur Foerderung der Angewandten Forschung E.V.

Inventors: Johannes Hilpert, Bernhard Grill, Matthias Neusinger, Julien Robilliard, Maria Luis-Valero
Method of and apparatus for image analysis

Patent number: 8238691

Abstract: A method of and apparatus for image analysis for picture loss detection in fields or frames in video or film content makes use of different correlation characteristics of picture images and non-picture images to detect picture loss. A measure of self correlation of a plurality of image data samples, and a measure of the correlation of the plurality of image data samples with a mean value are determined, and a positive detection of picture loss is based on a comparison between the two correlation measures.

Type: Grant

Filed: September 6, 2006

Date of Patent: August 7, 2012

Assignee: Snell & Wilcox Limited

Inventors: Jonathan Diggins, Martin Weston
System, apparatus, and method for a robust synchronization scheme for digital communication systems

Patent number: 8233576

Abstract: A synchronization sequence (preamble) that is known to the receiver forms as an integral part of packet-based digital communication systems. The first operation in such digital communication systems is the detection of the beginning of a valid signal (packet). A system, apparatus, and method for a scheme to robustly detect the preamble are provided having a hierarchical cross-correlator in combination with a second stage delayed auto-correlator using the output of the cross-correlator as an input to the second stage correlator.

Type: Grant

Filed: December 5, 2006

Date of Patent: July 31, 2012

Assignee: Koninklijke Philips Electronics N.V.

Inventor: Dagnachew Birru
Speech processing apparatus and method

Patent number: 8229739

Abstract: A speech processing apparatus includes a plurality of microphones which receive speech produced by a first sound source to obtain first speech signals for a plurality of channels having one-to-one correspondence with the plurality of microphones, a calculation unit configured to calculate a first characteristic amount indicative of an inter-channel correlation of the first speech signals, a storage unit configured to store in advance a second characteristic amount indicative of an inter-channel correlation of second speech signals for the plurality of channels obtained by receiving speech produced by a second sound source by the plurality of microphones, and a collation unit configured to collate the first characteristic amount with the second characteristic amount to determine whether the first sound source matches with the second sound source.

Type: Grant

Filed: July 21, 2008

Date of Patent: July 24, 2012

Assignee: Kabushiki Kaisha Toshiba

Inventor: Tadashi Amada
Pitch range refinement

Patent number: 8214201

Abstract: A method of refining a pitch period estimation of a signal, the method comprising: for each of a plurality of portions of the signal, scanning over a predefined range of time offsets to find an estimate of the pitch period of the portion within the predefined range of time offsets; identifying the average pitch period of the estimated pitch periods of the portions; determining a refined range of time offsets in dependence on the average pitch period, the refined range of time offsets being narrower than the predefined range of time offsets; and for a subsequent portion of the signal, scanning over the refined range of time offsets to find an estimate of the pitch period of the subsequent portion.

Type: Grant

Filed: November 19, 2008

Date of Patent: July 3, 2012

Assignee: Cambridge Silicon Radio Limited

Inventor: Xuejing Sun
Stereo decoder that conceals a lost frame in one channel using data from another channel

Patent number: 8209168

Abstract: An audio data transmitting/receiving apparatus for realizing a high-quality frame compensation in audio communications. In an audio data transmitting apparatus (10), a delay part (104) subjects multi-channel audio data to a delay process that delays the L-ch encoded data relative to the R-ch encoded data by a predetermined delay amount. A multiplexing part (106) multiplexes the audio data as subjected to the delay process. A transmitting part (108) transmits the audio data as multiplexed. In an audio data receiving apparatus (20), a separating part (114) separates, for each channel, the audio data received from the audio data transmitting apparatus (10). A decoding part (118) decodes, for each channel, the audio data as separated. If there has occurred a loss or error in the audio data as separated, then a frame compensating part (120) uses one of the L-ch and R-ch encoded data to compensate for the loss or error in the other encoded data.

Type: Grant

Filed: May 20, 2005

Date of Patent: June 26, 2012

Assignee: Panasonic Corporation

Inventor: Koji Yoshida
Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same

Patent number: 8175869

Abstract: A method, apparatus, and medium for classifying a speech signal and a method, apparatus, and medium for encoding the speech signal using the same are provided. The method for classifying a speech signal includes calculating classification parameters from an input signal having block units, calculating a plurality of classification criteria from the classification parameters, and classifying the level of the input signal using the plurality of classification criteria. The classification parameters include at least one of an energy parameter of the input signal, a cross-correlation parameter between a specific block of a present frame and the input signal, and an integrated cross-correlation parameter obtained by accumulating the cross-correlation parameter.

Type: Grant

Filed: July 5, 2006

Date of Patent: May 8, 2012

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hosang Sung, Rakesh Taori, Kangeun Lee
Speech analysis apparatus, speech analysis method and computer program

Patent number: 8165873

Abstract: A speech analysis apparatus analyzing prosodic characteristics of speech information and outputting a prosodic discrimination result includes an input unit inputting speech information, an acoustic analysis unit calculating relative pitch variation and a discrimination unit performing speech discrimination processing, in which the acoustic analysis unit calculates a current template relative pitch difference, determining whether a difference absolute value between the current template relative pitch difference and a previous template relative pitch difference is equal to or less than a predetermined threshold or not, when the value is not less than the threshold, calculating an adjacent relative pitch difference, and when the adjacent relative pitch difference is equal to or less than a previously set margin value, executing correction processing of adding or subtracting an octave of the current template relative pitch difference to calculate the relative pitch variation by applying the relative pitch differe

Type: Grant

Filed: July 21, 2008

Date of Patent: April 24, 2012

Assignee: Sony Corporation

Inventor: Keiichi Yamada
Seamless audio speed change based on time scale modification

Patent number: 8155972

Abstract: This invention involves time-scale modification of audio signals. The invention describes overlap and add time scale modification with variable input and output buffer sizes. Seamless speed change is achieved by keeping track of previously processed data to avoid discontinuities during playback speed transitions.

Type: Grant

Filed: October 5, 2005

Date of Patent: April 10, 2012

Assignee: Texas Instruments Incorporated

Inventors: Atsuhiro Sakurai, Yoshihide Iwata
Stereo audio encoding device, stereo audio decoding device, and method thereof

Patent number: 8150702

Abstract: Disclosed is a stereo audio encoding device capable of improving a spatial image of a decoded audio in stereo audio encoding. In this device, an original cross correlation calculation unit (101) calculates a mutual relationship coefficient (C1) between the original L channel signal and the original R channel signal. A stereo audio reconfiguration unit (104) subjects the inputted L channel signal and the R channel signal to encoding and decoding so as to generate an L channel reconfigured signal (L?) and an R channel reconfigured signal (R?). A reconfiguration cross correlation calculation unit (105) calculates a cross correlation coefficient (C2) between the L channel reconfigured signal (L?) and the R channel reconfigured signal (R?). A cross correlation comparison unit (106) calculates and outputs a comparison result &agr; between the cross correlation coefficient (C1) and the cross correlation coefficient (C2).

Type: Grant

Filed: August 2, 2007

Date of Patent: April 3, 2012

Assignee: Panasonic Corporation

Inventors: Jiong Zhou, Kok Seng Chong
Apparatus, method, and computer program for comparing audio signals

Patent number: 8150683

Abstract: An apparatus, method, and computer program are capable of receiving and cross-correlating a first audio signal and a second audio signal. This produces a cross-correlated signal, which is used to identify a plurality of parameters associated with at least one of the first and second audio signals. The parameters are used to generate an indicator identifying an extent to which the first and second audio signals match.

Type: Grant

Filed: November 4, 2003

Date of Patent: April 3, 2012

Assignee: STMicroelectronics Asia Pacific Pte., Ltd.

Inventors: Kabi P. Padhi, Sapna George
Systems, methods, and apparatus for computationally efficient, iterative alignment of speech waveforms

Patent number: 8145477

Abstract: Systems, methods, and apparatus described include waveform alignment operations in which a single set of evaluated cosines and sines is used to calculate cross-correlations of two periodic waveforms at two different phase shifts.

Type: Grant

Filed: December 1, 2006

Date of Patent: March 27, 2012

Inventors: Sharath Manjunath, Ananthapadmanabhan A. Kandhadai
Sound source separation system which converges a separation matrix using a dynamic update amount based on a cost function

Patent number: 8131542

Abstract: A system capable of separating sound source signals with high precision while improving a convergence rate and convergence precision. A process of updating a current separation matrix Wk to a next separation matrix Wk+1 such that a next value J(Wk+1) of a cost function is closer to a minimum value J(W0) than a current value J(Wk) is iteratively performed. An update amount ?Wk of the separation matrix is increased as the current value J(Wk) of the cost function is increased and is decreased as a current gradient ?J(Wk)/?W of the cost function is rapid. On the basis of input signals x from a plurality of microphones Mi and an optimal separation matrix W0, it is possible to separate sound source signals y(=W0·x) with high precision while improving a convergence rate and convergence precision.

Type: Grant

Filed: June 5, 2008

Date of Patent: March 6, 2012

Assignee: Honda Motor Co., Ltd.

Inventors: Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino
Two microphone noise reduction system

Patent number: 8131541

Abstract: A two microphone noise reduction system is described. In an embodiment, input signals from each of the microphones are divided into subbands and each subband is then filtered independently to separate noise and desired signals and to suppress non-stationary and stationary noise. Filtering methods used include adaptive decorrelation filtering. A post-processing module using adaptive noise cancellation like filtering algorithms may be used to further suppress stationary and non-stationary noise in the output signals from the adaptive decorrelation filtering and a single microphone noise reduction algorithm may be used to further provide optimal stationary noise reduction performance of the system.

Type: Grant

Filed: April 25, 2008

Date of Patent: March 6, 2012

Assignee: Cambridge Silicon Radio Limited

Inventors: Kuan-Chieh Yen, Rogerio Guedes Alves
Audio time scale modification algorithm for dynamic playback speed control

Patent number: 8078456

Abstract: A modified synchronized overlap add (SOLA) algorithm for performing high-quality, low-complexity audio time scale modification (TSM) is described. The algorithm produces good output audio quality with a very low complexity and without producing additional audible distortion during dynamic change of the audio playback speed. The algorithm may achieve complexity reduction by performing the maximization of normalized cross-correlation using decimated signals. By updating the input buffer and the output buffer in a precise sequence with careful checking of the appropriate array bounds, the algorithm may also achieve seamless audio playback during dynamic speed change with a minimal requirement on memory usage.

Type: Grant

Filed: May 12, 2008

Date of Patent: December 13, 2011

Assignee: Broadcom Corporation

Inventors: Juin-Hwey Chen, Robert W. Zopf
Apparatus, method and computer program product for feature extraction

Patent number: 8073686

Abstract: A feature extraction apparatus includes a spectrum calculating unit that calculates, based on an input speech signal, a frequency spectrum having frequency components obtained at regular intervals on a logarithmic frequency scale for each of frames that are defined by regular time intervals, and thereby generates a time series of the frequency spectrum; a cross-correlation coefficients calculating unit that calculates, for each target frame of the frames, a cross-correlation coefficients between frequency spectra calculated for two different frames that are in vicinity of the target frame and a predetermined frame width apart from each other; and a shift amount predicting unit that predicts a shift amount of the frequency spectra on the logarithmic frequency scale with respect to the predetermined frame width by use of the cross-correlation coefficients.

Type: Grant

Filed: February 5, 2009

Date of Patent: December 6, 2011

Assignee: Kabushiki Kaisha Toshiba

Inventors: Yusuke Kida, Takashi Masuko

1 2 next