Excitation Patterns Patents (Class 704/223)

Dual-pulse excited linear prediction for speech coding

Patent number: 8175870

Abstract: The invention proposed a Dual-Pulse Excitation Model; wherein two pulses of each pair of pulses are always adjacent each other. Only one position index for each pair of pulses needs to be sent to the decoder, which saves bits to code all pulse positions. The magnitudes of each pair of pulses have limited number of patterns. Because the two pulses are adjacent each other, each pair of pulses with different magnitudes can produce different high-pass and/or low-pass effect. Since the magnitudes have enough variation, it is possible to assign the candidate positions of each pair of pulses within a small range in order to save the searching complexity.

Type: Grant

Filed: November 19, 2007

Date of Patent: May 8, 2012

Assignee: Huawei Technologies Co., Ltd.

Inventor: Yang Gao
Method and system for improving speech quality

Patent number: 8165872

Abstract: A method and system for improving speech quality may include estimating at least one component of a distorted portion of a speech signal from at least one component of an undistorted portion of the speech signal and reinforcing the component of the distorted portion based on the estimating. The components may include the pitch, spectral envelope and spectral energy of the speech signal. The undistorted portion of the speech signal may be delayed and the components of the distorted portion may be interpolated from the components of a delayed undistorted portion and a current undistorted portion of the speech signal. The components of the distorted portion of the speech signal may be extrapolated from a current undistorted portion of the speech signal. Components of the distorted portion of the speech signal may be estimated from frequency bands other than the frequency band affected by the distortion.

Type: Grant

Filed: February 1, 2007

Date of Patent: April 24, 2012

Assignee: Broadcom Corporation

Inventors: Wilfrid LeBlanc, Mohammad Zad-Issa
Method and apparatus for layered code-excited linear prediction speech utilizing linear prediction excitation corresponding to optimal gains

Patent number: 8160872

Abstract: A layered code-excited linear prediction (CELP) encoder, an Adaptive Multirate Wideband (AMR-WB) encoder and methods of CELP encoding and decoding. In one embodiment, the encoder includes: (1) a core layer subencoder and (2) at least one enhancement layer subencoder, at least one of the core layer subencoder and the enhancement layer subencoder having first and second adaptive codebooks and configured to retrieve a pitch lag estimate from the second adaptive codebook and perform a closed-loop search of the first adaptive codebook based on the pitch lag estimate.

Type: Grant

Filed: April 3, 2008

Date of Patent: April 17, 2012

Assignee: Texas Instruments Incorporated

Inventor: Jacek P. Stachurski
Speech coding method and apparatus which codes spectrum parameters and an excitation signal

Patent number: 8160871

Abstract: A wideband speech coding apparatus which causes an input speech signal to be represented by spectrum parameters and an excitation signal. The apparatus includes a coding unit configured to select a plurality of pulses from given pulse position candidates, and to code the excitation signal with the selected pulses; an identification unit configured to identify whether the input speech signal is a wideband speech signal or a narrowband speech signal; and a control unit configured to control the coding unit to select a pulse position candidate having a time resolution which is set in advance in accordance with the wideband speech signal, when the identification unit identifies that the input speech signal is the wideband speech signal, and to control the coding unit to lower the time resolution of the pulse position candidate, when the identification unit identifies that the input speech signal is the narrowband speech signal.

Type: Grant

Filed: March 31, 2010

Date of Patent: April 17, 2012

Assignee: Kabushiki Kaisha Toshiba

Inventor: Kimio Miseki
Time warping frames inside the vocoder by modifying the residual

Patent number: 8155965

Abstract: In one embodiment, the present invention comprises a vocoder having at least one input and at least one output, an encoder comprising a filter having at least one input operably connected to the input of the vocoder and at least one output, a decoder comprising a synthesizer having at least one input operably connected to the at least one output of the encoder, and at least one output operably connected to the at least one output of the vocoder, wherein the encoder comprises a memory and the encoder is adapted to execute instructions stored in the memory comprising classifying speech segments and encoding speech segments, and the decoder comprises a memory and the decoder is adapted to execute instructions stored in the memory comprising time-warping a residual speech signal to an expanded or compressed version of the residual speech signal.

Type: Grant

Filed: May 5, 2005

Date of Patent: April 10, 2012

Assignee: QUALCOMM Incorporated

Inventors: Rohit Kapoor, Serafin Diaz Spindola
Speech decoding method and apparatus which generates an excitation signal and a synthesis filter

Patent number: 8155955

Abstract: A speech decoding method which generates an excitation signal and a synthesis filter from coded data and which obtains a speech signal based on the excitation signal and the synthesis filter. The method includes acquiring identification information used for determining whether the speech signal to be decoded is a narrowband signal or a wideband signal; and modifying the excitation signal based on the identification information by controlling strength or presence of emphasis of pitch periodicity with respect to the excitation signal generated from the coded data, so as to generate the speech signal by use of the modified excitation signal and the synthesis filter.

Type: Grant

Filed: March 31, 2010

Date of Patent: April 10, 2012

Assignee: Kabushiki Kaisha Toshiba

Inventor: Kimio Miseki
Method for high quality audio transcoding

Patent number: 8150685

Abstract: A method and apparatus for a voice transcoder that converts a bitstream representing frames of data encoded according to a first voice compression standard to a bitstream representing frames of data according to a second voice compression standard using perceptual weighting that uses tuned weighting factors, such that the bitstream of a second voice compression standard to produce a higher quality decoded voice signal than a comparable tandem transcoding solution.

Type: Grant

Filed: April 29, 2011

Date of Patent: April 3, 2012

Assignee: Onmobile Global Limited

Inventors: Marwan A. Jabri, Jianwei Wang, Nicola Chong-White, Michael Ibrahim
Robot behavior control system and method, and robot apparatus

Patent number: 8145492

Abstract: A behavior control system of a robot for learning a phoneme sequence includes a sound inputting device inputting a phoneme sequence, a sound signal learning unit operable to convert the phoneme sequence into a sound synthesis parameter and to learn or evaluate a relationship between a sound synthesis parameter of a phoneme sequence that is generated by the robot and a sound synthesis parameter used for sound imitation, and a sound synthesizer operable to generate a phoneme sequence based on the sound synthesis parameter obtained by the sound signal learning unit.

Type: Grant

Filed: April 6, 2005

Date of Patent: March 27, 2012

Assignee: Sony Corporation

Inventor: Masahiro Fujita
Conference bridge which detects control information embedded in audio information to prioritize operations

Patent number: 8144854

Abstract: Disclosed herein are methods, systems, and devices for improved audio, video, and data conferencing. The present invention provides a conferencing system comprising a plurality of endpoints communicating data including audio data and control data according to a communication protocol. A local conference endpoint may control or be controlled by a remote conference endpoint. Data comprising control signals may be exchanged between the local endpoint and remote endpoint via various communication protocols. In other embodiments, the present invention provides for improved bridge architecture for controlling functions of conference endpoints including controlling functions of the bridge.

Type: Grant

Filed: March 15, 2005

Date of Patent: March 27, 2012

Assignee: Polycom Inc.

Inventor: Jeffrey Rodman
Method and arrangements for coding audio signals

Patent number: 8135584

Abstract: According to the invention, an excitation signal is generated as a result of sampled excitation values in order to excite an audio synthesis filter, the generated sampled excitation values being continuously stored in an adaptive codebook. A noise generator is provided which continuously generates random sampled values. A sequence of the stored sampled excitation values is selected from the adaptive codebook based on a fed audio fundamental frequency parameter by means of which a time gap between the sequence that is to be selected and the actual time reference is predefined. The excitation signal is generated by mixing the selected sequence with a random sequence encompassing actual random sampled valued of the noise generator.

Type: Grant

Filed: January 31, 2006

Date of Patent: March 13, 2012

Assignee: Siemens Enterprise Communications GmbH & Co. KG

Inventors: Bernd Geiser, Peter Jax, Stefan Schandl, Herve Taddei
Method and device for code conversion between audio encoding/decoding methods and storage medium thereof

Patent number: 8117028

Abstract: When performing audio communication by using different encoding/decoding methods, a code obtained by encoding audio by a certain method is converted into a code decodable by another method with a high audio quality and a small calculation amount. In a code conversion device for converting a first code string into a second code string, an audio decoding circuit acquires a first linear prediction coefficient and excitation signal information from the first code string and drives the filter having the first linear prediction coefficient by the excitation signal obtained from the excitation signal information, thereby creating a first audio signal. A fixed codebook code generation circuit uses the fixed codebook information and minimizes the distance between the second audio signal generated from the information obtained from the second code string and the first audio signal, thereby obtaining the fixed codebook information in the second code string.

Type: Grant

Filed: May 22, 2003

Date of Patent: February 14, 2012

Assignee: NEC Corporation

Inventor: Atsushi Murashima
Audio encoding device and audio encoding method

Patent number: 8112271

Abstract: Provided is an audio encoding device capable of improving performance of an adaptive codebook and improving quality of a decoded audio. In this audio encoding device, an adaptive codebook cuts out a vector specified by a comparator from adaptive code vectors stored in an internal buffer and outputs it to a filter and a switch. The filter performs a predetermined filtering process on the adaptive sound source signal and outputs the obtained adaptive code vector to the switch. According to an instruction from the comparator, the switch outputs the adaptive code vector directly output from the adaptive codebook to a adjuster when the adaptive codebook is searched and outputs the adaptive code vector output from the filter after being subjected to the filtering process to the gain adjuster when a fixed sound source is searched after the adaptive sound source search.

Type: Grant

Filed: August 7, 2007

Date of Patent: February 7, 2012

Assignee: Panasonic Corporation

Inventor: Toshiyuki Morii
Method and device for updating status of synthesis filters

Patent number: 8078459

Abstract: A method and device for updating statuses of synthesis filters are provided. The method includes: exciting a synthesis filter corresponding to a first encoding rate by using an excitation signal of the first encoding rate, outputting reconstructed signal information, and updating status information of the synthesis filter and a synthesis filter corresponding to a second encoding rate. In the present disclosure, the status of the synthesis filter corresponding to the current rate and the statuses of the synthesis filters at other rates are updated. Thus, synchronization between the statuses of the synthesis filters corresponding to different rates at the encoding terminal may be realized, thereby facilitating the consistency of the reconstructed signals of the encoding and decoding terminals when the encoding rate is switched, and improving the quality of the reconstructed signal of the decoding terminal.

Type: Grant

Filed: June 14, 2010

Date of Patent: December 13, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventor: Jinliang Dai
Systems, methods, and apparatus for quantization of spectral envelope representation

Patent number: 8069040

Abstract: A quantizer according to an embodiment is configured to quantize a smoothed value of an input value (e.g., a vector of line spectral frequencies) to produce a corresponding output value, where the smoothed value is based on a scale factor and a quantization error of a previous output value.

Type: Grant

Filed: April 3, 2006

Date of Patent: November 29, 2011

Assignee: QUALCOMM Incorporated

Inventor: Koen Bernard Vos
Method and apparatus for implementing fixed codebooks of speech codecs as common module

Patent number: 8050913

Abstract: A method and apparatus for implementing fixed codebooks as a common module are provided. In the method of implementing fixed codebooks of a plurality of speech codecs as a common module, it is possible to include only a part excluding fixed codebooks in a communication terminal or communication system, support various speech codecs without using a chip with high price and high performance, and reduce a memory space that is occupied by the speech codecs by generating a track of a fixed codebook corresponding to a speech codec based on information on the speech codec among the plurality of speech codecs and selecting a codebook vector corresponding to a target signal among codebook vectors constructed with combinations of pulses represented by the generated track. In addition, it is possible to reduce processing complexity as compared with a case of embodying the common fixed codebook module in software by embodying the common fixed codebook module in hardware.

Type: Grant

Filed: October 31, 2007

Date of Patent: November 1, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Kang-eun Lee, Do-hyung Kim, Chang-yong Son
System enhancement of speech signals

Patent number: 8050914

Abstract: A system enhances speech by detecting a speaker's utterance through a first microphone positioned a first distance from a source of interference. A second microphone may detect the speaker's utterance at a different position. A monitoring device may estimate the power level of a first microphone signal. A synthesizer may synthesize part of the first microphone signal by processing the second microphone signal. The synthesis may occur when power level is below a predetermined level.

Type: Grant

Filed: November 12, 2008

Date of Patent: November 1, 2011

Assignee: Nuance Communications, Inc.

Inventors: Gerhard Uwe Schmidt, Mohamed Krini
Method and device for updating status of synthesis filters

Patent number: 8046216

Abstract: A method and device for updating statuses of synthesis filters are provided. The method includes: exciting a synthesis filter corresponding to a first encoding rate by using an excitation signal of the first encoding rate, outputting reconstructed signal information, and updating status information of the synthesis filter and a synthesis filter corresponding to a second encoding rate. In the present disclosure, the status of the synthesis filter corresponding to the current rate and the statuses of the synthesis filters at other rates are updated. Thus, synchronization between the statuses of the synthesis filters corresponding to different rates at the encoding terminal may be realized, thereby facilitating the consistency of the reconstructed signals of the encoding and decoding terminals when the encoding rate is switched, and improving the quality of the reconstructed signal of the decoding terminal.

Type: Grant

Filed: July 14, 2009

Date of Patent: October 25, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventor: Jinliang Dai
Method and device for adaptive bandwidth pitch search in coding wideband signals

Patent number: 8036885

Abstract: A pitch search method and device for digitally encoding a wideband signal, in particular but not exclusively a speech signal, in view of transmitting, or storing, and synthesizing this wideband sound signal. The new method and device which achieve efficient modeling of the harmonic structure of the speech spectrum uses several forms of low pass filters applied to a pitch codevector, the one yielding higher prediction gain (i.e. the lowest pitch prediction error) is selected and the associated pitch codebook parameters are forwarded.

Type: Grant

Filed: November 17, 2009

Date of Patent: October 11, 2011

Assignee: Voiceage Corp.

Inventors: Bruno Bessette, Redwan Salami, Roch Lefebvre
Scalable encoding device and scalable encoding method

Patent number: 8036390

Abstract: A scalable encoding device prevents sound quality deterioration of a decoded signal, reduces the encoding rate, and reduces the circuit size. The scalable encoding device includes a first layer encoder for generating a monaural signal by using a plurality of channel signals (L channel signal and R channel signal) constituting a stereo signal and encoding the monaural signal to generate a sound source parameter. The scalable encoding device also includes a second layer encoder for generating a first conversion signal by using the channel signal and the monaural signal, generating a synthesis signal by using the sound source parameter and the first conversion signal, and generating a second conversion coefficient index by using the synthesis signal and the first conversion signal.

Type: Grant

Filed: January 30, 2006

Date of Patent: October 11, 2011

Assignee: Panasonic Corporation

Inventors: Michiyo Goto, Koji Yoshida
Identification of the presence of speech in digital audio data

Patent number: 8036884

Abstract: The present invention provides a method, a computer-software-product and an apparatus for enabling a determination of speech related audio data within a record of digital audio data. The method comprises steps for extracting audio features from the record of digital audio data, for classifying one or more subsections of the record of digital audio data, and for marking at least a part of the record of digital audio data classified as speech. The classification of the digital audio data record is performed on the basis of the extracted audio features and with respect to at least one predetermined audio class.

Type: Grant

Filed: February 24, 2005

Date of Patent: October 11, 2011

Assignee: Sony Deutschland GmbH

Inventors: Yin Hay Lam, Josep Maria Sola I Caros
Adaptive postfiltering methods and systems for decoding speech

Patent number: 8032363

Abstract: A method of processing a decoded speech (DS) signal including successive DS frames, each DS frame including DS samples. The method comprises: adaptively filtering the DS signal to produce a filtered signal; gain-scaling the filtered signal with an adaptive gain updated once a DS frame, thereby producing a gain-scaled signal; and performing a smoothing operation to smooth possible waveform discontinuities in the gain-scaled signal.

Type: Grant

Filed: August 9, 2002

Date of Patent: October 4, 2011

Assignee: Broadcom Corporation

Inventors: Juin-Hwey Chen, Jes Thyssen, Chris C Lee
Pulse allocating method in voice coding

Patent number: 8024187

Abstract: A pulse allocating method capable of coding stereophonic voice signals efficiently. In the fixed code note retrievals of this pulse allocating method, for individual subframes, the stereophonic voice signals are compared to judge similarity between channels, and are judged on their characteristics. On the basis of the similarity between the channels and the characteristics of the stereophonic signals, the pulse numbers to be allocated to the individual channels are determined. Pulse retrievals are executed to determine the pulse positions for the individual channels, so that the pulses determined are coded.

Type: Grant

Filed: February 9, 2006

Date of Patent: September 20, 2011

Assignee: Panasonic Corporation

Inventors: Chun Woei Teo, Sua Hong Neo, Koji Yoshida, Michiyo Goto
Method and apparatus for adaptively encoding and decoding high frequency band

Patent number: 8010352

Abstract: Provided are a method and apparatus for encoding and decoding an audio signal. According to the present application, a signal of a high frequency band above a preset frequency band is adaptively encoded or decoded in the time domain or in the frequency domain by using a signal of a low frequency band below the preset frequency band. As such, the sound quality of a high frequency signal is not deteriorate even when an audio signal is encoded or decoded by using a small number of bits and thus coding efficiency may be maximized.

Type: Grant

Filed: June 21, 2007

Date of Patent: August 30, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Chang-yong Son, Eun-mi Oh, Ki-hyun Choo, Jung-hoe Kim
Systems and methods for dynamic normalization to reduce loss in precision for low-level signals

Patent number: 8005671

Abstract: A normalization factor for a current frame of a signal may be determined. The normalization factor may depend on an amplitude of the current frame of the signal. The normalization factor may also depend on values of states after one or more operations were performed on a previous frame of a normalized signal. The current frame of the signal may be normalized based on the normalization factor that is determined. The states' normalization factor may be adjusted based on the normalization factor that is determined.

Type: Grant

Filed: January 31, 2007

Date of Patent: August 23, 2011

Assignee: QUALCOMM Incorporated

Inventors: Vivek Rajendran, Ananthapadmanabhan A. Kandhadai
Low-complexity code excited linear prediction encoding

Patent number: 8000967

Abstract: Information about excitation signals of a first signal encoded by CELP is used to derive a limited set of candidate excitation signals for a second correlated second signal. Preferably, pulse locations of the excitation signals of the first encoded signal are used for determining the set of candidate excitation signals. More preferably, the pulse locations of the set of candidate excitation signals are positioned in the vicinity of the pulse locations of the excitation signals of the first encoded signal. The first and second signals may be multi-channel signals of a common speech or audio signal. However, the first and second signals may also be identical, whereby the coding of the second signal can be utilized for re-encoding at a lower bit rate.

Type: Grant

Filed: March 9, 2005

Date of Patent: August 16, 2011

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventor: Anisse Taleb
Device, method and computer program for analyzing an audio signal

Patent number: 7996212

Abstract: A hardware device for analyzing an audio signal comprises a calculator for calculating a neural activity pattern over time resulting at nerve fibers of an ear model based on the audio signal and a processor for processing the neural activity pattern to obtain a sequence of time information as an analysis representation describing a temporal position of consecutive trajectories, wherein a trajectory includes activity impulses on different nerve fibers based on the same event in the audio signal. A two-dimensional representation of the neural activity pattern is gradually distorted over time, and it is recognized when an approximately straight line is contained in the distorted two-dimensional representation of the neural activity pattern over time. Accordingly, a time information belonging to the trajectory is provided.

Type: Grant

Filed: June 29, 2005

Date of Patent: August 9, 2011

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventor: Frank Klefenz
Speech encoding apparatus and speech encoding method that encode speech signals in a scalable manner, and speech decoding apparatus and speech decoding method that decode scalable encoded signals

Patent number: 7991611

Abstract: An audio encoding device for correcting a component having insufficient encoding capability in a core layer by an extended layer. A core layer encoder encodes an audio signal. An extended layer encoder encodes an encoding residual of the core layer encoder. A characteristic correction inverse filter arranged at a pre-stage of an LPC synthesis filter subjects the component having insufficient encoding capability in the core layer to an inverse characteristic correction process, and a characteristic correction filter arranged at a post-stage of the LPC synthesis filter performs a process for characteristic correction of the synthesis signal inputted from the LPC synthesis filter.

Type: Grant

Filed: October 13, 2006

Date of Patent: August 2, 2011

Assignee: Panasonic Corporation

Inventors: Hiroyuki Ehara, Koji Yoshida
System and methods for concealing errors in data transmission

Patent number: 7979272

Abstract: The present invention provides a frame erasure concealment device and method that is based on reestimating gain parameters for a code excited linear prediction (CELP) coder. During operation, when a frame in a stream of received data is detected as being erased, the coding parameters, especially an adaptive codebook gain gp and a fixed codebook gain gc, of the erased and subsequent frames can be reestimated by a gain matching procedure. By using this technique with the IS-641 speech coder, it has been found that the present invention improves the speech quality under various channel conditions, compared with a conventional extrapolation-based concealment algorithm.

Type: Grant

Filed: October 12, 2007

Date of Patent: July 12, 2011

Assignee: AT&T Intellectual Property II, L.P.

Inventors: Hong-Goo Kang, Hong Kook Kim
Method and system for low bit rate voice encoding and decoding applicable for any reduced bandwidth requirements including wireless

Patent number: 7970607

Abstract: An implementation of the present invention comprises a voice encoder and decoder method and system that uses voice excitation, eliminating the voice/unvoiced pitch tracking, and the first formant up to 2400 Hertz for synchronous and up to 1600 Hertz for asynchronous, does not use pulse code modulation encoding, but uses the zero crossings only of the first formant, frequency dividing by two and sampling at the formant frequency. The resulting combination uses half or less of the bit rate for excitation and the remainder for short-term spectrum analysis. The spectrum could be updated each 20 milliseconds using 49 bits for the spectrum frame and 49 bits for excitation and one frame bit for synchronous Asynchronous operation could be update at 21.25 milliseconds using 49 bits for the spectrum information and 34 bits for excitation with one bit for frame synchronization.

Type: Grant

Filed: February 15, 2008

Date of Patent: June 28, 2011

Assignee: Clyde Holmes

Inventor: Clyde Holmes
Method for high quality audio transcoding

Patent number: 7962333

Abstract: A method and apparatus for a voice transcoder that converts a bitstream representing frames of data encoded according to a first voice compression standard to a bitstream representing frames of data according to a second voice compression standard using perceptual weighting that uses tuned weighting factors, such that the bitstream of a second voice compression standard to produce a higher quality decoded voice signal than a comparable tandem transcoding solution.

Type: Grant

Filed: August 2, 2007

Date of Patent: June 14, 2011

Assignee: Onmobile Global Limited

Inventors: Marwan A. Jabri, Jianwei Wang, Nicola Chong-White, Michael Ibrahim
Method and apparatus to update parameter of error frame

Patent number: 7962835

Abstract: In a method and apparatus to conceal an error in an audio signal, when the current frame has no error and a past frame input prior to the current frame has an error, a parameter for the past frame is generated using a parameter for the current frame and a parameter of a frame out of frames input prior to the past frame and a previously stored parameter is updated with the generated parameter, thereby concealing an error of an audio signal without additional delay and preventing degradation in sound quality in a frame that is input after a frame having an error.

Type: Grant

Filed: September 20, 2007

Date of Patent: June 14, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventors: Ho-sang Sung, Kang-eun Lee, Eun-mi Oh
Voice transcoder

Patent number: 7957963

Abstract: First encoded voice bits are transcoded into second encoded voice bits by dividing the first encoded voice bits into one or more received frames, with each received frame containing multiple ones of the first encoded voice bits. First parameter bits for at least one of the received frames are generated by applying error control decoding to one or more of the encoded voice bits contained in the received frame, speech parameters are computed from the first parameter bits, and the speech parameters are quantized to produce second parameter bits. Finally, a transmission frame is formed by applying error control encoding to one or more of the second parameter bits, and the transmission frame is included in the second encoded voice bits.

Type: Grant

Filed: December 14, 2009

Date of Patent: June 7, 2011

Assignee: Digital Voice Systems, Inc.

Inventor: John C. Hardwick
Fixed codebook searching apparatus and fixed codebook searching method

Patent number: 7957962

Abstract: A fixed codebook searching apparatus which slightly suppresses an increase in the operation amount, even if the filter applied to the excitation pulse has the characteristic that it cannot be represented by a lower triangular matrix and realizes a quasi-optimal fixed codebook search. This fixed codebook searching apparatus is provided with an algebraic codebook that generates a pulse excitation vector; a convolution operation section that convolutes an impulse response of auditory weighted synthesis filter into an impulse response vector that has a value at negative times, to generate a second impulse response vector that has a value at second negative times; a matrix generating section that generates a Toeplitz-type convolution matrix by means of the second impulse response vector; and a convolution operation section that convolutes the matrix generated by matrix generating section into the pulse excitation vector generated by algebraic codebook.

Type: Grant

Filed: February 25, 2009

Date of Patent: June 7, 2011

Assignee: Panasonic Corporation

Inventors: Hiroyuki Ehara, Koji Yoshida
Adaptive filter pitch extraction

Patent number: 7949520

Abstract: An enhancement system extracts pitch from a processed speech signal. The system estimates the pitch of voiced speech by deriving filter coefficients of an adaptive filter and using the obtained filter coefficients to derive pitch. The pitch estimation may be enhanced by using various techniques to condition the input speech signal, such as spectral modification of the background noise and the speech signal, and/or reduction of the tonal noise from the speech signal.

Type: Grant

Filed: December 9, 2005

Date of Patent: May 24, 2011

Assignee: QNX Software Sytems Co.

Inventors: Rajeev Nongpiur, Phillip A. Hetherington
Fixed codebook searching apparatus and fixed codebook searching method

Patent number: 7949521

Abstract: A fixed codebook searching apparatus which slightly suppresses an increase in the operation amount, even if the filter applied to the excitation pulse has the characteristic that it cannot be represented by a lower triangular matrix and realizes a quasi-optimal fixed codebook search. This fixed codebook searching apparatus is provided with an algebraic codebook that generates a pulse excitation vector; a convolution operation section that convolutes an impulse response of auditory weighted synthesis filter into an impulse response vector that has a value at negative times, to generate a second impulse response vector that has a value at second negative times; a matrix generating section that generates a Toeplitz-type convolution matrix by means of the second impulse response vector; and a convolution operation section that convolutes the matrix generated by matrix generating section into the pulse excitation vector generated by algebraic codebook.

Type: Grant

Filed: February 25, 2009

Date of Patent: May 24, 2011

Assignee: Panasonic Corporation

Inventors: Hiroyuki Ehara, Koji Yoshida
Fixed codebook search method and searcher

Patent number: 7941314

Abstract: A fixed codebook search method includes: initializing a counter; searching for pulses and calculating the value of a cost function Qk; initializing the counter if the Qk value increases; increasing the value of the counter if the Qk value does not increase; judging whether the value of the counter is greater than the threshold value; continuing the search process if the value of the counter is not greater than the threshold value; and ending the whole search process if the value of the counter is greater than the threshold value. The present invention reduces the search count and improves the search efficiency.

Type: Grant

Filed: May 11, 2010

Date of Patent: May 10, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventors: Dejun Zhang, Liang Zhang, Lixiong Li, Tinghong Wang, Yue Lang, Wenhai Wu
Scalable audio coding

Patent number: 7921007

Abstract: The invention relates to an audio encoder and decoder and methods for audio encoding and decoding. In a preferred encoder embodiment an audio signal is encoded by deterministic encoder means to form a first encoded signal part. A spectrum of the audio signal is determined and represented by an excitation pattern, i.e. spectral values corresponding to human auditory filters, as a second encoded signal part. A masking curve is also extracted based on the excitation pattern, thus improving encoding efficiency in terms of bit rate. In a preferred decoder the first encoded signal part is decoded by deterministic decoder means. A noise generator uses the decoded first signal part together with the second signal part, i.e. the excitation pattern for the original audio signal, to generate a noise signal. The noise signal is then added to the first decoded signal part to form an output audio signal. At the decoder side the masking curve is also extracted based on the second encoded signal part, i.e.

Type: Grant

Filed: July 25, 2005

Date of Patent: April 5, 2011

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Steven Leonardus Josephus Dimphina Elisabeth Van de Par, Valery Stephanovich Kot, Nicolle Hanneke Van Schijndel
Method and device for updating status of synthesis filters

Patent number: 7921009

Abstract: A method and device for updating statuses of synthesis filters are provided. The method includes: exciting a synthesis filter corresponding to a first encoding rate by using an excitation signal of the first encoding rate, outputting reconstructed signal information, and updating status information of the synthesis filter and a synthesis filter corresponding to a second encoding rate. In the present disclosure, the status of the synthesis filter corresponding to the current rate and the statuses of the synthesis filters at other rates are updated. Thus, synchronization between the statuses of the synthesis filters corresponding to different rates at the encoding terminal may be realized, thereby facilitating the consistency of the reconstructed signals of the encoding and decoding terminals when the encoding rate is switched, and improving the quality of the reconstructed signal of the decoding terminal.

Type: Grant

Filed: September 16, 2010

Date of Patent: April 5, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventor: Jinliang Dai
Method and apparatus for speech data

Patent number: 7912711

Abstract: There is disclosed a speech processing device in which prediction taps for finding prediction values of the speech of high sound quality are extracted from the synthesized sound obtained on affording linear prediction coefficients and residual signals, generated from a preset code, to a speech synthesis filter, speech of high sound quality being higher in sound quality than the synthesized sound, and in which the prediction taps are used along with preset tap coefficients to perform preset predictive calculations to find the prediction values of the speech of high sound quality. The speech of high sound quality is higher in sound quality than the synthesized sound.

Type: Grant

Filed: September 21, 2007

Date of Patent: March 22, 2011

Assignee: Sony Corporation

Inventors: Tetsujiro Kondo, Tsutomu Watanabe, Masaaki Hattori, Hiroto Kimura, Yasuhiro Fujimori
High-frequency bandwidth extension in the time domain

Patent number: 7912729

Abstract: A system extends the high-frequency spectrum of a narrow band audio signal in the time domain. The system extends the harmonics of vowels by introducing a non linearity in a narrow band signal. Extended consonants are generated by a random-noise generator. The system differentiates the vowels from the consonants by exploiting predetermined features of a speech signal.

Type: Grant

Filed: June 4, 2007

Date of Patent: March 22, 2011

Assignee: QNX Software Systems Co.

Inventors: Rajeev Nongpiur, Phillip A. Hetherington
Fixed codebook search method and searcher

Patent number: 7908136

Abstract: A fixed codebook search method includes initializing a counter, searching for pulses and calculating the value of a cost function Qk, initializing the counter if the Qk value increases, increasing the value of the counter if the Qk value does not increase, judging whether the value of the counter is greater than the threshold value, continuing the search process if the value of the counter is not greater than the threshold value, and ending the whole search process if the value of the counter is greater than the threshold value.

Type: Grant

Filed: July 16, 2010

Date of Patent: March 15, 2011

Assignee: Huawei Technologies Co., Ltd.

Inventors: Dejun Zhang, Liang Zhang, Lixiong Li, Tinghong Wang, Yue Lang, Wenhai Wu
Sub-band voice codec with multi-stage codebooks and redundant coding

Patent number: 7904293

Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.

Type: Grant

Filed: October 9, 2007

Date of Patent: March 8, 2011

Assignee: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Hosam A. Khalil, Xiaoqin Sun, Wei-Ge Chen
Scalable encoding device, scalable decoding device, and method thereof

Patent number: 7904292

Abstract: A scalable encoding device for realizing scalable encoding by CELP encoding of a stereo sound signal and improving the encoding efficiency. In this device, an adder and a multiplier obtain an average of a first channel signal CH1 and a second channel signal CH2 as a monaural signal M. A CELP encoder for a monaural signal subjects the monaural signal M to CELP encoding, outputs the obtained encoded parameter to outside, and outputs a synthesized monaural signal M? synthesized by using the encoded parameter to a first channel signal encoder. By using the synthesized monaural signal M? and the second channel signal CH2, the first channel signal encoder subjects the first channel signal CH1 to CELP encoding to minimize the sum of the encoding distortion of the first channel signal CH1 and the encoding distortion of the second channel signal CH2.

Type: Grant

Filed: September 28, 2005

Date of Patent: March 8, 2011

Assignee: Panasonic Corporation

Inventors: Michiyo Goto, Koji Yoshida, Hiroyuki Ehara, Masahiro Oshikiri
Monitoring system with speech recognition

Patent number: 7881939

Abstract: A system for monitoring conditions associated with an individual in a region includes at least one speech input transducer and speech processing software coupled thereto. Results of the speech processing can initiate communications with a displaced communications device such as a telephone or a computer to provide a source of feedback.

Type: Grant

Filed: May 31, 2005

Date of Patent: February 1, 2011

Assignee: Honeywell International Inc.

Inventor: Lee D. Tice
Sound encoder and sound encoding method

Patent number: 7873512

Abstract: Even when a combination of the stegonography technique and prediction encoding is applied to sound encoding, a sound encoder does not cause deterioration in quality of decoded signals. In the device, an encoding section (102) outputs an encoding code (I) to a bit embedding section (104). A function extension encoding section (103) generates an encoding code (J) for information required for extending functions of the sound encoder (100) and outputs it to the bit embedding section (104). The bit embedding section (104) embeds information on the encoding code (J) into a part of bits of the encoding code (I) and outputs the resultant encoding code (I?). A synchronization information generating section (106) generates synchronization information according to the encoding code (I?) after the bit embedding and outputs the synchronization information to the encoding section (102).

Type: Grant

Filed: July 14, 2005

Date of Patent: January 18, 2011

Assignee: Panasonic Corporation

Inventor: Masahiro Oshikiri
Method and a device for source coding

Patent number: 7869993

Abstract: A method and a device for source coding with a time advanced excitation signal. During an encoding process, a source data signal is first divided into consecutive blocks, then a first set of parameters related to a filter describing properties of a first block covering a first time period is extracted, followed by the extraction of a second set of parameters related to an excitation signal for said filter, where said second set of parameters is determined from and describing properties of both the first block and a second block following the first block within a second time period starting later than said first time period and extending outside said first time period.

Type: Grant

Filed: October 4, 2004

Date of Patent: January 11, 2011

Inventor: Pasi S. Ojala
VECTOR QUANTIZATION DEVICE, VECTOR INVERSE QUANTIZATION DEVICE, AND METHOD THEREOF

Publication number: 20110004469

Abstract: Disclosed are a vector quantization device and others capable of adaptively adjusting a vector space of a code vector for quantization of a second stage by using a quantization result of a first stage and improving the quantization accuracy.

Type: Application

Filed: October 16, 2007

Publication date: January 6, 2011

Applicant: PANASONIC CORPORATION

Inventors: Kaoru Sato, Toshiyuki Morii, Tomofumi Yamanashi
Apparatus and method for speech segment detection and system for speech recognition

Patent number: 7860718

Abstract: Provided are an apparatus and method for speech segment detection, and a system for speech recognition. The apparatus is equipped with a sound receiver and an image receiver and includes: a lip motion signal detector for detecting a motion region from image frames output from the image receiver, applying lip motion image feature information to the detected motion region, and detecting a lip motion signal; and a speech segment detector for detecting a speech segment using sound frames output from the sound receiver and the lip motion signal detected from the lip motion signal detector. Since lip motion image information is checked in a speech segment detection process, it is possible to prevent dynamic noise from being misrecognized as speech.

Type: Grant

Filed: December 4, 2006

Date of Patent: December 28, 2010

Assignee: Electronics and Telecommunications Research Institute

Inventors: Soo Jong Lee, Sang Hun Kim, Young Jik Lee, Eung Kyeu Kim
Method and apparatus for a thin audio codec

Patent number: 7848922

Abstract: An apparatus and method for encoding and decoding a voice signal. The apparatus includes an encoder configured to generate an output bitstream signal from an input voice signal. The output bitstream signal is associated with at least a first standard of a first plurality of CELP voice compression standards. Additionally, the apparatus includes a decoder configured to generate an output voice signal from an input bitstream signal. The input bitstream signal is associated with at least a first standard of a second plurality of CELP voice compression standards. The CELP encoder includes a plurality of codec-specific encoder modules. Additionally, the CELP encoder includes a plurality of generic encoder modules. The CELP decoder includes a plurality of codec-specific decoder modules. Additionally, the CELP decoder includes a plurality of generic decoder modules.

Type: Grant

Filed: August 2, 2007

Date of Patent: December 7, 2010

Inventors: Marwan A. Jabri, Nicola Chong-White, Jianwei Wang
Speech coding apparatus and speech decoding apparatus

Patent number: RE43209

Abstract: A speech coding apparatus comprises a repetition period pre-selecting unit for generating a plurality of candidates for the repetition period of a driving excitation source by multiplying the repetition period of an adaptive excitation source by a plurality of constant numbers, respectively, and for pre-selecting a predetermined number of candidates from all the candidates generated. A driving excitation source coding unit provides both excitation source location information and excitation source polarity information that minimize a coding distortion, for each of the predetermined number of candidates, and provides an evaluation value associated with the minimum coding distortion for each of the predetermined number of candidates.

Type: Grant

Filed: January 28, 2010

Date of Patent: February 21, 2012

Assignee: Mitsubishi Denki Kabushiki Kaisha

Inventors: Hirohisa Tasaki, Tadashi Yamaura

prev 1 2 3 4 5 6 7 8 9 … next