Cross-correlation Patents (Class 704/218)

Decimated bisectional pitch refinement

Patent number: 8010350

Abstract: A method and system for refining an estimated pitch period estimate based on a coarse pitch useful for performing frame loss concealment in an audio decoder as well as for other applications. A normalized correlation at the coarse pitch lag is computed and used as the current best candidate. The normalized correlation is then evaluated at the midpoint of the refinement pitch range on either side of the current best candidate. If the normalized correlation at either midpoint is greater than the current best lag, the midpoint with the maximum correlation is selected as the current best lag. After each iteration, the refinement range is decreased by a factor of two and centered on the current best lag. This bisectional search continues until the pitch has been refined to an acceptable tolerance or until the refinement range has been exhausted. During each step of the bisectional pitch refinement, the signal is decimated to reduce the complexity of computing the normalized correlation.

Type: Grant

Filed: April 13, 2007

Date of Patent: August 30, 2011

Assignee: Broadcom Corporation

Inventor: Robert W. Zopf
Channel estimation method and system using linear correlation based interference cancellation (LCIC) combined with decision-feedback-equalization (DFE)

Patent number: 7933366

Abstract: A channel estimation method and system using linear correlation based interference cancellation combined with decision-feedback-equalization (LCIC-DFE) are provided. The channel estimation method includes generating a first correlation sequence by calculating a linear correlation between a baseband sampled complex signal and a locally stored pseudo-noise signal and obtaining a second correlation sequence by iteratively removing inter-path interference from the first correlation sequence and generating a first channel impulse response (CIR) sequence based on the second correlation sequence. And, obtaining a third correlation sequence by removing random-data interference from the second correlation sequence based on the first CIR sequence and a feedback signal and generating a second CIR sequence based on the third correlation sequence.

Type: Grant

Filed: May 4, 2007

Date of Patent: April 26, 2011

Assignee: Samsung Electronics Co., Ltd.

Inventor: Guanghui Liu
Signal Pitch Period Estimation

Publication number: 20100268530

Abstract: A method and apparatus for estimating the pitch period of a signal. The method includes identifying a first candidate pitch period by performing a search only over a first range of potential pitch periods. The method further includes determining a second candidate pitch period by dividing the first candidate pitch period by an integer, wherein the second candidate pitch period is outside the first range of potential pitch periods. The method further includes selecting as the estimate of the pitch period of the signal the smaller of the candidate pitch periods that is such that portions of the signal separated by that candidate pitch period are well correlated.

Type: Application

Filed: April 21, 2009

Publication date: October 21, 2010

Applicant: CAMBRIDGE SILICON RADIO LIMITED

Inventors: Xuejing Sun, Sameer Gadre
Apparatus, method, and medium for processing audio signal using correlation between bands

Patent number: 7756715

Abstract: Apparatus, method, and medium for processing an audio signal using a correlation between bands are provided. The apparatus includes an encoding unit encoding an input audio signal and a decoding unit decoding the encoded input audio signal.

Type: Grant

Filed: November 17, 2005

Date of Patent: July 13, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Dohyung Kim, Sihwa Lee
Methods and apparatus for blind separation of multichannel convolutive mixtures in the frequency domain

Patent number: 7711553

Abstract: A method and apparatus performing blind source separation using frequency-domain normalized multichannel blind deconvolution. Multichannel mixed signals are frames of N samples including r consecutive blocks of M samples. The frames are separated using separating filters in frequency domain in an overlap-save manner by discrete Fourier transform (DFT). The separated signals are then converted back into time domain using inverse DFT applied to a nonlinear function. Cross-power spectra between separated signals and nonlinear-transformed signals are computed and normalized by power spectra of both separated signals and nonlinear-transformed signals to have flat spectra. Time domain constraint is then applied to preserve first L cross-correlations. These alias-free normalized cross-power spectra are further constrained by nonholonomic constraints. Then, natural gradient is computed by convolving alias-free normalized cross-power spectra with separating filters.

Type: Grant

Filed: February 26, 2005

Date of Patent: May 4, 2010

Inventor: Seung Hyon Nam
Auto segmentation based partitioning and clustering approach to robust endpointing

Patent number: 7680657

Abstract: Possible segmentations for an audio signal are scored based on distortions for feature vectors of the audio signal and the total number of segments in the segmentation. The scores are used to select a segmentation and the selected segmentation is used to identify a starting point and an ending point for a speech signal in the audio signal.

Type: Grant

Filed: August 15, 2006

Date of Patent: March 16, 2010

Assignee: Microsoft Corporation

Inventors: Yu Shi, Frank Kao-ping Soong, Jian-Iai Zhou
Method and system for detecting voice activity based on cross-correlation

Patent number: 7653537

Abstract: A system and method is provided for determining whether a data frame of a coded speech signal corresponds to voice or to noise. In one embodiment, a voice activity detector determines a cross-correlation of data. If the cross-correlation is lower than a predetermined cross-correlation value, then the data frame corresponds to noise. If not, then the voice activity detector determines a periodicity of the cross-correlation and a variance of the periodicity. If the variance is less than a predetermined variance value, then the data frame corresponds to voice. In another embodiment, a method determines energy of the data frame and an average energy of the coded speech signal. If the data frame is one of a predetermined number of initial data frames, then a comparison between the average energy to the energy of the data frame is used to determine whether the data frame is noise or voice.

Type: Grant

Filed: September 28, 2004

Date of Patent: January 26, 2010

Assignee: STMicroelectronics Asia Pacific Pte. Ltd.

Inventors: Kabi Prakash Padhi, Sapna George
Apparatus and method for creating pitch wave signals and apparatus and method compressing, expanding and synthesizing speech signals using these pitch wave signals

Patent number: 7630883

Abstract: A pitch wave signal creation method as a preliminary process for efficiently coding a speech wave signal having a fluctuated pitch period is provided. A speech signal compressing/expanding apparatus and a speech signal synthesizing apparatus using the method, and a signal processing associated therewith are further provided. The pitch wave creation method of the invention is essentially comprised of a method of detecting the instantaneous pitch period of each pitch wave element of the speech wave signal, and a process of converting a corresponding pitch wave element into a normalized pitch wave element having a predetermined fixed time length by expanding and compressing the pitch wave element on a time axis while retaining its wave pattern based on the each detected instantaneous pitch period. The speech signal having a pitch fluctuation can be compressed in high quality and high efficiency by coding or synthesizing the speech wave signal using the pitch wave signal creation method of the invention.

Type: Grant

Filed: August 30, 2002

Date of Patent: December 8, 2009

Assignee: Kabushiki Kaisha Kenwood

Inventor: Yasushi Sato
System for bandwidth extension of narrow-band speech

Patent number: 7613604

Abstract: A system and method are disclosed for extending the bandwidth of a narrowband signal such as a speech signal. The method applies a parametric approach to bandwidth extension but does not require training. The parametric representation relates to a discrete acoustic tube model (DATM). The method comprises computing narrowband linear predictive coefficients (LPCs) from a received narrowband speech signal, computing narrowband partial correlation coefficients (parcors) using recursion, computing Mnb area coefficients from the partial correlation coefficient, and extracting Mwb area coefficients using interpolation. Wideband parcors are computed from the Mwb area coefficients and wideband LPCs are computed from the wideband parcors.

Type: Grant

Filed: March 26, 2007

Date of Patent: November 3, 2009

Assignee: AT&T Intellectual Property II, L.P.

Inventors: David Malah, Richard Vandervoort Cox
System and method for real-time jitter control and packet-loss concealment in an audio signal

Patent number: 7596488

Abstract: An “adaptive audio playback controller” operates by decoding and reading received packets of an audio signal into a signal buffer. Samples of the decoded audio signal are then played out of the signal buffer according to the needs of a player device. Jitter control and packet loss concealment are accomplished by continuously analyzing buffer content in real-time, and determining whether to provide unmodified playback from the buffer contents, whether to compress buffer content, stretch buffer content, or whether to provide for packet loss concealment for overly delayed or lost packets as a function of buffer content. Further, the adaptive audio playback controller also determines where to stretch or compress particular frames or signal segments in the signal buffer, and how much to stretch or compress such segments in order to optimize perceived playback quality.

Type: Grant

Filed: September 15, 2003

Date of Patent: September 29, 2009

Assignee: Microsoft Corporation

Inventors: Dinei Florencio, Philip Chou, Li-Wei He
Pitch detection method and apparatus

Patent number: 7593847

Abstract: A pitch detection method and apparatus, the pitch detection apparatus includes: a data rearrangement unit which rearranges voice data on the basis of a center peak of the voice data included in a single frame; a decomposition unit which decomposes rearranged voice data into even symmetrical components on the basis of a center peak; a pitch determination unit which obtains a segment correlation value between a reference point and at least one or more local peaks in relation to even symmetrical components, and determines the location of a local peak corresponding to a maximum segment correlation value among the obtained segment correlation values, as a pitch period.

Type: Grant

Filed: October 21, 2004

Date of Patent: September 22, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventor: Kwangcheol Oh
Constant pitch variable speed audio decoding

Patent number: 7580833

Abstract: A digital audio signal can be processed to have a constant pitch for a range of playback rates. A playback rate associated with a portion of the digital audio signal can be received, which differs from the rate at which the digital audio signal was recorded. A channel of the digital audio signal can be divided into a plurality of windows and one or more cross-correlation evaluations can be performed for at least a first window and a second window of the channel to determine an appropriate overlap. Further, the first cross-correlation evaluation can be performed using an ideal overlap. One or more sample values associated with a first waveform included in the first window can be averaged with one or more corresponding sample values associated with a second waveform included in the second window for the determined appropriate overlap to generate a rate-modified audio signal, which can be played.

Type: Grant

Filed: September 7, 2005

Date of Patent: August 25, 2009

Assignee: Apple Inc.

Inventor: Daniel Steinberg
Audio coding/decoding with spatial parameters and non-uniform segmentation for transients

Patent number: 7542896

Abstract: In binaural stereo coding, only one monaural channel is encoded. An additional layer holds the parameters to retrieve the left and right signal. An encoder is disclosed which links transient information extracted from the mono encoded signal to parametric multi-channel layers to provide increased performance. Transient positions can either be directly derived from the bit-stream or be estimated from other encoded parameters (e.g. window-switching flag in mp3).

Type: Grant

Filed: July 1, 2003

Date of Patent: June 2, 2009

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Erik Gosuinus Petrus Schuijers, Arnoldus Werner Johannes Oomen
Low-complexity packet loss concealment method for voice-over-IP speech transmission

Patent number: 7411985

Abstract: A low complexity packet loss concealment method for use in voice-over-IP speech transmission calculates a cross-correlation of previous speech data to estimate the pitch period of the previous speech when speech frames have been lost. A tap interval used to calculate the cross-correlation is dynamically adapted, thereby reducing the computational complexity of the process. In addition, the pitch period estimation is bypassed completely when it is determined not to be necessary, as a result of the speech being unvoiced or silence. A waveform “bending” operation is performed into the current frame without inserting any algorithmic delay into each frame.

Type: Grant

Filed: March 21, 2003

Date of Patent: August 12, 2008

Assignee: Lucent Technologies Inc.

Inventors: Minkyu Lee, James William McGowan
Multiple step adaptive method for time scaling

Patent number: 7337109

Abstract: A multiple step adaptive method for time scaling. Synthesizing S3[n] signal from signal S1[n]signal and S2[n]signal. Comprising following steps: (a) calculating a first magnitude of a cross-correlation function of S1[n]signal and S2[n]signal according to a first index; (b) comparing the first magnitude with a threshold value; (c) if first magnitude is smaller than threshold value, calculating a first reference magnitude of cross-correlation function of S1[n]signal and S2[n]signal according to a first reference index behind the first index by a first determined number, or calculating a second reference magnitude of the cross-correlation function of the S1[n] signal and the S2[n] signal according to a second reference index behind the first index by a second number; (d) synthesizing the S3[n] signal by adding S1[n]signal to the S2[n] signal in accordance with a maximum index corresponding to a largest magnitude among all the magnitudes calculated in (c).

Type: Grant

Filed: October 2, 2003

Date of Patent: February 26, 2008

Assignee: ALI Corporation

Inventor: Gin-Der Wu
Cross-correlation signal detector

Patent number: 7336739

Abstract: A method, system, and computer program product are provided for detecting at least one signal of interest within an input signal. At least one set of samples from the input signal is multiplied by at least one set of samples representing a complex conjugate of the input signal to obtain a series of correlation samples. Sets of one or more consecutive correlation samples are summed to obtain a series of pulse sums. A plurality of non-consecutive pulse sums are integrated to obtain a search value.

Type: Grant

Filed: September 26, 2003

Date of Patent: February 26, 2008

Assignee: Lockheed Martin Corporation

Inventors: Karen E. Petzold, Stephen C. Smith
Excitation vector generator, speech coder and speech decoder

Patent number: 7289952

Abstract: A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.

Type: Grant

Filed: May 7, 2001

Date of Patent: October 30, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Kazutoshi Yasunaga, Toshiyuki Morii, Hiroyuki Ehara
System for bandwidth extension of narrow-band speech

Patent number: 7216074

Abstract: A system, computer-readable medium and generated signal are disclosed for extending the bandwidth of a first signal (i.e., a narrowband signal) such as a speech signal. The system produces a second signal from a first signal by computing first area coefficients from a first signal, generating second area coefficients from the first area coefficients and generating a second signal using the second area coefficients. The first signal may be a narrowband signal and second signal may be a wideband signal. The first area coefficients may be narrowband coefficients and the second area coefficients may be wideband area coefficients.

Type: Grant

Filed: April 25, 2005

Date of Patent: May 8, 2007

Assignee: AT&T Corp.

Inventors: David Malah, Richard Vandervoort Cox
Method and apparatus for predicting CCK subsymbols

Patent number: 7187730

Abstract: An apparatus and a method for symbol decoding of baseband data in a wireless communications network is disclosed, and specifically CCK subsymbol prediction and symbol demodulation that occurs at 5.5 Mbps or 11 Mbps. The apparatus is configured to demodulate or predict the data differently, depending on the modulation rate. If the data was modulated at 11 Mbps, the ?3 rotator is rotated through each of its possible phase values and symbol correlation takes four clock cycles to complete. If the data was modulated at 5.5 Mbps, ?3 is not rotated with a set value of 0 within the correlator architecture, thereby saving power and reducing symbol correlation and subsymbol prediction to a single cycle while in such transmission mode.

Type: Grant

Filed: September 19, 2002

Date of Patent: March 6, 2007

Assignee: Marvell International Ltd.

Inventors: Guorong Hu, Yungping Hsu
Nonlinear overlap method for time scaling

Patent number: 7173986

Abstract: A nonlinear overlap method for time scaling to synthesize an S1[n] and an S2[n] into an S3[n] is disclosed. The S1[n] and the S2[n] having N1 and N2 signals respectively. The nonlinear overlap method includes the following steps: (a) delaying the S2[n] by a predetermined number and forming an S5[n], (b) establishing a correlogram of a cross-correlation function of the S1[n] and S5[n], and (c) setting S3[n] as a number of S1[n] when 0<=n<; as a number formed by overlap-adding the S1[n] and an S4[n] in a weighting manner when (the predetermined number+the maximum index+the first threshold)<=n<(N1?a second threshold); and as a number of S4 wherein the first and second thresholds are not equal to zero at the same time, and the S4[n] is formed by delaying the S5[n] by the maximum index.

Type: Grant

Filed: October 5, 2003

Date of Patent: February 6, 2007

Assignee: ALI Corporation

Inventor: Gin-Der Wu
Optical processor enhanced receiver architecture (opera)

Patent number: 7130292

Abstract: A method and apparatus for enhancing the receiving and information identification functions of multiple access communications systems by employing one or more optical processors configured as a bank of 1-D correlators. The present invention is particularly useful in a DS/SS CDMA communications system, resulting in a multiuser CDMA system that approaches carrier to noise performance (C/N) as opposed to being limited by multiple access interference (MAI). The correlators are arranged in parallel to detect and/or demodulate the received signal, in conjunction with one or more complex algorithms to perform near-optimum multiuser detection, perform multipath combining and/or perform carrier Doppler compensation.

Type: Grant

Filed: January 19, 2001

Date of Patent: October 31, 2006

Assignee: Essex Corporation

Inventors: Terry M. Turpin, James L. Lafuse
Method and system for the automatic generation of multi-lingual synchronized sub-titles for audiovisual data

Patent number: 7117231

Abstract: The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.

Type: Grant

Filed: November 27, 2001

Date of Patent: October 3, 2006

Assignee: International Business Machines Corporation

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
System and method for detecting a narrowband signal

Patent number: 7042221

Abstract: A system for the detection of narrowband signals in wideband noise that combines information across two frequency channels that straddle the frequency of the target signal. Two band pass filters having center frequencies that straddle the frequency of the target signal and that have phase transfer functions that differ by 180 degrees relative to each other at the frequency of the target signal. The presence of the target signal is detected by performing a running cross-correlation of the outputs of saturating, non-linearities that follow from the filters, and determining when the output of the running cross-correlator drops below a predetermined threshold due to the phase shift between the two filter responses caused by the presence of the target signal.

Type: Grant

Filed: July 31, 2003

Date of Patent: May 9, 2006

Assignee: Syracuse University

Inventors: Laurel H. Carney, Michael C. Anzalone
Speech recognition using dual-pass pitch tracking

Patent number: 7039582

Abstract: A computationally efficient and robust pitch detection and tracking system and related methods are presented. According to certain exemplary implementations a method is presented comprising identifying an initial set of pitch period candidates using a first estimation algorithm, filtering the initial set of candidates and passing the filtered candidates through a second, more accurate pitch estimation algorithm to generate a final set of pitch period candidates from which the most likely pitch value is selected.

Type: Grant

Filed: February 22, 2005

Date of Patent: May 2, 2006

Assignee: Microsoft Corporation

Inventors: Eric I-Chao Chang, Jian-Lai Zhou
Controlling a weighting filter based on the spectral content of a speech signal

Patent number: 7010480

Abstract: A method for preparing a speech signal for encoding comprises determining whether the spectral content of an input speech signal is representative of a defined spectral characteristic (e.g., a defined characteristic slope). A frequency specific filter component of a weighting filter is controlled based on the determination of the spectral content of the speech signal or/and its location in the encoder. A core weighting filter component of the weighting filter may be maintained regardless of the spectral content of the speech signal.

Type: Grant

Filed: September 13, 2001

Date of Patent: March 7, 2006

Assignee: Mindspeed Technologies, Inc.

Inventors: Yang Gao, Huan-Yu Su
System and method for combined frequency-domain and time-domain pitch extraction for speech signals

Patent number: 6988064

Abstract: A system, computer readable medium, and method for sampling a speech signal; dividing the sampled speech signal into overlapped frames; extracting first pitch information from a frame using frequency domain analysis; providing at least one pitch candidate, each being associated with a spectral score, from the first pitch information, each of the at least one pitch candidate representing a possible pitch estimate for the frame; extracting second pitch information from the frame using a time domain analysis; providing a correlation score for the at least one pitch candidate from the second pitch information; and selecting one of the at least one pitch candidate to represent the pitch estimate of the frame. The system, computer readable medium, and method are suitable for speech coding and for distributed speech recognition.

Type: Grant

Filed: March 31, 2003

Date of Patent: January 17, 2006

Assignees: Motorola, Inc., International Business Machines Corporation

Inventors: Tenkasi V. Ramabadran, Alexander Sorin
Low power passive correlators for multichannel global positioning system signal receiver

Patent number: 6965631

Abstract: One embodiment of the present invention includes a circular shift register, K storage elements, and a code register. The circular shift register having N data samples circularly shifts a first data sample of the N data samples into a data position at a first clock frequency. The N data samples correspond to signal received from one of K satellites in a global positioning system (GPS). The N data samples are loaded into the circular shift register at a second clock frequency. The K storage elements store K code sequences, respectively. Each of the K code sequences has N code samples and includes a first code sample being written at a code position corresponding to the data position at a third clock frequency. The K storage elements correspond to the K satellites. The code register stores the N code samples loaded from one of the K storage elements at a fourth clock frequency. The fourth clock frequency is K times faster than the first clock frequency.

Type: Grant

Filed: March 13, 2001

Date of Patent: November 15, 2005

Assignee: PRI Research & Development Corp.

Inventors: Kaveh Shakeri, Alireza Mehrnia, Farshid Soheili-Najafabadi
Apparatus and method for providing server state and attribute management for multiple-threaded voice enabled web applications

Patent number: 6883015

Abstract: An application server generates and maintains a server-side data record, also referred to as a “brownie”, that includes application state information and user attribute information for multiple users within a single session controlled by a web-based browser. The brownie includes a session identifier that uniquely identifies the session, and a subsession identifier that uniquely identifies each corresponding user of the application session. As each new user is added to the session, for example by initiating a call to the new user, the application server stores the subsession identifier and corresponding application state information for the new user in the same brownie. In response to receiving a second web page request from the browser that includes the session identifier, the application server initiates a new web application instance, and recovers the brownie from the memory based on the session identifier included in the second page request.

Type: Grant

Filed: March 30, 2000

Date of Patent: April 19, 2005

Assignee: Cisco Technology, Inc.

Inventors: David William Geen, Geetha Ravishankar, Satish Joshi, Melissa L. Denbar, William Bateman Willaford, IV, Zhiwei Zhang
Information processing for retrieving coded audiovisual data

Patent number: 6532445

Abstract: In order to efficiently retrieve AV data by using signal characteristics as retrieval conditions, in a first step, a comparison and determination section computes a correlation coefficient (degree of similarity) of a spectrum coefficient of coded audio data and a spectrum coefficient of a sample waveform, and extracts correlation coefficients such that the value of the computed spectrum coefficient is larger than a threshold value which is set in the first step, and assumes them to be retrieval results. In a second step, the comparison and determination section determines whether or not the retrieval result is satisfactory. When it is determined that the number of pieces of audio data retrieved in the first step is equal to or greater than the predetermined threshold value and the retrieval result is not satisfactory, the process proceeds to a third step.

Type: Grant

Filed: September 16, 1999

Date of Patent: March 11, 2003

Assignee: Sony Corporation

Inventors: Yasuhiro Toguri, Kenzo Akagiri
Method and apparatus for pitch tracking

Patent number: 6226606

Abstract: In a method for tracking pitch in a speech signal, first and second window vectors are created from samples taken across first and second windows of the speech signal. The first window is separated from the second window by a test pitch period. The energy of the speech signal in the first window is combined with the correlation between the first window vector and the second window vector to produce a predictable energy factor. The predictable energy factor is then used to determine a pitch score for the test pitch period. Based in part on the pitch score, a portion of the pitch track is identified.

Type: Grant

Filed: November 24, 1998

Date of Patent: May 1, 2001

Assignee: Microsoft Corporation

Inventors: Alejandro Acero, James G. Droppo, III
Linear prediction coefficient analyzing apparatus for the auto-correlation function of a digital speech signal

Patent number: 6167373

Abstract: A sample speech is analyzed by a speech analyzing unit to obtain sample characteristic parameters, and a coding distortion is calculated from the sample characteristic parameters in each of a plurality of coding modules. The sample characteristic parameters and the coding distortions are statistically processed by a statistical processing unit to obtain a coding module selecting rule. Thereafter, when a speech is analyzed by the speech analyzing unit to obtain characteristic parameters, an appropriate coding module is selected by a coding module selecting unit from the coding modules according to the coding module selecting rule on condition that a coding distortion for the characteristic parameters is minimized in the appropriate coding module. Thereafter, the characteristic parameters of the speech are coded in the appropriate coding module, and a coded speech is obtained. When the coded speech is decoded, a reproduced speech is obtained.

Type: Grant

Filed: December 30, 1999

Date of Patent: December 26, 2000

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventor: Toshiyuki Morii
Formant shift-compensated sound synthesizer and method of operation thereof

Patent number: 6101469

Abstract: For use in a synthesizer having a wave source that produces a periodic wave, frequency shifting circuitry for frequency-shifting the periodic wave and waveshaping circuitry for transforming the periodic wave into a waveform containing a formant, the frequency-shifting causing displacement of the formant, a circuit for, and method of, compensating for the displacement and a synthesizer employing the circuit or the method. In one embodiment, the circuit includes bias circuitry, coupled to the wave source and the frequency shifting circuitry, that introduces a bias into the periodic wave based on a degree to which the frequency shifting circuitry frequency shifts the periodic wave, the bias reducing a degree to which the formant is correspondingly frequency-shifted.

Type: Grant

Filed: March 2, 1998

Date of Patent: August 8, 2000

Assignee: Lucent Technologies Inc.

Inventor: Steven D. Curtin
Speech coding apparatus and speech decoding apparatus

Patent number: 6061648

Abstract: In a speech coding apparatus, an input device inputs a mixed speech signal of a plurality of speakers. A separating device analyzes period characteristics of the input mixed speech signal, and separates the same signal into a plurality of single speech signals each associated with a corresponding one of the speakers, based on a result of the analysis. A first extracting device extracts source speech characteristic parameters included in each of the single speech signals. A second extracting device extracts a generic vocal-tract characteristic parameter from the input mixed speech signal. In a speech decoding apparatus, a first input device inputs the source speech characteristic parameters for each of the speakers. A second input device inputs the vocal-tract characteristic parameter.

Type: Grant

Filed: February 26, 1998

Date of Patent: May 9, 2000

Assignee: Yamaha Corporation

Inventor: Akitoshi Saito
Reduced real-time processing in stochastic celp encoding

Patent number: 6044339

Abstract: Methods are presented for reducing the processing required for CELP speech encoders which have multiple fixed stochastic codebook subframes corresponding to a single adaptive codebook subframe. The search for the optimum excitation vector in the fixed stochastic codebook requires calculating terms involving correlation of the target speech sample and the fixed stochastic codebook excitation vector as well as energy terms involving only the fixed stochastic codebook excitation vector, and for this class of CELP encoders it is possible to simplify the calculations to reduce their complexity and to make advantageous use of an adaptive energy lookup table. In addition, linear interpolation may be employed to estimate values for the adaptive energy lookup table and further reduce the computational burden.

Type: Grant

Filed: December 2, 1997

Date of Patent: March 28, 2000

Assignee: DSPC Israel Ltd.

Inventors: Rafael Zack, Shimon Dahan
Method and device for voice activity detection and a communication device

Patent number: 5963901

Abstract: The invention concerns a voice activity detection device in which an input speech signal (x(n)) is divided in subsignals (S(s)) representing specific frequency bands and noise (N(s)) is estimated in the subsignals. On basis of the estimated noise in the subsignals, subdecision signals (SNR(s)) are generated and a voice activity decision (V.sub.ind) for the input speech signal is formed on basis of the subdecision signals. Spectrum components of the input speech signal and a noise estimate are calculated and compared. More specifically a signal-to-noise ratio is calculated for each subsignal and each signal-to-noise ratio represents a subdecision signal (SNR(s)). From the signal-to-noise ratios a value proportional to their sum is calculated and compared with a threshold value and a voice activity decision signal (V.sub.ind) for the input speech signal is formed on basis of the comparison.

Type: Grant

Filed: December 10, 1996

Date of Patent: October 5, 1999

Assignee: Nokia Mobile Phones Ltd.

Inventors: Antti Vahatalo, Juha Hakkinen, Erkki Paajanen
Method and apparatus for generating modified speech from pitch-synchronous segmented speech waveforms

Patent number: 5933808

Abstract: A system that synchronously segments a speech waveform using pitch period and a center of the pitch waveform. The pitch waveform center is determined by finding a local minimum of a centroid histogram waveform of the low-pass filtered speech waveform for one pitch period. The speech waveform can then be represented by one or more of such pitch waveforms or segments during speech compression, reconstruction or synthesis. The pitch waveform can be modified by frequency enhancement/filtering, waveform stretching/shrinking in speech synthesis or speech disguise. The utterance rate can also be controlled to speed up or slow down the speech.

Type: Grant

Filed: November 7, 1995

Date of Patent: August 3, 1999

Assignee: The United States of America as represented by the Secretary of the Navy

Inventors: George S. Kang, Lawrence J. Fransen
Efficient decomposition in noise and periodic signal waveforms in waveform interpolation

Patent number: 5924061

Abstract: A low-complexity method and apparatus for performing signal decomposition in a low bit-rate WI speech encoder. A time-ordered sequence of sets of time-domain parameters is generated based on samples of a speech signal to be coded, each set of time-domain parameters corresponding to a waveform characterizing the speech signal. A cross correlation is then performed between two or more of said sets of time-domain parameters to produce a set of signals which represents relatively high rates of evolution of characterizing waveform shape across the time-ordered sequence of sets. Finally, the speech signal is coded based on the produced set of signals. A set of signals which represents relatively low rates of evolution of characterizing waveform shape across the time-ordered sequence of sets may also be produced.

Type: Grant

Filed: March 10, 1997

Date of Patent: July 13, 1999

Assignee: Lucent Technologies Inc.

Inventor: Yair Shoham
Pitch period extracting apparatus of speech signal

Patent number: 5819209

Abstract: A pitch period extracting apparatus includes a microcomputer which determines a sampling frequency for an A/D converter, and a range of delay times for calculating autocorrelative values on the basis of the sampling frequency. For example, the delay times are set within a range of 20 samples.ltoreq.k.ltoreq.100 samples in a case of 8 kHz, and a range of 15 samples.ltoreq.k.ltoreq.75 samples in a case of 6 kHz. The microcomputer calculates the autocorrelative values of speech signal data stored in a buffer memory, and outputs a delay time at which a maximum autocorrelative value is obtainable as a pitch period of an inputted speech signal.

Type: Grant

Filed: May 23, 1995

Date of Patent: October 6, 1998

Assignee: Sanyo Electric Co., Ltd.

Inventor: Takeo Inoue
Method and apparatus for speech excitation waveform coding using multiple error waveforms

Patent number: 5809459

Abstract: A method and apparatus (100) for pitch-epoch-synchronous source-filter speech encoding by means of error component modeling methods (310) which capture fundamental orthogonal (uncorrelated) basis elements of an excitation source waveform. A periodic waveform model (318) along with four orthogonal error waveforms, desirably including phase error (319), ensemble error (321), standard deviation error (323), and mean error (324) waveforms, are incorporated together to form a complete description of the excitation. These error waveforms (319,321, 323, 324) represent those portions of the excitation that are not represented by the purely periodic model. By thus orthogonalizing the error components, the perceptual effect of each element is isolated from the composite set, and can thus be encoded separately.

Type: Grant

Filed: May 21, 1996

Date of Patent: September 15, 1998

Assignee: Motorola, Inc.

Inventors: Chad Scott Bergstrom, Carl Steven Gifford, Richard James Pattison, Glen Patrick Abousleman
Coding of a speech or music signal with quantization of harmonics components specifically and then residue components

Patent number: 5806024

Abstract: Harmonics coefficients are estimated in primary coefficients of an orthogonal transform of a speech or a music input signal by using a pitch frequency extracted from the input signal and are quantized into a harmonics code vector. Residue coefficients are calculated by removing the harmonics coefficients from the primary coefficients and quantized into residue code vectors and gain code vectors. It is possible to search harmonics excitation pulses at the harmonics locations for harmonics quantization into the harmonics code vector. On the other hand, it is possible to estimate the harmonics coefficients or excitation pulses by using quantized LSP parameters and to calculate secondary coefficients for use in weighting the harmonics quantization and residue quantization and, if applicable, in excitation pulse search.

Type: Grant

Filed: December 23, 1996

Date of Patent: September 8, 1998

Assignee: NEC Corporation

Inventor: Kazunori Ozawa
Method and apparatus for time-scale modification of a signal

Patent number: 5806023

Abstract: A method used by a controller (112) and a communication receiver (122) for performing a time-scale modification of a signal comprised of an input stream of signal representations to form an output stream of signal representations. An output block of S.sub.o signal representations is selected from the output stream for use in overlapping signal representations from the output block with S.sub.o contiguous signal representations of the input stream, the S.sub.o contiguous signal representations beginning with a predetermined signal representation. The overlapping process includes determining (406) a maximum similarity measure between selected portions of the input and the output streams, applying (410) a weighting function to each, and summing the results of each. The overlapping process further includes replacing (412) a selected portion of the output stream with the latter summation, and selecting (414) a new set of samples from the input stream to append to the output stream.

Type: Grant

Filed: February 23, 1996

Date of Patent: September 8, 1998

Assignee: Motorola, Inc.

Inventor: Sunil Satyamurti
Method of speech synthesis by means of concentration and partial overlapping of waveforms

Patent number: 5774855

Abstract: A synthesis method in which that part of each interval of the original signal which contains the fundamental information is left unchanged, and only the remaining part of the interval is altered. In this way, not only is processing time reduced, but the natural sound of the synthetic signal is also improved. The main part of the interval is an exact reproduction of the original signal. At least the waveforms associated to voiced sounds are subdivided into a plurality of intervals, corresponding to the responses of the vocal duct to a series of excitation impulses of the vocal cords, synchronous with the fundamental frequency of the signal. Each interval is subjected to a weighting. The signals resulting from the weighting are replaced with a replica thereof shifted in time by an amount that depends on a prosodic information. The synthesis is then carried out by overlapping and adding the shifted signals.

Type: Grant

Filed: September 15, 1995

Date of Patent: June 30, 1998

Assignee: CSELT-Centro Studi e Laboratori Tellecomunicazioni S.p.A.

Inventors: Enzo Foti, Luciano Nebbia, Stefano Sandri
Vector quantinizer with distance measure calculated by using correlations

Patent number: 5761632

Abstract: A vector quantizer for a speech coder for coding speech signals at low bit rates. The vector quantizer includes an auto-correlation calculation circuit for calculating an impulse response of a weighting function for each sub-interval of an input signal vector. The vector quantizer also includes a weighted cross-correlation calculation circuit for calculating a weighted cross-correlation of the weighted input signal vector and the weighted codevector having a code length equal to that of the input signal vector. The vector quantizer further includes a weighted auto-correlation calculation circuit for calculating an auto-correlation of the weighted codevectors, by using respective auto-correlations of the impulse responses, the codevectors and the cross-correlations.

Type: Grant

Filed: May 16, 1997

Date of Patent: June 2, 1998

Assignee: NEC Corporation

Inventor: Masahiro Serizawa
Pitch period extracting apparatus of speech signal

Patent number: RE38889

Abstract: A pitch period extracting apparatus includes a microcomputer which determines a sampling frequency for an A/D converter, and a range of delay times for calculating autocorrelative values on the basis of the sampling frequency. For example, the delay times are set within a range of 20 samples?k?100 samples in a case of 8 kHz, and a range of 15 samples?k?75 samples in a case of 6 kHz. The microcomputer calculates the autocorrelative values of speech signal data stored in a buffer memory, and outputs a delay time at which a maximum autocorrelative value is obtainable as a pitch period of an inputted speech signal.

Type: Grant

Filed: October 6, 2000

Date of Patent: November 22, 2005

Assignee: Sanyo Electric Co., Ltd.

Inventor: Takeo Inoue

prev 1 2