Autocorrelation Patents (Class 704/217)

Method and apparatus for determining emotional arousal by speech analysis

Patent number: 7606701

Abstract: An apparatus for determining emotional arousal of a subject by speech analysis, and an associated method. In the method, a speech sample is obtained, the speech sample is pre-processed into silent and active speech segments and the active speech segments are divided into strings of equal length blocks (the blocks having primary speech parameters including pitch and amplitude parameters), a plurality of selected secondary speech parameters indicative of characteristics of equal-pitch are derived, rising-pitch and falling-pitch trends in the strings of blocks, the secondary speech parameters are compared with predefined, subject independent values representing non-emotional speech to generate a processing result indicative of emotional arousal, and the generated processed result is outputted to an output device.

Type: Grant

Filed: August 7, 2002

Date of Patent: October 20, 2009

Assignee: VoiceSense, Ltd.

Inventors: Yoav Degani, Yishai Zamir
Auto-correlators with DC and CW cancellation

Patent number: 7593848

Abstract: Embodiments of methods and means for correcting auto-correlated wireless signal samples are provided. Such embodiments include isolating and subtracting an interference vector from auto-correlated signal samples so that a corrected signal sample data set is derived. The corrected signal samples are then used in detecting and identifying symbols within the original wireless signal. Reliable and expeditious wireless communications can be achieved in accordance with the present embodiments.

Type: Grant

Filed: September 28, 2006

Date of Patent: September 22, 2009

Assignee: Intel Corporation

Inventors: Assaf Gurevitz, Uri Perlmutter
LINEAR PREDICTIVE CODING OF AN AUDIO SIGNAL

Publication number: 20090204397

Abstract: An apparatus for linear predictive coding of an audio signal comprises a segmentation processor (201) which generates signal segments for the audio signal. An autocorrelation processor (401) for generates a first autocorrelation sequence for each signal segment and a modification processor (403) generates a second autocorrelation sequence for each signal segment by modifying the first autocorrelation sequence in response to at least one psychoacoustic characteristic. A prediction coefficient processor (405) determines linear predictive coding coefficients for each signal segment in response to the second autocorrelation sequence. The invention allows a low complexity linear encoding which takes into account psychoacoustic considerations thereby allowing an improved perceived coding quality for a given data rate.

Type: Application

Filed: May 15, 2007

Publication date: August 13, 2009

Inventor: Albertus Cornelis Den Drinker
Method for recovery of lost speech data

Patent number: 7565286

Abstract: A method for lost speech samples recovery in speech transmission systems is disclosed. The method employs a waveform coder operating on digital speech samples. It exploits the composite model of speech, wherein each speech segment contains both periodic and colored noise components, and separately estimates these two components of the unreliable samples. First, adaptive FIR filters computed from received signal statistics are used to interpolate estimates of the periodic component for the unreliable samples. These FIR filters are inherently stable and typically short, since only strongly correlated elements of the signal corresponding to pitch offset samples are used to compute the estimate. These periodic estimates are also computed for sample times corresponding to reliable samples adjacent to the unreliable sample interval. The differences between these reliable samples and the corresponding periodic estimates are considered as samples of the noise component.

Type: Grant

Filed: July 16, 2004

Date of Patent: July 21, 2009

Assignee: Her Majesty the Queen in right of Canada, as represented by the Minister of Industry, Through the Communications Research Centre Canada

Inventors: Ken Gracie, John Lodge
LPC-to-MELP transcoder

Patent number: 7529662

Abstract: A method for transcoding a bit stream encoded according to a linear predictive coding (LPC) standard to a bit stream encoded according to a mixed-excitation linear prediction (MELP) standard, including: decoding a bit stream into a first set of vocoder parameters compatible with the LPC standard; transforming the first set of vocoder parameters into a second set of vocoder parameters compatible with the MELP standard without converting the first set of vocoder parameters to an analog or digital waveform representation; and encoding the second set of vocoder parameters into a bit stream compatible with the MELP vocoder standard.

Type: Grant

Filed: August 31, 2006

Date of Patent: May 5, 2009

Assignee: General Electric Company

Inventors: Richard L. Zinser, Jr., Steven R. Koch
Method and apparatus for discriminating speech from voice-band data in a communication network

Patent number: 7487083

Abstract: A method and an apparatus accurately discriminates between speech and voice-band data (VBD) in a communication network by calculating self similarity ratio (SSR) values, which indicate periodicity characteristics of an input signal segment, and/or autocorrelation coefficients, which indicate spectral characteristics of an input signal segment, to generate a speech/VBD discrimination result. In one implementation, the speech-VBD discriminating apparatus calculates both short-term delay and long-term delay SSR values to analyze the repetition rate of an input signal frame, thereby indicating whether the input signal frame has the periodicity characteristics of a typical speech signal or a VBD signal. The speech-VBD discriminating apparatus further calculates a plurality of short-term autocorrelation coefficients to determine the spectral envelope of an input frame, thereby facilitating accurate speech/VBD discrimination.

Type: Grant

Filed: July 13, 2000

Date of Patent: February 3, 2009

Assignee: Alcatel-Lucent USA Inc.

Inventor: Peng Jie Zhang
SPEECH ANALYSIS APPARATUS, SPEECH ANALYSIS METHOD AND COMPUTER PROGRAM

Publication number: 20090030690

Abstract: A speech analysis apparatus analyzing prosodic characteristics of speech information and outputting a prosodic discrimination result includes an input unit inputting speech information, an acoustic analysis unit calculating relative pitch variation and a discrimination unit performing speech discrimination processing, in which the acoustic analysis unit calculates a current template relative pitch difference, determining whether a difference absolute value between the current template relative pitch difference and a previous template relative pitch difference is equal to or less than a predetermined threshold or not, when the value is not less than the threshold, calculating an adjacent relative pitch difference, and when the adjacent relative pitch difference is equal to or less than a previously set margin value, executing correction processing of adding or subtracting an octave of the current template relative pitch difference to calculate the relative pitch variation by applying the relative pitch differe

Type: Application

Filed: July 21, 2008

Publication date: January 29, 2009

Inventor: Keiichi YAMADA
Method and apparatus for robust speech classification

Patent number: 7472059

Abstract: A speech classification technique for robust classification of varying modes of speech to enable maximum performance of multi-mode variable bit rate encoding techniques. A speech classifier accurately classifies a high percentage of speech segments for encoding at minimal bit rates, meeting lower bit rate requirements. Highly accurate speech classification produces a lower average encoded bit rate, and higher quality decoded speech. The speech classifier considers a maximum number of parameters for each frame of speech, producing numerous and accurate speech mode classifications for each frame. The speech classifier correctly classifies numerous modes of speech under varying environmental conditions.

Type: Grant

Filed: December 8, 2000

Date of Patent: December 30, 2008

Assignee: QUALCOMM Incorporated

Inventor: Pengjun Huang
Method of estimating pitch by using ratio of maximum peak to candidate for maximum of autocorrelation function and device using the method

Patent number: 7457744

Abstract: A device and a method for estimating an open-loop pitch in a general speech CODEC are disclosed. The open-loop pitch estimation device includes an autocorrelation function calculation unit which calculates a normalized autocorrelation function from a perceptual weighing filtered speech signal, a maximum autocorrelation function and lag estimation unit which estimates a maximum autocorrelation function and candidates for the maximum autocorrelation function, a pitch candidate decision unit which decides candidates for a pitch by using the ratio of the estimated maximum autocorrelation function to the candidates for the estimated maximum autocorrelation function, and lags of which values are smaller than a predetermined threshold value, and a pitch estimation unit which estimates a pitch between the candidates for a pitch and the lags corresponding to the estimated maximum autocorrelation function by using a pitch of a previous frame of the speech signal.

Type: Grant

Filed: July 25, 2003

Date of Patent: November 25, 2008

Assignee: Electronics and Telecommunications Research Institute

Inventors: Mi-suk Lee, Dae-hwan Hwang
Pitch prediction for packet loss concealment

Patent number: 7457746

Abstract: There is provided a pitch lag predictor for use by a speech decoder to generate a predicted pitch lag parameter. The pitch lag predictor comprises a summation calculator configured to generate a first summation based on a plurality of previous pitch lag parameters, and a second summation based on a plurality of previous pitch lag parameters and a position of each of the plurality of previous pitch lag parameters with respect to the predicted pitch lag parameter; a coefficient calculator configured to generate a first coefficient using a first equation based on the first summation and the second summation, and a second coefficient using a second equation based on the first summation and the second summation, wherein the first equation is different than the second equation; and a predictor configured to generate the predicted pitch lag parameter based on the first coefficient and the second coefficient.

Type: Grant

Filed: March 20, 2006

Date of Patent: November 25, 2008

Assignee: Mindspeed Technologies, Inc.

Inventor: Yang Gao
Characterization of impaired intervals in a voice over packet session using audio frame loss concealment

Patent number: 7433358

Abstract: An embodiment may include an apparatus comprising a dejitter buffer to receive packets containing audio data, a codec coupled with the dejitter buffer, the codec to receive coded audio frames from the dejitter buffer and decode them, and a concealed seconds meter coupled with the dejitter buffer, the concealed seconds meter to record concealment events by the decoder to provide an objective measure of media impairment. Another exemplary embodiment may be a method comprising receiving packets containing audio information at a dejitter buffer, decomposing the packets to coded audio frames, sending the coded audio frames to a decoder and decoding the frames, generating a concealment output stream if the decoder does not receive a valid frame from the dejitter buffer, and recording concealment events to provide an objective measure of media impairment.

Type: Grant

Filed: July 8, 2005

Date of Patent: October 7, 2008

Assignee: Cisco Technology, Inc.

Inventors: Paul Volkaerts, Kevin Joseph Connor, James C. Frauenthal, Rajesh Kumar
Lossless multi-channel audio codec using adaptive segmentation with random access point (RAP) and multiple prediction parameter set (MPPS) capability

Publication number: 20080215317

Abstract: A lossless audio codec encodes/decodes a lossless variable bit rate (VBR) bitstream with random access point (RAP) capability to initiate lossless decoding at a specified segment within a frame and/or multiple prediction parameter set (MPPS) capability partitioned to mitigate transient effects. This is accomplished with an adaptive segmentation technique that fixes segment start points based on constraints imposed by the existence of a desired RAP and/or detected transient in the frame and selects a optimum segment duration in each frame to reduce encoded frame payload subject to an encoded segment payload constraint. In general, the boundary constraints specify that a desired RAP or detected transient must lie within a certain number of analysis blocks of a segment start point.

Type: Application

Filed: January 30, 2008

Publication date: September 4, 2008

Inventor: Zoran Fejzo
Voice activated device

Patent number: 7415416

Abstract: A voice activated camera is described which allows users to take remote photographs by speaking one or more keywords. In a preferred embodiment, a speech processing unit is provided which is arranged to detect extended periodic signals from a microphone of the camera. A control unit is also provided to control the taking of a photograph when such an extended periodic component is detected by the speech processing unit.

Type: Grant

Filed: September 10, 2004

Date of Patent: August 19, 2008

Assignee: Canon Kabushiki Kaisha

Inventor: David Llewellyn Rees
Digital signal processing method, learning method, apparatuses for them, and program storage medium

Patent number: 7412384

Abstract: A digital signal processing method and learning method and devices therefor, and a program storage medium which are capable of further improving the waveform reproducibility of a digital signal. Self correlation coefficients are calculated by cutting parts out of the digital signal by multiple windows having different sizes, and the parts are classified based on the calculation results of the self correlation coefficients. Then, the digital signal is converted by the prediction method corresponding to the classified class, so that the conversion further suitable for the features of the digital signal can be conducted.

Type: Grant

Filed: July 31, 2001

Date of Patent: August 12, 2008

Assignee: Sony Corporation

Inventors: Tetsujiro Kondo, Tsutomu Watanabe
Apparatus and method of processing an audio signal

Patent number: 7411528

Abstract: In one embodiment, a block of audio data is partitioned into N sub-blocks, and a plurality of code parameters s(0), s(1), . . . s(N?1) are determined. The N sub-blocks are entropy encoded using a plurality of entropy codes defined by the plurality of code parameters, and the code parameters are transmitted. The code parameters may be transmitted by directly transmitting s(0) representing the code parameter of the first sub-block, and transmitting a difference s(i)?s(i?1) for i=1, . . . N?1, s(i) representing the code parameter of an ith sub-block following the first sub-block.

Type: Grant

Filed: July 7, 2006

Date of Patent: August 12, 2008

Assignee: LG Electronics Co., Ltd.

Inventor: Tilman Liebchen
Method and apparatus for reconstructing voice information

Patent number: 7403893

Abstract: A communication system includes a destination that receives voice samples and a voice parameter generated by a source. The destination uses the voice samples and voice parameter to reconstruct voice information in response to a packet loss. The destination may reconstruct voice information from multiple sources.

Type: Grant

Filed: January 19, 2006

Date of Patent: July 22, 2008

Assignee: Cisco Technology, Inc.

Inventors: Pascal H. Huart, Luke K. Surazski
Pitch lag estimation

Publication number: 20080091418

Abstract: Autocorrelation values are determined as a basis for an estimation of a pitch lag in a segment of an audio signal. A first considered delay range for the autocorrelation computations is divided into a first set of sections, and first autocorrelation values are determined for delays in a plurality of sections of this first set of sections. A second considered delay range for the autocorrelation computations is divided into a second set of sections such that sections of the first set and sections of the second set are overlapping. Second autocorrelation values are determined for delays in a plurality of sections of this second set of sections.

Type: Application

Filed: October 13, 2006

Publication date: April 17, 2008

Inventors: Lasse Laaksonen, Anssi Ramo, Adriana Vasilache
Apparatus and method for identifying prosody and apparatus and method for recognizing speech

Publication number: 20070276659

Abstract: A prosody identifying apparatus for identifying input speech on the basis of prosodic features of the input speech is provided. The prosody identifying apparatus includes a sound analyzing section for acquiring an amount of change in movement of a feature distribution obtained from an autocorrelation matrix of the frequency characteristic of the input speech and an identifying section for recognizing the input speech on the basis of an output of the sound analyzing section.

Type: Application

Filed: May 23, 2007

Publication date: November 29, 2007

Inventor: Keiichi Yamada
Excitation vector generator, speech coder and speech decoder

Patent number: 7289952

Abstract: A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.

Type: Grant

Filed: May 7, 2001

Date of Patent: October 30, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Kazutoshi Yasunaga, Toshiyuki Morii, Hiroyuki Ehara
Speech recognition system

Patent number: 7266496

Abstract: The present invention discloses a complete speech recognition system having a training button and a recognition button, and the whole system uses the application specific integrated circuit (ASIC) architecture for the design, and also uses the modular design to divide the speech processing into 4 modules: system control module, autocorrelation and linear predictive coefficient module, cepstrum module, and DTW recognition module. Each module forms an intellectual product (IP) component by itself. Each IP component can work with various products and application requirements for the design reuse to greatly shorten the time to market.

Type: Grant

Filed: December 24, 2002

Date of Patent: September 4, 2007

Assignee: National Cheng-Kung University

Inventors: Jhing-Fa Wang, Jia-Ching Wang, Tai-Lung Chen, Chin-Chan Chang
Method and apparatus for predicting CCK subsymbols

Patent number: 7187730

Abstract: An apparatus and a method for symbol decoding of baseband data in a wireless communications network is disclosed, and specifically CCK subsymbol prediction and symbol demodulation that occurs at 5.5 Mbps or 11 Mbps. The apparatus is configured to demodulate or predict the data differently, depending on the modulation rate. If the data was modulated at 11 Mbps, the ?3 rotator is rotated through each of its possible phase values and symbol correlation takes four clock cycles to complete. If the data was modulated at 5.5 Mbps, ?3 is not rotated with a set value of 0 within the correlator architecture, thereby saving power and reducing symbol correlation and subsymbol prediction to a single cycle while in such transmission mode.

Type: Grant

Filed: September 19, 2002

Date of Patent: March 6, 2007

Assignee: Marvell International Ltd.

Inventors: Guorong Hu, Yungping Hsu
Method for detecting and attenuating inhalation noise in a communication system

Patent number: 7139701

Abstract: A method for detecting and attenuating inhalation noise in a communication system coupled to a pressurized air delivery system, the method including the steps of: generating an inhalation noise model (912, 1012) based on inhalation noise; receiving an input signal (802) that includes inhalation noise; comparing (810) the input signal to the noise model to obtain a similarity measure; determining (854) a gain factor based on the similarity measure; and modifying (852) the input signal based on the gain factor, wherein the inhalation noise in the input signal is attenuated based on the gain factor.

Type: Grant

Filed: June 30, 2004

Date of Patent: November 21, 2006

Assignee: Motorola, Inc.

Inventors: Sara M. Harton, Mark A. Jasiuk, William M. Kushner
Speech processing system

Patent number: 7035790

Abstract: A speech processing system is provided which is operable to receive sets of signal values representative of a speech signal generated by a speech source as distorted by a transmission channel between the speech source and the speech processing system. The system stores data defining a predetermined function derived from a signal model which models both the speech source and the channel and defining a probability density function which gives, for a given set of model parameters, the probability that the signal model has those model parameters given that the signal model is assumed to have generated the received set of signal values. The system applies a current set of received signal values to the stored probability density function and then draws samples from it using a Gibbs sampler. The system then analyses the samples to determine a set of parameter values representative of the speech signal before it was distorted by the channel.

Type: Grant

Filed: May 30, 2001

Date of Patent: April 25, 2006

Assignee: Canon Kabushiki Kaisha

Inventor: Jebu Jacob Rajan
Method for modeling speech harmonic magnitudes

Patent number: 7027980

Abstract: A system or method for modeling a signal, such as a speech signal, in which harmonic frequencies and amplitudes are identified and the harmonic magnitudes are interpolated to obtain spectral magnitudes at a set of fixed frequencies. An inverse transform is applied to the spectral magnitudes to obtain a pseudo auto-correlation sequence, from which linear prediction coefficients are calculated. From the linear prediction coefficients, model harmonic magnitudes are generated by sampling the spectral envelope defined by the linear prediction coefficients. A set of scale factors are then calculated as the ratio of the harmonic magnitudes to the model harmonic magnitudes and interpolated to obtain a second set of scale factors at the set of fixed frequencies. The spectral envelope magnitudes at the set of fixed frequencies are multiplied by the second set of scale factors to obtain new spectral magnitudes and the process is iterated to obtain final linear prediction coefficients.

Type: Grant

Filed: March 28, 2002

Date of Patent: April 11, 2006

Assignee: Motorola, Inc.

Inventors: Tenkasi V. Ramabadran, Aaron M. Smith, Mark A. Jasiuk
Method and apparatus for reconstructing voice information

Patent number: 7013267

Abstract: A communication system includes a destination that receives voice samples and a voice parameter generated by a source. The destination uses the voice samples and voice parameter to reconstruct voice information in response to a packet loss. The destination may reconstruct voice information from multiple sources.

Type: Grant

Filed: July 30, 2001

Date of Patent: March 14, 2006

Assignee: Cisco Technology, Inc.

Inventors: Pascal H. Huart, Luke K. Surazski
Reception AGC circuit

Patent number: 7003057

Abstract: A reception AGC circuit includes a high-speed power calculating circuit for calculating the power of an input signal at a short period, a normal power calculating circuit for calculating the power at a normal period, a circuit for receiving a power calculation result from the high-speed power calculating circuit or normal power calculating circuit to calculate a feedback amplification value, and addition amplification value setting units for self-station communication and peripheral station monitoring which receive the feedback amplification value through a switch and add the feedback amplification value to an amplifier in use.

Type: Grant

Filed: October 25, 2001

Date of Patent: February 21, 2006

Assignee: NEC Corporation

Inventor: Osamu Hasegawa
Systems and methods for correlating images in an image correlation system with reduced computational loads

Patent number: 6996291

Abstract: After one or both of a pair of images are obtained, an auto-correlation function for one of those images is generated to determine a smear amount and possibly a smear direction. The smear amount and direction are used to identify potential locations of a peak portion of the correlation function between the pair of images. The pair of images is then correlated only at offset positions corresponding to the one or more of the potential peak locations. In some embodiments, the pair of images is correlated according to a sparse set of image correlation function value points around the potential peak locations. In other embodiments, the pair of images is correlated at a dense set of correlation function value points around the potential peak locations. The correlation function values of these correlation function value points are then analyzed to determine the offset position of the true correlation function peak.

Type: Grant

Filed: August 6, 2001

Date of Patent: February 7, 2006

Assignee: Mitutoyo Corporation

Inventor: Michael Nahum
Method and device for estimating the pitch of a speech signal using a binary signal

Patent number: 6954726

Abstract: A method of estimating the pitch of a speech signal comprises the steps of sampling the speech signal to obtain a series of samples, dividing the series of samples into segments, each segment having a fixed number of consecutive samples, calculating for each segment a conformity function, and detecting peaks in the conformity function. The method provides also an intermediate signal derived from the speech signal, which is set to logical “1” where the intermediate signals exceeds a pre-selected threshold and to logical “0” where the intermediate signal does not exceed the pre-selected threshold, calculating the autocorrelation of the binary signal, and using the distance between peaks in the autocorrelation of the binary signal as an estimate of the pitch. Elaborate operations needed in prior art algorithms is thus avoided. A device conforming to the method is described.

Type: Grant

Filed: April 5, 2001

Date of Patent: October 11, 2005

Assignee: Telefonaktiebolaget L M Ericsson (Publ)

Inventors: Cecilia Brandel, Henrik Johannisson
Apparatus and method for providing server state and attribute management for multiple-threaded voice enabled web applications

Patent number: 6883015

Abstract: An application server generates and maintains a server-side data record, also referred to as a “brownie”, that includes application state information and user attribute information for multiple users within a single session controlled by a web-based browser. The brownie includes a session identifier that uniquely identifies the session, and a subsession identifier that uniquely identifies each corresponding user of the application session. As each new user is added to the session, for example by initiating a call to the new user, the application server stores the subsession identifier and corresponding application state information for the new user in the same brownie. In response to receiving a second web page request from the browser that includes the session identifier, the application server initiates a new web application instance, and recovers the brownie from the memory based on the session identifier included in the second page request.

Type: Grant

Filed: March 30, 2000

Date of Patent: April 19, 2005

Assignee: Cisco Technology, Inc.

Inventors: David William Geen, Geetha Ravishankar, Satish Joshi, Melissa L. Denbar, William Bateman Willaford, IV, Zhiwei Zhang
Method and apparatus for recording/reproducing or producing a waveform using time position information

Patent number: 6873955

Abstract: Partial waveform data representative of a waveform shape variation are extracted from supplied waveform data, and the extracted partial waveform data are stored along with time position information indicative of their respective time positions. In reproduction, the partial waveform data and time position information are read out, then the partial waveform data are arranged on the time axis in accordance with the time position information, and a waveform is produced on the basis of the waveform data arranged on the time axis. In another implementation, sets of sample identification information and time position information are obtained in accordance with a performance tone waveform to be reproduced, and sample data are obtained from a database in accordance with the sample identification information. The thus-obtained sample data are arranged on the time axis in accordance with the time position information, and the desired waveform is produced on the basis of the sample data arranged on the time axis.

Type: Grant

Filed: September 22, 2000

Date of Patent: March 29, 2005

Assignee: Yamaha Corporation

Inventors: Hideo Suzuki, Motoichi Tamura, Satoshi Usa
Prediction parameter analysis apparatus and a prediction parameter analysis method

Patent number: 6842731

Abstract: A prediction parameter analysis apparatus comprises a windowing part which generates a short time input signal by subjecting an input signal or a signal derived from the input signal to windowing, a component removal part which removes an unnecessary component from the short time input signal to generate a modified short time input signal, an autocorrelation coefficient computation part which computes autocorrelation coefficients based on the modified short time input signal, and a prediction parameter computation part which computes prediction parameters based on the autocorrelation coefficients.

Type: Grant

Filed: May 16, 2002

Date of Patent: January 11, 2005

Assignee: Kabushiki Kaisha Toshiba

Inventor: Kimio Miseki
Method for conditioning a digital speech signal

Patent number: 6775650

Abstract: The invention concerns a method for conditioning a digital speech signal(s) processed by successive frames, which consists carrying out a harmonic analysis to estimate the pitch on each frame where it has a speech activity, and in oversampling at an oversampling frequency (fe) which is a multiple of the estimated pitch.

Type: Grant

Filed: June 2, 2000

Date of Patent: August 10, 2004

Assignee: Matra Nortel Communications

Inventors: Philip Lockwood, Stéphane Lubiarz
Method and apparatus for providing fast data recovery with adaptive pulse code modulation coding

Patent number: 6765514

Abstract: A method and apparatus for providing fast data recovery with adaptive pulse code modulation (ADPCM) coding wherein, an ADPCM encoder periodically records the compressional parameters in memory together with regular compressed codes, and an ADPCM decoder retrieves the previously saved compressional parameters when reading the regular data from memory. In case any error occurs in the data compression and decompression processes which would cause a data divergence in the output of the ADPCM decoder, the previously saved compressional parameters can be used to correct the output data, thus enabling fast data recovery in the data outputting process without affecting downstream data in the data stream.

Type: Grant

Filed: December 11, 2002

Date of Patent: July 20, 2004

Assignee: Avid Electronics Corp.

Inventors: Hsien-Ming Chang, Chung-Liang Yen
Device and method for analysing an audio signal in view of obtaining rhythm information

Publication number: 20040068401

Abstract: An apparatus for analyzing an audio signal with regard to rhythm information of the audio signal comprises a filterbank for dividing the audio signal into at least two sub-band signals. Every sub-band signal is examined with regard to a periodicity of the sub-band signal to obtain rhythm raw-information of every sub-band signal. The rhythm raw-information is subjected to a quality evaluation to obtain a significance measure for every sub-band signal. The rhythm information of the audio signal will finally be established by considering the significance measure of the sub-band signal and the rhythm raw-information. This enables a more robust analysis of the audio signal, since sub-band signals, where significant rhythm information are present, are preferred compared to sub-band signals where less significant rhythm information are present, when establishing the rhythm information.

Type: Application

Filed: August 11, 2003

Publication date: April 8, 2004

Inventors: Jurgen Herre, Jan Rohden, Christian Uhle, Markus Cremer
Voiceband signal classifier

Patent number: 6708146

Abstract: A method and apparatus for classifying signals into a multiplicity of signal classes which employs discriminant functions of low-complexity discriminant variables that are computed directly from the passband signal. The method can be applied to the problem of classifying voiceband data (VBD), facsimile (FAX), native binary data, and speech on a 64 Kbps digital channel. In a hybrid two stage classification system, the first stage employs linear discriminant functions to make classification decisions into a smaller number of possible preliminary signal classes. The decisions of the first stage are then refined by a second stage that uses nonlinear discriminant functions such as quadratic or pseudo-quadratic functions. The second stage of a hybrid classifier then assigns the signal into a larger number of possible classes than does the first stage of the classifier alone.

Type: Grant

Filed: April 30, 1999

Date of Patent: March 16, 2004

Assignee: Telecommunications Research Laboratories

Inventors: Jeremy S. Sewall, Bruce F. Cockburn, Deepak P. Sarda
Voiced/unvoiced speech classifier

Patent number: 6640208

Abstract: A voiced/unvoiced speech classifier (30) includes a speech segmentor (34) which segments an input digitized speech waveform into frames of speech and a band-pass filter (36) which filters the frames of speech. A relative energy generator (38) generates a relative energy value for each filtered frame of speech and a decision parameter generator (52) including an autocorrelation calculator (54) and a pitch calculator (56) generates a decision parameter based on an autocorrelation function and a pitch frequency index for the filtered frames of speech. A normalized energy calculator (46) adjusts the threshold and then normalizes the relative energy. A comparator (60) provides a signal indicative of whether a frame of speech is voiced speech or unvoiced speech depending on a comparison of the decision parameter and the normalized relative energy value for each filtered frame of speech.

Type: Grant

Filed: September 12, 2000

Date of Patent: October 28, 2003

Assignee: Motorola, Inc.

Inventors: Yaxin Zhang, Jianming Song, Anton Madievski
Method for modeling speech harmonic magnitudes

Publication number: 20030187635

Abstract: A system or method for modeling a signal, such as a speech signal, in which harmonic frequencies and amplitudes are identified and the harmonic magnitudes are interpolated to obtain spectral magnitudes at a set of fixed frequencies. An inverse transform is applied to the spectral magnitudes to obtain a pseudo auto-correlation sequence, from which linear prediction coefficients are calculated. From the linear prediction coefficients, model harmonic magnitudes are generated by sampling the spectral envelope defined by the linear prediction coefficients. A set of scale factors are then calculated as the ratio of the harmonic magnitudes to the model harmonic magnitudes and interpolated to obtain a second set of scale factors at the set of fixed frequencies. The spectral envelope magnitudes at the set of fixed frequencies are multiplied by the second set of scale factors to obtain new spectral magnitudes and the process is iterated to obtain final linear prediction coefficients.

Type: Application

Filed: March 28, 2002

Publication date: October 2, 2003

Inventors: Tenkasi V. Ramabadran, Aaron M. Smith, Mark A. Jasiuk
Voice conversion system and methodology

Patent number: 6615174

Abstract: A voice conversion system employs a codebook mapping approach to transforming a source voice to sound like a target voice. Each speech frame is represented by a weighted average of codebook entries. The weights represent a perceptual distance of the speech frame and may be refined by a gradient descent analysis. The vocal tract characteristics, represented by a line spectral frequency vector, the excitation characteristics, represented by a linear predictive coding residual, the duration, and the amplitude of the speech frame are transformed in the same weighted-average framework.

Type: Grant

Filed: February 22, 2000

Date of Patent: September 2, 2003

Assignee: Microsoft Corporation

Inventors: Levent Mustafa Arslan, David Thieme Talkin
Correlation domain formant enhancement

Publication number: 20030144835

Abstract: The present invention comprises a system and method for correlation domain formant enhancement. The method includes attenuating an output speech signal in areas of low spectral amplitude. The attenuated signals are subsequently analyzed, and then converted to a format suitable for transmission.

Type: Application

Filed: September 13, 2002

Publication date: July 31, 2003

Inventors: Richard L. Zinser, Steven R. Koch
Frequency domain formant enhancement

Publication number: 20030093268

Abstract: The present invention comprises a system and method for frequency domain formant enhancement. The method includes attenuating an output speech signal in areas of low spectral amplitude. The attenuated signals are subsequently analyzed, and then converted to a format suitable for transmission.

Type: Application

Filed: September 13, 2002

Publication date: May 15, 2003

Inventors: Richard L. Zinser, Steven R. Koch
Method and system for injecting external content into computer network interactive sessions

Patent number: 6564243

Abstract: A method and system for injecting external content to a user's client computer engaged in an interactive computer network session. A request for selected content from a user is intercepted and a decision is made whether to deliver external content to the user's client computer in addition to the requested content. The method and system allows for local service providers such as ISPs to add their own content to sessions involving remote content suppliers.

Type: Grant

Filed: August 20, 1999

Date of Patent: May 13, 2003

Assignee: Adwise Ltd.

Inventors: Meir Yedidia, Yaron Buznach
Method for the algebraic codebook search of a speech signal encoder

Publication number: 20030046067

Abstract: A method for the algebraic codebook search of a speech signal encoder, preferably using the Code Excited Linear Prediction process, in which, in order to calculate coefficients of the triangular matrix of the auto-correlation matrix of the Toeplitz type, a time interval comprising n speech signal samplings is divided into an integral number of tracks t with p possible pulse positions each, and in which the coefficients are stored in a memory grouped in combinations of adjacent tracks, combinations of non-adjacent tracks, combinations of identical tracks, and coefficients of the main diagonals of the auto-correlation matrix.

Type: Application

Filed: August 13, 2002

Publication date: March 6, 2003

Inventor: Dietmar Gradl
Method and apparatus for seamless multi-rate speech coding

Patent number: 6496794

Abstract: A communications system (100) includes a multi-rate source coder (MRSC) (102), a variable size/rate buffer (VSRB) (112), a speech buffer (104), and a buffer control block (106). The variable size/rate buffer (112) includes a source coder bit buffer (SCBB) (114) and an adaptive transmit frame buffer (116). The source coder bit buffer (114) receives speech frames coded at different rates from the multi-rate source coder (102), and deposits an integer or non-integer number of frames in the adaptive transmit frame buffer (ATFB) (116). A receiver includes a seamless rate transition module (SRTM) (308) and an variable buffer (310). The seamless rate transition module (308) correlates speech data previously coded at different rates, and it then truncates or alternatively appends, concatenates, and warps the speech data to remove any annoying artifacts at the rate change boundary.

Type: Grant

Filed: November 22, 1999

Date of Patent: December 17, 2002

Assignee: Motorola, Inc.

Inventors: John Eric Kleider, Jeffery Scott Chuprun, Richard James Pattison, Chad Bergstrom, Byron Tarver
Prediction parameter analysis apparatus and a prediction parameter analysis method

Publication number: 20020184008

Abstract: A prediction parameter analysis apparatus comprises a windowing part which generates a short time input signal by subjecting an input signal or a signal derived from the input signal to windowing, a component removal part which removes an unnecessary component from the short time input signal to generate a modified short time input signal, an autocorrelation coefficient computation part which computes autocorrelation coefficients based on the modified short time input signal, and a prediction parameter computation part which computes prediction parameters based on the autocorrelation coefficients.

Type: Application

Filed: May 16, 2002

Publication date: December 5, 2002

Inventor: Kimio Miseki
Audio processing device, receiver and filtering method for filtering a useful signal and restoring it in the presence of ambient noise

Patent number: 6487529

Abstract: An audio processing device includes an analyzer and a filter. The analyzer extracts an envelope of a noise signal and derives therefrom noise envelope parameters. The filter has coefficients which vary in response to noise envelope parameters and filters a useful signal to form a filtered signal. The coefficients are varied so that the filter enhances frequency bands of the useful signal that correspond to frequency bands of the noise signal having a higher energy than a predetermined value.

Type: Grant

Filed: October 26, 1999

Date of Patent: November 26, 2002

Assignee: Koninklijke Philips Electronics N.V.

Inventor: Gilles Miet
Systems and methods for improving power spectral estimation of speech signals

Patent number: 6463408

Abstract: A system determines a power spectral density associated with an audio signal that includes a speech signal and/or a noise signal. The system updates an autocorrelation function of the audio signal from samples in the audio signal, estimates an autocorrelation function of the speech signal from the updated autocorrelation function of the audio signal, and calculates a power spectral density of the speech signal using the estimated autocorrelation function. The system then determines the power spectral density of the audio signal from the calculated power spectral density of the speech signal.

Type: Grant

Filed: November 22, 2000

Date of Patent: October 8, 2002

Assignee: Ericsson, Inc.

Inventors: Leonid Krasny, Soontorn Oraintara
Method and apparatus for pitch determination of a low bit rate digital voice message

Patent number: 6418407

Abstract: A pitch determiner (931) of a system controller (106) that generates a smoothed pitch value for a current frame of a low bit rate voice message includes a pitch function generator (955) that generates a pitch detection function (PDF) for each frame of digital samples of a voice signal, a pitch candidate selector (960) that selects a future frame pitch candidate from a pitch detection function (PDF), and a pitch adjuster (978) that generates the smoothed pitch value. The pitch adjuster includes a subharmonic pitch corrector (965) that determines a future frame pitch value by performing pitch subharmonic correction of a future frame pitch candidate using a roughness factor of the frequency transformed window.

Type: Grant

Filed: September 30, 1999

Date of Patent: July 9, 2002

Assignee: Motorola, Inc.

Inventors: Jian-Cheng Huang, Floyd Simpson, Sunil Satyamurti, Kenneth Finlon
Method and apparatus for coding and decoding speech

Patent number: 6415252

Abstract: Bits are allocated to short-term repetition information for unvoiced input signals. Stated differently, more bits are allocated for pitch information during unvoiced input speech than in the prior art. The improved method and apparatus in an encoder (300) and decoder (700) result in improved consistency of amplitude pulses compared to prior art methods which indicates improved stability due to increased search resolution. Also, the improved method and apparatus result in higher energy compared to prior art methods which indicates that the synthesized waveform matches the target waveform more closely, resulting in a higher fixed codebook (FCB) gain.

Type: Grant

Filed: May 28, 1998

Date of Patent: July 2, 2002

Assignee: Motorola, Inc.

Inventors: Weimin Peng, James Patrick Ashley
Method and apparatus for clustering-based signal segmentation

Patent number: 6314392

Abstract: In a computerized method a continuous signal is segmented in order to determine statistically stationary units of the signal. The continuous signal is sampled at periodic intervals to produce a timed sequence of digital samples. Fixed numbers of adjacent digital samples are grouped into a plurality of disjoint sets or frames. A statistical distance between adjacent frames is determined. The adjacent sets are merged into a larger set of samples or cluster if the statistical distance is less than a predetermined threshold. In an iterative process, the statistical distance between the adjacent sets are determined, and as long as the distance is less than the predetermined threshold, the sets are iteratively merged to segment the signal into statistically stationary units.

Type: Grant

Filed: September 20, 1996

Date of Patent: November 6, 2001

Assignee: Digital Equipment Corporation

Inventors: Brian S. Eberman, William D. Goldenthal
Pitch period extracting apparatus of speech signal

Patent number: RE38889

Abstract: A pitch period extracting apparatus includes a microcomputer which determines a sampling frequency for an A/D converter, and a range of delay times for calculating autocorrelative values on the basis of the sampling frequency. For example, the delay times are set within a range of 20 samples?k?100 samples in a case of 8 kHz, and a range of 15 samples?k?75 samples in a case of 6 kHz. The microcomputer calculates the autocorrelative values of speech signal data stored in a buffer memory, and outputs a delay time at which a maximum autocorrelative value is obtainable as a pitch period of an inputted speech signal.

Type: Grant

Filed: October 6, 2000

Date of Patent: November 22, 2005

Assignee: Sanyo Electric Co., Ltd.

Inventor: Takeo Inoue

prev 1 2 3 4 next