Interpolation Patents (Class 704/265)

Interpolation in a speech decoder of a transmission system on the basis of transformed received prediction parameters

Patent number: 6157907

Abstract: A transmission system wherein a speech signal is represented by a plurality of prediction parameters updated once per frame. Each frame comprises a plurality of sub-frames in which an excitation signal generated by a fixed codebook and an adaptive codebook is updated. In order to enhance the reconstructed speech quality the prediction coefficients are interpolated at the decoder by an LPC coefficient interpolator to obtain interpolated prediction coefficients for each sub-frame. According to the present invention the interpolation of the prediction coefficients is not based on the prediction coefficients used for transmission, such as reflection coefficients or Log Area Ratios, but on Line Spectral Frequencies. This reduces degradation of speech quality due to interpolation.

Type: Grant

Filed: February 5, 1998

Date of Patent: December 5, 2000

Assignee: U.S. Philips Corporation

Inventors: Rakesh Taori, Andreas J. Gerrits
Formant-based speech synthesizer employing demi-syllable concatenation with independent cross fade in the filter parameter and source domains

Patent number: 6144939

Abstract: The concatenative speech synthesizer employs demi-syllable subword units to generate speech. The synthesizer is based on a source-filter model that uses source signals that correspond closely to the human glottal source and that uses filter parameters that correspond closely to the human vocal tract. Concatenation of the demi-syllable units is facilitated by two separate cross fade techniques, one applied in the time domain to the demi-syllable source signal waveforms, and one applied in the frequency domain by interpolating the corresponding filter parameters of the concatenated demi-syllables. The dual cross fade technique results in natural sounding synthesis that avoids time-domain glitches without degrading or smearing characteristic resonances in the filter domain.

Type: Grant

Filed: November 25, 1998

Date of Patent: November 7, 2000

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Steve Pearson, Nicholas Kibre, Nancy Niedzielski
Personalized audio information delivery system

Patent number: 6122617

Abstract: An information delivery system for delivery of audible information to a plurality of end-users comprising, in accordance with the preferred embodiments of an apparatus of the present invention, a master controller connected to a plurality of remotely-located information sources and to a plurality of remotely-located end-user information devices. The master controller connects to a plurality of distant local controllers via a data transport network, including a public switched telecommunications network and a broadcast data transport network. Each local controller includes a sound synthesizer which connects to an end-user audio device, such as a loudspeaker system, a tape recorder, or earphone.

Type: Grant

Filed: May 27, 1999

Date of Patent: September 19, 2000

Inventor: Gary S. Tjaden
Pitch synchronized sinusoidal synthesizer

Patent number: 6029133

Abstract: A pitch synchronous sinusoidal synthesizer for multi-band excitation vocoders will produce excitation signals necessary to artificially mimic speech from input data. The input data will contain the pitch frequencies for current and previous synthesizing frame samples, starting phase information for all harmonics within the current synthesizing frame sample, magnitudes for each of the harmonics present within the current synthesizing frame sample, the voiced/unvoiced decisions for each of the harmonics within the current frame sample, and an energy description for the harmonics of the current synthesizing frame sample. The pitch synchronous sinusoidal synthesizer will produce the synthetic speech with a minimum of the distortion caused by the sampling and regeneration of the speech excitation signals. The pitch synchronized sinusoidal synthesizer has a plurality of pitch interpolators.

Type: Grant

Filed: September 15, 1997

Date of Patent: February 22, 2000

Assignee: Tritech Microelectronics, Ltd.

Inventor: Ma Wei
Method and device for short-time Fourier-converting and resynthesizing a speech signal, used as a vehicle for manipulating duration or pitch

Patent number: 5970440

Abstract: A method is described for short-time Fourier-converting a speech signal and for resynthesizing an output speech signal from the modulus of its short-time Fourier transform and from an initial phase. In particular, after the Fourier converting the signal is subjected to a phase-specifying operation. Subsequently speech duration is affected by systematically maintaining, periodically repeating or periodically suppressing result intervals of the successive Fourier converting and phase affecting. Finally, a resynthesizing operation is executed. Speech pitch can likewise be affected through systematically excising or inserting signal intervals. Finally, the two strategies can be combined, so that ultimately, pitch and duration can be affected independently from each other.

Type: Grant

Filed: November 22, 1996

Date of Patent: October 19, 1999

Assignee: U.S. Philips Corporation

Inventors: Raymond N. J. Veldhuis, Haiyan He
Sound characteristic converter, sound-label association apparatus and method therefor

Patent number: 5956685

Abstract: This invention relates to converting the characteristics of sounds such as oices, musical tones, natural sounds, and so on, and more specifically to facilitating the conversion operation, and also to sound-label association suitable for the characteristic conversion. Various embodiments of the invention comprise several of the following elements to provide useful results: sound-label data holding means, display control means, conversion means, sound-label dividing means, label-data dividing means, association forming means, data inputting means, and communication means. Other embodiments of the invention may be practiced as processes or articles of manufacture.

Type: Grant

Filed: March 11, 1997

Date of Patent: September 21, 1999

Assignees: Arcadia, Inc., ATR Human Information Processing Research Laboratories, Co., Inc.

Inventors: Seiichi Tenpaku, Yoh'Ichi Tohkura
Method and apparatus for reproducing speech signals and method for transmitting same

Patent number: 5926788

Abstract: An encoding unit 2 divides speech signals provided to an input terminal 10 into frames and encodes the divided signals on the frame basis to output encoding parameters such as line spectral pair (LSP) parameters, pitch, voiced(V)/unvoiced (UV) or spectral amplitude A.sub.m. The modified encoding parameter calculating unit 3 interpolates the encoding parameters for calculating modified encoding parameters associated with desired time points. A decoding unit 6 synthesizes sine waves and the noise based upon the modified encoding parameters and outputs the synthesized speech signals at an output terminal 37. Speed control can be achieved easily at an arbitrary rate over a wide range with high sound quality with the phoneme and the pitch remaining unchanged.

Type: Grant

Filed: June 17, 1996

Date of Patent: July 20, 1999

Assignee: Sony Corporation

Inventor: Masayuki Nishiguchi
Personalized audio information delivery system

Patent number: 5915238

Abstract: An information delivery system for delivery of audible information to a plurality of end-users comprising, in accordance with the preferred embodiments of an apparatus of the present invention, a master controller connected to a plurality of remotely-located information sources and to a plurality of remotely-located end-user information devices. The master controller connects to a plurality of distant local controllers via a data transport network, including a public switched telecommunications network and a broadcast data transport network. Each local controller includes a sound synthesizer which connects to an end-user audio device, such as a loudspeaker system, a tape recorder, or earphone.

Type: Grant

Filed: July 16, 1996

Date of Patent: June 22, 1999

Inventor: Gary S. Tjaden
Method, device and system for using statistical information to reduce computation and memory requirements of a neural network based speech synthesis system

Patent number: 5913194

Abstract: A method (400), device and system (300) provide, in response to linguistic information, efficient generation of a parametric representation of speech using a neural network.

Type: Grant

Filed: July 14, 1997

Date of Patent: June 15, 1999

Assignee: Motorola, Inc.

Inventors: Orhan Karaali, Noel Massey, Gerald Corrigan
Waveform interpolation speech coding using splines

Patent number: 5903866

Abstract: A low-complexity method and apparatus for performing waveform interpolation in a low bit-rate WI speech decoder, wherein interpolation between received waveforms is performed with use of spline coefficients generated based thereupon. Specifically, two signals are received from a WI encoder, each comprising a set of frequency domain parameters representing a speech signal segment of a corresponding pitch period. Then, spline coefficients are generated from each of the received signals, wherein each set of spline coefficients comprises a spline representation of a time domain transformation of the corresponding set of frequency domain parameters. Finally, the decoder interpolates between the spline representations to generate interpolated time domain data which is used to synthesize a reconstructed speech signal. In certain embodiments of the present invention, the time scale of at least one of the spline representations is modified to enable the interpolation therebetween.

Type: Grant

Filed: March 10, 1997

Date of Patent: May 11, 1999

Assignee: Lucent Technologies Inc.

Inventor: Yair Shoham
Speech decoding device to update the synthesis postfilter and prefilter during unvoiced speech or noise

Patent number: 5899967

Abstract: In the disclosed speech decoding device, activation of a postfilter process is halted during unvoiced sections. However, the updating process of the internal states of the postfilter continues even though the postfilter process is not activated during unvoiced sections. At changes between voiced and unvoiced sections, output signals outputted during voiced sections that have been subjected to a postfilter process and output signals outputted during unvoiced sections that have not been subjected to a postfilter process are interpolated and outputted. In one embodiment, a prefilter controller activates a prefilter state updater for unvoiced sections to update the internal state of the filter of a prefilter section based on excited signals to decrease any perception of noncontinuity in an output signal switching between activation and deactivation of the prefilter when changing between voiced and unvoiced sections.

Type: Grant

Filed: March 25, 1997

Date of Patent: May 4, 1999

Assignee: NEC Corporation

Inventor: Mayumi Nagasaki
Interpolating between representative frame waveforms of a prediction error signal for speech synthesis

Patent number: 5890118

Abstract: A speech synthesis apparatus includes; a memory for storing a plurality of typical waveforms corresponding to a plurality of frames, the typical waveforms each previously obtained by extracting in units of at least one frame from a prediction error signal formed in predetermined units, a voiced speech source generator including an interpolation circuit for performing interpolation between the typical waveforms read out from the memory means to obtain a plurality of interpolation signals each having at least one of an interpolation pitch period and a signal level which changes smoothly between the corresponding frames, a superposition circuit for superposing the interpolation signals obtained by the interpolation circuit to form a voiced speech source signal, an unvoiced speech source generator for generating an unvoiced speech source signal, and a vocal tract filter selectively driven by the voiced speech source signal outputted from the voiced speech source generator and the unvoiced speech source signal fro

Type: Grant

Filed: March 8, 1996

Date of Patent: March 30, 1999

Assignee: Kabushiki Kaisha Toshiba

Inventors: Takehiko Kagoshima, Masami Akamine
Audio data decompression and interpolation apparatus and method

Patent number: 5890126

Abstract: Apparatus for simultaneously decompressing and interpolating compressed audio data. The compressed audio data is stored in differential log format, meaning that the difference between each two consecutive data points is taken and the log of the difference calculated to form each compressed data point. To efficiently decompress and interpolate the compressed data, advantage is taken of the fact that addition of logs is equivalent to multiplication of linear values. Thus the log of an interpolation factor is added to each compressed data point prior to taking the inverse log of the sum. An integrator block completes the interpolation and decompression of the data.

Type: Grant

Filed: March 10, 1997

Date of Patent: March 30, 1999

Assignee: EuPhonics, Incorporated

Inventor: Eric Lindemann
Prototype waveform speech coding with interpolation of pitch, pitch-period waveforms, and synthesis filter

Patent number: 5884253

Abstract: A speech coding system providing reconstructed voiced speech with a smoothly evolving pitch-cycle waveform. A speech signal is represented by isolating and coding prototype waveforms. Each prototype waveform is an exemplary pitch-cycle of voiced speech. A coded prototype waveform is transmitted at regular intervals to a receiver which synthesizes (or reconstructs) an estimate of the original speech segment based on the prototypes. The estimate of the original speech signal is provided by a prototype interpolation process which provides a smooth time-evolution of pitch-cycle waveforms in the reconstructed speech. Illustratively, a frame of original speech is coded by first filtering the frame with a linear predictive filter. Next a pitch-cycle of the filtered original is identified and extracted as a prototype waveform. The prototype waveform is then represented as a set of Fourier series (frequency domain) coefficients.

Type: Grant

Filed: October 3, 1997

Date of Patent: March 16, 1999

Assignee: Lucent Technologies, Inc.

Inventor: Willem Bastiaan Kleijn
Method and apparatus for reducing artifacts that result from time compressing and decompressing speech

Patent number: 5884268

Abstract: A processing system time compresses a voice message before transmission, and processing system time expands the message after reception. To process the message the processing systems perform at least one of: (a) randomizing the order of a sequence of samples form a silent portion of the message after reception thereof before blending the sequence with a last portion of the expanded message; (b) selecting the sequence of samples from the silent portion of the message after reception thereof, the sequence selected being poorly correlated with the last portion of the expanded message, before blending the sequence with the last portion of the message; and (c) compressing the dynamic range of the message before transmission, by an amount dependent upon the signal-to-noise ratio of the message, and aggressively expanding the dynamic range of the message after reception, by a fixed amount.

Type: Grant

Filed: June 27, 1997

Date of Patent: March 16, 1999

Assignee: Motorola, Inc.

Inventors: William Michael Campbell, Clifford Allan Wood, James Earl Womack, Wade Alan Bastien, Deborah Ann Calie, Robert Andrew Rapp, Terence Edward Sumner
Transmission system for quasi periodic signals

Patent number: 5878081

Abstract: In a transmission system for a quasi-periodic signal, an encoder (12) transmits to a decoder (28) no more than a single period from a total number of periods of the quasi-periodic signal in encoded form. In this decoder (28) the lacking periods of the quasi-periodic signal are reconstructed by means of interpolation. For obtaining an enhanced quality of the reconstructed signal, a signal segment is transmitted that is representative of two successive periods of the quasi-periodic signal. This signal segment may contain two successive periods, but may also be a signal segment having the length of a single period and being determined by a weighted sum of two successive periods.

Type: Grant

Filed: October 14, 1997

Date of Patent: March 2, 1999

Assignee: U.S. Philips Corporation

Inventors: Robert J. Sluijter, Eric Kathmann, Rakesh Taori
Speech synthesis with equal interval line spectral pair frequency interpolation

Patent number: 5864796

Abstract: A speech synthesis apparatus in which spectrum emphasis characteristics can be set easily taking into account the frequency response and psychoacoustic hearing sense and in which the degree of freedom in setting the response is larger. An excitation signal ex(n) is synthesized by a synthesis filter 12 to give a synthesized speech signal which is sent to a spectrum emphasis filter 13. The spectrum emphasis filter 13 spectrum-emphasizes the synthesized speech signal and outputs the resulting spectrum-emphasized signal. The vocal tract parameters from an input terminal 21 are converted by a parameter conversion circuit 23 into linear spectral pair (LSP) frequencies which are interpolated by an LSP interpolation circuit 24 with equal-interval line spectral pair frequencies to produce interpolated LSP frequencies. The transfer function of the spectrum emphasis filter 13 is determined on the basis of the interpolated LSP frequencies.

Type: Grant

Filed: February 6, 1997

Date of Patent: January 26, 1999

Assignee: Sony Corporation

Inventors: Akira Inoue, Masayuki Nishiguchi
Method for improving the voice quality in low-rate dynamic bit allocation sub-band coding

Patent number: 5842160

Abstract: Voice quality is improved in low-rate dynamic bit allocation sub-band coding (DBASC) by inserting synthetic filler signals for non-transmitted signal bands. Large spectral holes may exist in an output voice speech spectrum of DBASC with a low transmission rate. The holes are caused by low energy signal sub-bands that are not transmitted and create noticeable artifacts in the received voice signal. To avoid these artifacts, the spectral holes in the received signal are filled with synthetic signals generated from the received signal. The synthetic filler signals are signals from transmitted energy bands that are scaled to the energy level of the non-transmitted band.

Type: Grant

Filed: July 18, 1997

Date of Patent: November 24, 1998

Assignee: Ericsson Inc.

Inventor: Richard L Zinser
Speech coding parameter sequence reconstruction by sequence classification and interpolation

Patent number: 5839102

Abstract: A method and apparatus which allows the transmission of the perceptually important features of a speech-coding parameter at a low bit rate. The speech coding parameter may, for example, comprise the signal power of the speech. The parameter is processed on a block by block basis. The parameter value at the block boundaries is transmitted by conventional methods such as, for example, by means of differential quantization. The shape of the reconstructed parameter contour within block boundaries is based on a classification. The classification determines perceptually important features of the parameter contour within a block. The classification can be performed either at the transmitting end of the coder (using, for example, the original parameter contour with high time resolution and possibly other speech parameters as well) or at the receiving end of the coder (using, for example, the transmitted parameter values, and possibly other transmitted speech parameters as well).

Type: Grant

Filed: November 30, 1994

Date of Patent: November 17, 1998

Assignee: Lucent Technologies Inc.

Inventors: Jesper Haagen, Willem Bastiaan Kleijn
Lossless and loss-limited compression of sampled data signals

Patent number: 5839100

Abstract: An efficient method for compressing audio and other sampled data signals without loss, or with a controlled amount of loss, is described. The compression apparatus contains a subset selector, an approximator, an adder, two derivative encoders, a header encoder, and a compressed block formatter. The decompression apparatus contains a compressed block parser, a header decoder, two integration decoders, an approximator, and an adder. The compressor first divides each block of input samples into a first subset and a second subset. The approximator uses the first subset samples to approximate the second subset samples. An error signal is created by subtracting the approximated second subset samples from the actual second subset samples. The first subset samples and error signal are separately encoded by the derivative encoders, which select the signal's derivative that requires the least amount of storage for a block floating point representation.

Type: Grant

Filed: April 22, 1996

Date of Patent: November 17, 1998

Inventor: Albert William Wegener
System architecture and method for linear interpolation implementation

Patent number: 5832436

Abstract: The present invention comprises an interpolation circuit for performing a linear interpolation operation for a voice synthesizer receiving a plurality of frames of voice data wherein each frame is subdivided into a plurality of sub frames with each sub frame being assigned with a predefined interpolation weighting factor. The interpolation circuit comprises an input port for receiving a plurality of sub frame voice data to be interpolated wherein said sub frame data are received in a predefined order. The interpolation circuit further includes a read-only-memory (ROM) for storing a plurality of multiplications of each of the plurality of sub frame voice data to each of the weighting factors and a read-only-memory (ROM) address generator capable of determining a address in ROM utilizing the sub frame voice data received from the input port and the predefined order of the frame and the sub frame thereof when the voice data is being received.

Type: Grant

Filed: April 16, 1997

Date of Patent: November 3, 1998

Assignee: Industrial Technology Research Institute

Inventor: Chau-kai Hsieh
Method for voice analysis and synthesis using wavelets

Patent number: 5826232

Abstract: The voice synthesis of the invention analyzes a voice signal by orthogonal breakdown on a basis of wavelets with compact support, preferably Daubechies wavelets. The synthesis is carried out on the basis of coefficients which are stored and selected during the analysis, according to the same algorithm as that used for the analysis.

Type: Grant

Filed: February 18, 1993

Date of Patent: October 20, 1998

Assignee: Sextant Avionique

Inventor: Christian Gulli
Speech decoder having an interpolation circuit for updating background noise

Patent number: 5809460

Abstract: In a LPC speech signal decoder, background noise is simulated during periods of silence at the transmitting end based upon a background noise frame containing information about the background noise at the sending end. When the silence persists, the transmitter periodically updates the background noise frame previously send by transmitting an updating background noise frame. When an update background noise frame is received, an interpolation is performed so as to make the simulated background noise sound natural to the listener. The interpolation process includes a step of selecting between interpolation spectrum parameters which are produced by the interpolation process and the updated spectrum parameters which are based solely upon the most recent updated background noise frame.

Type: Grant

Filed: November 7, 1994

Date of Patent: September 15, 1998

Assignee: NEC Corporation

Inventors: Toshihiro Hayata, Yoshihiro Unno
Signal quantizer wherein average level replaces subframe steady-state levels

Patent number: 5794180

Abstract: A quantizer and a low bit rate communication system using the quantizer is described. The quantizer includes a 3-bit and 5-bit encoder where the 3-bit encoder provides the encoded gain for a first half of a sampled frame of speech and the second encoder for the second half of the frame of speech. A special 3-bit code is provided when a steady state is determined by comparing the 3-bit code and neighboring 5-bit codes. The decoder in the system when detecting the special code provides an average of the 5-bit codes if the decoded 5-bit code is within 5 dB of the previous 5-bit code.

Type: Grant

Filed: April 30, 1996

Date of Patent: August 11, 1998

Assignee: Texas Instruments Incorporated

Inventor: Alan V. McCree
Method and apparatus for text-to-voice audio output with accent control and improved phrase control

Patent number: 5758320

Abstract: A text-to-voice audio output unit includes a storage section for storing analyzed information pertaining to words, boundaries between articulations, and accents obtained by analyzing an input character list, a voice synthesis rule section for changing a reduction or damping characteristic of a phrase component of a fundamental frequency of an output voice, and a voice synthesizing section for generating a composite tone based on the analyzed information from the storage section. The reduction or damping characteristic, calculated for each phrase component, is overdamped, critically damped, or underdamped and is based on speech rate, syntactic information, number of articulations, and positional information. When a prosodic phrase is short, the reduction or damping characteristic causes a decrease in the fundamental frequency for a meaningfully-delimited portion, and when a prosodic phrase is long, the reduction or damping characteristic is controlled over the entire prosodic phrase.

Type: Grant

Filed: June 12, 1995

Date of Patent: May 26, 1998

Assignee: Sony Corporation

Inventor: Yasuharu Asano
System for automatically morphing audio information

Patent number: 5749073

Abstract: In the first step of a sound morphing process, each sound which forms the basis for the morph is converted into one or more quantitative representations, such as spectrograms. After the representations have been obtained, the temporal axes of the two sounds are matched, so that similar components of the two sounds, such as onsets, harmonic regions and inharmonic regions, are aligned with one another. Other characteristics of the sounds, such as pitch, formant frequencies, or the like, are then matched. Once the energy in each of the sounds has been accounted for and matched to that of the other sound, the two sounds are cross-faded, to produce a representation of a new sound. This representation is then inverted, to generate the morphed sound.

Type: Grant

Filed: March 15, 1996

Date of Patent: May 5, 1998

Assignee: Interval Research Corporation

Inventor: Malcolm Slaney
Digital sound recording and reproduction device using a coding technique to compress data for reduction of memory requirements

Patent number: 5668924

Abstract: A sound recording and reproducing device codes a sound signal by a code excited linear predictive coding means including an adaptive code book created on the basis of past excitation signals, a stochastic code book having a plurality of different stochastic signals and a pulse code book having a plurality of pulse signals. When decoding the sound data coded frame by frame by code excited linear predictive coding, each frame comprising a plurality of sub-frames, by using an adaptive code book created on the basis of a past excitation signal, and a stochastic code book having a plurality of different stochastic signals, a switch controller controls switches such that the coded data in a predetermined sub-frame of each frame is skipped in accordance with the rate of change of reproduction speed to decode the coded data in the remaining sub-frame or sub-frames without decoding the coded data in the skipped sub-frame, and the contents of the adaptive code book are renewed.

Type: Grant

Filed: September 27, 1995

Date of Patent: September 16, 1997

Assignee: Olympus Optical Co. Ltd.

Inventor: Hidetaka Takahashi
Processing of acoustic waveforms

Patent number: RE36478

Abstract: A sinusoidal model for acoustic waveforms is applied to develop a new analysis/synthesis technique which characterizes a waveform by the amplitudes, frequencies, and phases of component sine waves. These parameters are estimated from a short-time Fourier transform. Rapid changes in the highly-resolved spectral components are tracked using the concept of "birth" and "death" of the underlying sine waves. The component values are interpolated from one frame to the next to yield a representation that is applied to a sine wave generator. The resulting synthetic waveform preserves the general waveform shape and is perceptually indistinguishable from the original. Furthermore, in the presence of noise the perceptual characteristics of the waveform as well as the noise are maintained. The method and devices are particularly useful in speech coding, time-scale modification, frequency scale modification and pitch modification.

Type: Grant

Filed: April 12, 1996

Date of Patent: December 28, 1999

Assignee: Massachusetts Institute of Technology

Inventors: Robert J. McAulay, Thomas F. Quatieri, Jr.

prev 1 2 3 4