Patents by Inventor Allen Gersho

Allen Gersho has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

REW parametric vector quantization and dual-predictive SEW vector quantization for waveform interpolative coding

Patent number: 7584095

Abstract: An enhanced analysis-by-synthesis waveform interpolative speech coder able to operate at 2.8 kbps. Novel features include dual-predictive analysis-by-synthesis quantization of the slowly-evolving waveform, efficient parametrization of the rapidly-evolving waveform magnitude, and analysis-by-synthesis vector quantization of the rapidly evolving waveform parameter. Subjective quality tests indicate that it exceeds G.723.1 at 5.3 kbps, and of G.723.1 at 6.3 kbps.

Type: Grant

Filed: September 23, 2005

Date of Patent: September 1, 2009

Assignee: The Regents of the University of California

Inventors: Oded Gottesman, Allen Gersho
LPC-harmonic vocoder with superframe structure

Patent number: 7315815

Abstract: An enhanced low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.

Type: Grant

Filed: September 22, 1999

Date of Patent: January 1, 2008

Assignee: Microsoft Corporation

Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
LPC-harmonic vocoder with superframe structure

Patent number: 7286982

Abstract: An enhanced low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.

Type: Grant

Filed: July 20, 2004

Date of Patent: October 23, 2007

Assignee: Microsoft Corporation

Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
Frequency domain postfiltering for quality enhancement of coded speech

Patent number: 7124077

Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.

Type: Grant

Filed: January 28, 2005

Date of Patent: October 17, 2006

Assignee: Microsoft Corporation

Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam A. Khalil
REW parametric vector quantization and dual-predictive SEW vector quantization for waveform interpolative coding

Publication number: 20060069554

Abstract: An enhanced analysis-by-synthesis waveform interpolative speech coder able to operate at 2.8 kbps. Novel features include dual-predictive analysis-by-synthesis quantization of the slowly-evolving waveform, efficient parametrization of the rapidly-evolving waveform magnitude, and analysis-by-synthesis vector quantization of the rapidly evolving waveform parameter. Subjective quality tests indicate that it exceeds G.723.1 at 5.3 kbps, and of G.723.1 at 6.3 kbps.

Type: Application

Filed: September 23, 2005

Publication date: March 30, 2006

Inventors: Oded Gottesman, Allen Gersho
REW parametric vector quantization and dual-predictive SEW vector quantization for waveform interpolative coding

Patent number: 7010482

Abstract: An enhanced analysis-by-synthesis waveform interpolative speech coder able to operate at 2.8 kbps. Novel features include dual-predictive analysis-by-synthesis quantization of the slowly-evolving waveform, efficient parametrization of the rapidly-evolving waveform magnitude, and analysis-by-synthesis vector quantization of the rapidly evolving waveform parameter. Subjective quality tests indicate that it exceeds G.723.1 at 5.3 kbps, and of G.723.1 at 6.3 kbps.

Type: Grant

Filed: March 16, 2001

Date of Patent: March 7, 2006

Assignee: The Regents of the University of California

Inventors: Oded Gottesman, Allen Gersho
Frequency domain postfiltering for quality enhancement of coded speech

Patent number: 6941263

Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.

Type: Grant

Filed: June 29, 2001

Date of Patent: September 6, 2005

Assignee: Microsoft Corporation

Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam A. Khalil
Frequency domain postfiltering for quality enhancement of coded speech

Publication number: 20050131696

Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.

Type: Application

Filed: January 28, 2005

Publication date: June 16, 2005

Applicant: Microsoft Corporation

Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam Khalil
LPC-harmonic vocoder with superframe structure

Publication number: 20050075869

Abstract: An enhanced_low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.

Type: Application

Filed: July 20, 2004

Publication date: April 7, 2005

Applicant: Microsoft Corporation

Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
Method for coding speech and music signals

Patent number: 6658383

Abstract: The present invention provides a transform coding method efficient for music signals that is suitable for use in a hybrid codec, whereby a common Linear Predictive (LP) synthesis filter is employed for both speech and music signals. The LP synthesis filter switches between a speech excitation generator and a transform excitation generator, in accordance with the coding of a speech or music signal, respectively. For coding speech signals, the conventional CELP technique may be used, while a novel asymmetrical overlap-add transform technique is applied for coding music signals. In performing the common LP synthesis filtering, interpolation of the LP coefficients is conducted for signals in overlap-add operation regions. The invention enables smooth transitions when the decoder switches between speech and music decoding modes.

Type: Grant

Filed: June 26, 2001

Date of Patent: December 2, 2003

Assignee: Microsoft Corporation

Inventors: Kazuhito Koishida, Vladimir Cuperman, Amir H. Majidimehr, Allen Gersho
Variable bit rate coder, and associated method, for a communication station operable in a communication system

Patent number: 6625226

Abstract: A variable bit rate coder, and an associated method, for encoding a frame of speech, such as frames of data generated during operation of a communication station operable in a cellular communication system. Selection of the coding rate is made responsive to indicia of actual coding performance of a coder at more than one coding rate.

Type: Grant

Filed: December 3, 1999

Date of Patent: September 23, 2003

Inventors: Allen Gersho, Vladimir Cuperman, Jan Linden, Ajit V. Rao, Sassan Ahmadi, Fenghua Liu, Ryan Heidari
Frequency domain postfiltering for quality enhancement of coded speech

Publication number: 20030009326

Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.

Type: Application

Filed: June 29, 2001

Publication date: January 9, 2003

Applicant: Microsoft Corporation

Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam A. Khalil
Method for coding speech and music signals

Publication number: 20030004711

Abstract: The present invention provides a transform coding method efficient for music signals that is suitable for use in a hybrid codec, whereby a common Linear Predictive (LP) synthesis filter is employed for both speech and music signals. The LP synthesis filter switches between a speech excitation generator and a transform excitation generator, in accordance with the coding of a speech or music signal, respectively. For coding speech signals, the conventional CELP technique may be used, while a novel asymmetrical overlap-add transform technique is applied for coding music signals. In performing the common LP synthesis filtering, interpolation of the LP coefficients is conducted for signals in overlap-add operation regions. The invention enables smooth transitions when the decoder switches between speech and music decoding modes.

Type: Application

Filed: June 26, 2001

Publication date: January 2, 2003

Applicant: Microsoft Corporation

Inventors: Kazuhito Koishida, Vladimir Cuperman, Amir H. Majidimehr, Allen Gersho
Method and apparatus for hybrid coding of speech at 4KBPS having phase alignment between mode-switched frames

Patent number: 6475245

Abstract: A method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and “transitory” or “transition” speech, and a particular type of coding scheme is used for each class. Harmonic coding is used for steady state voiced speech, “noise-like” coding is used for stationary unvoiced speech, and a special coding mode is used for transition speech, designed to capture the location, the structure, and the strength of the local time events that characterize the transition portions of the speech. The compression schemes can be applied to the speech signal or to the LP residual signal.

Type: Grant

Filed: February 5, 2001

Date of Patent: November 5, 2002

Assignee: The Regents of the University of California

Inventors: Allen Gersho, Eyal Shlomot, Vladimir Cuperman, Chunyan Li
REW parametric vector quantization and dual-predictive SEW vector quantization for waveform interpolative coding

Publication number: 20020116184

Abstract: An enhanced analysis-by-synthesis waveform interpolative speech coder able to operate at 2.8 kbps. Novel features include dual-predictive analysis-by-synthesis quantization of the slowly-evolving waveform, efficient parametrization of the rapidly-evolving waveform magnitude, and analysis-by-synthesis vector quantization of the rapidly evolving waveform parameter. Subjective quality tests indicate that it exceeds G.723.1 at 5.3 kbps, and of G.723.1 at 6.3 kbps.

Type: Application

Filed: March 16, 2001

Publication date: August 22, 2002

Inventors: Oded Gottsman, Allen Gersho
Adaptive windows for analysis-by-synthesis CELP-type speech coding

Patent number: 6311154

Abstract: A speech coder and a method for speech coding wherein the speech signal is represented by an excitation signal applied to a synthesis filter. The speech is partitioned into frames and subframes. A classifier identifies which of several categories the speech frame belongs to, and a different coding method is applied to represent the excitation for each category. For some categories, one or more windows are identified for the frame where all or most of the excitation signal samples are assigned by a coding scheme. Performance is enhanced by coding the important segments of the excitation more accurately. The window locations are determined from a linear prediction residual by identifying peaks of the smoothed residual energy contour. The method adjusts the frame and subframe boundaries so that each window is located entirely within a modified subframe or frame.

Type: Grant

Filed: December 30, 1998

Date of Patent: October 30, 2001

Assignee: Nokia Mobile Phones Limited

Inventors: Allen Gersho, Vladimir Cuperman, Ajit V Rao, Tung-Chiang Yang, Sassan Ahmadi, Fenghua Liu
Method and apparatus for hybrid coding of speech at 4kbps

Publication number: 20010023396

Abstract: A method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and “transitory” or “transition” speech, and a particular type of coding scheme is used for each class. Harmonic coding is used for steady state voiced speech, “noise-like” coding is used for stationary unvoiced speech, and a special coding mode is used for transition speech, designed to capture the location, the structure, and the strength of the local time events that characterize the transition portions of the speech. The compression schemes can be applied to the speech signal or to the LP residual signal.

Type: Application

Filed: February 5, 2001

Publication date: September 20, 2001

Inventors: Allen Gersho, Eyal Shlomot, Vladimir Cuperman, Chunyan Li
Method and apparatus for hybrid coding of speech at 4kbps

Patent number: 6233550

Abstract: A method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and “transitory” or “transition” speech, and a particular type of coding scheme is used for each class. Harmonic coding is used for steady state voiced speech, “noise-like” coding is used for stationary unvoiced speech, and a special coding mode is used for transition speech, designed to capture the location, the structure, and the strength of the local time events that characterize the transition portions of the speech. The compression schemes can be applied to the speech signal or to the LP residual signal.

Type: Grant

Filed: August 28, 1998

Date of Patent: May 15, 2001

Assignee: The Regents of the University of California

Inventors: Allen Gersho, Eyal Shlomot, Vladimir Cuperman, Chunyan Li
Universal codebook vector quantization with constrained storage

Patent number: 5889891

Abstract: Vectors associated with multiple sources are quantized with a codebook. A mapping function associates each source with a subset of codevectors in the codebook, each subset being defined prior to quantization. Each vector from a source is matched to a codevector in the subset corresponding to the source. The universal codebook is designed by first providing a codebook and a mapping function. Using the codebook, the mapping function is updated by identifying subsets that reduce distortion when vectors from a source are quantized using a corresponding subset.

Type: Grant

Filed: November 21, 1995

Date of Patent: March 30, 1999

Assignee: Regents of the University of California

Inventors: Allen Gersho, Kenneth Rose, Sangeeta Ramakrishnan
Variable dimension vector quantization

Patent number: 5890110

Abstract: A variable dimension vector quantization method that uses a single "universal" codebook. The method can be given the interpretation of sampling full-dimensioned codevectors in the universal codebook and generating subcodevectors of the same dimension as input data subvector, which dimension may vary in time. A subcodevector is selected from the codebook to have minimum distortion between it and the input data subvector. The subcodevector with minimum distortion corresponds to the representative, full-dimensioned codevector in the codebook. The codebook is designed by inverse sampling of training subvectors to obtain full-dimension vectors, then iteratively clustering the training set until a stable centroid vector is obtained.

Type: Grant

Filed: March 27, 1995

Date of Patent: March 30, 1999

Assignee: The Regents of the University of California

Inventors: Allen Gersho, Amitava Das, Ajit Venkat Rao

1 2 next