Patents by Inventor Vladimir Cuperman

Vladimir Cuperman has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7315815
    Abstract: An enhanced low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.
    Type: Grant
    Filed: September 22, 1999
    Date of Patent: January 1, 2008
    Assignee: Microsoft Corporation
    Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
  • Patent number: 7286982
    Abstract: An enhanced low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.
    Type: Grant
    Filed: July 20, 2004
    Date of Patent: October 23, 2007
    Assignee: Microsoft Corporation
    Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
  • Patent number: 7124077
    Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.
    Type: Grant
    Filed: January 28, 2005
    Date of Patent: October 17, 2006
    Assignee: Microsoft Corporation
    Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam A. Khalil
  • Patent number: 6941263
    Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.
    Type: Grant
    Filed: June 29, 2001
    Date of Patent: September 6, 2005
    Assignee: Microsoft Corporation
    Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam A. Khalil
  • Publication number: 20050131696
    Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.
    Type: Application
    Filed: January 28, 2005
    Publication date: June 16, 2005
    Applicant: Microsoft Corporation
    Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam Khalil
  • Publication number: 20050075869
    Abstract: An enhanced_low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.
    Type: Application
    Filed: July 20, 2004
    Publication date: April 7, 2005
    Applicant: Microsoft Corporation
    Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
  • Patent number: 6785645
    Abstract: An efficient and accurate classification method for classifying speech and music signals, or other diverse signal types, is provided. The method and system are especially, although not exclusively, suited for use in real-time applications. Long-term and short-term features are extracted relative to each frame, whereby short-term features are used to detect a potential switching point at which to switch a coder operating mode, and long-term features are used to classify each frame and validate the potential switch at the potential switch point according to the classification and a predefined criterion.
    Type: Grant
    Filed: November 29, 2001
    Date of Patent: August 31, 2004
    Assignee: Microsoft Corporation
    Inventors: Hosam Adel Khalil, Vladimir Cuperman, Tian Wang
  • Patent number: 6658383
    Abstract: The present invention provides a transform coding method efficient for music signals that is suitable for use in a hybrid codec, whereby a common Linear Predictive (LP) synthesis filter is employed for both speech and music signals. The LP synthesis filter switches between a speech excitation generator and a transform excitation generator, in accordance with the coding of a speech or music signal, respectively. For coding speech signals, the conventional CELP technique may be used, while a novel asymmetrical overlap-add transform technique is applied for coding music signals. In performing the common LP synthesis filtering, interpolation of the LP coefficients is conducted for signals in overlap-add operation regions. The invention enables smooth transitions when the decoder switches between speech and music decoding modes.
    Type: Grant
    Filed: June 26, 2001
    Date of Patent: December 2, 2003
    Assignee: Microsoft Corporation
    Inventors: Kazuhito Koishida, Vladimir Cuperman, Amir H. Majidimehr, Allen Gersho
  • Patent number: 6647366
    Abstract: A method and a system are provided for controlling the coding rates of a multimode coding system with respect to a sequence of input audio signal frames. The method eliminates or minimizes the overflow and underflow of a bit-stream buffer maintained by the coding system for temporarily recording bit-stream data prior to transmission or storage.
    Type: Grant
    Filed: December 28, 2001
    Date of Patent: November 11, 2003
    Assignee: Microsoft Corporation
    Inventors: Tian Wang, Kazuhito Koishida, Vladimir Cuperman
  • Patent number: 6625226
    Abstract: A variable bit rate coder, and an associated method, for encoding a frame of speech, such as frames of data generated during operation of a communication station operable in a cellular communication system. Selection of the coding rate is made responsive to indicia of actual coding performance of a coder at more than one coding rate.
    Type: Grant
    Filed: December 3, 1999
    Date of Patent: September 23, 2003
    Inventors: Allen Gersho, Vladimir Cuperman, Jan Linden, Ajit V. Rao, Sassan Ahmadi, Fenghua Liu, Ryan Heidari
  • Publication number: 20030125932
    Abstract: A method and a system are provided for controlling the coding rates of a multimode coding system with respect to a sequence of input audio signal frames. The method eliminates or minimizes the overflow and underflow of a bit-stream buffer maintained by the coding system for temporarily recording bit-stream data prior to transmission or storage.
    Type: Application
    Filed: December 28, 2001
    Publication date: July 3, 2003
    Applicant: Microsoft Corporation
    Inventors: Tian Wang, Kazuhito Koishida, Vladimir Cuperman
  • Publication number: 20030101050
    Abstract: An efficient and accurate classification method for classifying speech and music signals, or other diverse signal types, is provided. The method and system are especially, although not exclusively, suited for use in real-time applications. Long-term and short-term features are extracted relative to each frame, whereby short-term features are used to detect a potential switching point at which to switch a coder operating mode, and long-term features are used to classify each frame and validate the potential switch at the potential switch point according to the classification and a predefined criterion.
    Type: Application
    Filed: November 29, 2001
    Publication date: May 29, 2003
    Applicant: Microsoft Corporation
    Inventors: Hosam Adel Khalil, Vladimir Cuperman, Tian Wang
  • Publication number: 20030009326
    Abstract: A method and system of performing postfiltering in the frequency domain to improve the quality of a speech signal, especially for synthesized speech resulting from codecs of low bit-rate, is provided. The method comprises LPC tilt computation and compensation methods and modules, a formant filter gain computation method and module, and an anti-aliasing method and module. The formant filter gain calculation employs an LPC representation, an all-pole modeling, a non-linear transformation and a phase computation. The LPC used for deriving the postfilter may be transmitted from an encoder or may be estimated from a synthesized or other speech signal in a decoder or receiver. The invention may be implemented in a linked decoder and encoder. A separate LPC evaluation unit that is responsible for processing and or deriving the LPC may be implemented within the invention.
    Type: Application
    Filed: June 29, 2001
    Publication date: January 9, 2003
    Applicant: Microsoft Corporation
    Inventors: Hong Wang, Vladimir Cuperman, Allen Gersho, Hosam A. Khalil
  • Publication number: 20030004711
    Abstract: The present invention provides a transform coding method efficient for music signals that is suitable for use in a hybrid codec, whereby a common Linear Predictive (LP) synthesis filter is employed for both speech and music signals. The LP synthesis filter switches between a speech excitation generator and a transform excitation generator, in accordance with the coding of a speech or music signal, respectively. For coding speech signals, the conventional CELP technique may be used, while a novel asymmetrical overlap-add transform technique is applied for coding music signals. In performing the common LP synthesis filtering, interpolation of the LP coefficients is conducted for signals in overlap-add operation regions. The invention enables smooth transitions when the decoder switches between speech and music decoding modes.
    Type: Application
    Filed: June 26, 2001
    Publication date: January 2, 2003
    Applicant: Microsoft Corporation
    Inventors: Kazuhito Koishida, Vladimir Cuperman, Amir H. Majidimehr, Allen Gersho
  • Patent number: 6475245
    Abstract: A method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and “transitory” or “transition” speech, and a particular type of coding scheme is used for each class. Harmonic coding is used for steady state voiced speech, “noise-like” coding is used for stationary unvoiced speech, and a special coding mode is used for transition speech, designed to capture the location, the structure, and the strength of the local time events that characterize the transition portions of the speech. The compression schemes can be applied to the speech signal or to the LP residual signal.
    Type: Grant
    Filed: February 5, 2001
    Date of Patent: November 5, 2002
    Assignee: The Regents of the University of California
    Inventors: Allen Gersho, Eyal Shlomot, Vladimir Cuperman, Chunyan Li
  • Patent number: 6311154
    Abstract: A speech coder and a method for speech coding wherein the speech signal is represented by an excitation signal applied to a synthesis filter. The speech is partitioned into frames and subframes. A classifier identifies which of several categories the speech frame belongs to, and a different coding method is applied to represent the excitation for each category. For some categories, one or more windows are identified for the frame where all or most of the excitation signal samples are assigned by a coding scheme. Performance is enhanced by coding the important segments of the excitation more accurately. The window locations are determined from a linear prediction residual by identifying peaks of the smoothed residual energy contour. The method adjusts the frame and subframe boundaries so that each window is located entirely within a modified subframe or frame.
    Type: Grant
    Filed: December 30, 1998
    Date of Patent: October 30, 2001
    Assignee: Nokia Mobile Phones Limited
    Inventors: Allen Gersho, Vladimir Cuperman, Ajit V Rao, Tung-Chiang Yang, Sassan Ahmadi, Fenghua Liu
  • Publication number: 20010023396
    Abstract: A method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and “transitory” or “transition” speech, and a particular type of coding scheme is used for each class. Harmonic coding is used for steady state voiced speech, “noise-like” coding is used for stationary unvoiced speech, and a special coding mode is used for transition speech, designed to capture the location, the structure, and the strength of the local time events that characterize the transition portions of the speech. The compression schemes can be applied to the speech signal or to the LP residual signal.
    Type: Application
    Filed: February 5, 2001
    Publication date: September 20, 2001
    Inventors: Allen Gersho, Eyal Shlomot, Vladimir Cuperman, Chunyan Li
  • Patent number: 6233550
    Abstract: A method and apparatus for encoding speech for communication to a decoder for reproduction of the speech where the speech signal is classified into steady state voiced (harmonic), stationary unvoiced, and “transitory” or “transition” speech, and a particular type of coding scheme is used for each class. Harmonic coding is used for steady state voiced speech, “noise-like” coding is used for stationary unvoiced speech, and a special coding mode is used for transition speech, designed to capture the location, the structure, and the strength of the local time events that characterize the transition portions of the speech. The compression schemes can be applied to the speech signal or to the LP residual signal.
    Type: Grant
    Filed: August 28, 1998
    Date of Patent: May 15, 2001
    Assignee: The Regents of the University of California
    Inventors: Allen Gersho, Eyal Shlomot, Vladimir Cuperman, Chunyan Li