Patents by Inventor Wei-ge Chen

Wei-ge Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20080319739
    Abstract: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.
    Type: Application
    Filed: June 22, 2007
    Publication date: December 25, 2008
    Applicant: Microsoft Corporation
    Inventors: Sanjeev Mehrotra, Wei-Ge Chen
  • Publication number: 20080312758
    Abstract: An audio encoder/decoder provides efficient compression of spectral transform coefficient data characterized by sparse spectral peaks. The audio encoder/decoder applies a temporal prediction of the frequency position of spectral peaks. The spectral peaks in the transform coefficients that are predicted from those in a preceding transform coding block are encoded as a shift in frequency position from the previous transform coding block and two non-zero coefficient levels. The prediction may avoid coding very large zero-level transform coefficient runs as compared to conventional run length coding. For spectral peaks not predicted from those in a preceding transform coding block, the spectral peaks are encoded as a value trio of a length of a run of zero-level spectral transform coefficients, and two non-zero coefficient levels.
    Type: Application
    Filed: June 15, 2007
    Publication date: December 18, 2008
    Applicant: Microsoft Corporation
    Inventors: Kazuhito Koishida, Sanjeev Mehrotra, Wei-Ge Chen
  • Publication number: 20080312759
    Abstract: An audio encoder/decoder performs band partitioning for vector quantization encoding of spectral holes and missing high frequencies that result from quantization when encoding at low bit rates. The encoder/decoder determines a band structure for spectral holes based on two threshold parameters: a minimum hole size threshold and a maximum band size threshold. Spectral holes wider than the minimum hole size threshold are partitioned evenly into bands not exceeding the maximum band size threshold in size. Such hole filling bands are configured up to a preset number of hole filling bands. The bands for missing high frequencies are then configured by dividing the high frequency region into bands having binary-increasing, linearly-increasing or arbitrarily-configured band sizes up to a maximum overall number of bands.
    Type: Application
    Filed: June 15, 2007
    Publication date: December 18, 2008
    Applicant: Microsoft Corporation
    Inventors: Kazuhito Koishida, Sanjeev Mehrotra, Wei-Ge Chen
  • Patent number: 7460993
    Abstract: A transform coder adaptively configures window sizes for transform coding in a two-pass process to maximize coding efficiency, while achieving necessary time resolution to avoid pre-echo. In a first pass, the coder places small size windows over detected transient regions of an input signal in an open-loop window configuration process. In a second pass, the coder adjusts the window size configuration according to measurements of the achieved quality in a closed-loop window configuration process. Where quality measurement shows unacceptable quantization noise, the coder increases window size. Where pre-echo is detected, the coder reduces window size within coding bit rate constraints.
    Type: Grant
    Filed: December 14, 2001
    Date of Patent: December 2, 2008
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Patent number: 7460990
    Abstract: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.
    Type: Grant
    Filed: June 29, 2004
    Date of Patent: December 2, 2008
    Assignee: Microsoft Corporation
    Inventors: Sanjeev Mehrotra, Wei-Ge Chen
  • Patent number: 7454332
    Abstract: A gain-constrained noise suppression for speech more precisely estimates noise, including during speech, to reduce musical noise artifacts introduced from noise suppression. The noise suppression operates by applying a spectral gain G(m, k) to each short-time spectrum value S(m, k) of a speech signal, where m is the frame number and k is the spectrum index. The spectrum values are grouped into frequency bins, and a noise characteristic estimated for each bin classified as a “noise bin.” An energy parameter is smoothed in both the time domain and the frequency domain to improve noise estimation per bin. The gain factors G(m, k) are calculated based on the current signal spectrum and the noise estimation, then smoothed before being applied to the signal spectral values S(m, k).
    Type: Grant
    Filed: June 15, 2004
    Date of Patent: November 18, 2008
    Assignee: Microsoft Corporation
    Inventors: Kazuhito Koishida, Feng Zhuge, Hosam A. Khalil, Tian Wang, Wei-ge Chen
  • Publication number: 20080262855
    Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.
    Type: Application
    Filed: May 16, 2008
    Publication date: October 23, 2008
    Applicant: Microsoft Corporation
    Inventors: Sanjeev Mehrotra, Wei-Ge Chen
  • Publication number: 20080259828
    Abstract: A communication end device of a two-way communication system is shown. The device includes an audio signal capture device for capturing local audio to be transmitted to another end device, an audio signal rendering device for playing remote audio received from the other end device, and buffers for buffering the captured and rendered audio signals. The device also includes an audio echo canceller operating to predict echo from the rendered audio signal at a calculated relative offset in the captured audio signal based on an adaptive filter, and subtract the predicted echo from the signal transmitted to the other end device The calculated relative offset that is used by the audio echo canceller for a current signal sample is adjusted if a difference between it and an adjusted relative offset of a preceding sample exceeds a threshold value.
    Type: Application
    Filed: April 23, 2007
    Publication date: October 23, 2008
    Applicant: Microsoft Corporation
    Inventors: Chao He, Qin Li, Wei-ge Chen
  • Publication number: 20080253553
    Abstract: A communications device is presented for providing bi-directional audio communications between a near-end user and a far-end user via a bidirectional communications channel. The communications device includes an adaptive echo canceller receiving a near-end audio signal and a far-end audio signal and providing an echo-canceled near-end audio signal for transmission to the far-end user via the communications channel. The adaptive echo canceller includes a first bank of analysis filters for filtering the near-end audio signal, a second bank of analysis filters for filtering the far-end audio signal, and a bank of synthesis filters for filtering sub-band echo-canceled signals generated within the adaptive echo canceller. The first and second filter banks have a frequency response optimized to reduce echo residual gain.
    Type: Application
    Filed: April 10, 2007
    Publication date: October 16, 2008
    Applicant: Microsoft Corporation
    Inventors: Qin Li, Wei-ge Chen, Chao He
  • Patent number: 7433824
    Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.
    Type: Grant
    Filed: August 25, 2003
    Date of Patent: October 7, 2008
    Assignee: Microsoft Corporation
    Inventors: Sanjeev Mehrotra, Wei-ge Chen
  • Publication number: 20080228476
    Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.
    Type: Application
    Filed: May 27, 2008
    Publication date: September 18, 2008
    Applicant: Microsoft Corporation
    Inventors: Sanjeev Mehrotra, Wei-ge Chen
  • Publication number: 20080221908
    Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.
    Type: Application
    Filed: May 15, 2008
    Publication date: September 11, 2008
    Applicant: Microsoft Corporation
    Inventors: Naveen Thumpudi, Wei-Ge Chen
  • Patent number: 7424434
    Abstract: A unified lossy and lossless audio compression scheme combines lossy and lossless audio compression within a same audio signal. This approach employs mixed lossless coding of a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.
    Type: Grant
    Filed: July 14, 2003
    Date of Patent: September 9, 2008
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Chao He
  • Publication number: 20080212809
    Abstract: A surround experience for headphones can be created using various techniques and tools. The described techniques and tools can be applied separately or in combination. For example, a surround experience can be created by splitting input audio channel signals into high-pass and low-pass signals. The low-pass signals can undergo cross-mixing and artificial reverberation. The artificial reverberation can apply different delay profiles to the low-pass audio signals. The high-pass and low-pass audio signals can be combined to generate output audio channel signals. A surround experience can be crated using a system comprising an input module, a crossover module, a channel cross-mixing module, a reverberation module, and an adder module.
    Type: Application
    Filed: June 29, 2007
    Publication date: September 4, 2008
    Applicant: Microsoft Corporation
    Inventors: Serge Smirnov, Chris Messer, Wei-Ge Chen
  • Patent number: 7383180
    Abstract: CBR control strategies provide constant or relatively constant bitrate output with variable quality. The control strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder uses a trellis in two-pass or delayed-decision CBR encoding. The trellis nodes are states derived by quantizing buffer fullness values. The transitions between nodes of a previous stage and nodes of a current stage depend on encoding a current chunk of audio at different quality levels. When pruning the trellis, the encoder uses a cost function that considers smoothness in quality as well as quality in absolute terms. The encoder may store compressed data at different quality levels, then output the compressed data after simplification of the trellis to a suitable point. If the two-pass or delayed-decision CBR encoding fails, the encoder uses one-pass CBR encoding for the sequence or part of the sequence.
    Type: Grant
    Filed: July 18, 2003
    Date of Patent: June 3, 2008
    Assignee: Microsoft Corporation
    Inventors: Naveen Thumpudi, Wei-Ge Chen
  • Publication number: 20080109230
    Abstract: An encoder uses multi-pass VBR control strategies to provide constant or relatively constant quality for VBR output while guaranteeing (within tolerance) either compressed file size or, equivalently, overall average bitrate. The control strategies include various techniques and tools, which can be used in combination or independently. For example, in a first pass, an audio encoder encodes a sequence of audio data partitioned into variable-size chunks. In a second pass, the encoder encodes the sequence according to control parameters to produce output of relatively constant quality. The encoder sets checkpoints in the second pass to adjust the control parameters and/or subsequent checkpoints. The encoder selectively considers a peak bitrate constraint to limit peak bitrate. The encoder stores auxiliary information from the first pass for use in the second pass, which increases the speed of the second pass. Finally, the encoder compares signatures for the input data to check consistency between passes.
    Type: Application
    Filed: December 21, 2007
    Publication date: May 8, 2008
    Applicant: Microsoft Corporation
    Inventors: Naveen Thumpudi, Wei-Ge Chen
  • Patent number: 7343291
    Abstract: An encoder uses multi-pass VBR control strategies to provide constant or relatively constant quality for VBR output while guaranteeing (within tolerance) either compressed file size or, equivalently, overall average bitrate. The control strategies include various techniques and tools, which can be used in combination or independently. For example, in a first pass, an audio encoder encodes a sequence of audio data partitioned into variable-size chunks. In a second pass, the encoder encodes the sequence according to control parameters to produce output of relatively constant quality. The encoder sets checkpoints in the second pass to adjust the control parameters and/or subsequent checkpoints. The encoder selectively considers a peak bitrate constraint to limit peak bitrate. The encoder stores auxiliary information from the first pass for use in the second pass, which increases the speed of the second pass. Finally, the encoder compares signatures for the input data to check consistency between passes.
    Type: Grant
    Filed: July 18, 2003
    Date of Patent: March 11, 2008
    Assignee: Microsoft Corporation
    Inventors: Naveen Thumpudi, Wei-Ge Chen
  • Patent number: 7340394
    Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.
    Type: Grant
    Filed: October 26, 2005
    Date of Patent: March 4, 2008
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Publication number: 20080040105
    Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.
    Type: Application
    Filed: October 9, 2007
    Publication date: February 14, 2008
    Applicant: Microsoft Corporation
    Inventors: Tian Wang, Kazuhito Koishida, Hosam Khalil, Xiaoqin Sun, Wei-Ge Chen
  • Publication number: 20080040121
    Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.
    Type: Application
    Filed: October 9, 2007
    Publication date: February 14, 2008
    Applicant: Microsoft Corporation
    Inventors: Tian Wang, Kazuhito Koishida, Hosam Khalil, Xiaoqin Sun, Wei-Ge Chen