Patents by Inventor Wei-ge Chen

Wei-ge Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Constant bitrate media encoding techniques

Publication number: 20050015259

Abstract: CBR control strategies provide constant or relatively constant bitrate output with variable quality. The control strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder uses a trellis in two-pass or delayed-decision CBR encoding. The trellis nodes are states derived by quantizing buffer fullness values. The transitions between nodes of a previous stage and nodes of a current stage depend on encoding a current chunk of audio at different quality levels. When pruning the trellis, the encoder uses a cost function that considers smoothness in quality as well as quality in absolute terms. The encoder may store compressed data at different quality levels, then output the compressed data after simplification of the trellis to a suitable point. If the two-pass or delayed-decision CBR encoding fails, the encoder uses one-pass CBR encoding for the sequence or part of the sequence.

Type: Application

Filed: July 18, 2003

Publication date: January 20, 2005

Inventors: Naveen Thumpudi, Wei-Ge Chen
Entropy coding by adapting coding between level and run-length/level modes

Publication number: 20050015249

Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.

Type: Application

Filed: August 25, 2003

Publication date: January 20, 2005

Inventors: Sanjeev Mehrotra, Wei-ge Chen
Techniques for quantization of spectral data in transcoding

Publication number: 20040225506

Abstract: A transcoder reduces excess requantization error in quantization of spectral data. The transcoder phase shifts data decompressed by a decompressor. The phase shifting causes a change to corresponding spectral data produced in later transform coding of the decompressed data. When the spectral data is then quantized to reduce bitrate, the earlier phase shifting reduces excess requantization error. After transcoding, a second decompressor can compensate for the phase shifting by, for example, reverse shifting by the amount of the phase shift. Instead of phase shifting, the transcoder can reduce excess requantization error by, for example, adding random noise to the decompressed data or changing transform block sizes.

Type: Application

Filed: June 15, 2004

Publication date: November 11, 2004

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Ming-Chieh Lee
Techniques for quantization of spectral data in transcoding

Patent number: 6757648

Abstract: A transcoder reduces excess requantization error in quantization of spectral data. The transcoder phase shifts data decompressed by a decompressor. The phase shifting causes a change to corresponding spectral data produced in later transform coding of the decompressed data. When the spectral data is then quantized to reduce bitrate, the earlier phase shifting reduces excess requantization error. After transcoding, a second decompressor can compensate for the phase shifting by, for example, reverse shifting by the amount of the phase shift. Instead of phase shifting, the transcoder can reduce excess requantization error by, for example, adding random noise to the decompressed data or changing transform block sizes.

Type: Grant

Filed: June 28, 2001

Date of Patent: June 29, 2004

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Ming-Chieh Lee
Multi-channel audio encoding and decoding

Publication number: 20040049379

Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

Type: Application

Filed: August 15, 2003

Publication date: March 11, 2004

Applicant: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen
Quantization and inverse quantization for audio

Publication number: 20040044527

Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of quantization (e.g., weighting) and inverse quantization (e.g., inverse weighting) in audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder quantizes audio data in multiple channels, applying multiple channel-specific quantizer step modifiers, which give the encoder more control over balancing reconstruction quality between channels. The encoder also applies multiple quantization matrices and varies the resolution of the quantization matrices, which allows the encoder to use more resolution if overall quality is good and use less resolution if overall quality is poor. Finally, the encoder compresses one or more quantization matrices using temporal prediction to reduce the bitrate associated with the quantization matrices. An audio decoder performs corresponding inverse processing and decoding.

Type: Application

Filed: August 15, 2003

Publication date: March 4, 2004

Applicant: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen
Innovations in pure lossless audio compression

Publication number: 20040044534

Abstract: A lossless audio compression scheme is adapted for use in a unified lossy and lossless audio compression scheme. In the lossless compression, the adaptation rate of an adaptive filter is varied based on transient detection, such as increasing the adaptation rate where a transient is detected. A multi-channel lossless compression uses an adaptive filter that processes samples from multiple channels in predictive coding a current sample in a current channel. The lossless compression also encodes using an adaptive filter and Golomb coding with non-power of two divisor.

Type: Application

Filed: July 14, 2003

Publication date: March 4, 2004

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Chao He
Unified lossy and lossless audio compression

Publication number: 20040044521

Abstract: A unified lossy and lossless audio compression scheme combines lossy and lossless audio compression within a same audio signal. This approach employs mixed lossless coding of a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.

Type: Application

Filed: July 14, 2003

Publication date: March 4, 2004

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Chao He
Mixed lossless audio compression

Publication number: 20040044520

Abstract: A mixed lossless audio compression has application to a unified lossy and lossless audio compression scheme that combines lossy and lossless audio compression within a same audio signal. The mixed lossless compression codes a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.

Type: Application

Filed: July 14, 2003

Publication date: March 4, 2004

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Chao He
Adaptive window-size selection in transform coding

Publication number: 20030115052

Abstract: A transform coder adaptively configures window sizes for transform coding in a two-pass process to maximize coding efficiency, while achieving necessary time resolution to avoid pre-echo. In a first pass, the coder places small size windows over detected transient regions of an input signal in an open-loop window configuration process. In a second pass, the coder adjusts the window size configuration according to measurements of the achieved quality in a closed-loop window configuration process. Where quality measurement shows unacceptable quantization noise, the coder increases window size. Where pre-echo is detected, the coder reduces window size within coding bit rate constraints.

Type: Application

Filed: December 14, 2001

Publication date: June 19, 2003

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Quantization matrices for digital audio

Publication number: 20030115051

Abstract: Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.

Type: Application

Filed: December 14, 2001

Publication date: June 19, 2003

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Quality and rate control strategy for digital audio

Publication number: 20030115050

Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

Type: Application

Filed: December 14, 2001

Publication date: June 19, 2003

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Quality improvement techniques in an audio encoder

Publication number: 20030115041

Abstract: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source.

Type: Application

Filed: December 14, 2001

Publication date: June 19, 2003

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Techniques for measurement of perceptual audio quality

Publication number: 20030115042

Abstract: An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.

Type: Application

Filed: December 14, 2001

Publication date: June 19, 2003

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Techniques for quantization of spectral data in transcoding

Publication number: 20030028371

Abstract: A transcoder reduces excess requantization error in quantization of spectral data. The transcoder phase shifts data decompressed by a decompressor. The phase shifting causes a change to corresponding spectral data produced in later transform coding of the decompressed data. When the spectral data is then quantized to reduce bitrate, the earlier phase shifting reduces excess requantization error. After transcoding, a second decompressor can compensate for the phase shifting by, for example, reverse shifting by the amount of the phase shift. Instead of phase shifting, the transcoder can reduce excess requantization error by, for example, adding random noise to the decompressed data or changing transform block sizes.

Type: Application

Filed: June 28, 2001

Publication date: February 6, 2003

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Ming-Chieh Lee
Code book construction for variable to variable length entropy encoding

Patent number: 6404931

Abstract: A method of constructing a code book for groupings of symbols drawn from an alphabet, in which variable-sized groups of symbols are each assigned a variable length code based on probability of occurrence of symbol groupings. Code book entries are added by tentatively extending the high probability groupings with symbols from the alphabet. Code book size is restrained by identification of identify high probability symbol groupings, such that low probability groupings are combined into a single code book entry. Probability of occurrence for each entry is tracked. Extension and combination is repeated until a code book of predetermined size is reached. Each code book entry is assigned an entropy-type code according to the probability associated with each book entry.

Type: Grant

Filed: December 14, 1998

Date of Patent: June 11, 2002

Assignee: Microsoft Corporation

Inventors: Wei-ge Chen, Ming-Chieh Lee
Variable to variable length entropy encoding

Patent number: 6377930

Abstract: Entropy encoding and decoding of data with a code book containing variable length entropy-type codes that are assigned to variable length input symbol groupings. The variable length input sequences are identified by scanning an input channel, such as a live broadcast, non-volatile data storage, or network connection (e.g., LAN, WAN, Internet). Each time a symbol grouping is recognized, a corresponding entropy-type code is output as a replacement for the input stream. Decoding is the inverse process of encoding, where a code word is looked up in the code book and the corresponding original input is obtained.

Type: Grant

Filed: December 14, 1998

Date of Patent: April 23, 2002

Assignee: Microsoft Corporation

Inventors: Wei-ge Chen, Ming-Chieh Lee
Entrophy code mode switching for frequency-domain audio coding

Patent number: 6300888

Abstract: A frequency-domain audio coder selects among different entropy coding modes according to characteristics of an input stream. In particular, the input stream is partitioned into frequency ranges according to some statistical criteria derived from a statistical analysis of typical or actual input to be encoded. Each range is assigned an entropy encoder optimized to encode that range's type of data. During encoding and decoding, a mode selector applies the correct entropy method to the different frequency ranges. Partition boundaries can be decided in advance, allowing the decoder to implicitly know which decoding method to apply to encoded data. Or, adaptive arrangements may be used, in which boundaries are flagged in the output stream by indicating a change in encoding mode for subsequent data. For example, one can create a partition boundary which separates out primarily zero quantized frequency coefficients, from primarily non-zero quantized coefficients, and then apply a coder optimized for such data.

Type: Grant

Filed: December 14, 1998

Date of Patent: October 9, 2001

Assignee: Microsoft Corporation

Inventors: Wei-ge Chen, Ming-Chieh Lee
Multi-level run length coding for frequency-domain audio coding

Patent number: 6223162

Abstract: A technique for entropy coding information relating to frequency domain audio coefficients. For portions of a frequency spectrum having a predominate value of zero, a multi-level run length encoder statistically correlates sequences of zero values with non-zero values and assigns variable length code words. An encoder uses a specialized code book generated with respect to the probability of receiving an input sequence of zero-valued spectral coefficients followed by a non-zero coefficient. A corresponding decoder associates a variable length code word with a sequence of zero value coefficients adjacent a non-zero value coefficient.

Type: Grant

Filed: December 14, 1998

Date of Patent: April 24, 2001

Assignee: Microsoft Corporation

Inventors: Wei-ge Chen, Ming-Chieh Lee
Morphological pure speech detection using valley percentage

Patent number: 6205422

Abstract: A human speech detection method detects pure-speech signals in an audio signal containing a mixture of pure-speech and non-speech or mixed-speech signals. The method accurately detects the pure-speech signals by computing a novel Valley Percentage feature from the audio signal and then classifying the audio signals into pure-speech and non-speech (or mixed-speech) classifications. The Valley Percentage is a measurement of the low energy parts of the audio signal (the valley) in comparison to the high energy parts of the audio signal (the mountain). To classify the audio signal, the method performs a threshold decision on the value of the Valley Percentage. Using a binary mask, a high Valley Percentage is classified as pure-speech and a low Valley Percentage is classified as non-speech (or mixed-speech). The method further employs morphological filters to improve the accuracy of human speech detection.

Type: Grant

Filed: November 30, 1998

Date of Patent: March 20, 2001

Assignee: Microsoft Corporation

Inventors: Chuang Gu, Ming-Chieh Lee, Wei-ge Chen

prev … 6 7 8 9 10 11 next