Patents by Inventor Wei-ge Chen
Wei-ge Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20050015259Abstract: CBR control strategies provide constant or relatively constant bitrate output with variable quality. The control strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder uses a trellis in two-pass or delayed-decision CBR encoding. The trellis nodes are states derived by quantizing buffer fullness values. The transitions between nodes of a previous stage and nodes of a current stage depend on encoding a current chunk of audio at different quality levels. When pruning the trellis, the encoder uses a cost function that considers smoothness in quality as well as quality in absolute terms. The encoder may store compressed data at different quality levels, then output the compressed data after simplification of the trellis to a suitable point. If the two-pass or delayed-decision CBR encoding fails, the encoder uses one-pass CBR encoding for the sequence or part of the sequence.Type: ApplicationFiled: July 18, 2003Publication date: January 20, 2005Inventors: Naveen Thumpudi, Wei-Ge Chen
-
Publication number: 20050015249Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.Type: ApplicationFiled: August 25, 2003Publication date: January 20, 2005Inventors: Sanjeev Mehrotra, Wei-ge Chen
-
Publication number: 20040225506Abstract: A transcoder reduces excess requantization error in quantization of spectral data. The transcoder phase shifts data decompressed by a decompressor. The phase shifting causes a change to corresponding spectral data produced in later transform coding of the decompressed data. When the spectral data is then quantized to reduce bitrate, the earlier phase shifting reduces excess requantization error. After transcoding, a second decompressor can compensate for the phase shifting by, for example, reverse shifting by the amount of the phase shift. Instead of phase shifting, the transcoder can reduce excess requantization error by, for example, adding random noise to the decompressed data or changing transform block sizes.Type: ApplicationFiled: June 15, 2004Publication date: November 11, 2004Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Ming-Chieh Lee
-
Patent number: 6757648Abstract: A transcoder reduces excess requantization error in quantization of spectral data. The transcoder phase shifts data decompressed by a decompressor. The phase shifting causes a change to corresponding spectral data produced in later transform coding of the decompressed data. When the spectral data is then quantized to reduce bitrate, the earlier phase shifting reduces excess requantization error. After transcoding, a second decompressor can compensate for the phase shifting by, for example, reverse shifting by the amount of the phase shift. Instead of phase shifting, the transcoder can reduce excess requantization error by, for example, adding random noise to the decompressed data or changing transform block sizes.Type: GrantFiled: June 28, 2001Date of Patent: June 29, 2004Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Ming-Chieh Lee
-
Publication number: 20040049379Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.Type: ApplicationFiled: August 15, 2003Publication date: March 11, 2004Applicant: Microsoft CorporationInventors: Naveen Thumpudi, Wei-Ge Chen
-
Publication number: 20040044527Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of quantization (e.g., weighting) and inverse quantization (e.g., inverse weighting) in audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder quantizes audio data in multiple channels, applying multiple channel-specific quantizer step modifiers, which give the encoder more control over balancing reconstruction quality between channels. The encoder also applies multiple quantization matrices and varies the resolution of the quantization matrices, which allows the encoder to use more resolution if overall quality is good and use less resolution if overall quality is poor. Finally, the encoder compresses one or more quantization matrices using temporal prediction to reduce the bitrate associated with the quantization matrices. An audio decoder performs corresponding inverse processing and decoding.Type: ApplicationFiled: August 15, 2003Publication date: March 4, 2004Applicant: Microsoft CorporationInventors: Naveen Thumpudi, Wei-Ge Chen
-
Publication number: 20040044534Abstract: A lossless audio compression scheme is adapted for use in a unified lossy and lossless audio compression scheme. In the lossless compression, the adaptation rate of an adaptive filter is varied based on transient detection, such as increasing the adaptation rate where a transient is detected. A multi-channel lossless compression uses an adaptive filter that processes samples from multiple channels in predictive coding a current sample in a current channel. The lossless compression also encodes using an adaptive filter and Golomb coding with non-power of two divisor.Type: ApplicationFiled: July 14, 2003Publication date: March 4, 2004Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Chao He
-
Publication number: 20040044521Abstract: A unified lossy and lossless audio compression scheme combines lossy and lossless audio compression within a same audio signal. This approach employs mixed lossless coding of a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.Type: ApplicationFiled: July 14, 2003Publication date: March 4, 2004Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Chao He
-
Publication number: 20040044520Abstract: A mixed lossless audio compression has application to a unified lossy and lossless audio compression scheme that combines lossy and lossless audio compression within a same audio signal. The mixed lossless compression codes a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.Type: ApplicationFiled: July 14, 2003Publication date: March 4, 2004Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Chao He
-
Publication number: 20030115052Abstract: A transform coder adaptively configures window sizes for transform coding in a two-pass process to maximize coding efficiency, while achieving necessary time resolution to avoid pre-echo. In a first pass, the coder places small size windows over detected transient regions of an input signal in an open-loop window configuration process. In a second pass, the coder adjusts the window size configuration according to measurements of the achieved quality in a closed-loop window configuration process. Where quality measurement shows unacceptable quantization noise, the coder increases window size. Where pre-echo is detected, the coder reduces window size within coding bit rate constraints.Type: ApplicationFiled: December 14, 2001Publication date: June 19, 2003Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Publication number: 20030115051Abstract: Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.Type: ApplicationFiled: December 14, 2001Publication date: June 19, 2003Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Publication number: 20030115050Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.Type: ApplicationFiled: December 14, 2001Publication date: June 19, 2003Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Publication number: 20030115041Abstract: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source.Type: ApplicationFiled: December 14, 2001Publication date: June 19, 2003Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Publication number: 20030115042Abstract: An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.Type: ApplicationFiled: December 14, 2001Publication date: June 19, 2003Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Publication number: 20030028371Abstract: A transcoder reduces excess requantization error in quantization of spectral data. The transcoder phase shifts data decompressed by a decompressor. The phase shifting causes a change to corresponding spectral data produced in later transform coding of the decompressed data. When the spectral data is then quantized to reduce bitrate, the earlier phase shifting reduces excess requantization error. After transcoding, a second decompressor can compensate for the phase shifting by, for example, reverse shifting by the amount of the phase shift. Instead of phase shifting, the transcoder can reduce excess requantization error by, for example, adding random noise to the decompressed data or changing transform block sizes.Type: ApplicationFiled: June 28, 2001Publication date: February 6, 2003Applicant: Microsoft CorporationInventors: Wei-Ge Chen, Ming-Chieh Lee
-
Patent number: 6404931Abstract: A method of constructing a code book for groupings of symbols drawn from an alphabet, in which variable-sized groups of symbols are each assigned a variable length code based on probability of occurrence of symbol groupings. Code book entries are added by tentatively extending the high probability groupings with symbols from the alphabet. Code book size is restrained by identification of identify high probability symbol groupings, such that low probability groupings are combined into a single code book entry. Probability of occurrence for each entry is tracked. Extension and combination is repeated until a code book of predetermined size is reached. Each code book entry is assigned an entropy-type code according to the probability associated with each book entry.Type: GrantFiled: December 14, 1998Date of Patent: June 11, 2002Assignee: Microsoft CorporationInventors: Wei-ge Chen, Ming-Chieh Lee
-
Patent number: 6377930Abstract: Entropy encoding and decoding of data with a code book containing variable length entropy-type codes that are assigned to variable length input symbol groupings. The variable length input sequences are identified by scanning an input channel, such as a live broadcast, non-volatile data storage, or network connection (e.g., LAN, WAN, Internet). Each time a symbol grouping is recognized, a corresponding entropy-type code is output as a replacement for the input stream. Decoding is the inverse process of encoding, where a code word is looked up in the code book and the corresponding original input is obtained.Type: GrantFiled: December 14, 1998Date of Patent: April 23, 2002Assignee: Microsoft CorporationInventors: Wei-ge Chen, Ming-Chieh Lee
-
Patent number: 6300888Abstract: A frequency-domain audio coder selects among different entropy coding modes according to characteristics of an input stream. In particular, the input stream is partitioned into frequency ranges according to some statistical criteria derived from a statistical analysis of typical or actual input to be encoded. Each range is assigned an entropy encoder optimized to encode that range's type of data. During encoding and decoding, a mode selector applies the correct entropy method to the different frequency ranges. Partition boundaries can be decided in advance, allowing the decoder to implicitly know which decoding method to apply to encoded data. Or, adaptive arrangements may be used, in which boundaries are flagged in the output stream by indicating a change in encoding mode for subsequent data. For example, one can create a partition boundary which separates out primarily zero quantized frequency coefficients, from primarily non-zero quantized coefficients, and then apply a coder optimized for such data.Type: GrantFiled: December 14, 1998Date of Patent: October 9, 2001Assignee: Microsoft CorporationInventors: Wei-ge Chen, Ming-Chieh Lee
-
Patent number: 6223162Abstract: A technique for entropy coding information relating to frequency domain audio coefficients. For portions of a frequency spectrum having a predominate value of zero, a multi-level run length encoder statistically correlates sequences of zero values with non-zero values and assigns variable length code words. An encoder uses a specialized code book generated with respect to the probability of receiving an input sequence of zero-valued spectral coefficients followed by a non-zero coefficient. A corresponding decoder associates a variable length code word with a sequence of zero value coefficients adjacent a non-zero value coefficient.Type: GrantFiled: December 14, 1998Date of Patent: April 24, 2001Assignee: Microsoft CorporationInventors: Wei-ge Chen, Ming-Chieh Lee
-
Patent number: 6205422Abstract: A human speech detection method detects pure-speech signals in an audio signal containing a mixture of pure-speech and non-speech or mixed-speech signals. The method accurately detects the pure-speech signals by computing a novel Valley Percentage feature from the audio signal and then classifying the audio signals into pure-speech and non-speech (or mixed-speech) classifications. The Valley Percentage is a measurement of the low energy parts of the audio signal (the valley) in comparison to the high energy parts of the audio signal (the mountain). To classify the audio signal, the method performs a threshold decision on the value of the Valley Percentage. Using a binary mask, a high Valley Percentage is classified as pure-speech and a low Valley Percentage is classified as non-speech (or mixed-speech). The method further employs morphological filters to improve the accuracy of human speech detection.Type: GrantFiled: November 30, 1998Date of Patent: March 20, 2001Assignee: Microsoft CorporationInventors: Chuang Gu, Ming-Chieh Lee, Wei-ge Chen