Patents by Inventor Wei-ge Chen

Wei-ge Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Prediction of spectral coefficients in waveform coding and decoding

Patent number: 7684981

Abstract: Techniques and tools for prediction of spectral coefficients in encoding and decoding are described herein. For certain types and patterns of content, coefficient prediction exploits correlation between adjacent spectral coefficients, making subsequent entropy encoding more efficient. For example, an audio encoder predictively codes quantized spectral coefficients in the quantized domain and entropy encodes results of the predictive coding. Or, for a particular quantized spectral coefficient, an audio decoder entropy decodes a difference value, computes a predictor in the quantized domain, and combines the predictor and the difference value.

Type: Grant

Filed: July 15, 2005

Date of Patent: March 23, 2010

Assignee: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen, Chao He
Digital audio processing

Patent number: 7672743

Abstract: A compressed digital audio signal is transmitted from an audio source along a connection wire to an audio receiver. The digital audio signal can encode digital audio data having different sampling frequencies, frames sizes, and other information. The audio receiver that receives the digital audio signal can decode and convert the compressed digital audio signal into multiple synchronized analog signals, which are used to drive multiple speakers. The audio receiver may also synchronize the audio data with associated video data so that the audio playback and video playback are “in sync”, despite delay introduced by the audio signal decoding at the audio receiver.

Type: Grant

Filed: April 25, 2005

Date of Patent: March 2, 2010

Assignee: Microsoft Corporation

Inventors: Christopher Messer, Naveen Thumpudi, Raymond Cheng, Serge Smirnov, Wei-ge Chen, Timothy Onders
Audio encoding and decoding with intra frames and adaptive forward error correction

Patent number: 7668712

Abstract: Various strategies for rate/quality control and loss resiliency in an audio codec are described. The various strategies can be used in combination or independently. For example, a real-time speech codec uses intra frame coding/decoding, adaptive multi-mode forward error correction [“FEC”], and rate/quality control techniques. Intra frames help a decoder recover quickly from packet losses, while compression efficiency is still emphasized with predicted frames. Various strategies for inserting intra frames and signaling intra/predicted frames are described. With the adaptive multi-mode FEC, an encoder adaptively selects between multiple modes to efficiently and quickly provide a level of FEC that takes into account the bandwidth currently available for FEC. The FEC information itself may be predictively encoded and decoded relative to primary encoded information. Various rate/quality and FEC control strategies allow additional adaptation to available bandwidth and network conditions.

Type: Grant

Filed: March 31, 2004

Date of Patent: February 23, 2010

Assignee: Microsoft Corporation

Inventors: Tian Wang, Hosam A. Khalil, Kazuhito Koishida, Wei-Ge Chen, Mu Han
Multi-pass variable bitrate media encoding

Patent number: 7644002

Abstract: An encoder uses multi-pass VBR control strategies to provide constant or relatively constant quality for VBR output while guaranteeing (within tolerance) either compressed file size or, equivalently, overall average bitrate. The control strategies include various techniques and tools, which can be used in combination or independently. For example, in a first pass, an audio encoder encodes a sequence of audio data partitioned into variable-size chunks. In a second pass, the encoder encodes the sequence according to control parameters to produce output of relatively constant quality. The encoder sets checkpoints in the second pass to adjust the control parameters and/or subsequent checkpoints. The encoder selectively considers a peak bitrate constraint to limit peak bitrate. The encoder stores auxiliary information from the first pass for use in the second pass, which increases the speed of the second pass. Finally, the encoder compares signatures for the input data to check consistency between passes.

Type: Grant

Filed: December 21, 2007

Date of Patent: January 5, 2010

Assignee: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen
QUALITY IMPROVEMENT TECHNIQUES IN AN AUDIO ENCODER

Publication number: 20090326962

Abstract: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source.

Type: Application

Filed: August 27, 2009

Publication date: December 31, 2009

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Frequency segmentation to obtain bands for efficient coding of digital media

Patent number: 7630882

Abstract: Frequency segmentation is important to the quality of encoding spectral data. Segmentation involves breaking the spectral data into units called sub-bands or vectors. Homogeneous segmentation may be suboptimal. Various features are described for providing spectral data intensity dependent segmentation. Finer segmentation is provided for regions of greater spectral variance and coarser segmentation is provided for more homogeneous regions. Sub-bands which have similar characteristics may be merged with very little effect on quality, whereas sub-bands with highly variable data may be better represented if a sub-band is split. Various methods are described for measuring tonality, energy, or shape of a sub-band. These various measurements are discussed in light of making decisions of when to split or merge sub-bands to provide variable frequency segmentation.

Type: Grant

Filed: July 15, 2005

Date of Patent: December 8, 2009

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
ROBUST DECODER

Publication number: 20090276212

Abstract: Techniques and tools related to delayed or lost coded audio information are described. For example, a concealment technique for one or more missing frames is selected based on one or more factors that include a classification of each of one or more available frames near the one or more missing frames. As another example, information from a concealment signal is used to produce substitute information that is relied on in decoding a subsequent frame. As yet another example, a data structure having nodes corresponding to received packet delays is used to determine a desired decoder packet delay value.

Type: Application

Filed: July 14, 2009

Publication date: November 5, 2009

Applicant: Microsoft Corporation

Inventors: Hosam A. Khalil, Tian Wang, Kazuhito Koishida, Xiaoqin Sun, Wei-Ge Chen
Selectively using multiple entropy models in adaptive coding and decoding

Patent number: 7599840

Abstract: Techniques and tools for selectively using multiple entropy models in adaptive coding and decoding are described herein. For example, for multiple symbols, an audio encoder selects an entropy model from a first model set that includes multiple entropy models. Each of the multiple entropy models includes a model switch point for switching to a second model set that includes one or more entropy models. The encoder processes the multiple symbols using the selected entropy model and outputs results. Techniques and tools for generating entropy models are also described.

Type: Grant

Filed: July 15, 2005

Date of Patent: October 6, 2009

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Robust decoder

Patent number: 7590531

Abstract: Techniques and tools related to delayed or lost coded audio information are described. For example, a concealment technique for one or more missing frames is selected based on one or more factors that include a classification of each of one or more available frames near the one or more missing frames. As another example, information from a concealment signal is used to produce substitute information that is relied on in decoding a subsequent frame. As yet another example, a data structure having nodes corresponding to received packet delays is used to determine a desired decoder packet delay value.

Type: Grant

Filed: August 4, 2005

Date of Patent: September 15, 2009

Assignee: Microsoft Corporation

Inventors: Hosam A. Khalil, Tian Wang, Kazuhito Koishida, Xiaoqin Sun, Wei-Ge Chen
MIXED LOSSLESS AUDIO COMPRESSION

Publication number: 20090228290

Abstract: A mixed lossless audio compression has application to a unified lossy and lossless audio compression scheme that combines lossy and lossless audio compression within a same audio signal. The mixed lossless compression codes a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.

Type: Application

Filed: May 18, 2009

Publication date: September 10, 2009

Applicant: Microsoft Corporation

Inventors: Wei-Ge Chen, Chao He
Modification of codewords in dictionary used for efficient coding of digital media spectral data

Patent number: 7562021

Abstract: Coding of spectral data by representing certain portions of the spectral data as a scaled version of a code-vector, where the code-vector is chosen from either a fixed predetermined codebook or a codebook taken from a baseband. Various optional features are described for modifying the code-vectors in the codebook according to some rules which allow the code-vector to better represent the data they are modeling. The code-vector modification comprises a linear or non-linear transform of one or more code-vectors, such as, by exponentiation, negation, reversing, or combining elements from plural code-vectors.

Type: Grant

Filed: July 15, 2005

Date of Patent: July 14, 2009

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen, Kazuhito Koishida
Techniques for measurement of perceptual audio quality

Patent number: 7548850

Abstract: An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.

Type: Grant

Filed: June 26, 2006

Date of Patent: June 16, 2009

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Techniques for measurement of perceptual audio quality

Patent number: 7548855

Abstract: An audio processing tool measures the quality of reconstructed audio data. For example, an audio encoder measures the quality of a block of reconstructed frequency coefficient data in a quantization loop. The invention includes several techniques and tools, which can be used in combination or separately. First, before measuring quality, the tool normalizes the block to account for variation in block sizes. Second, for the quality measurement, the tool processes the reconstructed data by critical bands, which can differ from the quantization bands used to compress the data. Third, the tool accounts for the masking effect of the reconstructed data, not just the masking effect of the original data. Fourth, the tool band weights the quality measurement, which can be used to account for noise substitution or band truncation. Finally, the tool changes quality measurement techniques depending on the channel coding mode.

Type: Grant

Filed: June 26, 2006

Date of Patent: June 16, 2009

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Coding with improved time resolution for selected segments via adaptive block transformation of a group of samples from a subband decomposition

Patent number: 7546240

Abstract: A transform coder is described that performs a time-split transform in addition to a discrete cosine type transform. A time-split transform is selectively performed based on characteristics of media data. Transient detection identifies a changing signal characteristic, such as a transient in media data. After encoding an input signal from a time domain to a transform domain, a time-splitting transformer selectively perform an orthogonal sum-difference transform on adjacent coefficients indicated by a changing signal characteristic location. The orthogonal sum-difference transform on adjacent coefficients results in transforming a vector of coefficients in the transform domain as if they were multiplied by an identity matrix including at least one 2×2 time-split block along a diagonal of the matrix. A decoder performs an inverse of the described transforms.

Type: Grant

Filed: July 15, 2005

Date of Patent: June 9, 2009

Assignee: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen, Henrique Sarmento Malvar
Coding and decoding scale factor information

Patent number: 7539612

Abstract: Techniques and tools for representing, coding, and decoding scale factor information are described herein. For example, during encoding of scale factors, an encoder uses one or more of flexible scale factor resolution selection, spatial prediction of scale factors, flexible prediction of scale factors, smoothing of noisy scale factor amplitudes, reordering of scale factor prediction residuals, and prediction of scale factor prediction residuals. Or, during decoding, a decoder uses one or more of flexible scale factor resolution selection, spatial prediction of scale factors, flexible prediction of scale factors, reordering of scale factor prediction residuals, and prediction of scale factor prediction residuals.

Type: Grant

Filed: July 15, 2005

Date of Patent: May 26, 2009

Assignee: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen, Chao He
Mixed lossless audio compression

Patent number: 7536305

Abstract: A mixed lossless audio compression has application to a unified lossy and lossless audio compression scheme that combines lossy and lossless audio compression within a same audio signal. The mixed lossless compression codes a transition frame between lossy and lossless coding frames to produce seamless transitions. The mixed lossless coding performs a lapped transform and inverse lapped transform to produce an appropriately windowed and folded pseudo-time domain frame, which can then be losslessly coded. The mixed lossless coding also can be applied for frames that exhibit poor lossy compression performance.

Type: Grant

Filed: July 14, 2003

Date of Patent: May 19, 2009

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Chao He
TRANSCODER USING ENCODER GENERATED SIDE INFORMATION

Publication number: 20090125315

Abstract: An audio encoder encodes side information into a compressed audio bitstream containing encoding parameters used by the encoder for one or more encoding techniques, such as a noise-mask-ratio curve used for rate control. A transcoder uses the encoder generated side information to transcode the audio from the original compressed bitstream having an initial bit-rate into a second bitstream having a new bit-rate. Because the side information is derived from the original audio, the transcoder is able to better maintain audio quality of the transcoding. The side information also allows the transcoder to re-encode from an intermediate decoding/encoding stage for faster and lower complexity transcoding.

Type: Application

Filed: November 9, 2007

Publication date: May 14, 2009

Applicant: Microsoft Corporation

Inventors: Kazuhito Koishida, Sanjeev Mehrotra, Wei-Ge Chen
EFFICIENT CODING OF DIGITAL MEDIA SPECTRAL DATA USING WIDE-SENSE PERCEPTUAL SIMILARITY

Publication number: 20090083046

Abstract: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

Type: Application

Filed: November 26, 2008

Publication date: March 26, 2009

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Multi-channel audio encoding and decoding with multi-channel transform selection

Patent number: 7502743

Abstract: An audio encoder and decoder use architectures and techniques that improve the efficiency of multi-channel audio coding and decoding. The described strategies include various techniques and tools, which can be used in combination or independently. For example, an audio encoder performs a pre-processing multi-channel transform on multi-channel audio data, varying the transform so as to control quality. The encoder groups multiple windows from different channels into one or more tiles and outputs tile configuration information, which allows the encoder to isolate transients that appear in a particular channel with small windows, but use large windows in other channels. Using a variety of techniques, the encoder performs flexible multi-channel transforms that effectively take advantage of inter-channel correlation. An audio decoder performs corresponding processing and decoding. In addition, the decoder performs a post-processing multi-channel transform for any of multiple different purposes.

Type: Grant

Filed: August 15, 2003

Date of Patent: March 10, 2009

Assignee: Microsoft Corporation

Inventors: Naveen Thumpudi, Wei-Ge Chen
BITSTREAM SYNTAX FOR MULTI-PROCESS AUDIO DECODING

Publication number: 20090006103

Abstract: An audio decoder provides a combination of decoding components including components implementing base band decoding, spectral peak decoding, frequency extension decoding and channel extension decoding techniques. The audio decoder decodes a compressed bitstream structured by a bitstream syntax scheme to permit the various decoding components to extract the appropriate parameters for their respective decoding technique.

Type: Application

Filed: June 29, 2007

Publication date: January 1, 2009

Applicant: Microsoft Corporation

Inventors: Kazuhito Koishida, Sanjeev Mehrotra, Chao He, Wei-Ge Chen

prev 1 2 3 4 5 6 7 8 9 … next