Patents by Inventor Ming-Chieh Lee

Ming-Chieh Lee has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

VIDEO CODING / DECODING WITH RE-ORIENTED TRANSFORMS AND SUB-BLOCK TRANSFORM SIZES

Publication number: 20130301704

Abstract: Techniques and tools for video coding/decoding with sub-block transform coding/decoding and re-oriented transforms are described. For example, a video encoder adaptively switches between 8×8, 8×4, and 4×8 DCTs when encoding 8×8 prediction residual blocks; a corresponding video decoder switches between 8×8, 8×4, and 4×8 inverse DCTs during decoding. The video encoder may determine the transform sizes as well as switching levels (e.g., frame, macroblock, or block) in a closed loop evaluation of the different transform sizes and switching levels. When a video encoder or decoder uses spatial extrapolation from pixel values in a causal neighborhood to predict pixel values of a block of pixels, the encoder/decoder can use a re-oriented transform to address non-stationarity of prediction residual values.

Type: Application

Filed: July 16, 2013

Publication date: November 14, 2013

Inventors: Sridhar Srinivasan, Thomas W. Holcomb, Chih-Lung Lin, Pohsiang Hsu, Ming-Chieh Lee
Efficient macroblock header coding for video compression

Patent number: 8582903

Abstract: The coded block parameters used to code blocks of image samples into structures called macroblocks are compressed more efficiently by exploiting the correlation between chrominance and luminance blocks in each macroblock. In particular, the coded block pattern for chrominance and luminance are combined into a single parameter for the macroblock and jointly coded with a single variable length code. To further enhance coding efficiency, the spatial coherence of coded block patterns can be exploited by using spatial prediction to compute predicted values for coded block pattern parameters.

Type: Grant

Filed: October 15, 2012

Date of Patent: November 12, 2013

Assignee: Microsoft Corporation

Inventors: Chih-Lung Lin, Ming-Chieh Lee
Regions of interest for quality adjustments

Patent number: 8576908

Abstract: Quality settings established by an encoder are adjusted based on information associated with regions of interest (“ROIs”). For example, quantization step sizes can be reduced (to improve quality) or increased (to reduce bit rate). ROIs can be identified and quality settings can be adjusted based on input received from a user interface. An overlap setting can be determined for a portion of a picture that corresponds to an ROI overlap area. For example, an overlap setting is chosen from step sizes corresponding to a first overlapping ROI and a second overlapping ROI, or from relative reductions in step size corresponding to the first ROI and the second ROI. ROIs can be parameterized by information (e.g., using data structures) that indicates spatial dimensions of the ROIs and quality adjustment information (e.g., dead zone information, step size information, and quantization mode information).

Type: Grant

Filed: July 2, 2012

Date of Patent: November 5, 2013

Assignee: Microsoft Corporation

Inventors: Chih-Lung Lin, Ming-Chieh Lee, Stacey Spears, Philip Lawrence Garrett, Anil Madhav Murching, Amy Jennifer Beauford, Thomas W. Holcomb, Xiping Zuo
Quality improvement techniques in an audio encoder

Patent number: 8554569

Abstract: An audio encoder implements multi-channel coding decision, band truncation, multi-channel rematrixing, and header reduction techniques to improve quality and coding efficiency. In the multi-channel coding decision technique, the audio encoder dynamically selects between joint and independent coding of a multi-channel audio signal via an open-loop decision based upon (a) energy separation between the coding channels, and (b) the disparity between excitation patterns of the separate input channels. In the band truncation technique, the audio encoder performs open-loop band truncation at a cut-off frequency based on a target perceptual quality measure. In multi-channel rematrixing technique, the audio encoder suppresses certain coefficients of a difference channel by scaling according to a scale factor, which is based on current average levels of perceptual quality, current rate control buffer fullness, coding mode, and the amount of channel separation in the source.

Type: Grant

Filed: August 27, 2009

Date of Patent: October 8, 2013

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Media coding for loss recovery with remotely predicted data units

Patent number: 8548051

Abstract: An improved loss recovery method for coding streaming media classifies each data unit in the media stream as an independent data unit (I unit), a remotely predicted unit (R unit) or a predicted data unit (P unit). Each of these units is organized into independent segments having an I unit, multiple P units and R units interspersed among the P units. The beginning of each segment is the start of a random access point, while each R unit provides a loss recovery point that can be placed independently of the I unit. This approach separates the random access point from the loss recovery points provided by the R units, and makes the stream more impervious to data losses without substantially impacting coding efficiency. The most important data units are transmitted with the most reliability to ensure that the majority of the data received by the client is usable. The I units are the least sensitive to transmission losses because they are coded using only their own data.

Type: Grant

Filed: February 3, 2010

Date of Patent: October 1, 2013

Assignee: Microsoft Corporation

Inventors: Albert Szu-Chi Wang, Ming-Chieh Lee
DYNAMIC INSERTION OF SYNCHRONIZATION PREDICTED VIDEO FRAMES

Publication number: 20130223524

Abstract: A video bitstream can be encoded and sent over a computer network to a decoding computer system. The bitstream can follow a regular prediction structure when an encoding computer system is not notified of lost data from the bitstream. A notification of lost data in the bitstream can be received. The lost data can include at least a portion of a reference frame of the bitstream. In response, a synchronization predicted frame can be dynamically encoded with a prediction that references one or more other previously-sent frames in the bitstream and that does not reference the lost data. The synchronization predicted frame can be inserted in the bitstream in a position where the regular prediction structure would have dictated inserting a different predicted frame with a prediction that would have referenced the lost data according to the regular prediction structure.

Type: Application

Filed: February 29, 2012

Publication date: August 29, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
ENCODING PROCESSING FOR CONFERENCING SYSTEMS

Publication number: 20130208075

Abstract: Optimization of conference call encoding processes is provided. A first client of a multi-party conference call may receive client capability data, including video scalability support, from each of the other clients to the conference call. Based on the client capability data and the transmission capabilities of the first client, including video scalability support, the first client may determine a total number of data streams and properties for each data stream, such that the total number of data streams and the plurality of properties for each data stream are optimized and supported by the respective client capability data and the transmission capabilities. Subsequently, the first client generates one or more data streams according to the total number of data streams and the properties that were determined for each data stream and transmits the one or more data streams to the other clients of the conference call.

Type: Application

Filed: February 13, 2012

Publication date: August 15, 2013

Applicant: Microsoft Corporation

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
MULTI-LAYER RATE CONTROL

Publication number: 20130208809

Abstract: Concepts and technologies are described herein for multi-layer rate control. In accordance with the concepts and technologies disclosed herein, a video server obtains video data and encodes the video data into a multi-layer video stream. Layers of the video stream cart be output buffers and the buffers can be monitored to determine bit usage. A rate controller can obtain bit usage feedback for each layer of the encoded video stream and determine, based upon the bit usage feedback, a quantization parameter associated with each layer of the encoded video stream. In determining the quantization parameters, the rate controller can consider not only bitrates of the entire encoded video stream, but also bitrates and bit usage feedback associated with each layer of the encoded video stream. Further encoding can be based upon the quantization parameters determined by the video server.

Type: Application

Filed: February 14, 2012

Publication date: August 15, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
CAPABILITY ADVERTISEMENT, CONFIGURATION AND CONTROL FOR VIDEO CODING AND DECODING

Publication number: 20130177071

Abstract: Innovations described herein provide a framework for advertising encoder capabilities, initializing encoder configuration, and signaling run-time control messages for video coding and decoding. For example, an encoding controller receives a request for encoder capability data from a decoding host controller, determines the capability data, and sends the capability data in reply. The capability data can include data that indicate a number of bitstreams, each providing an alternative version of input video, as well as data that indicate scalable video coding capabilities. The decoding host controller creates stream configuration request data based on the encoder capability data, and sends the configuration request data to the encoding controller. During decoding, the decoding host controller can create and send a control message for run-time control of encoding, where the control message includes a stream identifier for a bitstream and layer identifiers for a given layer of the bitstream.

Type: Application

Filed: January 11, 2012

Publication date: July 11, 2013

Applicant: Microsoft Corporation

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
HARDWARE-ACCELERATED DECODING OF SCALABLE VIDEO BITSTREAMS

Publication number: 20130156101

Abstract: In various respects, hardware-accelerated decoding is adapted for decoding of video that has been encoded using scalable video coding. For example, for a given picture to be decoded, a host decoder determines whether a corresponding base picture will be stored for use as a reference picture. If so, the host decoder directs decoding with an accelerator such that the some of the same decoding operations can be used for the given picture and the reference base picture. Or, as another example, the host decoder groups encoded data associated with a given layer representation in buffers. The host decoder provides the encoded data for the layer to the accelerator. The host decoder repeats the process layer-after-layer in the order that layers appear in the bitstream, according to a defined call pattern for an acceleration interface, which helps the accelerator determine the layers with which buffers are associated.

Type: Application

Filed: December 16, 2011

Publication date: June 20, 2013

Applicant: Microsoft Corporation

Inventors: Mei-Hsuan Lu, Yongjun Wu, Ming-Chieh Lee, Gary J. Sullivan
ADDING TEMPORAL SCALABILITY TO A NON-SCALABLE BITSTREAM

Publication number: 20130114718

Abstract: Innovations described herein facilitate the addition of temporal scalability to non-scalable bitstreams. For example, a bitstream rewriter receives units of encoded video data for a non-scalable bitstream from components of a hardware-based encoder. The bitstream rewriter changes at least some of the units of encoded video data so as to produce a scalable bitstream with temporal scalability. In doing so, the bitstream rewriter can associate an original sequence parameter set (SPS) and original picture parameter set (PPS) with pictures for a temporal base layer, and associate a new SPS and new PPS with pictures for a temporal enhancement layer. The bitstream rewriter can also alter syntax elements in the units of encoded video data, for example, changing syntax elements in a slice header in ways that avoid bit shifting operations for following coded slice data for a unit of encoded video data for the temporal enhancement layer.

Type: Application

Filed: November 3, 2011

Publication date: May 9, 2013

Applicant: Microsoft Corporation

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
Quantization matrices for digital audio

Patent number: 8428943

Abstract: Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.

Type: Grant

Filed: March 11, 2011

Date of Patent: April 23, 2013

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
MULTI-LAYER ENCODING AND DECODING

Publication number: 20130070859

Abstract: Innovations described herein provide a generic encoding and decoding framework that includes some features of simulcast and some features of scalable video coding. For example, a bitstream multiplexer multiplexes component bitstreams into a multi-layer encoding (MLE) bitstream that provides temporal scalability, spatial resolution scalability and/or signal to noise ratio scalability. Each of the component bitstreams provides an alternative version of input video, and a given component bitstream can be a non-scalable bitstream or scalable bitstream. The multiplexer follows composition rules for the MLE bitstream and may rewrite values of certain syntax elements of component bitstreams using an approach that avoids bit shifting operations. A corresponding demultiplexer receives an MLE bitstream that includes component bitstreams and demultiplexes at least part of at least one of the component bitstreams from the MLE bitstream, following decomposition rules for the demultiplexing.

Type: Application

Filed: September 16, 2011

Publication date: March 21, 2013

Applicant: Microsoft Corporation

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
TECHNIQUES FOR DYNAMIC SWITCHING BETWEEN CODED BITSTREAMS

Publication number: 20130055326

Abstract: Techniques for dynamic switching in coded bitstreams are described. An apparatus may comprise a switching component operative to determine a timepoint to switch from broadcasting a first video stream to broadcasting a second video stream, the first video stream a first encoding of a video source at a first quality level and the second video stream a second encoding of the video source at a second quality level. Other embodiments are described and claimed.

Type: Application

Filed: August 30, 2011

Publication date: February 28, 2013

Applicant: MICROSOFT CORPORATION

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
Video conference rate matching

Patent number: 8380790

Abstract: Video conference rate matching may be provided. A video conference server may receive video source streams from clients on a video conference. The server may analyze each client's capabilities and choose a video stream to send to each client based on those capabilities. For example, a client capable of encoding and decoding a high definition video stream may provide three source video streams—a high definition stream, a medium resolution stream, and a low resolution stream. The server may send only the low resolution stream to a client with a low amount of available bandwidth. The server may send the medium resolution stream to another client with sufficient bandwidth for the high definition stream, but which lacks the ability to decode the high definition stream.

Type: Grant

Filed: December 15, 2008

Date of Patent: February 19, 2013

Assignee: Microsoft Corporation

Inventors: Ming Chieh Lee, Mu Han, Tim Moore
TECHNIQUES FOR ADAPTIVE ROUNDING OFFSET IN VIDEO ENCODING

Publication number: 20120307890

Abstract: Techniques adaptive rounding offset in video encoding are described. An apparatus may comprise a rounding offset adaptation component operative to adjust a quantization parameter rounding factor for a current macroblock of a current frame of a video stream being compressed by a video encoding system. Other embodiments are described and claimed.

Type: Application

Filed: June 2, 2011

Publication date: December 6, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Mei-Hsuan Lu, Ming-Chieh Lee
REGIONS OF INTEREST FOR QUALITY ADJUSTMENTS

Publication number: 20120269266

Abstract: Quality settings established by an encoder are adjusted based on information associated with regions of interest (“ROIs”). For example, quantization step sizes can be reduced (to improve quality) or increased (to reduce bit rate). ROIs can be identified and quality settings can be adjusted based on input received from a user interface. An overlap setting can be determined for a portion of a picture that corresponds to an ROI overlap area. For example, an overlap setting is chosen from step sizes corresponding to a first overlapping ROI and a second overlapping ROI, or from relative reductions in step size corresponding to the first ROI and the second ROI. ROIs can be parameterized by information (e.g., using data structures) that indicates spatial dimensions of the ROIs and quality adjustment information (e.g., dead zone information, step size information, and quantization mode information).

Type: Application

Filed: July 2, 2012

Publication date: October 25, 2012

Applicant: Microsoft Corporation

Inventors: Chih-Lung Lin, Ming-Chieh Lee, Stacey Spears, Philip Lawrence Garrett, Anil Madhav Murching, Amy Jennifer Beauford, Thomas W. Holcomb, Xiping Zuo
Encoding macroblock type and coded block pattern information

Patent number: 8290288

Abstract: The coded block parameters used to code blocks of image samples into structures called macroblocks are compressed more efficiently by exploiting the correlation between chrominance and luminance blocks in each macroblock. In particular, the coded block pattern for chrominance and luminance are combined into a single parameter for the macroblock and jointly coded with a single variable length code. To further enhance coding efficiency, the spatial coherence of coded block patterns can be exploited by using spatial prediction to compute predicted values for coded block pattern parameters.

Type: Grant

Filed: September 20, 2007

Date of Patent: October 16, 2012

Assignee: Microsoft Corporation

Inventors: Chih-Lung Lin, Ming-Chieh Lee
Regions of interest for quality adjustments

Patent number: 8243797

Abstract: Quality settings established by an encoder are adjusted based on information associated with regions of interest (“ROIs”). For example, quantization step sizes can be reduced (to improve quality) or increased (to reduce bit rate). ROIs can be identified and quality settings can be adjusted based on input received from a user interface. An overlap setting can be determined for a portion of a picture that corresponds to an ROI overlap area. For example, an overlap setting is chosen from step sizes corresponding to a first overlapping ROI and a second overlapping ROI, or from relative reductions in step size corresponding to the first ROI and the second ROI. ROIs can be parameterized by information (e.g., using data structures) that indicates spatial dimensions of the ROIs and quality adjustment information (e.g., dead zone information, step size information, and quantization mode information).

Type: Grant

Filed: March 30, 2007

Date of Patent: August 14, 2012

Assignee: Microsoft Corporation

Inventors: Chih-Lung Lin, Ming-Chieh Lee, Stacey Spears, Philip Lawrence Garrett, Anil Madhav Murching, Amy Jennifer Beauford, Thomas W. Holcomb, Xiping Zuo
Signaling and repeat padding for skip frames

Patent number: 8107531

Abstract: A video codec efficiently signals that a frame is identical to its reference frame, such that separate coding of its picture content is skipped. Information that a frame is skipped is represented jointly in a coding table of a frame coding type element for bit rate efficiency in signaling. Further, the video codec signals the picture type (e.g., progressive or interlaced) of skipped frames, which permits different repeat padding methods to be applied according to the picture type.

Type: Grant

Filed: November 12, 2004

Date of Patent: January 31, 2012

Assignee: Microsoft Corporation

Inventors: Shankar Regunathan, Chih-Lung Lin, Thomas W. Holcomb, Jie Liang, Ming-Chieh Lee, Pohsiang Hsu

prev … 3 4 5 6 7 8 9 10 11 … next