Patents by Inventor Lihua Zhu

Lihua Zhu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS AND APPARATUS FOR VIDEO USABILITY INFORMATION (VUI) FOR SCALABLE VIDEO CODING (SVC)

Publication number: 20180035119

Abstract: There are provided methods and apparatus for video usability information (VUI) for scalable video coding (SVC). An apparatus includes an encoder (100) for encoding video signal data into a bitstream. The encoder specifies video user information, excluding hypothetical reference decoder parameters, in the bitstream using a high level syntax element. The video user information corresponds to a set of interoperability points in the bitstream relating to scalable video coding (340, 355).

Type: Application

Filed: October 9, 2017

Publication date: February 1, 2018

Inventors: Jiancong Luo, Peng Yin, Lihua Zhu
SYNTAX STRUCTURES INDICATING COMPLETION OF CODED REGIONS

Publication number: 20180014033

Abstract: Syntax structures that indicate the completion of coded regions of pictures are described. For example, a syntax structure in an elementary bitstream indicates the completion of a coded region of a picture. The syntax structure can be a type of network abstraction layer unit, a type of supplemental enhancement information message or another syntax structure. For example, a media processing tool such as an encoder can detect completion of a coded region of a picture, then output, in a predefined order in an elementary bitstream, syntax structure(s) that contain the coded region as well as a different syntax structure that indicates the completion of the coded region. Another media processing tool such as a decoder can receive, in a predefined order in an elementary bitstream, syntax structure(s) that contain a coded region of a picture as well as a different syntax structure that indicates the completion of the coded region.

Type: Application

Filed: September 22, 2017

Publication date: January 11, 2018

Applicant: Microsoft Technology Licensing, LLC

Inventors: Yongjun Wu, Lihua Zhu, Shyam Sadhwani, Gary J. Sullivan
EFFICIENT DECODING AND RENDERING OF INTER-CODED BLOCKS IN A GRAPHICS PIPELINE

Publication number: 20170372494

Abstract: Innovations in video decoding and rendering operations for inter-coded blocks in a graphics pipeline, in which at least some of the operations are performed using a graphics processing unit (“GPU”), are described. For example, a video playback tool receives encoded data for a current picture and performs operations to decode the encoded data and reconstruct the current picture. For a given inter-coded block of the current picture, a graphics primitive represents texture values as a point for processing by the GPU. The graphics primitive can have one or more attributes, including a motion vector, a block size, a display index value (indicating a location in a display buffer), and/or a residual index value (indicating a location of residual values). The operations performed by the video playback tool can include interpolation of sample values at fractional-sample offsets and motion compensation performed for inter-coded blocks in multiple passes for different block sizes.

Type: Application

Filed: June 24, 2016

Publication date: December 28, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Lihua Zhu, B. Anil Kumar, Guosheng Sun, Olof L.E. Mases
Dynamically updating quality to higher chroma sampling rate

Patent number: 9854201

Abstract: Innovations in encoding of video pictures in a high-resolution chroma sampling format (such as YUV 4:4:4) using a video encoder operating on coded pictures in a low-resolution chroma sampling format (such as YUV 4:2:0) are presented. For example, according to a set of decision rules, high chroma resolution details are selectively encoded on a region-by-region basis such that increases in bit rate (due to encoding of sample values for the high chroma resolution details) happen when and where corresponding increases in chroma resolution are likely to improve quality in noticeable ways. In this way, available encoders operating on coded pictures in the low-resolution chroma sampling format can be effectively used to provide high chroma resolution details.

Type: Grant

Filed: January 16, 2015

Date of Patent: December 26, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shir Aharon, Lihua Zhu, B. Anil Kumar, Jeroen E. van Eesteren
Methods and apparatus for video usability information (VUI) for scalable video coding (SVC)

Patent number: 9826243

Abstract: There are provided methods and apparatus for video usability information (VUI) for scalable video coding (SVC). An apparatus includes an encoder for encoding video signal data into a bitstream. The encoder specifies video user information, excluding hypothetical reference decoder parameters, in the bitstream using a high level syntax element. The video user information corresponds to a set of interoperability points in the bitstream relating to scalable video coding.

Type: Grant

Filed: March 20, 2008

Date of Patent: November 21, 2017

Assignee: THOMSON Licensing

Inventors: Jiancong Luo, Peng Yin, Lihua Zhu
Syntax structures indicating completion of coded regions

Patent number: 9800898

Abstract: Syntax structures that indicate the completion of coded regions of pictures are described. For example, a syntax structure in an elementary bitstream indicates the completion of a coded region of a picture. The syntax structure can be a type of network abstraction layer unit, a type of supplemental enhancement information message or another syntax structure. For example, a media processing tool such as an encoder can detect completion of a coded region of a picture, then output, in a predefined order in an elementary bitstream, syntax structure(s) that contain the coded region as well as a different syntax structure that indicates the completion of the coded region. Another media processing tool such as a decoder can receive, in a predefined order in an elementary bitstream, syntax structure(s) that contain a coded region of a picture as well as a different syntax structure that indicates the completion of the coded region.

Type: Grant

Filed: May 6, 2015

Date of Patent: October 24, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yongjun Wu, Lihua Zhu, Shyam Sadhwani, Gary J. Sullivan
PROGRESSIVE UPDATES WITH MOTION

Publication number: 20170300312

Abstract: Non-limiting examples of the present disclosure describe detection of gross motion of a region of content. Gross motion of a region of content may be detected. A determination may be made as to a current quality level of the region. Based on detection of the gross motion, residual values may be generated for a progressive update of the region. The residual values are generated using the current quality level of the region as a base to determine a quantization update for a progressive update of the region at a higher quality level as compared with the current quality level of the region. Frame data for the progressive update of the region may be encoded. The frame data may comprise the residual values and motion vectors for progressive update of the region. The frame data may be transmitted for decoding. Other examples are also described.

Type: Application

Filed: April 13, 2016

Publication date: October 19, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Shir Aharon, Guosheng Sun, Costin Hagiu, Mauruthi Geetha Mohan, B. Anil Kumar, Lihua Zhu, Jeroen E. van Eesteren
EFFICIENT DECODING AND RENDERING OF BLOCKS IN A GRAPHICS PIPELINE

Publication number: 20170295379

Abstract: Innovations in video decoding and rendering operations in a graphics pipeline, in which at least some of the operations are performed using a graphics processing unit (“GPU”), are described. For example, a video playback tool aggregates texture values for intra-coded blocks of a picture in central processing unit (“CPU”) memory, then transfers the texture values for the intra-coded blocks from the CPU memory to GPU memory. The video playback tool performs operations to decode the encoded data and reconstruct the picture. For a given block (e.g., of a macroblock, coding unit) of the picture, a graphics primitive represents texture values for the given block as a point for processing by the GPU. The video playback tool uses one or more shader routines, executable by the GPU, to transfer texture values to a display buffer. In some cases, the video playback tool also performs decoding operations with the shader routines.

Type: Application

Filed: April 12, 2016

Publication date: October 12, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Guosheng Sun, Olof L.E. Mases, Lihua Zhu, B. Anil Kumar
Multi-stage image classification

Patent number: 9785865

Abstract: Techniques are described for performing multi-stage image classification. For example, multi-stage image classification can comprise a first classification stage and a second classification stage. The first classification stage can determine an overall classification for an input image (e.g., based on a relative entropy result calculated for the input image). The second classification stage can be performed by dividing the image into a plurality of blocks and classifying individual blocks, or groups of blocks, based on a classification model that is specific to the overall classification of the image determined in the first classification stage.

Type: Grant

Filed: April 10, 2017

Date of Patent: October 10, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Lihua Zhu, Sridhar Sankuratri, Shir Aharon, B. Anil Kumar, Jeroen E. van Eesteren
VIDEO DECODING USING POINT SPRITES

Publication number: 20170289547

Abstract: A GPU loads point sprites that represent coded blocks of transform coefficients of one or more frames encoded in a bitstream and loads a transform kernel as a transform kernel texture. The GPU constructs an output frame using an inverse transform on the coded blocks of transform coefficients by transforming the point sprites with the transform kernel texture and by optionally dequantizing the point sprites. A single render pass may be used in which the rasterization formula performs the inverse transform and optionally dequantization. To preserve bandwidth, a CPU may refrain from sending the GPU at least some zero valued transform coefficients for the point sprites. Also, to reduce processing, the transform coefficients can remain in a zig-zag arrangement. The transform kernel texture used in the decoding can correspond to a modified version of the basis matrices used to encode the frame, which compensates for the zig-zag arrangement.

Type: Application

Filed: April 1, 2016

Publication date: October 5, 2017

Inventors: Lihua Zhu, Guosheng Sun, B. Anil Kumar, Shir Aharon
Gradual updating using transform coefficients for encoding and decoding

Patent number: 9762912

Abstract: Innovations are provided for encoding and/or decoding video and/or image content using transform coefficient level gradual updating. Transform coefficient level gradual updating can be applied by encoding (or decoding) different subsets of the transform coefficients for the blocks, macroblocks, or other coding unit for each of a sequence of pictures. For example, a first subset of the transform coefficients of the blocks of a first picture can be encoded with the first picture, a second subset of the transform coefficients of the blocks of a second picture can be encoded with the second picture, and so on. A decoder can reconstruct pictures with increasing quality by receiving additional subsets of the transform coefficients.

Type: Grant

Filed: January 16, 2015

Date of Patent: September 12, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Lihua Zhu, Shir Aharon, Eric Rall, B. Anil Kumar, Jeroen E. van Eesteren
SYNTAX AND SEMANTICS FOR BUFFERING INFORMATION TO SIMPLIFY VIDEO SPLICING

Publication number: 20170257639

Abstract: Innovations in syntax and semantics of coded picture buffer removal delay (“CPBRD”) values potentially simplify splicing operations. For example, a video encoder sets a CPBRD value for a current picture that indicates an increment value relative to a nominal coded picture buffer removal time of a preceding picture in decoding order, regardless of whether the preceding picture has a buffering period SEI message. The encoder can signal the CPBRD value according to a single-value approach in which a flag indicates how to interpret the CPBRD value, according to a two-value approach in which another CPBRD value (having a different interpretation) is also signaled, or according to a two-value approach that uses a flag and a delta value. A corresponding video decoder receives and parses the CPBRD value for the current picture. A splicing tool can perform simple concatenation operations to splice bitstreams using the CPBRD value for the current picture.

Type: Application

Filed: May 22, 2017

Publication date: September 7, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Gary J. Sullivan, Lihua Zhu
Encoding/decoding of high chroma resolution details

Patent number: 9749646

Abstract: Innovations in encoding and decoding of video pictures in a high-resolution chroma sampling format (such as YUV 4:4:4) using a video encoder and decoder operating on coded pictures in a low-resolution chroma sampling format (such as YUV 4:2:0) are presented. For example, high chroma resolution details are selectively encoded on a region-by-region basis. Or, as another example, coded pictures that contain sample values for low chroma resolution versions of input pictures and coded pictures that contain sample values for high chroma resolution details of the input pictures are encoded as separate sub-sequences of a single sequence of coded pictures, which can facilitate effective motion compensation. In this way, available encoders and decoders operating on coded pictures in the low-resolution chroma sampling format can be effectively used to provide high chroma resolution details.

Type: Grant

Filed: January 16, 2015

Date of Patent: August 29, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Lihua Zhu, Shir Aharon, Yongjun Wu, B. Anil Kumar, Jeroen E. van Eesteren, Costin Hagiu, Shyam Sadhwani
MULTI-STAGE IMAGE CLASSIFICATION

Publication number: 20170213110

Abstract: Techniques are described for performing multi-stage image classification. For example, multi-stage image classification can comprise a first classification stage and a second classification stage. The first classification stage can determine an overall classification for an input image (e.g., based on a relative entropy result calculated for the input image). The second classification stage can be performed by dividing the image into a plurality of blocks and classifying individual blocks, or groups of blocks, based on a classification model that is specific to the overall classification of the image determined in the first classification stage.

Type: Application

Filed: April 10, 2017

Publication date: July 27, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Lihua Zhu, Sridhar Sankuratri, Shir Aharon, B. Anil Kumar, Jeroen E. van Eesteren
Multi-stage image classification

Patent number: 9704066

Abstract: Techniques are described for performing multi-stage image classification. For example, multi-stage image classification can comprise a first classification stage and a second classification stage. The first classification stage can determine an overall classification for an input image (e.g., based on a relative entropy result calculated for the input image). The second classification stage can be performed by dividing the image into a plurality of blocks and classifying individual blocks, or groups of blocks, based on a classification model that is specific to the overall classification of the image determined in the first classification stage.

Type: Grant

Filed: March 20, 2015

Date of Patent: July 11, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Lihua Zhu, Sridhar Sankuratri, Shir Aharon, B. Anil Kumar, Jeroen E. van Eesteren
Syntax and semantics for buffering information to simplify video splicing

Patent number: 9661341

Abstract: Innovations in syntax and semantics of coded picture buffer removal delay (“CPBRD”) values potentially simplify splicing operations. For example, a video encoder sets a CPBRD value for a current picture that indicates an increment value relative to a nominal coded picture buffer removal time of a preceding picture in decoding order, regardless of whether the preceding picture has a buffering period SEI message. The encoder can signal the CPBRD value according to a single-value approach in which a flag indicates how to interpret the CPBRD value, according to a two-value approach in which another CPBRD value (having a different interpretation) is also signaled, or according to a two-value approach that uses a flag and a delta value. A corresponding video decoder receives and parses the CPBRD value for the current picture. A splicing tool can perform simple concatenation operations to splice bitstreams using the CPBRD value for the current picture.

Type: Grant

Filed: December 17, 2013

Date of Patent: May 23, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Gary J. Sullivan, Lihua Zhu
WEB-BROWSER BASED DESKTOP AND APPLICATION REMOTING SOLUTION

Publication number: 20170046013

Abstract: An invention is disclosed for conducting a remote presentation session with a client that uses a web browser to conduct the session. The client previously received browser-native program code that executes within a runtime environment of the web browser. The browser-native program code instantiates a remote presentation client executing within a runtime environment of the web browser. The server generates graphics encoded according to a remote presentation protocol and sends them to the remote presentation client for display in the web browser. The client captures user input at the web browser and sends it to the remote presentation client, which encodes it with the remote presentation protocol and sends it to the server to be processed.

Type: Application

Filed: August 14, 2015

Publication date: February 16, 2017

Inventors: Raymond Matthew Reskusich, Jayashree Sadagopan, Lihua Zhu, Sridhar Sankuratri, Shir Aharon, Jeroen Eduard van Eesteren, Greg Sun, Derrick Isoka, Munindra Nath Das, Travis Michael Howe, B. Anil Kumar
BLOCK VECTOR PREDICTION IN VIDEO AND IMAGE CODING/DECODING

Publication number: 20160330471

Abstract: Innovations in the area of prediction of block vector (“BV”) values improve encoding or decoding of blocks using intra block copy (“BC”) prediction. For example, some of the innovations relate to use of a default BV predictor with a non-zero value. Other innovations relate to use of a selected one of multiple BV predictor candidates for a current block. Still other innovations relate to use of a skip mode in which a current intra-BC-predicted block uses a predicted BV value.

Type: Application

Filed: January 3, 2014

Publication date: November 10, 2016

Applicant: Microsoft Technology Licensing, LLC

Inventors: Lihua Zhu, Gary J. Sullivan, Jizheng Xu, Sridhar Sankuratri, B. Anil Kumar, Feng Wu
MULTI-STAGE IMAGE CLASSIFICATION

Publication number: 20160275374

Abstract: Techniques are described for performing multi-stage image classification. For example, multi-stage image classification can comprise a first classification stage and a second classification stage. The first classification stage can determine an overall classification for an input image (e.g., based on a relative entropy result calculated for the input image). The second classification stage can be performed by dividing the image into a plurality of blocks and classifying individual blocks, or groups of blocks, based on a classification model that is specific to the overall classification of the image determined in the first classification stage.

Type: Application

Filed: March 20, 2015

Publication date: September 22, 2016

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Lihua Zhu, Sridhar Sankuratri, Shir Aharon, B. Anil Kumar, Jeroen E. van Eesteren
FILTERING TO MITIGATE ARTIFACTS WHEN CHANGING CHROMA SAMPLING RATES

Publication number: 20160212423

Abstract: Innovations in the area of filtering in and around high chroma resolution regions of output pictures are presented. For example, a video processing tool applies a recovery filter in a high chroma resolution region of an output picture, which compensates for anti-aliasing filtering previously performed during chroma sub-sampling, but skips application of the recovery filter in a low chroma resolution region of the output picture. Or, a video processing tool applies a deblocking filter to chroma sample values at a boundary between a low chroma resolution region and high chroma resolution region in the output picture, which can mitigate perceptible distortion at the boundary. Or, a video processing tool selectively applies a deringing filter to chroma sample values in a high chroma resolution region of the output picture, which can mitigate perceptible distortion due to low quality coding of high chroma resolution details.

Type: Application

Filed: January 16, 2015

Publication date: July 21, 2016

Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Shir Aharon, Lihua Zhu, B. Anil Kumar, Jeroen E. van Eesteren

prev 1 2 3 4 5 6 7 next