Patents by Inventor Guan-Ming Su

Guan-Ming Su has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

GENERATING HDR IMAGE FROM CORRESPONDING CAMERA RAW AND SDR IMAGES

Publication number: 20250117904

Abstract: Guided filtering is applied, with a camera raw image as a guidance image, to a first image to generate an intermediate image. A dynamic range mapping is performed on the intermediate image to generate a second image of a different dynamic range. The second image is used to generate specific local reshaping function index values for selecting specific local reshaping functions. The specific local reshaping functions are applied to the second image to generate a locally reshaped image.

Type: Application

Filed: March 8, 2023

Publication date: April 10, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming SU, Tsung-Wei HUANG, Tao CHEN
Efficient user-defined SDR-to-HDR conversion with model templates

Patent number: 12272039

Abstract: Backward reshaping metadata prediction models are trained with training SDR images and corresponding training HDR images. Content creation user input to define user adjusted HDR appearances for the corresponding training HDR images is received. Content-creation-user-specific modified backward reshaping metadata prediction models are generated based on the trained prediction models and the content creation user input. The content-creation-user-specific modified prediction models are used to predict operational parameter values of content-creation-user-specific backward reshaping mappings for backward reshaping SDR images into mapped HDR images of at least one content-creation-user-adjusted HDR appearance.

Type: Grant

Filed: August 12, 2020

Date of Patent: April 8, 2025

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Guan-Ming Su, Harshad Kadu
Wrapped reshaping for codeword augmentation with neighborhood consistency

Patent number: 12244872

Abstract: An input image of a first bit depth in an input domain is received. Forward reshaping operations are performed on the input image to generate a forward reshaped image of a second bit depth in a reshaping domain. An image container containing image data derived from the forward reshaped image is encoded into an output video signal of the second bit depth.

Type: Grant

Filed: November 10, 2021

Date of Patent: March 4, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Janos Horvath, Harshad Kadu, Guan-Ming Su
INTERACTIVE MOTION BLUR ON MOBILE DEVICES

Publication number: 20250069200

Abstract: Novel methods and systems are described for providing interactive motion blur on an image by motion inputs from movements of the mobile device displaying the image. The device can process the motion blur by modules providing motion blur parameter estimation, blur application, and image composition based on metadata and a baseline image from the encoder. A pre-loaded filter bank can provide blur kernels for blur application.

Type: Application

Filed: December 7, 2022

Publication date: February 27, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Dae Yeol Lee, Neeraj J. Gadgil, Guan-Ming Su
Trim-pass correction for cloud-based coding of HDR video

Patent number: 12238344

Abstract: In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment. When a parent scene of a secondary scene is processed by two or more neighboring nodes, initial forward reshaping functions and trim-pass correction parameters are adjusted using reference tone-mapping functions and updated scene-based trim-pass correction parameters.

Type: Grant

Filed: September 17, 2021

Date of Patent: February 25, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Harshad Kadu, Guan-Ming Su
FILM GRAIN PARAMETERS ADAPTATION BASED ON VIEWING ENVIRONMENT

Publication number: 20250063203

Abstract: Methods, systems, and bitstream syntax are described for metadata signaling and film-grain parameter adaptation based on a viewing environment which may differ from a reference environment. Example adaptation models are provided for viewing parameters that include: ambient room illumination, viewing distance, and pixels per inch in a target display. Example systems include a single reference viewing environment model and a multi-reference viewing environment model supporting adaptation of film-grain model parameters via adaptation functions or interpolation.

Type: Application

Filed: December 19, 2022

Publication date: February 20, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming SU, Harshad KADU, Peng YIN
Adaptive local reshaping for SDR-to-HDR up-conversion

Patent number: 12206907

Abstract: A global index value is generated for selecting a global reshaping function for an input image of a relatively low dynamic range using luma codewords in the input image. Image filtering is applied to the input image to generate a filtered image. The filtered values of the filtered image provide a measure of local brightness levels in the input image. Local index values are generated for selecting specific local reshaping functions for the input image using the global index value and the filtered values of the filtered image. A reshaped image of a relatively high dynamic range is generated by reshaping the input image with the specific local reshaping functions selected using the local index values.

Type: Grant

Filed: October 1, 2021

Date of Patent: January 21, 2025

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Tsung-Wei Huang, Guan-Ming Su, Neeraj J. Gadgil
Recursive segment to scene segmentation for cloud-based coding of HDR video

Patent number: 12190587

Abstract: In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment, while maintaining temporal continuity among scenes processed by multiple nodes. Methods to generate scene-based forward and backward reshaping functions to optimize video coding and improve the coding efficiency of reshaping-related metadata are also examined.

Type: Grant

Filed: September 17, 2021

Date of Patent: January 7, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Harshad Kadu, Guan-Ming Su, Neeraj J. Gadgil, Tsung-Wei Huang
DATA STRUCTURE FOR MULTIMEDIA APPLICATONS

Publication number: 20250005068

Abstract: Embodiments described herein provide a unified container format for delivering different multimedia applications. One embodiment provides a data structure utilized for implementing a plurality of multimedia applications. The data structure includes a first metadata level including low-level metadata used to perform operations associated with media data in a bitstream. The data structure includes a second metadata level including mid-level metadata used to apply operation metadata to render the media data. The data structure includes a third metadata level including upper-level metadata used to utilize the low-level metadata and the mid-level metadata to deliver the plurality of multimedia applications.

Type: Application

Filed: December 19, 2022

Publication date: January 2, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming SU, Tao CHEN, Sheng QU, Samir N. HULYALKAR
FACE REGION DETECTION AND LOCAL RESHAPING ENHANCEMENT

Publication number: 20240428612

Abstract: Methods and corresponding systems to process face regions are disclosed. The described methods include providing face bounding boxes and confidence levels for the faces, generating a histogram of the pixels and the faces, generating a probability of face, and generating a face probability map. A face contrast adjustment and a face saturation adjustment can be applied to the face probability map.

Type: Application

Filed: July 25, 2022

Publication date: December 26, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tsung-Wei Huang, Guan-Ming Su
LUMINANCE BASED CODING TOOLS FOR VIDEO COMPRESSION

Publication number: 20240430455

Abstract: Sample data and metadata related to spatial regions in images may be received from a coded video signal. It is determined whether specific spatial regions in the images correspond to a specific region of luminance levels. In response to determining the specific spatial regions correspond to the specific region of luminance levels, signal processing and video compression operations are performed on sets of samples in the specific spatial regions. The signal processing and video compression operations are at least partially dependent on the specific region of luminance levels.

Type: Application

Filed: June 26, 2023

Publication date: December 26, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Peng YIN, Guan-Ming SU, Taoran LU, Tao CHEN, Walter J. HUSAK
Rate-control-aware reshaping in HDR imaging

Patent number: 12177459

Abstract: Given an input image in a high dynamic range (HDR) which is mapped to a second image in a second dynamic range using a reshaping function, to improve coding efficiency, a reshaping function generator may adjust the codeword range of the HDR input under certain criteria, such as for noisy HDR images with a relatively-small codeword range. An example of generating a scaler for adjusting the HDR codeword range based on the original codeword range and a metric of the percentage of edge-points in the HDR image is provided. The adjusted reshaping function allows for more efficient rate control during the compression of reshaped images.

Type: Grant

Filed: November 25, 2020

Date of Patent: December 24, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Harshad Kadu, Ji Qi, Guan-Ming Su
RESHAPER FOR LEARNING BASED IMAGE/VIDEO CODING

Publication number: 20240422345

Abstract: An input image represented in an input domain is received from an input video signal. Forward reshaping is performed on the input image to generate a forward reshaped image represented in a reshaped image domain. Non-reshaping encoding operations are performed to encode the reshaped image into an encoded video signal. At least one of the non-reshaping encoding operations is implemented with an ML model that has been previously trained with training images in one or more training datasets in a preceding training stage. A recipient device of the encoded video signal is caused to generate a reconstructed image from the forward reshaped image.

Type: Application

Filed: August 5, 2022

Publication date: December 19, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Peng YIN, Fangjun PU, Taoran LU, Arjun ARORA, Guan-Ming SU, Tao CHEN, Sean Thomas MCCARTHY, Walter J. HUSAK
APPLYING MINIMUM AND AVERAGE DISTANCE CONSTRAINT IN VIDEO STREAMING

Publication number: 20240406461

Abstract: Input images are received as input to a multi-node system. The input images are divided into segments assigned to respective nodes of the multi-node system. Primary and secondary scenes are identified in the segments to ensure compliance with minimum and average distance constraints. Scene-level forward reshaping mappings are generated for the scenes by a respective node for an assigned segment. Forward reshaped images in the segment are generated by the node using the forward reshaping mappings and encoded into an output video signal, which enables a recipient device to generate reconstructed images and to render display images derived from the reconstructed images on an image display.

Type: Application

Filed: August 14, 2022

Publication date: December 5, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Harshad KADU, Guan-Ming SU
Iterative optimization of reshaping functions in single-layer HDR image codec

Patent number: 12149753

Abstract: A method, for generating (a) a forward reshaping function for compressing an input high-dynamic range (HDR) image into a reshaped standard-dynamic-range (SDR) image and (b) a backward reshaping function for decompressing the reshaped SDR image into a reconstructed HDR image, includes (i) optimizing the forward reshaping function to minimize a deviation between the reshaped SDR image and an input SDR image corresponding to the input HDR image, (ii) optimizing the backward reshaping function to minimize a deviation between the reconstructed HDR image and the input HDR image, and (iii) until a termination condition is met, applying a correction to the input SDR image and reiterating, based on the input SDR image as corrected, the steps of optimizing the forward and backward reshaping functions.

Type: Grant

Filed: April 21, 2021

Date of Patent: November 19, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Guan-Ming Su, Harshad Kadu
Applying minimum and average distance constraint in video streaming

Patent number: 12143644

Abstract: Input images are received as input to a multi-node system. The input images are divided into segments assigned to respective nodes of the multi-node system. Primary and secondary scenes are identified in the segments to ensure compliance with minimum and average distance constraints. Scene-level forward reshaping mappings are generated for the scenes by a respective node for an assigned segment. Forward reshaped images in the segment are generated by the node using the forward reshaping mappings and encoded into an output video signal, which enables a recipient device to generate reconstructed images and to render display images derived from the reconstructed images on an image display.

Type: Grant

Filed: August 14, 2022

Date of Patent: November 12, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Harshad Kadu, Guan-Ming Su
Chained reshaping function optimization

Patent number: 12143593

Abstract: An input image to a pipeline of chained reshaping functions is received. Reference images are generated from the input image. The input image and the reference images are used to determine operational parameters for chained reshaping functions in the pipeline of chained reshaping functions. A reshaped image generated from one or more of the chained reshaping functions is encoded in a video signal along with image metadata. The image metadata includes some or all of the operational parameters specifying the chained reshaping functions. A recipient device of the video signal is caused to use the image metadata and the reshaped image to generate a reconstructed image.

Type: Grant

Filed: June 1, 2022

Date of Patent: November 12, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventor: Guan-Ming Su
Noise synthesis for digital images

Patent number: 12136140

Abstract: Apparatus and methods for providing software and hardware based solutions to the problem of synthesizing noise for a digital image. According to one aspect, a probability image is generated and noise blocks are randomly placed at locations in the probability image where the locations have probability values that are compared to a threshold criterion, creating a synthesized noise image. Embodiments include generating synthesized film grain images and synthesized digital camera noise images.

Type: Grant

Filed: December 21, 2020

Date of Patent: November 5, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Harshad Kadu, Bharath Vishwanath, Guan-Ming Su, Samir N. Hulyalkar
TRANSMISSION OF VOLUMETRIC IMAGES IN MULTIPLANE IMAGING FORMAT

Publication number: 20240357181

Abstract: Methods and apparatus for transmission of volumetric images in the MPI format. According to an example embodiment, texture and alpha layers of multiplane images are packed, as tiles, into a sequence of video frames. The sequence of video frames is then compressed to generate a video bitstream, which is transmitted together with a metadata bitstream specifying at least the parameters of the packing arrangement for the tiles in the sequence of video frames. Example packing arrangements include various selectable spatial and temporal arrangements for texture layers, alpha layers, and camera views. In some examples, the metadata bitstream is implemented using a SEI message and includes parameters selected from the group consisting of a size of the reference view, the number of layers in the multiplane image, the number of simultaneous views, one or more characteristics of the packing arrangement, layer merging information, dynamic range adjustment information, and reference view information.

Type: Application

Filed: May 22, 2024

Publication date: October 24, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Taoran Lu, Peng YIN, Guan-Ming Su, Dae Yeol Lee, Sean Thomas McCarthy, Tsung-Wei Huang, Sejin Oh
Frame-rate scalable video coding

Patent number: 12108061

Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Type: Grant

Filed: March 26, 2024

Date of Patent: October 1, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su

1 2 3 4 5 … next