Patents by Inventor Guan-Ming Su
Guan-Ming Su has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20230300381Abstract: Methods and systems for generating a set of forward and backward reshaping functions for the efficient coding of high-dynamic range (HDR) images are provided. Given an initial set of forward reshaping functions, output forward reshaping functions are constructed by a) using the forward reshaping functions to generate a first set of corresponding backward reshaping functions b) generating a second set of backward reshaping functions using a multi-segment polynomial representation with a common set of pivot points c) generating an output set of backward reshaping functions by optimizing the polynomial representation of the second set of backward reshaping functions to minimize gap values between consecutive segments and d) using the output set of backward reshaping functions to generate the output set of forward reshaping functions by minimizing the distance between original input HDR codewords and reconstructed HDR codewords.Type: ApplicationFiled: April 20, 2021Publication date: September 21, 2023Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventor: Guan-Ming SU
-
Publication number: 20230291937Abstract: In a cloud-based system for encoding high dynamic range (HDR) video, a computing node is assigned to be a dispatcher node, segmenting the input video into scenes and generating a scene to segment allocation to be used by other computing nodes. The scene to segment allocation process includes one or more iterations with an initial random assignment of scenes to computing nodes, followed by a refined assignment based on optimizing the allocation cost across all the computing nodes. Methods to generate scene-based forward and backward reshaping functions to optimize video coding and improve the coding efficiency of reshaping-related metadata are also examined.Type: ApplicationFiled: July 8, 2021Publication date: September 14, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Guan-Ming SU, Harshad KADU, Neeraj J. GADGIL
-
Patent number: 11729400Abstract: Sample data and metadata related to spatial regions in images may be received from a coded video signal. It is determined whether specific spatial regions in the images correspond to a specific region of luminance levels. In response to determining the specific spatial regions correspond to the specific region of luminance levels, signal processing and video compression operations are performed on sets of samples in the specific spatial regions. The signal processing and video compression operations are at least partially dependent on the specific region of luminance levels.Type: GrantFiled: May 28, 2021Date of Patent: August 15, 2023Assignee: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Peng Yin, Guan-Ming Su, Taoran Lu, Tao Chen, Walter J. Husak
-
Publication number: 20230254494Abstract: Given input HDR and SDR images representing the same scene, a prediction model to predict the HDR image from a compressed representation of the input SDR image is generated as follows: a) generate noise data based at least on the characteristics of the HDR image b) generate a noisy SDR image by adding the noise data to the SDR image c) generate an augmented HDR data set and an augmented SDR data set by using the input HDR and SDR images and the noisy SDR image d) generate a prediction model to predict the augmented HDR data set based on the augmented SDR data set and e) solve the prediction model according to a minimization-error criterion to generate a set of prediction parameters to be transmitted to a decoder together with a compressed representation of the input SDR image to reconstruct an approximation of the input HDR image.Type: ApplicationFiled: June 21, 2021Publication date: August 10, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: GUAN-MING SU, HARSHAD KADU
-
Publication number: 20230232028Abstract: A method for distributing High Dynamic Range (HDR) content to playback devices for displaying images where the HDR content is encoded to an HDR bitstream and the HDR bitstream is subsequently decoded by a playback device. The HDR bitstream contains auxiliary metadata packets that are based upon the processing capability of the playback device.Type: ApplicationFiled: June 30, 2021Publication date: July 20, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Robin ATKINS, Guan-Ming SU, Gopi LAKSHMINARAYANAN
-
Publication number: 20230171417Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.Type: ApplicationFiled: January 12, 2023Publication date: June 1, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Robin Atkins, Peng YIN, Taoran LU, Fangjun PU, Sean Thomas MCCARTHY, Walter J. HUSAK, Tao CHEN, Guan-Ming SU
-
Publication number: 20230171436Abstract: A backward reshaping mapping table is initially generated as an inverse of a forward reshaping mapping table. The backward reshaping mapping table is updated by replacing the content-mapped luminance codewords with forward reshaped luminance codewords generated by applying a luminance forward mapping to the sampled luminance codewords. The luminance forward mapping is constructed from the forward reshaping mapping table. The backward reshaping mapping table and the luminance forward mapping are used to generate backward reshaping mappings for creating a reconstructed image from a forward reshaped image. The forward reshaped image is encoded, in a video signal, along with image metadata specifying the backward reshaping mappings. A recipient device of the video signal applies the backward reshaping mappings to the forward reshaped image to create the reconstructed image of the second dynamic range.Type: ApplicationFiled: October 16, 2020Publication date: June 1, 2023Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Qing Song, Arun Raj, Guan-Ming Su
-
Patent number: 11663702Abstract: Methods and systems for reducing banding artifacts when displaying images are described. Identified image bands are filtered using an adaptive sparse finite response filter, where the tap-distance in the sparse filter is adapted according to an estimated width of each image band. Image debanding may be performed across multiple pixel orientations, such as rows, columns, a 45-degree angle, or a ?45-degree angle. Given a threshold to decide whether sparse filtering needs to be performed or not, an iterative debanding process is also proposed.Type: GrantFiled: December 10, 2019Date of Patent: May 30, 2023Assignee: Dolby Laboratories Licensing CorporationInventors: Neeraj J. Gadgil, Qing Song, Guan-Ming Su
-
Publication number: 20230156343Abstract: In an encoder, a high-dynamic range (HDR) image is encoded using a family of local forward reshaping functions selected according to an array of forward mapping indices (FMI) indicating which local forward reshaping function needs to be used for each pixel in the HDR image to generate a reshaped standard dynamic range (SDR) image. A decoder, given the reshaped SDR image, iteratively generates a reconstructed HDR image and estimated reshaped SDR images by adjusting a local FMI array and a local array of backward mapping indices (BMI) until an error metric related to the difference between the local BMI and FMI arrays and the difference between the estimate SDR images and the reshaped SDR image satisfy a convergence criterion. Techniques for generating families of local forward reshaping functions and local backward reshaping functions based on a global forward reshaping function are also presented.Type: ApplicationFiled: April 1, 2021Publication date: May 18, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Guan-Ming Su, Harshad Kadu, Tsung-Wei Huang
-
Publication number: 20230084705Abstract: A source color grade and a destination color grade may be received by an end-to-end image mapping pipeline comprising forward and backward paths. Forward neural networks in the forward path may be applied to generate, from the source color grade, a forward reshaped color grade corresponding to the destination color grade. Backward neural networks in the backward path may be applied to generate, from the forward reshaped color grade, a backward reshaped color grade corresponding to the source color grade. An overall neural network cost function may be specified for the forward path and the backward path to generate a cost that may be used as a basis for determining operational parameters for the forward and backward neural networks.Type: ApplicationFiled: February 17, 2021Publication date: March 16, 2023Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventor: Guan-Ming SU
-
Publication number: 20230073388Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.Type: ApplicationFiled: September 21, 2022Publication date: March 9, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
-
Publication number: 20230050950Abstract: Apparatus and methods for providing software and hardware based solutions to the problem of synthesizing noise for a digital image. According to one aspect, a probability image is generated and noise blocks are randomly placed at locations in the probability image where the locations have probability values that are compared to a threshold criterion, creating a synthesized noise image. Embodiments include generating synthesized film grain images and synthesized digital camera noise images.Type: ApplicationFiled: December 21, 2020Publication date: February 16, 2023Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Harshad KADU, Bharath VISHWANATH, Guan-Ming SU, Samir N. HULYALKAR
-
Patent number: 11582472Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.Type: GrantFiled: February 8, 2022Date of Patent: February 14, 2023Assignee: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
-
Publication number: 20230039038Abstract: Given an input image in a high dynamic range (HDR) which is mapped to a second image in a second dynamic range using a reshaping function, to improve coding efficiency, a reshaping function generator may adjust the codeword range of the HDR input under certain criteria, such as for noisy HDR images with a relatively-small codeword range. An example of generating a scaler for adjusting the HDR codeword range based on the original codeword range and a metric of the percentage of edge-points in the HDR image is provided. The adjusted reshaping function allows for more efficient rate control during the compression of reshaped images.Type: ApplicationFiled: November 25, 2020Publication date: February 9, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Harshad Kadu, Ji Qi, Guan-Ming Su
-
Patent number: 11567950Abstract: A confidentiality preserving system and method for performing a rank-ordered search and retrieval of contents of a data collection. The system includes at least one computer system including a search and retrieval algorithm using term frequency and/or similar features for rank-ordering selective contents of the data collection, and enabling secure retrieval of the selective contents based on the rank-order. The search and retrieval algorithm includes a baseline algorithm, a partially server oriented algorithm, and/or a fully server oriented algorithm. The partially and/or fully server oriented algorithms use homomorphic and/or order preserving encryption for enabling search capability from a user other than an owner of the contents of the data collection. The confidentiality preserving method includes using term frequency for rank-ordering selective contents of the data collection, and retrieving the selective contents based on the rank-order.Type: GrantFiled: December 4, 2020Date of Patent: January 31, 2023Assignee: UNIVERSITY OF MARYLAND, COLLEGE PARKInventors: Ashwin Swaminathan, Yinian Mao, Guan-Ming Su, Hongmei Gou, Avinash Varna, Shan He, Min Wu, Douglas W. Oard
-
Publication number: 20230007294Abstract: A first predictor is applied to an input image to generate first-stage predicted codewords approximating prediction target codewords of a prediction target image. Second-stage prediction target values are created by performing an inverse cascade operation on the prediction target codewords and the first-stage predicted codewords. A second predictor is applied to the input image to generate second-stage predicted values approximating the second-stage prediction target values. Multiple sets of cascade prediction coefficients are generated to comprise first and second sets of cascade prediction coefficients specifying the first and second predictors. The multiple sets of cascade prediction coefficients are encoded, in a video signal, as image metadata. The video signal is further encoded with the input image.Type: ApplicationFiled: December 3, 2020Publication date: January 5, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Harshad KADU, Guan-Ming SU
-
Publication number: 20230005243Abstract: A method for image segmentation includes (a) clustering, based upon k-means clustering, pixels of an image into first clusters, (b) outputting a cluster map of the first clusters (c) re-clustering the pixels into a new plurality of non-disjoint pixel-clusters, and (d) classifying the non-disjoint pixel-clusters in categories, according to a user-indicated classification. Another method for image segmentation includes (a) forming a graph with each node of the graph corresponding to a first respective non-disjoint pixel-cluster of the image and connected to each terminal of the graph and to all other nodes corresponding to other respective non-disjoint pixel-clusters that, in the image, are within a neighborhood of the first respective non-disjoint pixel-cluster, (b) setting weights of connections of the graph according to a user-indicated classification in categories respectively associated with the terminals, and (c) segmenting the image into the categories by cutting the graph based upon the weights.Type: ApplicationFiled: December 2, 2020Publication date: January 5, 2023Applicant: Dolby Laboratories Licensing CorporationInventors: Amirhossein KHALILIAN-GOURTANI, Neeraj J. GADGIL, Guan-Ming Su
-
Publication number: 20220408081Abstract: A set of tensor-product B-Spline (TPB) basis functions is determined. A set of selected TPB prediction parameters to be used with the set of TPB basis functions for generating predicted image data in mapped images from source image data in source images of a source color grade is generated. The set of selected TPB prediction parameters is generated by minimizing differences between the predicted image data in the mapped images and reference image data in reference images of a reference color grade. The reference images correspond to the source images and depict same visual content as depicted by the source images. The set of selected TPB prediction parameters is encoded in a video signal as a part of image metadata along with the source image data in the source images. The mapped images are caused to be reconstructed and rendered with a recipient device of the video signal.Type: ApplicationFiled: September 29, 2020Publication date: December 22, 2022Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Guan-Ming Su, Harshad Kadu, Qing Song, Neeraj J. Gadgil
-
Patent number: 11523127Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.Type: GrantFiled: March 11, 2020Date of Patent: December 6, 2022Assignee: Dolby Laboratories Licensing CorporationInventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
-
Publication number: 20220301124Abstract: Backward reshaping metadata prediction models are trained with training SDR images and corresponding training HDR images. Content creation user input to define user adjusted HDR appearances for the corresponding training HDR images is received. Content-creation-user-specific modified backward reshaping metadata prediction models are generated based on the trained prediction models and the content creation user input. The content-creation-user-specific modified prediction models are used to predict operational parameter values of content-creation-user-specific backward reshaping mappings for backward reshaping SDR images into mapped HDR images of at least one content-creation-user-adjusted HDR appearance.Type: ApplicationFiled: August 12, 2020Publication date: September 22, 2022Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Guan-Ming SU, Harshad KADU