Patents by Inventor Harshad Kadu

Harshad Kadu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11430095
    Abstract: Different candidate image data feature types are evaluated to identify one or more specific image data feature types to be used in training a prediction model for optimizing one or more image metadata parameters. A plurality of image data features of the one or more selected image data feature types is extracted from one or more images. The plurality of image data features of the one or more selected image data feature types is reduced into a plurality of significant image data features. A total number of image data features in the plurality of significant image data features is no larger than a total number of image data features in the plurality of image data features of the one or more selected image data feature types. The plurality of significant image data features is applied to training the prediction model for optimizing one or more image metadata parameters.
    Type: Grant
    Filed: September 18, 2019
    Date of Patent: August 30, 2022
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Harshad Kadu, Guan-Ming Su
  • Patent number: 11388408
    Abstract: Methods and systems for generating an interpolated reshaping function for the efficient coding of high-dynamic range images are provided. The interpolated reshaping function is constructed based on a set of pre-computed basis reshaping functions. Interpolation schemes are derived for pre-computed basis reshaping functions represented as look-up tables, multi-segment polynomials, or matrices of coefficients in a multivariate, multi-regression representation. Encoders and decoders using asymmetric reshaping and interpolated reshaping functions for mobile applications are also presented.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: July 12, 2022
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Harshad Kadu, Qing Song, Guan-Ming Su
  • Patent number: 11277627
    Abstract: 3D mapping statistics are generated for a first image of a first dynamic range and a second image of a second dynamic range different from the first dynamic range. Multivariate multiple regression (MMR) coefficients are generated by solving an optimization problem formulated using an MMR matrix built with the 3D mapping statistics without a letterbox constraint, and used to generate chroma mappings for predicting chroma codeword values of the second image. It is determined whether a letterbox exists in the images. If so, it is determined whether the chroma mappings accurately predict chroma codeword values in the second image. A reconstructed image generated by a recipient device by backward reshaping one of the images is rendered by a display device operating in conjunction with the recipient device.
    Type: Grant
    Filed: May 9, 2019
    Date of Patent: March 15, 2022
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Qing Song, Harshad Kadu, Guan-Ming Su
  • Publication number: 20220058783
    Abstract: Training image pairs comprising training SDR image and corresponding training HDR images are received. Each training image pair in the training image pairs comprises a training SDR image and a corresponding training HDR image. The training SDR image and the corresponding training HDR image in the training image pair depict same visual content but with different luminance dynamic ranges. Training image feature vectors are extracted from training SDR images in the training image pairs. The training image feature vectors are used to train backward reshaping metadata prediction models for predicting operational parameter values of backward reshaping mappings used to backward reshape SDR images into mapped HDR images.
    Type: Application
    Filed: December 16, 2019
    Publication date: February 24, 2022
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Harshad KADU, Neeraj J. GADGIL, Guan-Ming SU
  • Publication number: 20220046245
    Abstract: Methods and systems for generating an interpolated reshaping function for the efficient coding of high-dynamic range images are provided. The interpolated reshaping function is constructed based on a set of pre-computed basis reshaping functions. Interpolation schemes are derived for pre-computed basis reshaping functions represented as look-up tables, multi-segment polynomials, or matrices of coefficients in a multivariate, multi-regression representation. Encoders and decoders using asymmetric reshaping and interpolated reshaping functions for mobile applications are also presented.
    Type: Application
    Filed: November 27, 2019
    Publication date: February 10, 2022
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Harshad KADU, Qing SONG, Guan-Ming SU
  • Publication number: 20210350512
    Abstract: Different candidate image data feature types are evaluated to identify one or more specific image data feature types to be used in training a prediction model for optimizing one or more image metadata parameters. A plurality of image data features of the one or more selected image data feature types is extracted from one or more images. The plurality of image data features of the one or more selected image data feature types is reduced into a plurality of significant image data features. A total number of image data features in the plurality of significant image data features is no larger than a total number of image data features in the plurality of image data features of the one or more selected image data feature types. The plurality of significant image data features is applied to training the prediction model for optimizing one or more image metadata parameters.
    Type: Application
    Filed: September 18, 2019
    Publication date: November 11, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Harshad KADU, Guan-Ming SU
  • Publication number: 20210195221
    Abstract: 3D mapping statistics are generated for a first image of a first dynamic range and a second image of a second dynamic range different from the first dynamic range. Multivariate multiple regression (MMR) coefficients are generated by solving an optimization problem formulated using an MMR matrix built with the 3D mapping statistics without a letterbox constraint, and used to generate chroma mappings for predicting chroma codeword values of the second image. It is determined whether a letterbox exists in the images. If so, it is determined whether the chroma mappings accurately predict chroma codeword values in the second image. A reconstructed image generated by a recipient device by backward reshaping one of the images is rendered by a display device operating in conjunction with the recipient device.
    Type: Application
    Filed: May 9, 2019
    Publication date: June 24, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Qing SONG, Harshad KADU, Guan-Ming SU
  • Patent number: 10701375
    Abstract: A tone-mapping function that maps input images of a high dynamic range into reference tone-mapped images of a relatively narrow dynamic range is generated. A luma forward reshaping function is derived, based on first bit depths and second bit depths, for forward reshaping luma codewords of the input images into forward reshaped luma codewords of forward reshaped images approximating the reference tone-mapped images. A chroma forward reshaping mapping is derived for predicting chroma codewords of the forward reshaped images. Backward reshaping metadata that is to be used by recipient devices to generate a luma backward reshaping function and a chroma backward reshaping mapping is transmitted with the forward reshaped images to the recipient devices. Techniques for the joint derivation of forward luma and chroma reshaping functions are also presented.
    Type: Grant
    Filed: March 22, 2017
    Date of Patent: June 30, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming Su, Jon Scott Miller, Walter J. Husak, Yee Jin Lee, Harshad Kadu
  • Patent number: 10701404
    Abstract: Real-time forward reshaping, comprising selecting a statistical sliding window that indexes with the current frame, having also, a look-back frame and a look-ahead frame, determining whether they are part of the current scene, determining a noise parameter, a luma transfer function and a luma forward reshaping function based on the luma transfer function and the noise parameter within the current scene, selecting a central tendency sliding window of the current frame and the look-back frame within the current scene, and determining a central tendency luma forward reshaping function. The chroma reshaping comprises analyzing statistics for the extended dynamic range (EDR) weights and EDR upper bounds, mapping these to standard dynamic range (SDR) weights and SDR upper bounds based on the central tendency luma forward reshaping function, determining a chroma content-dependent polynomial and a central tendency chroma forward reshaping polynomial and generating chroma MMR coefficients.
    Type: Grant
    Filed: August 28, 2017
    Date of Patent: June 30, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Qing Song, Harshad Kadu, Qian Chen, Guan-Ming Su
  • Patent number: 10659749
    Abstract: In a method to reconstruct a high dynamic range video signal, a decoder receives parameters in the input bitstream to generate a prediction function. Using the prediction function, it generates a first set of nodes for a first prediction lookup table, wherein each node is characterized by an input node value and an output node value. Then, it modifies the output node values of one or more of the first set of nodes to generate a second set of nodes for a second prediction lookup table, and generates output prediction values using the second lookup table. Low-complexity methods to modify the output node value of a current node in the first set of nodes based on computing modified slopes between the current node and nodes surrounds the current node are presented.
    Type: Grant
    Filed: June 28, 2017
    Date of Patent: May 19, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Harshad Kadu, Guan-Ming Su
  • Patent number: 10609424
    Abstract: A standard dynamic range (SDR) image is received. Composer metadata of the first level through the N-th level is generated. Composer metadata of the j-th level is generated based on the composer metadata of the first level through (j?1)-th level. The composer metadata of the first level through the composer metadata of the j-th level is to be used for mapping the SDR image to the j-th target image specifically optimized for the j-th reference target display. The SDR image is encoded with the composer metadata of the first level through the k-th level in an output SDR video signal, where 1<=k<=N. A display device renders a display image derived from a composed target image composed from the SDR image based on the composer metadata of the first level through the k-th level in the output SDR video signal.
    Type: Grant
    Filed: March 6, 2019
    Date of Patent: March 31, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Qing Song, Harshad Kadu, Guan-Ming Su
  • Patent number: 10575028
    Abstract: Given HDR and SDR video inputs representing the same content, segment-based methods are described to generate a backward-compatible reshaped SDR video which preserves the artistic intent or “look” of the inputs and satisfies other coding requirements. For each frame in a segment, reshaping functions are generated based on a support frames set determined based on a sliding window of frames that is adjusted based on scene cuts in the segment and which may include frames from both the current segment and neighboring segments. For luma reshaping, a mapping that preserves the cumulative density function of the luminance histogram values in the EDR and SDR inputs is combined with a minimum codeword allocation derived based on the EDR signal and the support frame set. For chroma reshaping, methods for segment-based forward and backward reshaping using multivariate, multi-regression models are also presented.
    Type: Grant
    Filed: September 11, 2017
    Date of Patent: February 25, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Harshad Kadu, Qian Chen, Guan-Ming Su
  • Patent number: 10542269
    Abstract: In a method to reconstruct a high dynamic range video signal, a decoder receives parameters in the input bitstream to generate a prediction function. Using the prediction function, it generates a first set of nodes for a first prediction lookup table, wherein each node is characterized by an input node value and an output node value. Then, it modifies the output node values of one or more of the first set of nodes to generate a second set of nodes for a second prediction lookup table, and generates output prediction values using the second lookup table. Low-complexity methods to modify the output node value of a current node in the first set of nodes based on computing modified slopes between the current node and nodes surrounding the current node are presented.
    Type: Grant
    Filed: December 7, 2016
    Date of Patent: January 21, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Harshad Kadu, Guan-Ming Su, Hanyang Sun
  • Publication number: 20190349607
    Abstract: Given HDR and SDR video inputs representing the same content, segment-based methods are described to generate a backward-compatible reshaped SDR video which preserves the artistic intent or “look” of the inputs and satisfies other coding requirements. For each frame in a segment, reshaping functions are generated based on a support frames set determined based on a sliding window of frames that is adjusted based on scene cuts in the segment and which may include frames from both the current segment and neighboring segments. For luma reshaping, a mapping that preserves the cumulative density function of the luminance histogram values in the EDR and SDR inputs is combined with a minimum codeword allocation derived based on the EDR signal and the support frame set. For chroma reshaping, methods for segment-based forward and backward reshaping using multivariate, multi-regression models are also presented.
    Type: Application
    Filed: September 11, 2017
    Publication date: November 14, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Harshad Kadu, Qian Chen, Guan-Ming Su
  • Publication number: 20190281325
    Abstract: A standard dynamic range (SDR) image is received. Composer metadata of the first level through the N-th level is generated. Composer metadata of the j-th level is generated based on the composer metadata of the first level through (j?1)-th level. The composer metadata of the first level through the composer metadata of the j-th level is to be used for mapping the SDR image to the j-th target image specifically optimized for the j-th reference target display. The SDR image is encoded with the composer metadata of the first level through the k-th level in an output SDR video signal, where 1<=k<=N. A display device renders a display image derived from a composed target image composed from the SDR image based on the composer metadata of the first level through the k-th level in the output SDR video signal.
    Type: Application
    Filed: March 6, 2019
    Publication date: September 12, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Qing Song, Harshad Kadu, Guan-Ming Su
  • Patent number: 10397576
    Abstract: In a system for coding high dynamic range (HDR) images using lower-dynamic range (LDR) images, a reshaping function allows for a more efficient distribution of the codewords in the lower dynamic range images for improved compression. A trim pass of the LDR images by a colorist may satisfy a director's intent for a given “look,” but may also result in unpleasant clipping artifacts in the reconstructed HDR images. Given an original forward reshaping function which maps HDR luminance values to LDR pixel values, a processor identifies areas of potential clipping and generates modified forward and backward reshaping functions to reduce the visibility of potential artifacts from the trim pass process while preserving the director's intent.
    Type: Grant
    Filed: September 8, 2017
    Date of Patent: August 27, 2019
    Assignee: Dolby Laboratoreis Licensing Corporation
    Inventors: Harshad Kadu, Guan-Ming Su
  • Publication number: 20190222866
    Abstract: Real-time forward reshaping, comprising selecting a statistical sliding window that indexes with the current frame, having also, a look-back frame and a look-ahead frame, determining whether they are part of the current scene, determining a noise parameter, a luma transfer function and a luma forward reshaping function based on the luma transfer function and the noise parameter within the current scene, selecting a central tendency sliding window of the current frame and the look-back frame within the current scene, and determining a central tendency luma forward reshaping function. The chroma reshaping comprises analyzing statistics for the extended dynamic range (EDR) weights and EDR upper bounds, mapping these to standard dynamic range (SDR) weights and SDR upper bounds based on the central tendency luma forward reshaping function, determining a chroma content-dependent polynomial and a central tendency chroma forward reshaping polynomial and generating chroma MMR coefficients.
    Type: Application
    Filed: August 28, 2017
    Publication date: July 18, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Qing SONG, Harshad KADU, Qian CHEN, Guan-Ming SU
  • Publication number: 20190208173
    Abstract: In a method to reconstruct a high dynamic range video signal, a decoder receives parameters in the input bitstream to generate a prediction function. Using the prediction function, it generates a first set of nodes for a first prediction lookup table, wherein each node is characterized by an input node value and an output node value. Then, it modifies the output node values of one or more of the first set of nodes to generate a second set of nodes for a second prediction lookup table, and generates output prediction values using the second lookup table. Low-complexity methods to modify the output node value of a current node in the first set of nodes based on computing modified slopes between the current node and nodes surrounds the current node are presented.
    Type: Application
    Filed: June 28, 2017
    Publication date: July 4, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Harshad KADU, Guan-Ming SU
  • Patent number: 10264287
    Abstract: An SDR CDF is constructed based on an SDR histogram generated from a distribution of SDR codewords in SDR images. An HDR CDF is constructed based on an HDR histogram generated from a distribution of HDR codewords in HDR images that correspond to the SDR images. A histogram transfer function is generated based on the SDR CDF and the HDR CDF. The SDR images are transmitted along with backward reshaping metadata to recipient devices. The backward reshaping metadata is generated at least in part on the histogram transfer function.
    Type: Grant
    Filed: October 4, 2017
    Date of Patent: April 16, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Bihan Wen, Harshad Kadu, Guan-Ming Su
  • Publication number: 20190110054
    Abstract: A tone-mapping function that maps input images of a high dynamic range into reference tone-mapped images of a relatively narrow dynamic range is generated. A luma forward reshaping function is derived, based on first bit depths and second bit depths, for forward reshaping luma codewords of the input images into forward reshaped luma codewords of forward reshaped images approximating the reference tone-mapped images. A chroma forward reshaping mapping is derived for predicting chroma codewords of the forward reshaped images. Backward reshaping metadata that is to be used by recipient devices to generate a luma backward reshaping function and a chroma backward reshaping mapping is transmitted with the forward reshaped images to the recipient devices. Techniques for the joint derivation of forward luma and chroma reshaping functions are also presented.
    Type: Application
    Filed: March 22, 2017
    Publication date: April 11, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming Su, Jon Scott Miller, Walter J. Husak, Yee Jin Lee, Harshad Kadu