Patents by Inventor Sandeep Kanumuri

Sandeep Kanumuri has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20190116379
    Abstract: Innovations in signaling of reference picture list (“RPL”) modification information. For example, a video encoder evaluates a condition that depends at least in part on a variable indicating a number of total reference pictures. Depending on the results of the evaluation, the encoder signals in a bitstream a flag that indicates whether an RPL is modified according to syntax elements explicitly signaled in the bitstream. A video decoder evaluates the condition and, depending on results of the evaluation, parses from a bitstream a flag that indicates whether an RPL is modified according to syntax elements explicitly signaled in the bitstream. The condition can be evaluated as part of processing for an RPL, modification structure that includes the flag, or as part of processing for a slice header. The encoder and decoder can also evaluate other conditions that affect syntax elements for list entries of the RPL modification information.
    Type: Application
    Filed: November 15, 2018
    Publication date: April 18, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Gary J. Sullivan, Sandeep Kanumuri
  • Patent number: 10250882
    Abstract: Innovations in control and use of chroma quantization parameter (“QP”) values that depend on luma QP values. More generally, the innovations relate to control and use of QP values for a secondary color component that depend on QP values for a primary color component. For example, during encoding, an encoder determines a QP index from a primary component QP and secondary component QP offset. The encoder maps the QP index to a secondary component QP, which has an extended range. The encoder outputs at least part of a bitstream including the encoded content. A corresponding decoder receives at least part of a bitstream including encoded content. During decoding, the decoder determines a QP index from a primary component QP and secondary component QP offset, then maps the QP index to a secondary component QP, which has an extended range.
    Type: Grant
    Filed: July 6, 2016
    Date of Patent: April 2, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Gary J. Sullivan, Sandeep Kanumuri
  • Publication number: 20190075318
    Abstract: The present invention provides low complexity planar mode coding in which a value of a bottom-right pixel in a prediction block is calculated from a value of at least one pixel in at least one of an array of horizontal boundary pixels and an array of vertical boundary pixels. Linear and bi-linear interpolations on the value of the bottom-right pixel and values of at least some of the horizontal and vertical boundary pixels to derive values of remaining pixels in the prediction block. A residual between the prediction block and an original block is signaled to a decoder.
    Type: Application
    Filed: October 17, 2018
    Publication date: March 7, 2019
    Inventors: Frank Jan BOSSEN, Sandeep KANUMURI
  • Publication number: 20190052898
    Abstract: The present invention provides low complexity planar mode coding in which a value of a bottom-right pixel in a prediction block is calculated from a value of at least one pixel in at least one of an array of horizontal boundary pixels and an array of vertical boundary pixels. Linear and bi-linear interpolations on the value of the bottom-right pixel and values of at least some of the horizontal and vertical boundary pixels to derive values of remaining pixels in the prediction block. A residual between the prediction block and an original block is signaled to a decoder.
    Type: Application
    Filed: October 17, 2018
    Publication date: February 14, 2019
    Inventors: Frank Jan BOSSEN, Sandeep KANUMURI
  • Publication number: 20190052899
    Abstract: The present invention provides low complexity planar mode coding in which a value of a bottom-right pixel in a prediction block is calculated from a value of at least one pixel in at least one of an array of horizontal boundary pixels and an array of vertical boundary pixels. Linear and bi-linear interpolations on the value of the bottom-right pixel and values of at least some of the horizontal and vertical boundary pixels to derive values of remaining pixels in the prediction block. A residual between the prediction block and an original block is signaled to a decoder.
    Type: Application
    Filed: October 17, 2018
    Publication date: February 14, 2019
    Inventors: Frank Jan BOSSEN, Sandeep KANUMURI
  • Publication number: 20190052900
    Abstract: The present invention provides low complexity planar mode coding in which a value of a bottom-right pixel in a prediction block is calculated from a value of at least one pixel in at least one of an array of horizontal boundary pixels and an array of vertical boundary pixels. Linear and bi-linear interpolations on the value of the bottom-right pixel and values of at least some of the horizontal and vertical boundary pixels to derive values of remaining pixels in the prediction block. A residual between the prediction block and an original block is signaled to a decoder.
    Type: Application
    Filed: October 17, 2018
    Publication date: February 14, 2019
    Inventors: Frank Jan BOSSEN, Sandeep KANUMURI
  • Publication number: 20190052902
    Abstract: The present invention provides low complexity planar mode coding in which a value of a bottom-right pixel in a prediction block is calculated from a value of at least one pixel in at least one of an array of horizontal boundary pixels and an array of vertical boundary pixels. Linear and bi-linear interpolations on the value of the bottom-right pixel and values of at least some of the horizontal and vertical boundary pixels to derive values of remaining pixels in the prediction block. A residual between the prediction block and an original block is signaled to a decoder.
    Type: Application
    Filed: October 17, 2018
    Publication date: February 14, 2019
    Inventors: Frank Jan BOSSEN, Sandeep KANUMURI
  • Publication number: 20190052901
    Abstract: The present invention provides low complexity planar mode coding in which a value of a bottom-right pixel in a prediction block is calculated from a value of at least one pixel in at least one of an array of horizontal boundary pixels and an array of vertical boundary pixels. Linear and bi-linear interpolations on the value of the bottom-right pixel and values of at least some of the horizontal and vertical boundary pixels to derive values of remaining pixels in the prediction block. A residual between the prediction block and an original block is signaled to a decoder.
    Type: Application
    Filed: October 17, 2018
    Publication date: February 14, 2019
    Inventors: Frank Jan BOSSEN, Sandeep KANUMURI
  • Patent number: 10205963
    Abstract: The present invention provides low complexity planar mode coding in which a value of a bottom-right pixel in a prediction block is calculated from a value of at least one pixel in at least one of an array of horizontal boundary pixels and an array of vertical boundary pixels. Linear and bi-linear interpolations on the value of the bottom-right pixel and values of at least some of the horizontal and vertical boundary pixels to derive values of remaining pixels in the prediction block. A residual between the prediction block and an original block is signaled to a decoder.
    Type: Grant
    Filed: October 3, 2016
    Date of Patent: February 12, 2019
    Assignee: NTT DOCOMO, INC.
    Inventors: Frank Jan Bossen, Sandeep Kanumuri
  • Patent number: 10165302
    Abstract: Innovations in signaling of reference picture list (“RPL”) modification information. For example, a video encoder evaluates a condition that depends at least in part on a variable indicating a number of total reference pictures. Depending on the results of the evaluation, the encoder signals in a bitstream a flag that indicates whether an RPL is modified according to syntax elements explicitly signaled in the bitstream. A video decoder evaluates the condition and, depending on results of the evaluation, parses from a bitstream a flag that indicates whether an RPL is modified according to syntax elements explicitly signaled in the bitstream. The condition can be evaluated as part of processing for an RPL modification structure that includes the flag, or as part of processing for a slice header. The encoder and decoder can also evaluate other conditions that affect syntax elements for list entries of the RPL modification information.
    Type: Grant
    Filed: July 20, 2017
    Date of Patent: December 25, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Gary J. Sullivan, Sandeep Kanumuri
  • Publication number: 20180350326
    Abstract: Described are examples for storing an advanced color image. A set of values stored in a first number of byte values can be determined, where each value in the set of values corresponds to a parameter of a set of parameters representing one or more pixels of an advanced color image. The set of values can be packed into a set of byte values, where a second number of byte values in the set of byte values is less than the first number of byte values. The set of byte values can be stored or communicated in representing the one or more pixels of the advanced color image.
    Type: Application
    Filed: October 9, 2017
    Publication date: December 6, 2018
    Inventors: Sandeep KANUMURI, Sudhanshu SOHONI
  • Patent number: 10148871
    Abstract: Described are examples for generating high dynamic range (HDR)/wide color gamut (WCG) output from an image sensor. A raw red, green, blue (RGB) image obtained by the image sensor can be received. A plurality of color transform operations can be applied to the raw RGB image to generate a HDR/WCG image. The HDR/WCG image can be stored in a memory, displayed on a display, transmitted to another device, etc.
    Type: Grant
    Filed: December 7, 2016
    Date of Patent: December 4, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Naveen Thumpudi, Sandeep Kanumuri, Jesse Hakanen, Matthew R. Wozniak
  • Publication number: 20180329978
    Abstract: Innovations for category-prefixed data batching (“CPDB”) of entropy-coded data or other payload data for coded media data, as well as innovations for corresponding recovery of the entropy-coded data (or other payload data) formatted with CPDB. The CPDB can be used in conjunction with coding/decoding for video content, image content, audio content or another type of content. For example, after receiving coded media data in multiple categories from encoding units, a formatting tool formats payload data with CPDB, generating a batch prefix for a batch of the CPDB-formatted payload data. The batch prefix includes a category identifier and a data quantity indicator. The formatting tool outputs the CPDB-formatted payload data to a bitstream. At the decoder side, a formatting tool receives the CPDB-formatted payload data in a bitstream, recovers the payload data from the CPDB-formatted payload data, and outputs the payload data (e.g., to decoding units).
    Type: Application
    Filed: January 5, 2018
    Publication date: November 15, 2018
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Gary J. Sullivan, Leon Bottou, Sandeep Kanumuri, Yongjun Wu
  • Publication number: 20180295253
    Abstract: The present disclosure provides devices and techniques for processing a media capture stream captured by a camera device using a chain device media foundation transform (DMFT). The techniques include configuring multiple DMFTs such that an original manufacturer (OEM) may have flexibility in independently selecting various functionalities from different sources (e.g., OS, OEM, IHV, ISV, or VARs) in order to maximize hardware capabilities while minimizing the drawbacks of creating a single DMFT. To that end, the implementation of the present disclosure includes a devices and techniques of chainable DMFTs such that a device transform manager may select a set of functionalities (e.g., face recognition, color effects, etc.) from multiple vendors to customize the camera's capabilities according to the OEM specification.
    Type: Application
    Filed: April 2, 2018
    Publication date: October 11, 2018
    Inventors: Sandeep KANUMURI, Naveen THUMPUDI, Sathyanarayanan KARIVARADASWAMY, Rui ZHAO
  • Publication number: 20180234686
    Abstract: Video frames of a higher-resolution chroma sampling format such as YUV 4:4:4 are packed into video frames of a lower-resolution chroma sampling format such as YUV 4:2:0 for purposes of video encoding. For example, sample values for a frame in YUV 4:4:4 format are packed into two frames in YUV 4:2:0 format. After decoding, the video frames of the lower-resolution chroma sampling format can be unpacked to reconstruct the video frames of the higher-resolution chroma sampling format. In this way, available encoders and decoders operating at the lower-resolution chroma sampling format can be used, while still retaining higher resolution chroma information. In example implementations, frames in YUV 4:4:4 format are packed into frames in YUV 4:2:0 format such that geometric correspondence is maintained between Y, U and V components for the frames in YUV 4:2:0 format.
    Type: Application
    Filed: April 17, 2018
    Publication date: August 16, 2018
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Lihua Zhu, Sridhar Sankuratri, B. Anil Kumar, Yongjun Wu, Sandeep Kanumuri, Shyam Sadhwani, Gary J. Sullivan
  • Patent number: 10038842
    Abstract: One or more techniques and/or systems are disclosed for generating improved planar panorama imagery, which may be used to view street-level, human-scale locations along a desired path, for example. One or more desired objects can be detected in source imagery that may be used to produce the resulting planar panorama imagery. Respective object regions comprising a desired object can be identified in one or more frames of the source imagery. One or more portions of the source imagery that do not comprise the object region for the desired object can be joined with at least some of the one or more frames that do comprise the object region for the desired object. The resulting planar panorama imagery can thus comprise the desired object. Also, the planar panorama imagery can be generated so as to lack one or more non-desired objects and/or adverse effects that may result therefrom.
    Type: Grant
    Filed: May 18, 2015
    Date of Patent: July 31, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Samuel Thomas Williams, Jonathan Paul Rosenberg, Johannes Kopf, Don Dongwoo Kim, Sandeep Kanumuri
  • Publication number: 20180160038
    Abstract: Described are examples for generating high dynamic range (HDR)/wide color gamut (WCG) output from an image sensor. A raw red, green, blue (RGB) image obtained by the image sensor can be received. A plurality of color transform operations can be applied to the raw RGB image to generate a HDR/WCG image. The HDR/WCG image can be stored in a memory, displayed on a display, transmitted to another device, etc.
    Type: Application
    Filed: December 7, 2016
    Publication date: June 7, 2018
    Inventors: Naveen THUMPUDI, Sandeep KANUMURI, Jesse HAKANEN, Matthew R. WOZNIAK
  • Patent number: 9979960
    Abstract: Video frames of a higher-resolution chroma sampling format such as YUV 4:4:4 are packed into video frames of a lower-resolution chroma sampling format such as YUV 4:2:0 for purposes of video encoding. For example, sample values for a frame in YUV 4:4:4 format are packed into two frames in YUV 4:2:0 format. After decoding, the video frames of the lower-resolution chroma sampling format can be unpacked to reconstruct the video frames of the higher-resolution chroma sampling format. In this way, available encoders and decoders operating at the lower-resolution chroma sampling format can be used, while still retaining higher resolution chroma information. In example implementations, frames in YUV 4:4:4 format are packed into frames in YUV 4:2:0 format such that geometric correspondence is maintained between Y, U and V components for the frames in YUV 4:2:0 format.
    Type: Grant
    Filed: September 13, 2013
    Date of Patent: May 22, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Lihua Zhu, Sridhar Sankuratri, B. Anil Kumar, Yongjun Wu, Sandeep Kanumuri, Shyam Sadhwani, Gary J. Sullivan
  • Patent number: 9967418
    Abstract: The present disclosure provides devices and techniques for processing a media capture stream captured by a camera device using a chain device media foundation transform (DMFT). The techniques include configuring multiple DMFTs such that an original manufacturer (OEM) may have flexibility in independently selecting various functionalities from different sources (e.g., OS, OEM, IHV, ISV, or VARs) in order to maximize hardware capabilities while minimizing the drawbacks of creating a single DMFT. To that end, the implementation of the present disclosure includes a devices and techniques of chainable DMFTs such that a device transform manager may select a set of functionalities (e.g., face recognition, color effects, etc.) from multiple vendors to customize the camera's capabilities according to the OEM specification.
    Type: Grant
    Filed: October 31, 2016
    Date of Patent: May 8, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Sandeep Kanumuri, Naveen Thumpudi, Sathyanarayanan Karivaradaswamy, Rui Zhao
  • Publication number: 20180124270
    Abstract: The present disclosure provides devices and techniques for processing a media capture stream captured by a camera device using a chain device media foundation transform (DMFT). The techniques include configuring multiple DMFTs such that an original manufacturer (OEM) may have flexibility in independently selecting various functionalities from different sources (e.g., OS, OEM, IHV, ISV, or VARs) in order to maximize hardware capabilities while minimizing the drawbacks of creating a single DMFT. To that end, the implementation of the present disclosure includes a devices and techniques of chainable DMFTs such that a device transform manager may select a set of functionalities (e.g., face recognition, color effects, etc.) from multiple vendors to customize the camera's capabilities according to the OEM specification.
    Type: Application
    Filed: October 31, 2016
    Publication date: May 3, 2018
    Inventors: Sandeep KANUMURI, Naveen THUMPUDI, Sathyanarayanan KARIVARADASWAMY, Rui ZHAO