Patents by Inventor Shyam Sadhwani

Shyam Sadhwani has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210377578
    Abstract: A media processing tool adds custom data to an elementary media bitstream or media container. The custom data indicates nominal range of samples of media content, but the meaning of the custom data is not defined in the codec format or media container format. For example, the custom data indicates the nominal range is full range or limited range. For playback, a media processing tool parses the custom data and determines an indication of media content type. A rendering engine performs color conversion operations whose logic changes based at least in part on the media content type. In this way, a codec format or media container format can in effect be extended to support full nominal range media content as well as limited nominal range media content, and hence preserve full or correct color fidelity, while maintaining backward compatibility and conformance with the codec format or media container format.
    Type: Application
    Filed: August 5, 2021
    Publication date: December 2, 2021
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Naveen Thumpudi, Shyam Sadhwani
  • Patent number: 11115691
    Abstract: A media processing tool adds custom data to an elementary media bitstream or media container. The custom data indicates nominal range of samples of media content, but the meaning of the custom data is not defined in the codec format or media container format. For example, the custom data indicates the nominal range is full range or limited range. For playback, a media processing tool parses the custom data and determines an indication of media content type. A rendering engine performs color conversion operations whose logic changes based at least in part on the media content type. In this way, a codec format or media container format can in effect be extended to support full nominal range media content as well as limited nominal range media content, and hence preserve full or correct color fidelity, while maintaining backward compatibility and conformance with the codec format or media container format.
    Type: Grant
    Filed: June 11, 2020
    Date of Patent: September 7, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Naveen Thumpudi, Shyam Sadhwani
  • Patent number: 10841494
    Abstract: Video image stabilization provides better performance on a generic platform for computing devices by evaluating available multimedia digital signal processing components, and selecting the available components to utilize according to a hierarchy structure for video stabilization performance for processing parts of the video stabilization. The video stabilization has improved motion vector estimation that employs refinement motion vector searching according to a pyramid block structure relationship starting from a downsampled resolution version of the video frames. The video stabilization also improves global motion transform estimation by performing a random sample consensus approach for processing the local motion vectors, and selection criteria for motion vector reliability. The video stabilization achieves the removal of hand shakiness smoothly by real-time one-pass or off-line two-pass temporal smoothing with error detection and correction.
    Type: Grant
    Filed: April 3, 2019
    Date of Patent: November 17, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Nikola Borisov, Weidong Zhao, Shyam Sadhwani, Naveen Thumpudi
  • Publication number: 20200304848
    Abstract: A media processing tool adds custom data to an elementary media bitstream or media container. The custom data indicates nominal range of samples of media content, but the meaning of the custom data is not defined in the codec format or media container format. For example, the custom data indicates the nominal range is full range or limited range. For playback, a media processing tool parses the custom data and determines an indication of media content type. A rendering engine performs color conversion operations whose logic changes based at least in part on the media content type. In this way, a codec format or media container format can in effect be extended to support full nominal range media content as well as limited nominal range media content, and hence preserve full or correct color fidelity, while maintaining backward compatibility and conformance with the codec format or media container format.
    Type: Application
    Filed: June 11, 2020
    Publication date: September 24, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Naveen Thumpudi, Shyam Sadhwani
  • Patent number: 10750205
    Abstract: Approaches to selectively using start code emulation prevention (“SCEP”) on encoded data for media content are described herein. For example, a media encoder selectively performs SCEP processing on encoded data for media content, and sets a value of a syntax element that indicates whether or not to perform SCEP processing on the encoded data. The encoder stores the encoded data for output as part of a bitstream, where the syntax element is signaled in association with the bitstream. A media decoder receives the encoded data, determines, from the value of the syntax element, whether or not to perform SCEP processing on the encoded data, and selectively performs SCEP processing on the encoded data. In this way, the computational cost of scanning operations for SCEP processing can be avoided in many scenarios, and bit rate increases due to insertion of SCEP bytes can be limited.
    Type: Grant
    Filed: March 6, 2019
    Date of Patent: August 18, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bakkama Srinath Reddy, Shyam Sadhwani, Victor Cherepanov
  • Patent number: 10715847
    Abstract: A media processing tool adds custom data to an elementary media bitstream or media container. The custom data indicates nominal range of samples of media content, but the meaning of the custom data is not defined in the codec format or media container format. For example, the custom data indicates the nominal range is full range or limited range. For playback, a media processing tool parses the custom data and determines an indication of media content type. A rendering engine performs color conversion operations whose logic changes based at least in part on the media content type. In this way, a codec format or media container format can in effect be extended to support full nominal range media content as well as limited nominal range media content, and hence preserve full or correct color fidelity, while maintaining backward compatibility and conformance with the codec format or media container format.
    Type: Grant
    Filed: August 10, 2018
    Date of Patent: July 14, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Naveen Thumpudi, Shyam Sadhwani
  • Patent number: 10715568
    Abstract: Systems, methods, and non-transitory computer-readable media can identify a set of participants in a group video call, wherein each participant is associated with an uplink capacity and a downlink capacity, and the set of participants includes a set of sender participants and a set of subscriber participants. For a first sender participant of the set of sender participants, one or more video stream layers to be uploaded by the first sender participant are determined based on downlink capacities of one or more subscriber participants of the set of subscriber participants. Each subscriber participant of the one or more subscriber participants is assigned to receive one video stream layer of the one or more video stream layers to be uploaded by the first sender participant.
    Type: Grant
    Filed: July 3, 2019
    Date of Patent: July 14, 2020
    Assignee: Facebook, Inc.
    Inventors: Arthur Cavalcanti Alem, Bret Lorimore, Shyam Sadhwani, Chi Wang Ho
  • Patent number: 10701401
    Abstract: Syntax structures that indicate the completion of coded regions of pictures are described. For example, a syntax structure in an elementary bitstream indicates the completion of a coded region of a picture. The syntax structure can be a type of network abstraction layer unit, a type of supplemental enhancement information message or another syntax structure. For example, a media processing tool such as an encoder can detect completion of a coded region of a picture, then output, in a predefined order in an elementary bitstream, syntax structure(s) that contain the coded region as well as a different syntax structure that indicates the completion of the coded region. Another media processing tool such as a decoder can receive, in a predefined order in an elementary bitstream, syntax structure(s) that contain a coded region of a picture as well as a different syntax structure that indicates the completion of the coded region.
    Type: Grant
    Filed: January 4, 2019
    Date of Patent: June 30, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Lihua Zhu, Shyam Sadhwani, Gary J. Sullivan
  • Patent number: 10574955
    Abstract: Innovations in reconstruction and rendering of panoramic video are described. For example, a view-dependent operation controller of a panoramic video playback system receives an indication of a view direction for an application and, based at least in part on the view direction, identifies a section of a picture of panoramic video in an input projection. The view-dependent operation controller limits operations of a color converter, video decoder, and/or streaming controller to the identified section. In this way, the panoramic video playback system can avoid performing operations to reconstruct sections of the picture of panoramic video that will not be viewed. As another example, a mapper of a panoramic video playback system re-projects at least some sample values in an input flat projection towards a center location for a view direction, producing an output flat projection, which an application can use to generate one or more screen projections.
    Type: Grant
    Filed: February 6, 2019
    Date of Patent: February 25, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Matthew Wozniak, Bakkama Srinath Reddy, Vladislav Alexandrov, David H. Mebane, Dibyajyoti Roy, Shyam Sadhwani
  • Publication number: 20200007597
    Abstract: Systems, methods, and non-transitory computer-readable media can identify a set of participants in a group video call, wherein each participant is associated with an uplink capacity and a downlink capacity, and the set of participants includes a set of sender participants and a set of subscriber participants. For a first sender participant of the set of sender participants, one or more video stream layers to be uploaded by the first sender participant are determined based on downlink capacities of one or more subscriber participants of the set of subscriber participants. Each subscriber participant of the one or more subscriber participants is assigned to receive one video stream layer of the one or more video stream layers to be uploaded by the first sender participant.
    Type: Application
    Filed: July 3, 2019
    Publication date: January 2, 2020
    Inventors: Arthur Cavalcanti Alem, Bret Lorimore, Shyam Sadhwani, Chi Wang Ho
  • Patent number: 10523953
    Abstract: Video frames of a higher-resolution chroma sampling format such as YUV 4:4:4 are packed into video frames of a lower-resolution chroma sampling format such as YUV 4:2:0 for purposes of video encoding. For example, sample values for a frame in YUV 4:4:4 format are packed into two frames in YUV 4:2:0 format. After decoding, the video frames of the lower-resolution chroma sampling format can be unpacked to reconstruct the video frames of the higher-resolution chroma sampling format. In this way, available encoders and decoders operating at the lower-resolution chroma sampling format can be used, while still retaining higher resolution chroma information. In example implementations, frames in YUV 4:4:4 format are packed into frames in YUV 4:2:0 format such that geometric correspondence is maintained between Y, U and V components for the frames in YUV 4:2:0 format.
    Type: Grant
    Filed: April 17, 2018
    Date of Patent: December 31, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Lihua Zhu, Sridhar Sankuratri, B. Anil Kumar, Yongjun Wu, Sandeep Kanumuri, Shyam Sadhwani, Gary J. Sullivan
  • Patent number: 10455286
    Abstract: Video content is protected using a digital rights management (DRM) mechanism, the video content having been previously encrypted and compressed for distribution, and also including metadata such as closed captioning data, which might be encrypted or clear. The video content is obtained by a system of a computing device, the metadata is extracted from the video content and provided to a video decoder, and the video content is provided to a secure DRM component. The secure DRM component decrypts the video content and provides the decrypted video content to a secure decoder component of a video decoder. As part of the decryption, the secure DRM component drops the metadata that was included in the obtained video content. However, the video decoder receives the extracted metadata in a non-protected environment and thus is able to provide the extracted metadata and the decoded video content to a content playback application.
    Type: Grant
    Filed: March 8, 2019
    Date of Patent: October 22, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Balachandar Sivakumar, Shyam Sadhwani
  • Publication number: 20190297263
    Abstract: Video image stabilization provides better performance on a generic platform for computing devices by evaluating available multimedia digital signal processing components, and selecting the available components to utilize according to a hierarchy structure for video stabilization performance for processing parts of the video stabilization. The video stabilization has improved motion vector estimation that employs refinement motion vector searching according to a pyramid block structure relationship starting from a downsampled resolution version of the video frames. The video stabilization also improves global motion transform estimation by performing a random sample consensus approach for processing the local motion vectors, and selection criteria for motion vector reliability. The video stabilization achieves the removal of hand shakiness smoothly by real-time one-pass or off-line two-pass temporal smoothing with error detection and correction.
    Type: Application
    Filed: April 3, 2019
    Publication date: September 26, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Nikola Borisov, Weidong Zhao, Shyam Sadhwani, Naveen Thumpudi
  • Patent number: 10389772
    Abstract: Systems, methods, and non-transitory computer-readable media can identify a set of participants in a group video call, wherein each participant is associated with an uplink capacity and a downlink capacity, and the set of participants includes a set of sender participants and a set of subscriber participants. For a first sender participant of the set of sender participants, one or more video stream layers to be uploaded by the first sender participant are determined based on downlink capacities of one or more subscriber participants of the set of subscriber participants. Each subscriber participant of the one or more subscriber participants is assigned to receive one video stream layer of the one or more video stream layers to be uploaded by the first sender participant.
    Type: Grant
    Filed: January 31, 2018
    Date of Patent: August 20, 2019
    Assignee: Facebook, Inc.
    Inventors: Arthur Cavalcanti Alem, Bret Lorimore, Shyam Sadhwani, Chi Wang Ho
  • Patent number: 10380358
    Abstract: Techniques are described for communicating encoded data using start code emulation prevention. The described techniques include obtaining at least one partially encrypted packet, identifying at least one portion of the packet that is unencrypted, and determining that the identified unencrypted portion(s) emulates a start code. Start code emulation prevention data or emulation prevention bytes (EPBs) may be inserted into only the encrypted portion of the packet. The modified packet may be communicated to another device/storage, along with an indication of which portion(s) of the packet are unencrypted. Upon receiving the packet and indication, the receiving device may identify and remove the EPBs in the identified unencrypted portion(s) of the packet, and decrypt the packet to recover the data. In some aspects, upon identifying the indication, the receiving device may only search for EPBs in the unencrypted portion(s) of the packet, thus yielding a more efficient start code emulation prevention process.
    Type: Grant
    Filed: July 18, 2018
    Date of Patent: August 13, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yongjun Wu, Shyam Sadhwani, Robert K. Hughes
  • Publication number: 20190246084
    Abstract: Innovations in reconstruction and rendering of panoramic video are described. For example, a view-dependent operation controller of a panoramic video playback system receives an indication of a view direction for an application and, based at least in part on the view direction, identifies a section of a picture of panoramic video in an input projection. The view-dependent operation controller limits operations of a color converter, video decoder, and/or streaming controller to the identified section. In this way, the panoramic video playback system can avoid performing operations to reconstruct sections of the picture of panoramic video that will not be viewed. As another example, a mapper of a panoramic video playback system re-projects at least some sample values in an input flat projection towards a center location for a view direction, producing an output flat projection, which an application can use to generate one or more screen projections.
    Type: Application
    Filed: February 6, 2019
    Publication date: August 8, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Matthew Wozniak, Bakkama Srinath Reddy, Vladislav Alexandrov, David H. Mebane, Dibyajyoti Roy, Shyam Sadhwani
  • Publication number: 20190238600
    Abstract: Systems, methods, and non-transitory computer-readable media can identify a set of participants in a group video call, wherein each participant is associated with an uplink capacity and a downlink capacity, and the set of participants includes a set of sender participants and a set of subscriber participants. For a first sender participant of the set of sender participants, one or more video stream layers to be uploaded by the first sender participant are determined based on downlink capacities of one or more subscriber participants of the set of subscriber participants. Each subscriber participant of the one or more subscriber participants is assigned to receive one video stream layer of the one or more video stream layers to be uploaded by the first sender participant.
    Type: Application
    Filed: January 31, 2018
    Publication date: August 1, 2019
    Inventors: Arthur Cavalcanti Alem, Bret Lorimore, Shyam Sadhwani, Chi Wang Ho
  • Patent number: 10368074
    Abstract: Innovations in opportunistic frame dropping for variable-frame-rate encoding of digital video are presented. In general, a computing system selectively drops a frame when the cost of encoding the frame (e.g., in terms of use of computational resources and/or power) is expected to outweigh the benefit of encoding the frame (e.g., in terms of better quality). For example, a frame dropping module detects whether there is significant change in a given frame relative to a control frame, which is a previous frame stored in a control frame buffer. If significant change is detected, the frame dropping module stores the given frame in the control frame buffer, thereby replacing the control frame, and passes the given frame to a video encoder. Otherwise, the frame dropping module drops the given frame without replacing the control frame in the control frame buffer and without passing the given frame to the video encoder.
    Type: Grant
    Filed: March 18, 2016
    Date of Patent: July 30, 2019
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Thomas W. Holcomb, Shyam Sadhwani, Bakkama Srinath Reddy
  • Publication number: 20190208226
    Abstract: Approaches to selectively using start code emulation prevention (“SCEP”) on encoded data for media content are described herein. For example, a media encoder selectively performs SCEP processing on encoded data for media content, and sets a value of a syntax element that indicates whether or not to perform SCEP processing on the encoded data. The encoder stores the encoded data for output as part of a bitstream, where the syntax element is signaled in association with the bitstream. A media decoder receives the encoded data, determines, from the value of the syntax element, whether or not to perform SCEP processing on the encoded data, and selectively performs SCEP processing on the encoded data. In this way, the computational cost of scanning operations for SCEP processing can be avoided in many scenarios, and bit rate increases due to insertion of SCEP bytes can be limited.
    Type: Application
    Filed: March 6, 2019
    Publication date: July 4, 2019
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Bakkama Srinath Reddy, Shyam Sadhwani, Victor Cherepanov
  • Publication number: 20190208276
    Abstract: Video content is protected using a digital rights management (DRM) mechanism, the video content having been previously encrypted and compressed for distribution, and also including metadata such as closed captioning data, which might be encrypted or clear. The video content is obtained by a system of a computing device, the metadata is extracted from the video content and provided to a video decoder, and the video content is provided to a secure DRM component. The secure DRM component decrypts the video content and provides the decrypted video content to a secure decoder component of a video decoder. As part of the decryption, the secure DRM component drops the metadata that was included in the obtained video content. However, the video decoder receives the extracted metadata in a non-protected environment and thus is able to provide the extracted metadata and the decoded video content to a content playback application.
    Type: Application
    Filed: March 8, 2019
    Publication date: July 4, 2019
    Inventors: Yongjun WU, Balachandar SIVAKUMAR, Shyam SADHWANI