Patents Assigned to VID Scale, Inc.
  • Publication number: 20220400254
    Abstract: Embodiments contemplate coding video data by generating a video encoded bitstream that may include reference picture set (RPS) extensions for inter-layer reference pictures, and the extensions may include inter-layer delta Picture Order Counts (POCs). Embodiments may also include signaling that lower layer reference pictures may be available in a lower layer decoder picture buffer (DPB), and/or an aggregate DPB, that may be added to the RPS set of a higher layer. The bitstream may include a signal indicating whether the higher layer RPS may be specified by a lower layer RPS, and the lower layer RPS may be temporal, inter-layer prediction (ILP), or both.
    Type: Application
    Filed: August 19, 2022
    Publication date: December 15, 2022
    Applicant: VID SCALE, INC.
    Inventors: Yong He, Yan Ye, George W. McClellan, Jie Dong
  • Publication number: 20220394075
    Abstract: A decoding complexity may be used to predict power consumption for receiving, decoding, and/or displaying multimedia content at a wireless transmit/receive unit (WTRU). The decoding complexity may be based on decoding complexity feedback received from a reference device, such as another WTRU. The decoding complexity feedback may be based on measurements performed at the reference device for receiving decoding, and/or displaying the multimedia content. A content providing device may indicate the decoding complexity of requested media content to a WTRU, or another network entity. The decoding complexity may be indicated in a streaming protocol or file associated with the media content. The WTRU, or other network entity, may use the decoding complexity determine its preferences regarding transmission of the media content. The content providing device may determine whether to transmit the media content based on the decoding complexity and/or the preferences of the WTRU or other network entity.
    Type: Application
    Filed: June 16, 2022
    Publication date: December 8, 2022
    Applicant: Vid Scale, Inc.
    Inventors: Osama Lotfallah, Eduardo Asbun, Hang Liu, Yuriy Reznik
  • Publication number: 20220385897
    Abstract: A video processing apparatus may comprise one or more processors that are configured to determine an interpolation filter length for an interpolation filter associated with a coding unit (CU) based on a size of the CU. The one or more processor may be configured to determine an interpolated reference sample based on the determined interpolation filter length for the interpolation filter and a reference sample for the CU. The one or more processor may be configured to predict the CU based on the interpolated reference sample. For example, if a first CU has a size that is greater than the size of a second CU, the one or more processors may be configured to use a shorter interpolation filter for the first CU than for the second CU.
    Type: Application
    Filed: September 18, 2020
    Publication date: December 1, 2022
    Applicant: VID SCALE, INC.
    Inventors: Wei Chen, Yuwen He, Hua Yang
  • Publication number: 20220385942
    Abstract: Systems, methods, and instrumentalities are disclosed for discontinuous face boundary filtering for 360-degree video coding. A face discontinuity may be filtered (e.g., to reduce seam artifacts) in whole or in part, for example, using coded samples or padded samples on either side of the face discontinuity. Filtering may be applied, for example, as an in-loop filter or a post-processing step. 2D positional information related to two sides of the face discontinuity may be signaled in a video bitstream so that filtering may be applied independent of projection formats and/or frame packing techniques.
    Type: Application
    Filed: August 9, 2022
    Publication date: December 1, 2022
    Applicant: VID SCALE, INC.
    Inventors: Philippe Hanhart, Yan Ye, Yuwen He
  • Patent number: 11516485
    Abstract: Methods and systems are disclosed for a mobile device to decode video based on available power and/or energy. For example, the mobile device may receive a media description file (MDF) from for a video stream from a video server. The MDF may include complexity information associated with a plurality of video segments. The complexity information may be related to the amount of processing power to be utilized for decoding the segment at the mobile device. The mobile device may determine at least one power metric for the mobile device. The mobile device may determine a first complexity level to be requested for a first video segment based on the complexity information from the MDF and the power metric. The mobile device may dynamically alter the decoding process to save energy based on the detected power/energy level.
    Type: Grant
    Filed: May 18, 2021
    Date of Patent: November 29, 2022
    Assignee: VID SCALE, Inc.
    Inventors: Yuwen He, Yan Ye, Yong He, George W. McClellan, Eun Seok Ryu
  • Publication number: 20220377344
    Abstract: Described herein are systems, methods, and instrumentalities associated with video coding. The signaling of certain syntax elements may be moved from a slice header to a picture header and/or a layer access unit delimiter (AUD). The dependency between AUD and one or more parameter sets may be explored. Syntax elements may be signaled to enable wrap-around motion compensation for certain sub-picture(s) and specify wrap-around motion compensation offsets for the sub-picture(s).
    Type: Application
    Filed: September 18, 2020
    Publication date: November 24, 2022
    Applicant: VID SCALE, INC.
    Inventors: Yong He, Yuwen He
  • Publication number: 20220377324
    Abstract: A filter may be applied to a subset of components associated with a sample in a coding block. The output of the filter may be used to modify values for other component(s). For example, a filter may be applied to a selected (for example, dominant) component(s). The output of the filter may be used to modify a value for one of the other components (for example, non-dominant components). The output of the filter may be used, for example, after a weighting factor is applied to the filter output, to modify a value for another one of the other components. A joint refinement signal may be obtained, for example, as the filtered output signal minus the filter input signal of the selected component(s). A properly weighted version of the joint refinement signal may be applied to modify the other components.
    Type: Application
    Filed: September 23, 2020
    Publication date: November 24, 2022
    Applicant: VID SCALE, INC.
    Inventors: Hua Yang, Yuwen He, Wei Chen, Hongyu Li
  • Publication number: 20220377316
    Abstract: An apparatus may be configured to determine a reference picture listed in a first reference picture list and a reference picture listed in a second reference picture list, for a coding block. The apparatus may be configured to determine whether to perform bi-directional optical flow (BDOF) for the coding block based at least in part on whether a distance between a picture associated with the coding block and the reference picture listed in the first reference picture list differs from a distance between the picture associated with the coding block and the reference picture listed in the second reference picture list. The apparatus may be configured to decode the coding block based on the determination of whether to perform BDOF for the coding block.
    Type: Application
    Filed: September 23, 2020
    Publication date: November 24, 2022
    Applicant: VID SCALE, INC.
    Inventors: Wei Chen, Yuwen He, Hua Yang
  • Publication number: 20220377385
    Abstract: Systems, methods, and instrumentalities may be provided for discounting reconstructed samples and/or coding information from spatial neighbors across face discontinuities. Whether a current block is located at a face discontinuity may be determined. The face discontinuity may be a face boundary between two or more adjoining blocks that are not spherical neighbors. The coding availability of a neighboring block of the current block may be determined, e.g., based on whether the neighboring block is on the same side of the face discontinuity as the current block. For example, the neighboring block may be determined to be available for decoding the current block if it is on the same side of the face discontinuity as the current block, and unavailable if it is not on the same side of the face discontinuity. The neighboring block may be a spatial neighboring block or a temporal neighboring block.
    Type: Application
    Filed: July 29, 2022
    Publication date: November 24, 2022
    Applicant: Vid Scale, Inc.
    Inventors: Philippe Hanhart, Yuwen He, Yan Ye
  • Publication number: 20220368947
    Abstract: Processing video data may include capturing the video data with multiple cameras and stitching the video data together to obtain a 360-degree video. A frame-packed picture may be provided based on the captured and stitched video data. A current sample location may be identified in the frame-packed picture. Whether a neighboring sample location is located outside of a content boundary of the frame-packed picture may be determined. When the neighboring sample location is located outside of the content boundary, a padding sample location may be derived based on at least one circular characteristic of the 360-degree video content and the projection geometry. The 360-degree video content may be processed based on the padding sample location.
    Type: Application
    Filed: July 8, 2022
    Publication date: November 17, 2022
    Applicant: VID SCALE, INC.
    Inventors: Yuwen He, Yan Ye, Philippe Hanhart, Xiaoyu Xiu
  • Publication number: 20220368943
    Abstract: A device may determine whether to enable or disable bi-directional optical flow (BIO) for a current coding unit (CU) (e.g., block and/or sub-block). Prediction information for the CU may be identified and may include prediction signals associated with a first reference block and a second reference block (e.g., or a first reference sub-block and a second reference sub-block). A prediction difference may be calculated and may be used to determine the similarity between the two prediction signals. The CU may be reconstructed based on the similarity. For example, whether to reconstruct the CU with BIO enabled or BIO disabled may be based on whether the two prediction signals are similar. It may be determined to enable BIO for the CU when the two prediction signals are determined to be dissimilar. For example, the CU may be reconstructed with BIO disabled when the two prediction signals are determined to be similar.
    Type: Application
    Filed: May 20, 2022
    Publication date: November 17, 2022
    Applicant: VID SCALE, INC.
    Inventors: Yan Ye, Xiaoyu Xiu, Yuwen He
  • Patent number: 11503328
    Abstract: Systems, methods, and instrumentalities are disclosed for motion vector clipping when affine motion mode is enabled for a video block. A video coding device may determine that an affine mode for a video block is enabled. The video coding device may determine a plurality of control point affine motion vectors associated with the video block. The video coding device may store the plurality of clipped control point affine motion vectors for motion vector prediction of a neighboring control point affine motion vector. The video coding device may derive a sub-block motion vector associated with a sub-block of the video block, clip the derived sub-block motion vector, and store it for spatial motion vector prediction or temporal motion vector prediction. For example, the video coding device may clip the derived sub-block motion vector based on a motion field range that may be based on a bit depth value.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: November 15, 2022
    Assignee: VID SCALE, Inc.
    Inventors: Yuwen He, Xiaoyu Xiu, Yan Ye, Philippe Hanhart
  • Publication number: 20220353520
    Abstract: A media content processing device may decode visual volumetric content based on one or more messages, which may indicate which attribute sub-bitstream of one or more attribute sub-bitstreams indicated in a parameter set is active, The parameter set may include a visual volumetric video-based parameter set. The message indicating one or more active attribute sub-bitstreams may be received by the decoder, A decoder may perform decoding, such as determining which attribute sub-bitstream to use for decoding visual media content, based on the one or more messages, The one or more messages may be generated and sent to a decoder, for example, to indicate the deactivation of the one or more attribute sub-bitstreams. The decoder may determine an inactive attribute sub-bitstream and skip the inactive attribute sub-bitstream for decoding the visual media content based on the one or more messages.
    Type: Application
    Filed: July 2, 2020
    Publication date: November 3, 2022
    Applicant: VID SCALE, INC.
    Inventor: Ahmed Hamza
  • Patent number: 11490074
    Abstract: Improved method and apparatus for signaling of reference pictures used for temporal prediction. The signaling schemes and construction process for different reference picture lists in HEVC Working Draft 5 (WD5) are improved.
    Type: Grant
    Filed: July 8, 2020
    Date of Patent: November 1, 2022
    Assignee: VID SCALE, INC.
    Inventors: Yan Ye, Yong He
  • Publication number: 20220345701
    Abstract: Systems, methods, and instrumentalities may be used for decoding and/or encoding a coding unit (CD), An intra-prediction mode for a CD may be determined. A split mode may be determined based on the intra-prediction mode, to generate a plurality of sub-partitions in the CU. A prediction for a first sub-partition of the plurality of sub-partitions in the CU may be based on a reference sample in a second sub-partition of the plurality of sub-partitions in the CU. The CU may be decoded and/or encoded, for example, based on the determined split mode.
    Type: Application
    Filed: September 17, 2020
    Publication date: October 27, 2022
    Applicant: VID SCALE, INC.
    Inventors: Hua Yang, Yuwen He, Wei Chen
  • Publication number: 20220329923
    Abstract: Systems, methods, and instrumentalities are disclosed that relate to the processing of a media container file associated with 3D video data. The media container file may indicate that certain video-based point cloud compression (V-PCC) component tracks may be played together as a playout group. These V-PCG component tracks may represent respective encoded versions of one or more V-PCC components, and a video decoding device may play the tracks together in response to determining that the tracks belong to the same playout track group. The video decoding device may also determine from the media container file that certain PCC component tracks include tile groups that correspond to different objects in a point cloud or different parts of a same object in the point cloud. The video decoding device may decode these tile groups independently from each other so that a subset of the objects or parts of the point cloud may be accessed without also accessing the rest of the objects or parts.
    Type: Application
    Filed: September 23, 2020
    Publication date: October 13, 2022
    Applicant: VID SCALE, INC.
    Inventors: Ahmed Hamza, Yong He
  • Patent number: 11470308
    Abstract: Systems and methods are described for reducing the complexity of using bi-directional optical flow (BIO) in video coding. In some embodiments, bit-width reduction steps are introduced in the BIO motion refinement process to reduce the maximum bit-width used for BIO calculations. In some embodiments, simplified interpolation filters are used to generate predicted samples in an extended region around a current coding unit. In some embodiments, different interpolation filters are used for vertical versus horizontal interpolation. In some embodiments, BIO is disabled for coding units with small heights and/or for coding units that are predicted using a sub-block level inter prediction technique, such as advanced temporal motion vector prediction (ATMVP) or affine prediction.
    Type: Grant
    Filed: September 17, 2019
    Date of Patent: October 11, 2022
    Assignee: VID SCALE, INC.
    Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye, Jiancong Luo
  • Patent number: 11470344
    Abstract: Systems and methods are described for selecting a motion vector (MV) to use in frame-rate up conversion (FRUC) coding of a block of video. In one embodiment, a first set of motion vector candidates is identified for FRUC prediction of the block. A search center is defined based on the first set of motion vector candidates, and a search window is determined, the search window having a selected width and being centered on the search center. A search for a selected MV is performed within the search window. In some embodiments, an initial set of MVs is processed with a clustering algorithm to generate a smaller number of MVs that are used as the first set. The selected MV may be subject to a motion refinement search, which may also be performed over a constrained search range. In additional embodiments, search iterations are constrained to limit complexity.
    Type: Grant
    Filed: January 28, 2019
    Date of Patent: October 11, 2022
    Assignee: VID SCALE, INC.
    Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye
  • Patent number: 11457198
    Abstract: A video coding device may be configured to periodically select the frame packing configuration (e.g., face layout and/or face rotations parameters) associated with a RAS, The device may receive a plurality of pictures, which may each comprise a plurality of faces. The pictures may be grouped Into a plurality of RASs. The device may select a frame packing configuration with the lowest cost for a first RAS. For example, the cost of a frame packing configuration may be determined based on the first picture of the first RAS. The device may select a frame packing configuration for a second RAS. The frame packing configuration for the first RAS may be different than the frame packing configuration for the second RAS. The frame packing configuration for the first RAS and the frame packing configuration for the second RAS may be signaled in the video bitstream.
    Type: Grant
    Filed: January 14, 2019
    Date of Patent: September 27, 2022
    Assignee: VID SCALE, Inc.
    Inventors: Philippe Hanhart, Yuwen He, Yan Ye
  • Patent number: 11445172
    Abstract: Embodiments contemplate coding video data by generating a video encoded bitstream that may include reference picture set (RPS) extensions for inter-layer reference pictures, and the extensions may include inter-layer delta Picture Order Counts (POCs). Embodiments may also include signaling that lower layer reference pictures may be available in a lower layer decoder picture buffer (DPB), and/or an aggregate DPB, that may be added to the RPS set of a higher layer. The bitstream may include a signal indicating whether the higher layer RPS may be specified by a lower layer RPS, and the lower layer RPS may be temporal, inter-layer prediction (ILP), or both.
    Type: Grant
    Filed: January 31, 2013
    Date of Patent: September 13, 2022
    Assignee: VID SCALE, Inc.
    Inventors: Yong He, Yan Ye, George W. McClellan, Jie Dong