Patents Assigned to VID Scale, Inc.
-
Publication number: 20220345701Abstract: Systems, methods, and instrumentalities may be used for decoding and/or encoding a coding unit (CD), An intra-prediction mode for a CD may be determined. A split mode may be determined based on the intra-prediction mode, to generate a plurality of sub-partitions in the CU. A prediction for a first sub-partition of the plurality of sub-partitions in the CU may be based on a reference sample in a second sub-partition of the plurality of sub-partitions in the CU. The CU may be decoded and/or encoded, for example, based on the determined split mode.Type: ApplicationFiled: September 17, 2020Publication date: October 27, 2022Applicant: VID SCALE, INC.Inventors: Hua Yang, Yuwen He, Wei Chen
-
Publication number: 20220329923Abstract: Systems, methods, and instrumentalities are disclosed that relate to the processing of a media container file associated with 3D video data. The media container file may indicate that certain video-based point cloud compression (V-PCC) component tracks may be played together as a playout group. These V-PCG component tracks may represent respective encoded versions of one or more V-PCC components, and a video decoding device may play the tracks together in response to determining that the tracks belong to the same playout track group. The video decoding device may also determine from the media container file that certain PCC component tracks include tile groups that correspond to different objects in a point cloud or different parts of a same object in the point cloud. The video decoding device may decode these tile groups independently from each other so that a subset of the objects or parts of the point cloud may be accessed without also accessing the rest of the objects or parts.Type: ApplicationFiled: September 23, 2020Publication date: October 13, 2022Applicant: VID SCALE, INC.Inventors: Ahmed Hamza, Yong He
-
Patent number: 11470308Abstract: Systems and methods are described for reducing the complexity of using bi-directional optical flow (BIO) in video coding. In some embodiments, bit-width reduction steps are introduced in the BIO motion refinement process to reduce the maximum bit-width used for BIO calculations. In some embodiments, simplified interpolation filters are used to generate predicted samples in an extended region around a current coding unit. In some embodiments, different interpolation filters are used for vertical versus horizontal interpolation. In some embodiments, BIO is disabled for coding units with small heights and/or for coding units that are predicted using a sub-block level inter prediction technique, such as advanced temporal motion vector prediction (ATMVP) or affine prediction.Type: GrantFiled: September 17, 2019Date of Patent: October 11, 2022Assignee: VID SCALE, INC.Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye, Jiancong Luo
-
Patent number: 11470344Abstract: Systems and methods are described for selecting a motion vector (MV) to use in frame-rate up conversion (FRUC) coding of a block of video. In one embodiment, a first set of motion vector candidates is identified for FRUC prediction of the block. A search center is defined based on the first set of motion vector candidates, and a search window is determined, the search window having a selected width and being centered on the search center. A search for a selected MV is performed within the search window. In some embodiments, an initial set of MVs is processed with a clustering algorithm to generate a smaller number of MVs that are used as the first set. The selected MV may be subject to a motion refinement search, which may also be performed over a constrained search range. In additional embodiments, search iterations are constrained to limit complexity.Type: GrantFiled: January 28, 2019Date of Patent: October 11, 2022Assignee: VID SCALE, INC.Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye
-
Patent number: 11457198Abstract: A video coding device may be configured to periodically select the frame packing configuration (e.g., face layout and/or face rotations parameters) associated with a RAS, The device may receive a plurality of pictures, which may each comprise a plurality of faces. The pictures may be grouped Into a plurality of RASs. The device may select a frame packing configuration with the lowest cost for a first RAS. For example, the cost of a frame packing configuration may be determined based on the first picture of the first RAS. The device may select a frame packing configuration for a second RAS. The frame packing configuration for the first RAS may be different than the frame packing configuration for the second RAS. The frame packing configuration for the first RAS and the frame packing configuration for the second RAS may be signaled in the video bitstream.Type: GrantFiled: January 14, 2019Date of Patent: September 27, 2022Assignee: VID SCALE, Inc.Inventors: Philippe Hanhart, Yuwen He, Yan Ye
-
Patent number: 11445172Abstract: Embodiments contemplate coding video data by generating a video encoded bitstream that may include reference picture set (RPS) extensions for inter-layer reference pictures, and the extensions may include inter-layer delta Picture Order Counts (POCs). Embodiments may also include signaling that lower layer reference pictures may be available in a lower layer decoder picture buffer (DPB), and/or an aggregate DPB, that may be added to the RPS set of a higher layer. The bitstream may include a signal indicating whether the higher layer RPS may be specified by a lower layer RPS, and the lower layer RPS may be temporal, inter-layer prediction (ILP), or both.Type: GrantFiled: January 31, 2013Date of Patent: September 13, 2022Assignee: VID SCALE, Inc.Inventors: Yong He, Yan Ye, George W. McClellan, Jie Dong
-
Patent number: 11432010Abstract: Systems, methods, and instrumentalities are disclosed for discontinuous face boundary filtering for 360-degree video coding. A face discontinuity may be filtered (e.g., to reduce seam artifacts) in whole or in part, for example, using coded samples or padded samples on either side of the face discontinuity. Filtering may be applied, for example, as an in-loop filter or a post-processing step. 2D positional information related to two sides of the face discontinuity may be signaled. In a video bitstream so that filtering may be applied independent of projection formats and/or frame packing techniques.Type: GrantFiled: December 18, 2018Date of Patent: August 30, 2022Assignee: VID SCALE, Inc.Inventors: Philippe Hanhart, Yan Ye, Yuwen He
-
Patent number: 11432001Abstract: Systems, methods, and instrumentalities are disclosed relating to intra prediction of a video signal based on mode-dependent subsampling. A block of coefficients associated with a first sub block of a video block, one or more blocks of coefficients associated with one or more remaining sub blocks of the video block, and an indication of a prediction mode for the video block may be received. One or more interpolating techniques, a predicted first sub block, and the predicted sub blocks of the one or more remaining sub blocks may be determined. A reconstructed first sub block and one or more reconstructed remaining sub blocks may be generated. A reconstructed video block may be formed based on the prediction mode, the reconstructed first sub block, and the one or more reconstructed remaining sub blocks.Type: GrantFiled: March 14, 2017Date of Patent: August 30, 2022Assignee: VID SCALE, Inc.Inventors: Yan Ye, Qian Chen, Jie Dong
-
Publication number: 20220272326Abstract: Systems, methods, and instrumentalities are described herein for calculating local illumination compensation (LIC) parameters for bi-predicted coding unit (CU). The LIC parameters may be used to generate adjusted samples for the current CU and to address local illumination changes that may exist among temporal neighboring pictures. LIC parameters may be calculated based on bi-predicted reference template samples and template samples for a current CU. Bi-predicted reference template samples may be generated based on reference template samples neighboring temporal reference CUs. For example, the bi-predicted reference template samples may be generated based on averaging the reference template samples. The reference template samples may correspond to template samples for the current CU. A CU may be or may include a coding block and/or a sub-block that may be derived by dividing the coding block.Type: ApplicationFiled: April 28, 2022Publication date: August 25, 2022Applicant: VID SCALE, INC.Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye, Saurav Bandyopadhyay
-
Patent number: 11425418Abstract: External overlapped block motion compensation (OBMC) may be performed for samples of a coding unit (CU) located along an inter-CU boundary of the CU while internal OBMC may be performed separately for samples located along inter-sub-block boundaries inside the CU. External OBMC may be applied based on substantially similar motion information associated with multiple external blocks neighboring the CU. The external blocks may be treated as a group to provide OBMC for multiple boundary samples together in an external OBMC operation. Internal OBMC may be applied using the same sub-block size used for sub-block level motion derivation. Internal OBMC may be disabled for the CU, for example, if the CU is coded in a spatial-temporal motion vector prediction (STMVP) mode.Type: GrantFiled: November 1, 2018Date of Patent: August 23, 2022Assignee: VID SCALE, Inc.Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye, Yan Zhang
-
Publication number: 20220261616Abstract: Systems, methods, and instrumentalities are disclosed for clustering-based quantization for neural network (NN) compression. A distribution of weights in weight tensors in NN layers may be analyzed to identify cluster outliers. Cluster inliers may be coded from cluster outliers, for example, using scalar and/or vector quantization. Weight-rearrangement may rearrange weights for higher dimensional weight tensors into lower dimensional matrices. For example, weight rearrangement may flatten a convolutional kernel into a vector. Correlation between kernels may be preserved, for example, by treating a filter or kernels across a channel as a point. A tensor may be split into multiple subspaces, for example, along an input and/or an output channel. Predictive coding may be performed for a current block of weights or weight matrix based on a reshaped or previously coded block or matrix. Arrangement, inlier, outlier, and/or prediction information may be signaled to a decoder for reconstruction of a compressed NN.Type: ApplicationFiled: July 1, 2020Publication date: August 18, 2022Applicant: VID SCALE, INC.Inventors: Duanshun Li, Dong Tian, Hua Yang, Yuwen He
-
Patent number: 11418816Abstract: Systems, methods, and instrumentalities may be provided for discounting reconstructed samples and/or coding information from spatial neighbors across face discontinuities. Whether a current block is located at a face discontinuity may be determined. The face discontinuity may be a face boundary between two or more adjoining blocks that are not spherical neighbors. The coding availability of a neighboring block of the current block may be determined, e.g., based on whether the neighboring block is on the same side of the face discontinuity as the current block. For example, the neighboring block may be determined to be available for decoding the current block if it is on the same side of the face discontinuity as the current block, and unavailable if it Is not on the same side of the face discontinuity. The neighboring block may be a spatial neighboring block or a temporal neighboring block.Type: GrantFiled: September 19, 2018Date of Patent: August 16, 2022Assignee: VID SCALE, Inc.Inventors: Philippe Hanhart, Yuwen He, Yan Ye
-
Patent number: 11405621Abstract: Sampling grid information may be determined for multi-layer video coding systems. The sampling grid information may be used to align the video layers of a coding system. Sampling grid correction may be performed based on the sampling grid information. The sampling grids may also be detected. In some embodiments, a sampling grid precision may also be detected and/or signaled.Type: GrantFiled: April 22, 2020Date of Patent: August 2, 2022Assignee: VID SCALE, Inc.Inventors: Yan Ye, Yuwen He, Jie Dong
-
Publication number: 20220239947Abstract: Systems, methods, and instrumentalities are disclosed herein that related to video-based point cloud streams in one or more ISO Base Media File Format (ISOBMFF) container files, A container format for point cloud data is provided and the container format indicates at least a relationship between a 3D region of the point cloud and one or more video-based point cloud compression (V-PCC) tracks. The V-PCC tracks may be grouped together and linked to the 3D region to allow spatial access to the 3D region.Type: ApplicationFiled: May 21, 2020Publication date: July 28, 2022Applicant: VID SCALE, INC.Inventors: Hamza Ahmed, Yong He
-
Publication number: 20220239921Abstract: Systems, methods, and instrumentalities are disclosed for sub-block/block refinement, including sub-block/block boundary refinement, such as block boundary prediction refinement with optical flow (BBPROF). A block comprising a current sub-block may be decoded based on a sample value for a first pixel that is obtained based on, for example, an MV for a current sub-block, an MV for a sub-block adjacent the current sub-block, and a sample value for a second pixel adjacent the first pixel. BBPROF may include determining spatial gradients at pixel(s)/sample location(s). An MV difference may be calculated between a current sub-block and one or more neighboring sub-blocks. An MV offset may be determined at pixel(s)/sample location(s) based on the MV difference. A sample value offset for the pixel in a current sub-block may be determined. The prediction for a reference picture list may be refined by adding the calculated sample value offset to the sub-block prediction.Type: ApplicationFiled: June 2, 2020Publication date: July 28, 2022Applicant: VID SCALE, INC.Inventors: Wei Chen, Jiancong Luo, Yuwen He
-
Patent number: 11395015Abstract: Media content coded using scalable coding techniques may be cached among a group of cache devices. Layered segments of the media content may be pre-loaded onto the cache devices, which may be located throughout a content distribution network, including a home network. The caching location of the media content may be determined based on multiple factors including a content preference associated with the group of cache devices and device capabilities. A cache controller may manage the caching of the media content.Type: GrantFiled: January 29, 2020Date of Patent: July 19, 2022Assignee: VID SCALE, Inc.Inventors: Yong He, Yuwen He, Yan Ye, Ralph Neff
-
Patent number: 11388438Abstract: Processing video data may include capturing the video data with multiple cameras and stitching the video data together to obtain a 360-degree video. A frame-packed picture may be provided based on the captured and stitched video data. A current sample location may be identified in the frame-packed picture. Whether a neighboring sample location is located outside of a content boundary of the frame-packed picture may be determined. When the neighboring sample location is located outside of the content boundary, a padding sample location may be derived based on at least one circular characteristic of the 360-degree video content and the projection geometry. The 360-degree video content may be processed based on the padding sample location.Type: GrantFiled: December 4, 2020Date of Patent: July 12, 2022Assignee: VID SCALE, Inc.Inventors: Yuwen He, Yan Ye, Philippe Hanhart, Xiaoyu Xiu
-
Publication number: 20220201327Abstract: A video coding system (e.g., an encoder and/or a decoder) may perform face-based sub-block motion compensation for 360-degree video to predict samples (e.g., of a sub-block). The video coding system may receive a 360-degree video content. The 360-degree video content may include a current block. The current block may include a plurality of sub-blocks. The system may determine whether a sub-block mode is used for the current block. The system may predict a sample in the current block based on the sub-block level face association. For a first sub-block in the current block, the system may identify a first location of the first sub-block. The system may associate the first sub-block with a first face based on the identified first location of the first sub-block. The system may predict a first sample in the first sub-block based on the first face that is associated with the first sub-block.Type: ApplicationFiled: March 9, 2022Publication date: June 23, 2022Applicant: VID SCALE, INC.Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye
-
Patent number: 11368676Abstract: Systems, methods, and Instrumentalities are described herein for calculating local Illumination compensation (LIC) parameters for bi-predicted coding unit (CU). The LIC parameters may be used to generate adjusted samples for the current CU and to address local illumination changes that may exist among temporal neighboring pictures. LIC parameters may be calculated based on bi-predicted reference template samples and template samples for a current CU. Bi-predicted reference template samples may be generated based on reference template samples neighboring temporal reference CUs. For example, the bi-predicted reference template samples may be generated based on averaging the reference template samples. The reference template samples may correspond to template samples for the current CU. A CU may be or may include a coding block and/or a sub-block that may be derived by dividing the coding block.Type: GrantFiled: January 15, 2019Date of Patent: June 21, 2022Assignee: VID SCALE, Inc.Inventors: Xiaoyu Xiu, Yuwen He, Yan Ye, Saurav Bandyopadhyay
-
Patent number: 11368509Abstract: A decoding complexity may be used to predict power consumption for receiving, decoding, and/or displaying multimedia content at a wireless transmit/receive unit (WTRU). The decoding complexity may be based on decoding complexity feedback received from a reference device, such as another WTRU. The decoding complexity feedback may be based on measurements performed at the reference device for receiving decoding, and/or displaying the multimedia content. A content providing device may indicate the decoding complexity of requested media content to a WTRU, or another network entity. The decoding complexity may be indicated in a streaming protocol or file associated with the media content. The WTRU, or other network entity, may use the decoding complexity determine its preferences regarding transmission of the media content. The content providing device may determine whether to transmit the media content based on the decoding complexity and/or the preferences of the WTRU or other network entity.Type: GrantFiled: January 25, 2019Date of Patent: June 21, 2022Assignee: VID SCALE, Inc.Inventors: Osama Lotfallah, Eduardo Asbun, Hang Liu, Yuriy Reznik