Patents by Inventor Kai Zhang

Kai Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240146907
    Abstract: Embodiments of the present disclosure provide a method for video processing. The method comprises: determining, during a conversion between a current video block of a video and a bitstream of the video, a geometric partition mode (GPM) partition index for the current video block, the GPM partition index being absent from the bitstream; and performing the conversion based on the GPM partition index. Compared with conventional solutions, the proposed method can advantageously reduce the coding bits while maintaining the coding quality.
    Type: Application
    Filed: January 8, 2024
    Publication date: May 2, 2024
    Inventors: Zhipin DENG, Kai ZHANG, Li ZHANG
  • Publication number: 20240146927
    Abstract: Techniques for video processing, including video coding, video decoding and video transcoding are described. One example method includes performing a conversion between a video picture that includes one or more tiles and one or more rectangular slices and a bitstream of the video according to a rule. The rule specifies that, for iteratively determining information about the one or more rectangular slices, a variable indicating a tile index is updated only for slices having indices that are smaller than a value equal to a number of slices in the video picture minus 1.
    Type: Application
    Filed: November 27, 2023
    Publication date: May 2, 2024
    Inventors: Kai ZHANG, Li Zhang, Ye-Kui Wang, Zhipin Deng, Kui Fan, Jizheng Xu, Hongbin Liu
  • Publication number: 20240146929
    Abstract: Devices, systems, and methods for palette mode coding are described. An exemplary method for video processing includes performing a conversion between a block of a video and a bitstream representation of the video. The bitstream representation is processed according to a format rule that specifies a first indication of usage of a palette mode and a second indication of usage of an intra block copy (IBC) mode are signaled dependent of each other.
    Type: Application
    Filed: October 17, 2023
    Publication date: May 2, 2024
    Inventors: Weijia Zhu, Li Zhang, Jizheng Xu, Kai Zhang, Hongbin Liu, Yue Wang
  • Publication number: 20240146973
    Abstract: Methods, systems and apparatus for video processing are described. One example video processing method includes performing a conversion between a video including a video region and a bitstream of the video according to a rule, wherein the rule specifies that an adaptive loop filtering operation is allowed for the video region in response to an absence of one or more adaptation parameter set (APS) network abstraction layer (NAL) units that include adaptive loop filtering data.
    Type: Application
    Filed: December 1, 2023
    Publication date: May 2, 2024
    Inventors: Zhipin DENG, Ye-Kui WANG, Li ZHANG, Kai ZHANG
  • Publication number: 20240146908
    Abstract: Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, during a conversion between a target video block of a video and a bitstream of the video, a cost metric for a target motion candidate for the target video block at least based on a matching cost of the target motion candidate; and performing the conversion based on a comparison of the cost metric and a further matching cost for the target video block. Compared with the conventional solution, the proposed method can advantageously improve the coding effectiveness and coding efficiency.
    Type: Application
    Filed: December 13, 2023
    Publication date: May 2, 2024
    Inventors: Na Zhang, Kai Zhang, Li Zhang
  • Patent number: 11973971
    Abstract: Devices, systems and methods for processing video are described. In a representative aspect, a video processing method is provided to include: maintaining tables, wherein each table includes a set of motion candidates and each motion candidate is associated with corresponding motion information; performing a conversion between a first video block and a bitstream representation of a video including the first video block based on the tables; and updating, after performing of the conversion, zero or more tables, based on an update rule.
    Type: Grant
    Filed: July 7, 2021
    Date of Patent: April 30, 2024
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD, BYTEDANCE INC.
    Inventors: Li Zhang, Kai Zhang, Hongbin Liu, Yue Wang
  • Patent number: 11972519
    Abstract: Described herein are techniques for learning neural reflectance shaders from images. A set of one or more machine learning models can be trained to optimize an illumination latent code and a set of reflectance latent codes for an object within a set of input images. A shader can then be generated based on a machine learning model of the one or more machine learning models. The shader is configured to sample the illumination latent code and the set of reflectance latent codes for the object. A 3D representation of the object can be rendered using the generated shader.
    Type: Grant
    Filed: June 24, 2022
    Date of Patent: April 30, 2024
    Assignee: Intel Corporation
    Inventors: Benjamin Ummenhofer, Shenlong Wang, Sanskar Agrawal, Yixing Lao, Kai Zhang, Stephan Richter, Vladlen Koltun
  • Patent number: 11973973
    Abstract: A method of video processing includes determining a refined prediction sample P?(x,y) at a position (x,y) in a video block by modifying a prediction sample P(x,y) at the position (x,y) with a first gradient component Gx(x, y) in a first direction estimated at the position (x,y) and a second gradient component Gy(x, y) in a second direction estimated at the position (x,y) and a first motion displacement Vx(x,y) estimated for the position (x,y) and a second motion displacement Vy(x,y) estimated for the position (x,y), where x and y are integer numbers, and performing a conversion between the video block and a bitstream representation of the video block using a reconstructed sample value Rec(x,y) at the position (x,y) that is obtained based on the refined prediction sample P?(x,y) and a residue sample value Res(x,y).
    Type: Grant
    Filed: September 16, 2021
    Date of Patent: April 30, 2024
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD, BYTEDANCE INC.
    Inventors: Kai Zhang, Li Zhang, Hongbin Liu, Jizheng Xu, Yue Wang
  • Patent number: 11973962
    Abstract: Devices, systems and methods for applying intra-block copy (IBC) in video coding are described. In general, methods for integrating IBC with existing motion compensation algorithms for video encoding and decoding are described. In a representative aspect, a method for video encoding using IBC includes determining whether a current block of the current picture is to be encoded using a motion compensation algorithm, and encoding, based on the determining, the current block by selectively applying an intra-block copy to the current block. In a representative aspect, another method for video encoding using IBC includes determining whether a current block of the current picture is to be encoded using an intra-block copy, and encoding, based on the determining, the current block by selectively applying a motion compensation algorithm to the current block.
    Type: Grant
    Filed: August 26, 2021
    Date of Patent: April 30, 2024
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD, BYTEDANCE INC.
    Inventors: Kai Zhang, Li Zhang, Hongbin Liu, Yue Wang
  • Patent number: 11973931
    Abstract: A method of video processing is described. The method includes: deriving, for a conversion between a current block of a video and a coded representation of the video, a prediction block for the current block that is determined according to a tool in which weights are used to determine a weighted sum of two initial prediction blocks used to determine the prediction block; and performing the conversion using the prediction block; wherein the deriving includes: applying the weights to two intermediate prediction samples that respectively belong to the two initial prediction blocks to derive a final prediction sample; and performing a right shift operation for the final prediction sample, wherein the right shift operation is pbSample>> (shift1+3)), wherein pbSample represents the final prediction sample, shift1 is set equal to Max(2, 14?bitDepth); and determining the prediction block based on the final prediction sample.
    Type: Grant
    Filed: March 10, 2022
    Date of Patent: April 30, 2024
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD, BYTEDANCE INC.
    Inventors: Yang Wang, Hongbin Liu, Li Zhang, Kai Zhang, Jizheng Xu, Yue Wang
  • Publication number: 20240135591
    Abstract: Embodiments of the present disclosure provide a method for point cloud coding. The method comprises: classifying, during a conversion between a current frame of a point cloud sequence and a bitstream of the point cloud sequence, a target point in the current frame into a first set of classes based on a second set of thresholds, the number of thresholds in the second set being larger than the number of classes in the first set; and performing the conversion based on the classification. Compared with the conventional solution, the proposed method can advantageously improve the accuracy of global motion estimation and coding quality.
    Type: Application
    Filed: December 28, 2023
    Publication date: April 25, 2024
    Inventors: Yingzhan XU, Kai ZHANG, Li ZHANG
  • Publication number: 20240137505
    Abstract: A mechanism for processing video data is disclosed. The mechanism determines whether to apply a bilateral filter to filter samples in a video unit based on video information related to the samples. A conversion is performed between a visual media data and a bitstream based on the bilateral filter and the samples.
    Type: Application
    Filed: December 22, 2023
    Publication date: April 25, 2024
    Inventors: Wenbin Yin, Kai Zhang, Li Zhang
  • Publication number: 20240137510
    Abstract: A mechanism for processing video data implemented by a video coding apparatus is disclosed. The mechanism determines to split a coding tree unit (CTU) into one or more coding units (CUs). One or more of the CUs may be one or more prediction tree units (PTUs). The mechanism further determines to recursively split the CUs into prediction units (PUs). The method performs a conversion between a visual media data and a bitstream based on the PUs.
    Type: Application
    Filed: December 29, 2023
    Publication date: April 25, 2024
    Inventors: Kai Zhang, Li Zhang, Zhipin Deng, Na Zhang, Yang Wang
  • Publication number: 20240137573
    Abstract: A mechanism for processing video data is disclosed. The mechanism determines to apply a bilateral filter to samples in a current block of a current picture. The bilateral filter includes filter weights that vary based on a distance between surrounding samples and a central sample and differences in intensities of the surrounding samples and the central sample. A conversion is performed between a visual media data and a bitstream based on the bilateral filter.
    Type: Application
    Filed: December 23, 2023
    Publication date: April 25, 2024
    Inventors: Wenbin Yin, Kai Zhang, Li Zhang
  • Publication number: 20240135592
    Abstract: Embodiments of the present disclosure provide a method for point cloud coding. The method comprises: obtaining, during a conversion between a current frame of a point cloud sequence and a bitstream of the point cloud sequence, a compensated reference frame of the current frame by performing motion compensation on a reference frame of the current frame based on a set of rotation matrixes comprising at least one fixed rotation matrix or a plurality of rotation matrixes; and performing the convention based on the compensated reference frame. Compared with the conventional solution, the proposed method can advantageously improve coding efficiency.
    Type: Application
    Filed: December 28, 2023
    Publication date: April 25, 2024
    Inventors: Yingzhan XU, Kai ZHANG, Li ZHANG
  • Publication number: 20240137529
    Abstract: Embodiments of the present disclosure provide a method for video processing. The method comprises: determining, during a conversion between a current video block of a video and a bitstream of the video, at least one target intra prediction mode for the current video block based on neighboring reconstructed samples of the current video block; determining a prediction or a reconstruction of the current video block based on a combination of the at least one target intra prediction mode and one of an inter coding tool or a candidate coding tool, the candidate coding tool being used for determining a reference block for the current video block with samples in a current picture associated with the current video block; and performing the conversion based on the prediction or the reconstruction of the current video. Compare with conventional solutions, the proposed method can advantageously improve coding efficiency and coding quality.
    Type: Application
    Filed: December 28, 2023
    Publication date: April 25, 2024
    Inventors: Yang WANG, Li ZHANG, Kai ZHANG, Zhipin DENG
  • Publication number: 20240137518
    Abstract: A method of processing video data. The method includes using coded information during application of a super resolution (SR) process to a video unit; and performing a conversion between a video including the video unit and a bitstream of the video based on the SR process as applied using the coded information. A corresponding video coding apparatus and non-transitory computer-readable recording medium are also disclosed.
    Type: Application
    Filed: December 27, 2023
    Publication date: April 25, 2024
    Inventors: Chaoyi Lin, Yue Li, Kai Zhang, Zhaobin Zhang, Li Zhang
  • Publication number: 20240137554
    Abstract: A method for processing a video includes performing a conversion between a current block of visual media data and a corresponding coded representation of the visual media data. The conversion of the current block includes determining whether a use of one or both of a bi-directional optical flow (BIO) technique or a decoder-side motion vector refinement (DMVR) technique to the current block is enabled or disabled. Determining the use of the BIO technique or the DMVR technique is based on a cost criterion associated with the current block.
    Type: Application
    Filed: December 6, 2023
    Publication date: April 25, 2024
    Inventors: Hongbin Liu, Li Zhang, Kai Zhang, Jizheng Xu, Yue Wang
  • Publication number: 20240137572
    Abstract: Methods, systems and apparatus for video processing are described. One example video processing method includes performing a conversion between a video comprising a video block and a bitstream of the video according to a rule, wherein the video block is a coding tree node that includes one or more coding units, and wherein the rule specifies that a coded information of the video block is indicative of whether a coding mode is enabled for the one or more coding units of the video block.
    Type: Application
    Filed: November 29, 2023
    Publication date: April 25, 2024
    Inventors: Zhipin Deng, Ye-kui Wang, Li Zhang, Kai Zhang
  • Publication number: 20240137517
    Abstract: A method of processing video data. The method includes applying a super resolution (SR) process to a video unit at a specific position relative to one or more in-loop filters when the one or more in-loop filters are applied to the video unit, and performing a conversion between a video comprising the video unit and a bitstream of the video based on the SR process and the one or more in-loop filters as applied. A corresponding video coding apparatus and non-transitory computer-readable recording medium are also disclosed.
    Type: Application
    Filed: December 27, 2023
    Publication date: April 25, 2024
    Inventors: Chaoyi Lin, Yue Li, Kai Zhang, Zhaobin Zhang, Li Zhang