Patents by Inventor Siwei Ma

Siwei Ma has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12659474
    Abstract: A method implemented by a video coding apparatus. The method includes applying a first filter to an unfiltered sample of a video unit to generate a filtered sample. The first filter is a neural network (NN) filter based on a non-deep learning-based filter (NDLF) being disabled, and the first filter is the NDLF based on the NN filter being disabled. The method also includes performing a conversion between a video media file and a bitstream based on the filtered sample that was generated.
    Type: Grant
    Filed: October 16, 2023
    Date of Patent: June 16, 2026
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC., BYTEDANCE (HK) LIMITED
    Inventors: Yue Li, Li Zhang, Kai Zhang, Junru Li, Meng Wang, Siwei Ma, Shiqi Wang
  • Patent number: 12659508
    Abstract: Embodiments of the present disclosure provide a solution for video processing. In some embodiments, a method for video processing is proposed, which comprises: reordering at least one intra string copy (ISC) reference string, the at least one ISC reference string matching with at least one string of a block of a video; reconstructing the block based on the reordered at least one ISC reference string; and performing a conversion between the reconstructed block and a bitstream of the video. The proposed method can advantageously improve the coding efficiency.
    Type: Grant
    Filed: April 1, 2022
    Date of Patent: June 16, 2026
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Kai Zhang, Li Zhang, Yuhuai Zhang, Junru Li, Siwei Ma
  • Patent number: 12621499
    Abstract: A method implemented by a video coding apparatus includes applying a neural network (NN) filter to an unfiltered sample of a video unit to generate a filtered sample. The NN filter is applied based on a syntax element of the video unit. The method also includes converting between a video media file and a bitstream based on the filtered sample that was generated.
    Type: Grant
    Filed: April 2, 2024
    Date of Patent: May 5, 2026
    Assignees: Lemon Inc., Beijing Bytedance Network Technology Co., Ltd., Bytedance Inc., Bytedance (HK) Limited
    Inventors: Yue Li, Li Zhang, Kai Zhang, Junru Li, Meng Wang, Siwei Ma, Shiqi Wang
  • Patent number: 12598300
    Abstract: Methods, system and apparatus for video processing are described. One example method of processing video data includes performing a conversion between a current block of a video and a bitstream of the video according to a rule. The rule specifies that whether a transform skip mode is enabled is determined based on coding information of the current block. The transform skip mode is a coding mode in which a transform is skipped on a prediction residual of a video block.
    Type: Grant
    Filed: February 21, 2023
    Date of Patent: April 7, 2026
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Li Zhang, Kai Zhang, Yuhuai Zhang, Hongbin Liu, Yue Wang, Siwei Ma
  • Patent number: 12587648
    Abstract: Methods, system and apparatus for video processing are described. One example method of processing video data includes performing a conversion between a current block of a video and a bitstream of the video. Samples of the current block are represented in the bitstream using coefficients that are arranged according to a rule responsive to locations of the samples of the current block.
    Type: Grant
    Filed: May 23, 2023
    Date of Patent: March 24, 2026
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Li Zhang, Kai Zhang, Yuhuai Zhang, Hongbin Liu, Yue Wang, Siwei Ma
  • Patent number: 12556699
    Abstract: Methods, system and apparatus for video processing are described. One example method of processing video data includes performing a conversion between a current block of a video and a bitstream of the video according to a rule. The rule specifies that whether a transform skip mode is enabled is determined based on coding information of the current block. The transform skip mode is a coding mode in which a transform is skipped on a prediction residual of a video block.
    Type: Grant
    Filed: February 21, 2023
    Date of Patent: February 17, 2026
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Li Zhang, Kai Zhang, Yuhuai Zhang, Hongbin Liu, Yue Wang, Siwei Ma
  • Publication number: 20250358403
    Abstract: A bitstreams generating method includes obtaining a historical motion information candidate list used for encoding each of a plurality of image blocks included in a region of a current frame, encoding the plurality of image blocks according to the historical motion information candidate list, and generating bitstreams including one or more indexes of motion information. The historical motion information candidate list is a history-based motion vector prediction (HMVP) candidate list. During a process of performing prediction for all the plurality of image blocks included in the region, the historical motion information candidate list remains unchanged. Only the motion information of the last image block in the region is used to update the historical motion information candidate list after an encoding process of the last image block is finished.
    Type: Application
    Filed: August 4, 2025
    Publication date: November 20, 2025
    Inventors: Suhong WANG, Xiaozhen ZHENG, Shanshe WANG, Siwei MA
  • Patent number: 12477115
    Abstract: Methods, systems and devices for sample string processing in intra coding are described. An example method of video processing includes determining, for a conversion between a video comprising a current block and a bitstream of the video, whether a first sample string of the current block forms a rectangle, and performing, based on the determining, the conversion, wherein the determining is based on a position and a length of the first sample string.
    Type: Grant
    Filed: April 21, 2023
    Date of Patent: November 18, 2025
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Hongbin Liu, Kai Zhang, Li Zhang, Yue Wang, Junru Li, Tianliang Fu, Siwei Ma
  • Patent number: 12469114
    Abstract: This application discloses an image processing method, device, system, electronic equipment, and storage medium, applied at the encoding end. The method includes extracting a one-dimensional feature vector from an original image block; transforming the original image block into a multidimensional feature map based on the one-dimensional feature vector; quantizing and encoding the one-dimensional feature vector to generate a first code stream; discretely encoding the multidimensional feature map to generate a second code stream, thereby efficiently compressing the spatial-independent vector and the multidimensional feature map; and sending the first and second code streams to the decoding end. Since the encoding stream comprises two layers each representing different types of image information, image reconstruction from the two-layer code streams maintains information integrity even at low bit rates, thus improving visual effects and experience.
    Type: Grant
    Filed: June 25, 2025
    Date of Patent: November 11, 2025
    Assignees: ADVANCED INSTITUTE OF INFORMATION TECHNOLOGY (AIIT), PEKING UNIVERSITY, PEKING UNIVERSITY
    Inventors: Siwei Ma, Yun Jiang, Bo Teng, Jianhui Chang, Wen Gao
  • Publication number: 20250328992
    Abstract: This application discloses an image processing method, device, system, electronic equipment, and storage medium, applied at the encoding end. The method includes extracting a one-dimensional feature vector from an original image block; transforming the original image block into a multidimensional feature map based on the one-dimensional feature vector; quantizing and encoding the one-dimensional feature vector to generate a first code stream; discretely encoding the multidimensional feature map to generate a second code stream, thereby efficiently compressing the spatial-independent vector and the multidimensional feature map; and sending the first and second code streams to the decoding end. Since the encoding stream comprises two layers each representing different types of image information, image reconstruction from the two-layer code streams maintains information integrity even at low bit rates, thus improving visual effects and experience.
    Type: Application
    Filed: June 25, 2025
    Publication date: October 23, 2025
    Applicants: ADVANCED INSTITUTE OF INFORMATION TECHNOLOGY (AIIT), PEKING UNIVERSITY, Peking University
    Inventors: Siwei Ma, Yun Jiang, Bo Teng, Jianhui Chang, Wen Gao
  • Publication number: 20250301150
    Abstract: An encoding method includes binarizing a chrominance prediction mode of an image block of an image that is permitted to use a CCLM and/or regular intra-frame chrominance prediction mode to obtain a bit string including adjacent bits. The regular intra-frame chrominance prediction mode includes first, second, and third modes. The regular intra-frame chrominance prediction mode is another intra-frame chrominance prediction mode except CCLM. A first bit of the adjacent bits indicates whether CCLM is used. A second bit indicates whether to use the first mode when the first bit indicates not to use CCLM. A third bit indicates whether or not the second or third mode is used when the second bit indicates the first mode is not used. The method further includes encoding the first bit and the second bit using mutually independent probability modes, respectively, and encoding the third bit using bypass mode.
    Type: Application
    Filed: June 10, 2025
    Publication date: September 25, 2025
    Inventors: Siwei MA, Jiaqi ZHANG, Xiaozhen ZHENG, Shanshe WANG
  • Patent number: 12407814
    Abstract: A bitstreams generating method includes obtaining a historical motion information candidate list used for encoding each of a plurality of image blocks included in a region of a current frame, encoding the plurality of image blocks according to the historical motion information candidate list, and generating bitstreams including one or more indexes of motion information. The historical motion information candidate list is a history-based motion vector prediction (HMVP) candidate list. During the process of encoding all the plurality of image blocks included in the region, the historical motion information candidate list is not updated. Only the motion information of the last image block in the region is used to update the historical motion information candidate list after the encoding process of the last image block is finished.
    Type: Grant
    Filed: March 18, 2024
    Date of Patent: September 2, 2025
    Assignee: SZ DJI TECHNOLOGY CO., LTD.
    Inventors: Suhong Wang, Xiaozhen Zheng, Shanshe Wang, Siwei Ma
  • Patent number: 12407830
    Abstract: Methods, systems and devices for video coding using sample string vectors are described. An example method of video processing includes determining, for a conversion between a video including a current block and a bitstream of the video, an availability of a block vector of one or more non-adjacent neighboring blocks of the current block for a prediction of a block vector of the current block according to a rule, and performing, based on the determining, the conversion, wherein the rule specifies a pre-defined order for checking the one or more non-adjacent neighboring blocks.
    Type: Grant
    Filed: April 20, 2023
    Date of Patent: September 2, 2025
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Hongbin Liu, Kai Zhang, Li Zhang, Yue Wang, Junru Li, Siwei Ma
  • Publication number: 20250274605
    Abstract: A video image decoding method includes receiving a bitstream that includes an index indicating a combination of two motion vector offsets, determining a base motion vector list that includes a dual-prediction base motion vector group including a first base motion vector and a second base motion vector, and determining the two motion vector offsets from a preset offset set according to the index included in the bitstream. The two motion vector offsets correspond to the first base motion vector and the second base motion vector, respectively. The method further includes determining motion vectors of a current image block according to the first base motion vector, the second base motion vector, and the two motion vector offsets, and performing a prediction on the current image block according to the motion vectors of the current image block.
    Type: Application
    Filed: April 28, 2025
    Publication date: August 28, 2025
    Inventors: Xiaozhen ZHENG, Suhong WANG, Shanshe WANG, Siwei MA, Weiran LI
  • Patent number: 12382041
    Abstract: A video processing method includes performing a conversion between a video block of a video and a bitstream of the video according to a rule. The rule specifies whether or how usage of a secondary transform within a video unit is indicated in the bitstream. The secondary transform is applied before quantization or after de-quantization.
    Type: Grant
    Filed: September 26, 2022
    Date of Patent: August 5, 2025
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Kai Zhang, Li Zhang, Tianliang Fu, Yue Wang, Siwei Ma
  • Patent number: 12363311
    Abstract: An encoding method includes binarizing a chrominance prediction mode of an image block of an image that is permitted to use a CCLM and/or regular intra-frame chrominance prediction mode to obtain a bit string including adjacent bits. The CCLM includes first, second, and third modes, a second mode, and a third mode, the regular intra-frame chrominance prediction mode being other intra-frame chrominance prediction modes except CCLM. A first bit of the adjacent bits indicates whether CCLM is used according to whether a value there of is 1. A second bit indicates whether to use the first mode when the first bit indicates to use CCLM. A third bit indicates whether the second or third mode is used when the second bit indicates the first mode is not used. The method further includes using mutually independent probability models to respectively encode the first bit and the second bit.
    Type: Grant
    Filed: December 19, 2023
    Date of Patent: July 15, 2025
    Assignee: SZ DJI TECHNOLOGY CO., LTD.
    Inventors: Siwei Ma, Jiaqi Zhang, Xiaozhen Zheng, Shanshe Wang
  • Patent number: 12328432
    Abstract: Methods, system and apparatus for video processing are described. One example video processing method includes determining, for a conversion between a current video block of a video and a bitstream of the video, a usage of an identity transform mode for the conversion of the current video block according to a rule. The rule specifies that the usage is based on representative coefficients of one or more representative blocks of the video. The method also includes performing the conversion based on the determining.
    Type: Grant
    Filed: September 7, 2022
    Date of Patent: June 10, 2025
    Assignees: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., BYTEDANCE INC.
    Inventors: Li Zhang, Kai Zhang, Yuhuai Zhang, Hongbin Liu, Yue Wang, Siwei Ma
  • Patent number: 12294737
    Abstract: A video image processing method includes dividing a current image block into a plurality of sub-blocks, in response to a target neighboring block of the current image block satisfying a preset condition, determining related blocks of the sub-blocks in a collocated frame of the current image block according to a motion vector of the target neighboring block, encoding or decoding the current image block according to motion vectors of the related blocks of the sub-blocks, determining a group of control point motion vectors in an affine transformation mode using one or more neighboring blocks of the current image block, and adding the group of control point motion vectors to a motion vector candidate list of the current image block.
    Type: Grant
    Filed: April 1, 2024
    Date of Patent: May 6, 2025
    Assignee: SZ DJI TECHNOLOGY CO., LTD.
    Inventors: Xiaozhen Zheng, Tianliang Fu, Shanshe Wang, Siwei Ma, Weiran Li, Suhong Wang
  • Patent number: 12294738
    Abstract: A video encoding method includes determining a base motion vector list that includes a dual-prediction base motion vector group including a first base motion vector and a second base motion vector, determining two motion vector offsets according to a preset offset set, determining motion vectors of a current image block according to the first base motion vector, the second base motion vector, and the two motion vector offsets, and performing prediction on the current image block according to the motion vectors of the current image block.
    Type: Grant
    Filed: May 24, 2024
    Date of Patent: May 6, 2025
    Assignee: SZ DJI TECHNOLOGY CO., LTD.
    Inventors: Xiaozhen Zheng, Suhong Wang, Shanshe Wang, Siwei Ma, Weiran Li
  • Patent number: 12294736
    Abstract: A video processing method includes obtaining motion information of a left neighboring block of a current image block, and, where the left neighboring block satisfies a condition, determining, in a collocated frame of the current image block, a related block of a sub-block of the current image block using a motion vector of the left neighboring block as a motion offset, and determining a motion vector of the sub-block according to a motion vector of the related block. The preset condition includes a reference image of the left neighboring block being same as the collocated frame of the current image block. Determining the motion vector of the sub-block includes, where reference images of the related block and the sub-block are not specific reference images, determining the motion vector of the sub-block according to a scaled motion vector obtained by scaling the motion vector of the related block.
    Type: Grant
    Filed: April 1, 2024
    Date of Patent: May 6, 2025
    Assignee: SZ DJI TECHNOLOGY CO., LTD.
    Inventors: Xiaozhen Zheng, Suhong Wang, Shanshe Wang, Siwei Ma, Weiran Li