Patents by Inventor Xiaozhong Xu

Xiaozhong Xu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240080463
    Abstract: The various implementations described herein include methods and systems for coding video. In one aspect, a method includes receiving video data including a current image frame having a sample with a first color component and a second color component. The method further includes obtaining a first clipping range for pixel values of the first color component in the sample based on a syntax element value in the received video data and deriving a second clipping range for pixel values of the second color component in the sample based on the first clipping range. The method further includes in response to a pixel value of the second color component exceeding the second clipping range, restricting the pixel value to a minimum value or a maximum value corresponding to the second clipping range. The method further includes reconstructing the current non-monochrome image frame using the restricted pixel value for the sample.
    Type: Application
    Filed: June 26, 2023
    Publication date: March 7, 2024
    Inventors: Xin ZHAO, Guichun LI, Xiaozhong XU, Shan LIU
  • Publication number: 20240080471
    Abstract: Aspects of the disclosure provide a method and an apparatus including processing circuitry that determines, based on a syntax element in a coded video bitstream, that a current block including a plurality of subblocks is coded in a subblock-based temporal motion vector prediction (SbTMVP) mode. Motion vector offset (MVO) information indicating an MVO is received. The MVO indicates a motion offset of a displacement vector (DV) used to adjust a location of a collocated block in a collocated reference picture. An updated DV of the current block is determined based on the DV and the MVO. SbTMVP information of a respective subblock in the plurality of subblocks is derived based on motion information of a corresponding subblock in the collocated block indicated by the updated DV. The plurality of subblocks in the SbTMVP mode is reconstructed based on the SbTMVP information of the subblock in the plurality of subblocks.
    Type: Application
    Filed: November 6, 2023
    Publication date: March 7, 2024
    Applicant: Tencent America LLC
    Inventors: Lien-Fei CHEN, Xin ZHAO, Roman CHERNYAK, Xiaozhong XU, Shan LIU
  • Patent number: 11924434
    Abstract: Aspects of the disclosure provide methods and apparatuses for mesh coding (e.g., compression and decompression). In some examples, an apparatus for mesh coding includes processing circuitry. The processing circuitry decodes a plurality of initial maps in two-dimension from a bitstream carrying a three-dimensional (3D) mesh frame. The processing circuitry determines at least two sampling rates associated with different portions of the plurality of initial maps and recovers from the plurality of initial maps and based on the at least two sampling rates associated with the different portions of the plurality of initial maps to obtain a plurality of recovered maps. A first portion of the plurality of initial maps is recovered based on a first sampling rate, and a second portion of the plurality of initial maps is recovered based on a second sampling rate. The processing circuitry reconstructs the 3D mesh frame based on the plurality of recovered maps.
    Type: Grant
    Filed: September 6, 2022
    Date of Patent: March 5, 2024
    Assignee: TENCENT AMERICA LLC
    Inventors: Xiaozhong Xu, Xiang Zhang, Shan Liu, Chao Huang, Jun Tian
  • Patent number: 11922664
    Abstract: A processing circuitry decodes a plurality of maps in 2D from a bitstream carrying a mesh frame. The mesh frame represents a surface of an object with polygons. The plurality of maps includes a decoded geometry map and a decoded attribute map with an adaptive 2D atlas sampling applied. The processing circuitry determines at least a first sampling rate and a second sampling rate according to syntaxes signaled in the bitstream. The first sampling rate is applied to a first region of the mesh frame and the second sampling rate is applied to a second region of the mesh frame during the adaptive 2D atlas sampling. The processing circuitry reconstructs, based on the plurality of maps, at least a first vertex of the mesh frame according to the first sampling rate, and a second vertex of the mesh frame according to the second sampling rate.
    Type: Grant
    Filed: September 14, 2022
    Date of Patent: March 5, 2024
    Assignee: TENCENT AMERICA LLC
    Inventors: Xiang Zhang, Shan Liu, Xiaozhong Xu, Chao Huang, Jun Tian
  • Publication number: 20240073433
    Abstract: Coding information of a mesh is received. The coding information includes a plurality of first coordinates and a plurality of second coordinates corresponding to a plurality of vertices and a texture map that are associated with the mesh. A respective first coordinate and a respective second coordinate associated with each of the plurality of vertices are normalized by adjusting the respective first coordinate based on a first factor and the respective second coordinate based on a second factor. The first factor and the second factor are associated with at least one of (i) a bit depth value indicating a coded range of the first coordinates and the second coordinates and (ii) a size of the texture map. The normalized respective first coordinate and the normalized respective second coordinate are expanded based on the first factor and the second factor respectively.
    Type: Application
    Filed: June 9, 2023
    Publication date: February 29, 2024
    Applicant: Tencent America LLC
    Inventors: Jun TIAN, Xiaozhong XU, Chao HUANG, Xiang ZHANG, Shan LIU
  • Publication number: 20240062470
    Abstract: A method and apparatus that renders a mesh from one or more virtual camera views of an object, the one or more virtual camera views being projections of the mesh onto a plurality of 2D planes; inputs the one or more virtual camera views into a fully-convolutional model for real-time segmentation; generates one or more 2D pixel-wise masks based on the fully-convolutional model, the 2D pixel-wise masks being associated with one or more segmented elements; performs remeshing and mesh optimization using the one or more 2D pixel-wise masks; and generates an output mesh based on the remeshing and mesh optimization.
    Type: Application
    Filed: May 10, 2023
    Publication date: February 22, 2024
    Inventors: Ethan SCHUR, Xiaozhong Xu, Shan Liu, Xiang Zhang
  • Publication number: 20240062466
    Abstract: Aspects of the disclosure provide methods and apparatuses for point cloud processing. In some examples, an apparatus for point cloud processing includes processing circuitry. For example, the processing circuitry obtains point cloud data corresponding to a point cloud in a three dimensional (3D) space, projects the point cloud in the 3D space to one or more two dimensional (2D) planes to generate one or more images. The processing circuitry generates a pixel wise mask for object instances in the point cloud according to the one or more images. The pixel wise mask includes first pixels that are associated with a first object instance in the point cloud. The processing circuitry processes the point cloud based on the pixel wise mask, a portion of the point cloud corresponding the first pixels in the pixel wise mask is processed based on one or more processing parameters determined for the first object instance.
    Type: Application
    Filed: June 8, 2023
    Publication date: February 22, 2024
    Applicant: Tencent America LLC
    Inventors: Ethan SCHUR, Xiaozhong XU, Shan LIU, Xiang ZHANG
  • Publication number: 20240064334
    Abstract: A method and apparatus comprising computer code configured to cause a processor or processors to obtain a mesh sequence comprising a plurality of meshes corresponding to volumetric data of at least one three-dimensional (3D) visual content, obtain a frame of the mesh sequence corresponding to the volumetric data, the frame including a plurality of vertices of a mesh of the mesh sequence, determine a motion field comprising motion vectors of the plurality of vertices of the mesh, and encode the volumetric data based on the motion field.
    Type: Application
    Filed: May 4, 2023
    Publication date: February 22, 2024
    Applicant: TENCENT AMERICA LLC
    Inventors: Chao HUANG, Xiaozhong Xu, Jun Tian, Xiang Zhang, Shan Liu
  • Patent number: 11909987
    Abstract: A method for controlling block intra prediction is provided for a decoder. The method includes: obtaining a coded video bitstream; decoding, prediction information of a current coding unit (CU) in a segment of a current picture from the coded video bitstream; determining, according to the prediction information, a prediction type of a chroma coding block (CB) of the current CU, the prediction type being intra prediction or inter prediction; and reconstructing the chroma CB according to the prediction information, where a width of any reconstructed chroma CB intra-predicted from the coded video bitstream is greater than 2.
    Type: Grant
    Filed: March 12, 2023
    Date of Patent: February 20, 2024
    Assignee: TENCENT AMERICA LLC
    Inventors: Liang Zhao, Xiang Li, Xin Zhao, Shan Liu, Guichun Li, Xiaozhong Xu
  • Patent number: 11909956
    Abstract: Systems and methods for deep neural network (DNN)-based cross component prediction are provided. A method includes inputting a reconstructed luma block of an image or video into a DNN; and predicting, by the DNN, a reconstructed chroma block of the image or video based on the reconstructed luma block that is input. Luma and chroma reference information and side information may also be input into the DNN to predict the reconstructed chroma block. The various inputs may also be generated using processes such as downsampling and transformation.
    Type: Grant
    Filed: May 20, 2022
    Date of Patent: February 20, 2024
    Assignee: TENCENT AMERICA LLC
    Inventors: Sheng Lin, Wei Jiang, Wei Wang, Liqiang Wang, Shan Liu, Xiaozhong Xu
  • Patent number: 11902584
    Abstract: A method, computer program, and computer system for encoding or decoding video data, and indicating, with a syntax element, types of slices for all slices of a coded picture, the syntax element being coded using an unsigned integer.
    Type: Grant
    Filed: September 21, 2020
    Date of Patent: February 13, 2024
    Assignee: TENCENT AMERICA LLC
    Inventors: Ling Li, Xiaozhong Xu, Byeongdoo Choi, Xiang Li, Stephan Wenger, Shan Liu
  • Publication number: 20240048773
    Abstract: Aspects of the disclosure include methods, apparatuses, and non-transitory computer-readable storage mediums for video encoding/decoding. An apparatus includes processing circuitry that receives metadata associated with a coded video bitstream. The metadata includes labeling information of one or more objects detected in a first picture that is coded in the coded video bitstream. The processing circuitry decodes the labeling information of the one or more objects in the first picture that is coded in the coded video bitstream. The processing circuitry applies the labeling information to the one or more objects in the first picture.
    Type: Application
    Filed: October 23, 2023
    Publication date: February 8, 2024
    Applicant: TENCENT AMERICA LLC
    Inventors: Shan Liu, Xiaozhong Xu, Wen Gao
  • Publication number: 20240048763
    Abstract: Whether each of a plurality of duplicate vertices is skippable in a mesh is determined. One or more skippable duplicate vertices of the plurality of duplicate vertices in the mesh are excluded to generate an updated mesh. Vertices of the updated mesh are divided into a plurality of vertex groups, where the plurality of vertex groups includes a first vertex group. A prediction mode of the first vertex group is determined at least based on estimation errors of vertices in the first vertex group of the updated mesh. Each of the estimation errors indicates a difference between a respective vertex in the first vertex group and a corresponding reference vertex in the reference frame. Prediction information of the vertices in the first vertex group is generated based on the determined prediction mode.
    Type: Application
    Filed: June 8, 2023
    Publication date: February 8, 2024
    Applicant: Tencent America LLC
    Inventors: Jun TIAN, Chao HUANG, Xiaozhong XU, Xiang ZHANG, Shan LIU
  • Publication number: 20240040148
    Abstract: A method and apparatus comprising computer code configured to cause a processor or processors to obtain an input mesh corresponding to volumetric data of at least one three-dimensional (3D) visual content, partitioning the input mesh into a plurality of submeshes, each of the submeshes comprising a respective one of instances of objects, determine a similarity between at least two of the instance of objects, grouping the submeshes based on the determined similarity, and code the volumetric data based on grouping the submeshes.
    Type: Application
    Filed: May 4, 2023
    Publication date: February 1, 2024
    Applicant: TENCENT AMERICA LLC
    Inventors: Thuong NGUYEN CANH, Xiang ZHANG, Xiaozhong XU, Shan LIU
  • Patent number: 11889086
    Abstract: An apparatus for video decoding includes processing circuitry. The circuitry can be configured to determine whether to apply the PROF to an affine coded block, and responsive to a determination to apply the PROF to the affine coded block, perform a PROF process. For example, during the PROF process, a prediction sample I(i,j) at a sample location (i,j) in the affine coded block can be generated. Spatial gradients g_x (i,j) and g_y (i,j) at the sample location (i,j) in the affine coded block can be generate. A prediction refinement ?I(i,j) based on the spatial gradients g_x (i,j) and g_y (i,j) can be generated. The prediction refinement ?I(i,j) can added to the prediction sample I(i,j) to generate a refined prediction sample. The PROF for affine prediction can be enabled according to a syntax element received at the decoder, or can be enabled by default.
    Type: Grant
    Filed: March 15, 2023
    Date of Patent: January 30, 2024
    Assignee: TENCENT AMERICA LLC
    Inventors: Guichun Li, Xiang Li, Xiaozhong Xu, Shan Liu
  • Publication number: 20240029316
    Abstract: The various implementations described herein include methods and systems for encoding video. In one aspect, a method includes receiving a mesh with polygons representing a surface of an object; detecting a first symmetric region in the mesh that includes a first symmetry line to divide the first symmetric region into a first partition and a second partition. The method includes recursively determining whether one of the first partition or the second partition includes a second symmetric region until no symmetric region is detected in both the first partition and the second partition. The method includes in response to detecting the second symmetric region within one of the first partition or the second partition: determining a second symmetry line within the second symmetric region to divide the first or the second partition into a third sub-partition and a fourth sub-partition; and compressing information of the third sub-partition, the second symmetry line and the first symmetry line into a bitstream.
    Type: Application
    Filed: June 9, 2023
    Publication date: January 25, 2024
    Inventors: Thuong NGUYEN CANH, Xiaozhong XU, Xiang ZHANG, Chao HUANG, Shan LIU
  • Patent number: 11882305
    Abstract: Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. In some examples, an apparatus for video encoding includes receiving circuitry and processing circuitry. The processing circuitry determines whether a block is under a local dual tree structure, and disallows a palette based coding mode for encoding the block in response to the block being under the local dual tree structure.
    Type: Grant
    Filed: September 10, 2021
    Date of Patent: January 23, 2024
    Assignee: TENCENT AMERICA LLC
    Inventors: Xiaozhong Xu, Shan Liu
  • Publication number: 20240022758
    Abstract: Aspects of the disclose provide methods and an apparatus for video and/or image coding. The apparatus includes processing circuitry that receives a bitstream including a plurality of pictures. The processing circuitry determines an initial motion vector (MV) associated with a motion compensation padding (MCP) block according to motion information of a boundary block located at a picture boundary of a picture in the plurality of pictures. The boundary block is located in the picture. The MCP block is located in an MCP area outside of the picture and adjacent to the picture boundary. The initial MV has a fractional precision. The processing circuitry generates a final MV associated with the MCP block based on the initial MV. The final MV has an integer precision. The processing circuitry reconstructs, according to the generated final MV having the integer precision, at least a sample in the MCP block.
    Type: Application
    Filed: June 8, 2023
    Publication date: January 18, 2024
    Applicant: Tencent America LLC
    Inventors: Lien-Fei CHEN, Xiaozhong XU, Guichun LI, Xin ZHAO, Shan LIU
  • Publication number: 20240022739
    Abstract: Aspects of the disclosure provide methods and apparatuses for video encoding/decoding. In some examples, processing circuitry receives a coded video bitstream including a current picture. The current picture includes a chroma block in a chroma separate tree, the chroma block being collocated in a same luma area with one or more luma blocks. The processing circuitry decodes, from the coded video bitstream, a syntax element that indicates a current picture referencing (CPR) mode for the chroma block, and determines, in response to the CPR mode, a chroma block vector for the chroma block according to one or more luma block vectors associated with the one or more luma blocks. The chroma block vector indicates a reference chroma block in the current picture. The processing circuitry reconstructs the chroma block based on the reference chroma block in the current picture.
    Type: Application
    Filed: November 8, 2022
    Publication date: January 18, 2024
    Applicant: Tencent America LLC
    Inventors: Guichun LI, Xin ZHAO, Lien-Fei CHEN, Xiaozhong XU, Shan LIU
  • Publication number: 20240022764
    Abstract: A plurality of candidate predictions of a current vertex is determined. The current vertex is one of a plurality of vertices of a mesh. Each of the plurality of candidate predictions is determined based on a respective one of a plurality of triangles in the mesh and including two-dimensional (2D) coordinates in a 2D domain. An average prediction of the plurality of candidate predictions is computed. A 2D prediction of the current vertex is selected from a prediction list that includes the plurality of candidate predictions and the average prediction. A three-dimensional (3D) prediction of the current vertex is determined based on one or more neighboring vertices of the current vertex in the mesh. A prediction residue comprising a prediction error of the current vertex is encoded. The prediction residue is obtained based on the selected 2D prediction and the determined 3D prediction of the current vertex.
    Type: Application
    Filed: June 28, 2023
    Publication date: January 18, 2024
    Applicant: Tencent America LLC
    Inventors: Jun TIAN, Chao HUANG, Xiaozhong XU, Xiang ZHANG, Shan LIU