Patents by Inventor Yan Ye

Yan Ye has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Video coding with large macroblocks

Patent number: 12389043

Abstract: Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels. In one example, an apparatus includes a video encoder configured to encode a coded unit comprising a plurality of video blocks, wherein at least one of the plurality of video blocks comprises a size of more than 16×16 pixels and to generate syntax information for the coded unit that includes a maximum size value, wherein the maximum size value indicates a size of a largest one of the plurality of video blocks in the coded unit. The syntax information may also include a minimum size value. In this manner, the encoder may indicate to a decoder the proper syntax decoder to apply to the coded unit.

Type: Grant

Filed: August 1, 2023

Date of Patent: August 12, 2025

Assignee: QUALCOMM Incorporated

Inventors: Peisong Chen, Yan Ye, Marta Karczewicz
LOSSLESS CODING OF VIDEO DATA

Publication number: 20250234003

Abstract: The present disclosure provides systems and methods for performing residual coding of video data. According to certain disclosed embodiments, the methods include: receiving control information for coding a video sequence; and determining, based on the control information, a coding method for coding a prediction residual signal of a coding block in the video sequence. The coding method is one of a transform residual coding and a transform-skip residual coding.

Type: Application

Filed: April 4, 2025

Publication date: July 17, 2025

Inventors: Mohammed Golam SARWER, Jiancong LUO, Yan YE
LOSSLESS CODING OF VIDEO DATA

Publication number: 20250234002

Abstract: The present disclosure provides systems and methods for performing residual coding of video data. According to certain disclosed embodiments, the methods include: receiving control information for coding a video sequence; and determining, based on the control information, a coding method for coding a prediction residual signal of a coding block in the video sequence. The coding method is one of a transform residual coding and a transform-skip residual coding.

Type: Application

Filed: April 4, 2025

Publication date: July 17, 2025

Inventors: Mohammed Golam SARWER, Jiancong LUO, Yan YE
Block partitioning methods for video coding

Patent number: 12363320

Abstract: The present disclosure provides systems and methods for processing video content. The method can include: partitioning, along a partitioning edge, a plurality of blocks associated with a picture into a first partition and a second partition; performing inter prediction on the plurality of blocks, to generate a first prediction signal for the first partition and a second prediction signal for the second partition; and blending the first and second prediction signals for edge blocks associated with the partitioning edge.

Type: Grant

Filed: February 23, 2024

Date of Patent: July 15, 2025

Assignee: Alibaba Group Holding Limited

Inventors: Ru-Ling Liao, Jie Chen, Yan Ye, Jiancong Luo
Neural network based in-loop filtering for video coding

Patent number: 12363350

Abstract: The present disclosure provides methods for performing training and executing of a multi-density neural network in video processing. An exemplary method comprises: receiving a video stream comprising a plurality of pictures; processing the plurality of pictures using a first branch of a first block in the neural network, wherein the neural network is configured to reduce blocking artifacts in video compression of the video stream and the first branch comprises one or more residual blocks; and processing the plurality of pictures using a second branch of the first block in the neural network, wherein the second branch comprises a down-sampling processing, an up-sampling processing, and one or more residual blocks.

Type: Grant

Filed: January 19, 2021

Date of Patent: July 15, 2025

Assignee: Alibaba Group Holding Limited

Inventors: Zhao Wang, Changyue Ma, Ru-Ling Liao, Yan Ye
Sign data hiding of video recording

Patent number: 12363276

Abstract: In a method of encoding a video sequence into a bitstream, an encoder determines, based on a value of a first encoded flag, whether to encode a second flag into the bitstream. The first flag indicates whether sign data hiding is turned off at a slice level for a video frame. The second flag indicates whether transform skip residual coding is turned off at the slice level for the video frame.

Type: Grant

Filed: March 24, 2021

Date of Patent: July 15, 2025

Assignee: Alibaba Group Holding Limited

Inventors: Mohammed Golam Sarwer, Yan Ye
TRICYCLIC BORONIC ACID DERIVATIVE, AND PREPARATION METHOD THEREFOR AND APPLICATION THEREOF

Publication number: 20250223303

Abstract: Disclosed are a tricyclic boronic acid derivative, and a preparation method therefor and an application thereof. Specifically, discloses are a compound represented by formula (I), an optical isomer and pharmaceutically acceptable salt thereof, and an application of the compound as a ?-lactamase inhibitor and an application of a composition as an antibacterial drug.

Type: Application

Filed: October 19, 2022

Publication date: July 10, 2025

Applicant: SHANGHAI JEMINCARE PHARMACEUTICAL CO., LTD.

Inventors: Jianmin ZHANG, Shuchun GUO, Zilong ZHANG, Cheng ZHENG, Lingjun LI, Jianbo FENG, Fang BAO, He HU, Nan WU, Zhitao ZHANG, Yu WANG, Yan YE, Jianbiao PENG
METHOD AND COMPRESSION FRAMEWORK WITH POST-PROCESSING FOR MACHINE VISION

Publication number: 20250227311

Abstract: A video processing method includes compressing and reconstructing an original visual signal to obtain a reconstructed visual signal; processing the reconstructed visual signal to obtain a post-processed visual signal; and feeding the post-processed visual signal to a machine task network

Type: Application

Filed: December 26, 2024

Publication date: July 10, 2025

Inventors: Binzhe LI, Shurun WANG, Yan YE
RELATIVE DIFFERENCE METRIC FOR FRAME CODING AND TWO-STAGE TRAINING FOR GENERATIVE FACE VIDEO COMPRESSION

Publication number: 20250227268

Abstract: Generative Face Video Compression (“GFVC”) techniques are provided to improve performance of facial video compression. A computing system is configured to compute a relative difference metric describing differences in features between frames, and determining, based on the relative difference metric, whether a current frame can be synthesized without entropy coding, or should be re-coded. A computing system is configured to perform two-stage training to stabilize Generative Adversarial Networks (“GAN”) training in GFVC.

Type: Application

Filed: January 2, 2025

Publication date: July 10, 2025

Inventors: Renjie Zou, Bolin Chen, Ru-ling Liao, Jie Chen, Yan Ye
TEMPLATE-MATCHING-BASED SUBBLOCK MOTION REFINEMENT FOR MOTION PREDICTION

Publication number: 20250227241

Abstract: Methods and systems implement application of template-matching-based motion refinement on subblocks of coding blocks. A VVC-standard encoder and a VVC-standard decoder can configure one or more processors of a computing system to obtain sub-templates of each subblock of a coding block from respective motion vectors of neighboring subblocks.

Type: Application

Filed: January 2, 2025

Publication date: July 10, 2025

Inventors: Jie Chen, Ru-ling Liao, Yan Ye, Xinwei Li
CHROMA INTRA PREDICTION MODES

Publication number: 20250227292

Abstract: Methods and systems implement fusion of intra TMP mode with other intra prediction modes that utilize adjacent samples, to improve prediction accuracy. A VVC-standard encoder and a VVC-standard decoder can configure one or more processors of a computing system to apply non-CCP modes on the template of the collocated luma block, and reorder non-CCP modes based on template matching cost; additionally reorder angular modes, such as a subset of efficient angular modes based on the template of the collocated luma block; prune non-CCP modes from the ordered list based on similarity; move a non-CCP mode of the reordered ordered list based on template matching cost difference relative to a predecessor; copy and reorder the ordered list of non-CCP modes once for each respective distinctly signaled chroma fusion mode; fuse a chroma DBV mode with a CCP mode; and select a least-cost reordered block vector for chroma DBV mode.

Type: Application

Filed: January 2, 2025

Publication date: July 10, 2025

Inventors: Xinwei Li, Ru-ling Liao, Jie Chen, Yan Ye
Decoder-side chroma intra prediction mode gradient-based derivation

Patent number: 12355982

Abstract: A VVC-standard encoder and a VVC-standard decoder are provided, implementing a decoder-side chroma intra prediction mode gradient-based derivation method, which improves coding efficiency of chroma intra prediction, thereby saving on signaling cost. A VVC-standard decoder configures one or more processors of a computing system to derive one of multiple possible chroma intra prediction modes by computing gradients of adjacent luma samples and chroma samples of a current chroma block. With minimal increase in signaling cost, the VVC-standard coding and decoding processes are enhanced to base intra prediction modes for chroma blocks on collocated luma block-adjacent reconstructed luma samples adjacent reconstructed chroma samples, enabling prior computational work done on coding and decoding adjacent blocks to be referenced. In this fashion, coding gains can be achieved in matching texture characteristics of the current chroma block based on a texture gradient including adjacent blocks.

Type: Grant

Filed: December 29, 2022

Date of Patent: July 8, 2025

Assignee: Alibaba (China) Co., Ltd

Inventors: Xinwei Li, Ru-Ling Liao, Jie Chen, Yan Ye
Fusion of video prediction modes

Patent number: 12348724

Abstract: The present disclosure provides methods and systems for fusing chroma intra prediction modes. An exemplary method includes: generating a plurality of predicted chroma samples associated with a pixel, by using a plurality of chroma intra prediction modes respectively; and determining a first predicted chroma sample, based on a weighted sum of the plurality of predicted chroma samples.

Type: Grant

Filed: December 23, 2022

Date of Patent: July 1, 2025

Assignee: Alibaba Innovation Private Limited

Inventors: Xinwei Li, Ru-Ling Liao, Jie Chen, Yan Ye
Filters for motion compensation interpolation with reference down-sampling

Patent number: 12348785

Abstract: The present disclosure provides systems and methods for processing video content using motion compensation interpolation. The methods include: encoding or decoding a picture block by applying one or more filters to a reference picture, wherein the applying of the one or more filters generates a sample at a fractional sample position, and wherein the one or more filters comprise an 8-tap filter that has a plurality of coefficients [p0, p1, . . . , p7] for each 1/16 fractional sample position p.

Type: Grant

Filed: December 28, 2023

Date of Patent: July 1, 2025

Assignee: Alibaba Group Holding Limited

Inventors: Jiancong Luo, Yan Ye, Mohammed Golam Sarwer
Methods for simplifying adaptive loop filter in video coding

Patent number: 12348787

Abstract: Systems, methods and instrumentalities are disclosed for adaptively selecting an adaptive loop filter (ALF) procedure for a frame based on which temporal layer the frame is in. ALF procedures may vary in computational complexity. One or more frames including the current frame may be in a temporal layer of a coding scheme. The decoder may determine the current frame's temporal layer level within the coding scheme. The decoder may select an ALF procedure based on the current frame's temporal layer level. If the current frame's temporal layer level is higher within the coding scheme than some other temporal layer levels, an ALF procedure that is less computationally complex may be selected for the current frame. Then the decoder may perform the selected ALF procedure on the current frame.

Type: Grant

Filed: April 28, 2023

Date of Patent: July 1, 2025

Assignee: InterDigital VC Holdings, Inc.

Inventors: Rahul Vanam, Yuwen He, Yan Ye
METHODS FOR CODING OR DECODING VIDEO PARAMETER SET OR SEQUENCE PARAMETER SET

Publication number: 20250211795

Abstract: The present disclosure provides a computer-implemented method for encoding video. The method includes: determining whether a coded video sequence (CVS) contains equal number of profile, tier and level (PTL) syntax structures and output layer sets (OLSs); and in response to the CVS containing equal number of PTL syntax structures and OLSs, coding the bitstream without signaling a first PTL syntax element specifying an index, to a list of PTL syntax structures in the VPS, of a PTL syntax structure that applies to a corresponding OLS in the VPS.

Type: Application

Filed: March 7, 2025

Publication date: June 26, 2025

Inventors: Jie CHEN, Jiancong LUO, Yan YE, Ru-Ling LIAO
METHOD AND APPARATUS FOR ENCODING OR DECODING VIDEO

Publication number: 20250211774

Abstract: The present disclosure provides a computer-implemented method for decoding video. The method includes receiving a bitstream including a coding unit of a video frame, the coding unit being coded in a skip mode or a direct mode; determining whether the coding units has a width greater than a threshold width or a height greater than a threshold height; in response to the coding unit having the width greater than the threshold width or the height greater than the threshold height, enabling a motion vector angular prediction process to store one or more motion vector angular prediction candidates in a candidate list storing a plurality of motion candidates associated with the coding unit; and performing an inter prediction to the coding unit according to the candidate list and an index parsed from the bitstream.

Type: Application

Filed: March 10, 2025

Publication date: June 26, 2025

Inventors: Ru-Ling LIAO, Jie CHEN, Yan YE
ON BLOCK LEVEL BI-PREDICTION WITH WEIGHTED AVERAGING

Publication number: 20250211760

Abstract: Video encoding and decoding techniques for bi-prediction with weighted averaging are disclosed. According to certain embodiments, a computer-implemented video signaling method includes signaling, by a processor to a video decoder, a bitstream including weight information used for prediction of a coding unit (CU). The weight information indicates: if weighted prediction is enabled for a bi-prediction mode of the CU, disabling weighted averaging for the bi-prediction mode.

Type: Application

Filed: March 7, 2025

Publication date: June 26, 2025

Inventor: Yan YE
METHODS FOR CODING OR DECODING VIDEO PARAMETER SET OR SEQUENCE PARAMETER SET

Publication number: 20250211794

Abstract: The present disclosure provides a computer-implemented method for encoding video. The method includes: determining whether a coded video sequence (CVS) contains equal number of profile, tier and level (PTL) syntax structures and output layer sets (OLSs); and in response to the CVS containing equal number of PTL syntax structures and OLSs, coding the bitstream without signaling a first PTL syntax element specifying an index, to a list of PTL syntax structures in the VPS, of a PTL syntax structure that applies to a corresponding OLS in the VPS.

Type: Application

Filed: March 7, 2025

Publication date: June 26, 2025

Inventors: Jie CHEN, Jiancong LUO, Yan YE, Ru-Ling LIAO
METHODS, APPARATUS, AND NON-TRANSITORY COMPUTER READABLE MEDIUM FOR CROSS-COMPONENT SAMPLE ADAPTIVE OFFSET

Publication number: 20250203089

Abstract: The present disclosure provides a video data processing method for cross-component sample adaptive offset (CCSAO). The method includes determining an index based on a vertical coordinate of a chroma sample within a picture; determining a luma sample based on the index; classifying the chroma sample based on a reconstructed value associated with the luma sample; determining an offset based on the classification; and adding the offset to a reconstructed value associated with the chroma sample.

Type: Application

Filed: February 27, 2025

Publication date: June 19, 2025

Inventors: Xinwei LI, Jie CHEN, Ru-Ling LIAO, Yan YE

1 2 3 4 5 … next