Patents by Inventor Ximin Zhang

Ximin Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250088633
    Abstract: Techniques related to video coding include content adaptive quantization that provides a selection between objective quality and subjective quality delta QP offsets. An adaptive method generates an objective quality delta QP offset that achieves a best peak signal-to-noise ratio (PSNR) and/or structural similarity (SSIM) score, which refers to a similarity between images. Also, the adaptive method generates a subjective quality delta QP offset that achieves the best video multi-method assessment fusion (VMAF) score and/or multi-scale structural similarity (MSSIM) score.
    Type: Application
    Filed: November 26, 2024
    Publication date: March 13, 2025
    Applicant: Intel Corporation
    Inventors: Zhijun Lei, Ximin Zhang, Sang-hee Lee
  • Patent number: 12244807
    Abstract: Techniques related to adaptive quantization matrix selection using machine learning for video coding are discussed. Such techniques include applying a machine learning model to generate an estimated quantization parameter for a frame and selecting a set of quantization matrices for encode of the frame from a number of sets of quantization matrices based on the estimated quantization parameter.
    Type: Grant
    Filed: November 30, 2020
    Date of Patent: March 4, 2025
    Assignee: Intel Corporation
    Inventors: James Holland, Sang-hee Lee, Ximin Zhang, Zhan Lou
  • Publication number: 20250056010
    Abstract: Techniques related to quantization parameter estimation for coding intra and scene change frames are discussed. Such techniques include generating features based on an intra or scene change frame including a proportion of smooth blocks and one or both of a measure of block variance and a prediction distortion, and applying a machine learning model to generate an estimated quantization parameter for encoding the intra or scene change frame.
    Type: Application
    Filed: August 23, 2024
    Publication date: February 13, 2025
    Applicant: Intel Corporation
    Inventors: Ximin Zhang, Sang-Hee Lee, Keith W. Rowe
  • Publication number: 20250047851
    Abstract: Techniques related to adaptive quality boosting for low latency video coding are discussed. Such techniques include segmenting each of a number of temporally adjacent video frames into unique high encode quality regions and encoding each of the video frames by applying a coding quality boost to the high encode quality regions relative to other regions of the video frames.
    Type: Application
    Filed: October 21, 2024
    Publication date: February 6, 2025
    Applicant: Intel Corporation
    Inventors: Ximin Zhang, Changliang Wang, Sang-hee Lee, Keith Rowe
  • Publication number: 20250022485
    Abstract: Disclosed in the embodiments of the present disclosure are a video generation method, an apparatus, an electronic device and a storage medium. The method comprises: during an image editing process of subjecting an initial image to a series of image editing operations to obtain a target image, acquiring initial key frames, the initial key frames being intermediate images between the initial image and the target image during the image editing process, the different initial key frames being intermediate images generated when different image editing operations of the series of image editing operations are triggered during the image editing process, and the intermediate images showing editing effects corresponding to the image editing operations; and generating a target video according to the initial key frames so as to record the image editing process of the target image. The present disclosure can be used for recording an image editing process.
    Type: Application
    Filed: September 30, 2024
    Publication date: January 16, 2025
    Inventors: Ximin ZHU, Yan Liu, Zhixuan Wei, Qingyuan Hu, Guanming Cai, Xinghua Zhang, Jiaming Li, Zhi Chen, Kai Li, Tianqi Chen, Binbin Li
  • Patent number: 12192478
    Abstract: An example apparatus for adaptively encoding video frames includes a network analyzer to predict an instant bitrate based on channel throughput feedback received from a network. The apparatus also includes a content analyzer to generate ladder info based on a received frame. The apparatus further includes an adaptive decision executer to determine a frame rate, a video resolution, and a target frame size based on the predicted instant bitrate and the ladder outputs. The apparatus further includes an encoder to encode the frame based on the frame rate, the video resolution, and the target frame size.
    Type: Grant
    Filed: November 14, 2019
    Date of Patent: January 7, 2025
    Assignee: INTEL CORPORATION
    Inventors: Yunbiao Lin, Changliang Wang, Ximin Zhang, Fan He, Jill Boyce, Sri Ranjan Srikantam
  • Patent number: 12184855
    Abstract: Techniques related to video coding include content adaptive quantization that provides a selection between objective quality and subjective quality delta QP offsets.
    Type: Grant
    Filed: November 19, 2020
    Date of Patent: December 31, 2024
    Assignee: Intel Corporation
    Inventors: Zhijun Lei, Ximin Zhang, Sang-hee Lee
  • Patent number: 12166986
    Abstract: Techniques related to adaptive quality boosting for low latency video coding are discussed. Such techniques include segmenting each of a number of temporally adjacent video frames into unique high encode quality regions and encoding each of the video frames by applying a coding quality boost to the high encode quality regions relative to other regions of the video frames.
    Type: Grant
    Filed: December 14, 2020
    Date of Patent: December 10, 2024
    Assignee: Intel Corporation
    Inventors: Ximin Zhang, Changliang Wang, Sang-hee Lee, Keith Rowe
  • Publication number: 20240357138
    Abstract: An example apparatus for encoding video frames includes a mask selector to select a subset of visual masks according to an actual target compression ratio and GOP configuration and a complexity estimator to estimate a picture level spatial/temporal complexity for a current frame. The example apparatus further includes a GOP adaptive visual mask selector to specify a visual mask from the subset of the visual masks corresponding to the estimated spatial and temporal complexity value a good enough picture QP deriver to derive a good enough picture QP value using the visual mask. The example apparatus also includes an adjustor to adjust the good enough picture QP value based on block level human visual system sensitivity and statistics of already encoded frames to obtain a final human visual system QP map.
    Type: Application
    Filed: May 15, 2024
    Publication date: October 24, 2024
    Inventors: Ximin Zhang, Sang-Hee Lee, Keith Rowe
  • Publication number: 20240348801
    Abstract: Using a fixed group of pictures (GOP) size in video encoding significantly hinders compression efficiency due to its inability to adapt to the dynamic nature of video content. While encoding leverages spatio-temporal redundancy within a GOP for compression, a predetermined size fails to capture the varying complexity of scenes. This leads to wasted bits in low-motion segments and insufficient reference frame variation for high-motion areas, resulting in visual artifacts and reduced compression efficiency. To address this limitation, a GOP size recommendation engine involving machine learning models can determine frame-level GOP size recommendations based on pre-encoder frame statistics. The frame-level GOP size recommendations are used to adapt the GOP size for encoding video frames.
    Type: Application
    Filed: June 25, 2024
    Publication date: October 17, 2024
    Applicant: Intel Corporation
    Inventors: Sebastian Possos, Yi-jen Chiu, Ximin Zhang
  • Patent number: 12120312
    Abstract: Techniques related to quantization parameter estimation for coding intra and scene change frames are discussed. Such techniques include generating features based on an intra or scene change frame including a proportion of smooth blocks and one or both of a measure of block variance and a prediction distortion, and applying a machine learning model to generate an estimated quantization parameter for encoding the intra or scene change frame.
    Type: Grant
    Filed: November 17, 2020
    Date of Patent: October 15, 2024
    Assignee: Intel Corporation
    Inventors: Ximin Zhang, Sang-hee Lee, Keith W. Rowe
  • Patent number: 12108185
    Abstract: An embodiment may include a display processor, memory to store a 2D frame corresponding to a projection from a 360 video, and a quality selector to select a quality factor for a block of the 2D frame based on quality information from neighboring blocks of the 2D frame, including blocks which are neighboring only in the 360 video space. The system may also include a range adjuster to adjust a search range for the 2D frame based on a search area of the 2D frame, a viewport manager to determine if a request for a viewport of the 2D frame extends beyond a first edge of the 2D frame and to fill the requested viewport with wrap-around image information, and/or a motion estimator to estimate motion information based on both color information and depth information. Other embodiments are disclosed and claimed.
    Type: Grant
    Filed: August 30, 2021
    Date of Patent: October 1, 2024
    Assignee: Intel Corporation
    Inventors: Joydeep Ray, Abhishek R. Appu, Stanley J. Baran, Sang-Hee Lee, Atthar H. Mohammed, Jong Dae Oh, Hiu-Fai R. Chan, Ximin Zhang
  • Publication number: 20240283927
    Abstract: In some compression techniques, in-loop filters can be included to effectively remove coding artifacts and improve the objective quality measurement at the same time. To avoid increasing complexity in the encoder, a single pass encoding solution can be implemented to efficiently and effectively make reasonably optimal in-loop filtering decisions. The solution can improve the in-loop filtering bit usage (e.g., reduce bitrate) and reduce the complexity in the encoder at the same time.
    Type: Application
    Filed: April 16, 2024
    Publication date: August 22, 2024
    Applicant: Intel Corporation
    Inventors: Ximin Zhang, Yi-jen Chiu, Keith W. Rowe
  • Publication number: 20240283952
    Abstract: A lightweight but effective adaptive coding tool selection system with content classification can be implemented to reduce complexity and maintain quality in a video encoder. Content classification may classify a current frame between at least three classifications: screen content, weak screen content, and natural content. Content classification may make use of two statistics, e.g., color number and variance, of blocks that are 8×8 pixels or larger in size. The statistics may be used to calculate three frame-level statistics, e.g., proportion/percentage of blocks with few colors, proportion/percentage of blocks with zero variance, and proportion/percentage of blocks with big/large variance. The frame-level statistics are used to classify the current frame. Based on the classification, coding tool control flags or control signals may be generated accordingly to configure the encoder to, e.g., turn on or off certain coding tools, and/or use certain parameter values for the coding tools.
    Type: Application
    Filed: April 16, 2024
    Publication date: August 22, 2024
    Applicant: Intel Corporation
    Inventors: Minzhi Sun, Ximin Zhang, Yi-jen Chiu, James Holland
  • Publication number: 20240259568
    Abstract: Different approaches for reducing complexity and computations in inter-prediction encoding are described. The approaches may involve one or more of quantization parameter and motion information being used to make a precision decision at a picture level that can improve compression efficiency. The approaches may involve finding prediction costs for the motion vector difference candidates and then performing rate-distortion optimization using the selected motion vector difference candidate having the lowest prediction cost. Prediction costs may be determined using sums of absolute transformed differences, which can be calculated efficiently in hardware. The rate-distortion cost of using merge mode with motion vector difference with the selected motion vector difference candidate may be compared against one or more other rate-distortion costs. In addition, in some scenarios, the approaches may involve finding prediction costs for a subset of the motion vector difference candidates to further reduce computations.
    Type: Application
    Filed: April 15, 2024
    Publication date: August 1, 2024
    Applicant: Intel Corporation
    Inventors: Qian Xu, Ximin Zhang, Yi-jen Chiu
  • Publication number: 20240214594
    Abstract: Techniques related to accelerated video enhancement using deep learning selectively applied based on video codec information are discussed. Such techniques include applying a deep learning video enhancement network selectively to decoded non-skip blocks that are in low quantization parameter frames, bypassing the deep learning network for decoded skip blocks in low quantization parameter frames, and applying non-deep learning video enhancement to high quantization parameter frames.
    Type: Application
    Filed: January 4, 2024
    Publication date: June 27, 2024
    Applicant: Intel Corporation
    Inventors: Chen Wang, Ximin Zhang, Huan Dou, Yi-Jen Chiu, Sang-Hee Lee
  • Patent number: 12022096
    Abstract: An example apparatus for encoding video frames includes a mask selector to select a subset of visual masks according to an actual target compression ratio and GOP configuration and a complexity estimator to estimate a picture level spatial/temporal complexity for a current frame. The example apparatus further includes a GOP adaptive visual mask selector to specify a visual mask from the subset of the visual masks corresponding to the estimated spatial and temporal complexity value a good enough picture QP deriver to derive a good enough picture QP value using the visual mask. The example apparatus also includes an adjustor to adjust the good enough picture QP value based on block level human visual system sensitivity and statistics of already encoded frames to obtain a final human visual system QP map.
    Type: Grant
    Filed: May 7, 2020
    Date of Patent: June 25, 2024
    Assignee: Intel Corporation
    Inventors: Ximin Zhang, Sang-Hee Lee, Keith Rowe
  • Publication number: 20240135485
    Abstract: The disclosure relates to tuning configuration parameters for graphics pipeline for better user experience. A device for graphics processing, comprising: hardware engines; a graphics pipeline at least partly implemented by the hardware engines; and a tuner, coupled to the hardware engines and the graphics pipeline, the tuner to: collect statuses of the device during runtime for a previous frame; determine configuration parameters based on the collected statuses, the configuration parameters associated with three-dimensional 3D rendering, pre-processing and video encoding of the graphics pipeline; and tune the graphics pipeline with the determined configuration parameters for processing a next frame.
    Type: Application
    Filed: September 1, 2023
    Publication date: April 25, 2024
    Applicant: Intel Corporation
    Inventors: Fan He, Yi Qian, Ning Luo, Yunbiao Lin, Changliang Wang, Ximin Zhang
  • Publication number: 20240107078
    Abstract: Methods, articles, and systems of image processing comprise obtaining image data of frames of a video sequence. The method also includes determining multiple reference frames of a current frame in the video sequence. The multiple reference frames each have at least one motion compensated (MC) block of image data. Also, the method then includes generating a weight that factors noise, distortion variance, and dispersion distribution between the same MC block position and the current block. Thereafter, the method includes generating denoised filtered image data comprising applying one of the weights to the image data of the motion compensated (MC) block.
    Type: Application
    Filed: November 30, 2023
    Publication date: March 28, 2024
    Applicant: Intel Corporation
    Inventors: Minzhi Sun, Ximin Zhang, Yi-jen Chiu
  • Patent number: 11889096
    Abstract: Techniques related to accelerated video enhancement using deep learning selectively applied based on video codec information are discussed. Such techniques include applying a deep learning video enhancement network selectively to decoded non-skip blocks that are in low quantization parameter frames, bypassing the deep learning network for decoded skip blocks in low quantization parameter frames, and applying non-deep learning video enhancement to high quantization parameter frames.
    Type: Grant
    Filed: June 26, 2020
    Date of Patent: January 30, 2024
    Assignee: Intel Corporation
    Inventors: Chen Wang, Ximin Zhang, Huan Dou, Yi-Jen Chiu, Sang-Hee Lee