Patents by Inventor Yuanyi XUE

Yuanyi XUE has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12284360
    Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.
    Type: Grant
    Filed: October 19, 2023
    Date of Patent: April 22, 2025
    Assignees: DISNEY ENTERPRISES, INC., ETH ZÜRICH (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)
    Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
  • Publication number: 20250126309
    Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. The first representation is analyzed to determine a first list of potential bitrates for the first portion of video. The method analyzes potential bitrates and quality associated with the respective potential bitrates to refine the first list of potential bitrates to a second list of bitrates. The second list of bitrates includes a different list of bitrates than the first list of potential bitrates. The method outputs the second list of bitrates for encoding the first portion of video.
    Type: Application
    Filed: December 20, 2024
    Publication date: April 17, 2025
    Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.
    Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
  • Patent number: 12278969
    Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.
    Type: Grant
    Filed: August 4, 2023
    Date of Patent: April 15, 2025
    Assignees: Disney Enterprises, Inc., ETH Zurich (Eidgenossische Technische Hochschule Zurich)
    Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
  • Publication number: 20250117909
    Abstract: In some embodiments, a method receives a first image and a second image for a comparison of film grain. The first image and the second image are converted from a spatial domain to a frequency domain to generate a first frequency domain representation for the first image and a second frequency domain representation of the second image. The method compares a first distribution of frequency components from the first frequency domain representation to a second distribution of frequency components from the second frequency domain representation. A score for an assessment of differences of the film grain in the first image and the second image is generated based on the comparing.
    Type: Application
    Filed: September 18, 2024
    Publication date: April 10, 2025
    Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.
    Inventors: Xuewei Meng, Wenhao Zhang, Chen Liu, Xuchang Huangfu, Yuanyi Xue
  • Publication number: 20250106408
    Abstract: In some embodiments, a method analyzes flagged locations from a plurality of locations in an encoding of a video to form a cluster of locations. Draft micro-chunk boundaries for the cluster are determined based on searching for a first start location and a first end location in the encoding. The method searches in a first search range before the first start location and a second search range after the first end location for a second start location in the first search range and a second end location in the second search range. The second start location and the second end location form a micro-chunk. An encoding parameter set is determined for the micro-chunk formed by the second start location and the second end location based on content characteristics of the micro-chunk. The method uses the encoding parameter set to encode the micro-chunk for insertion in the encoding of the video.
    Type: Application
    Filed: September 25, 2023
    Publication date: March 27, 2025
    Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.
    Inventors: YUANYI XUE, Roberto Gerson De Albuquerque Azevedo, Christopher Richard Schroers, SCOTT LABROZZI, Wenhao Zhang
  • Publication number: 20250095115
    Abstract: In some embodiments, a grain analysis system is configured for analyzing a first video frame and outputting respective first film grain information for film grain that is included in the first video frame or configured for analyzing a second video frame and outputting second film grain information. At least one of a grain removal system and a grain synthesis system is included. The grain removal system is configured for removing the film grain from the first video frame using the first film grain information to generate a third video frame corresponding to the first video frame with film grain removed. The grain analysis system is separate from the grain removal system. The grain synthesis system is configured for synthesizing film grain for the third video frame using the first film grain information or the second film grain information. The grain analysis system is separate from the grain synthesis system.
    Type: Application
    Filed: September 20, 2023
    Publication date: March 20, 2025
    Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd., ETH Zürich (Eidgenössische Technische Hochschule Zürich)
    Inventors: Abdelaziz Djelouah, Yang Zhang, Roberto Gerson De Albuquerque Azevedo, Elham Amin Mansour, Mingyang Song, Christopher Richard Schroers, Yuanyi Xue, Scott Labrozzi, Wenhao Zhang, Xuewei Meng, Jeroen Schulte
  • Publication number: 20250080797
    Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.
    Type: Application
    Filed: December 13, 2022
    Publication date: March 6, 2025
    Inventors: Yuanyi XUE, Michael John BRACCO, Scott Christopher LABROZZI, Christopher Richard SCHROERS, Wenhao ZHANG
  • Patent number: 12236979
    Abstract: A system includes processing hardware and a memory storing software code. The processing hardware executes the software code to receive automation data for media content having a default playback experience, analyze, using the automation data, at least one parameter of the media content, and generate, based on the analyzing, one or more automation instruction(s) for at least one portion(s) of the media content. The automation instruction(s) include at least one of: one or more bounding timestamps of the media content portion(s), an increased or reduced playback speed for the media content portion(s) relative to the default playback experience, or a variable playback speed for the media content portion(s). The software code is further executed to outputs the automation instruction(s) to a media delivery platform configured to distribute and control the quality of the media content or to a media player configured to automate playback of the media content.
    Type: Grant
    Filed: July 14, 2022
    Date of Patent: February 25, 2025
    Assignee: Disney Enterprises, Inc.
    Inventors: Manuel Briand, Yuanyi Xue, Nathan Crowe, Scott Labrozzi, Michael Bracco, Mugdha Oltikar
  • Patent number: 12225272
    Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.
    Type: Grant
    Filed: February 24, 2023
    Date of Patent: February 11, 2025
    Assignees: Disney Enterprises, Inc., BEIJING YOJAJA SOFTWARE TECHNOLOGY DEVELOPMENT CO., LTD.
    Inventors: Yuanyi Xue, Michael John Bracco, Scott Christopher Labrozzi, Christopher Richard Schroers, Wenhao Zhang
  • Patent number: 12225252
    Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.
    Type: Grant
    Filed: March 6, 2023
    Date of Patent: February 11, 2025
    Assignees: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.
    Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
  • Publication number: 20250016382
    Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.
    Type: Application
    Filed: September 13, 2024
    Publication date: January 9, 2025
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
  • Publication number: 20240430440
    Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.
    Type: Application
    Filed: October 19, 2023
    Publication date: December 26, 2024
    Applicants: Disney Enterprises, Inc., ETH Zürich (Eidgenössische Technische Hochschule Zürich)
    Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
  • Publication number: 20240362896
    Abstract: In some embodiments, a method sends information for a sample of content, a first question, and a second question for output on an interface. The first question receives, from a subject, a first response for a sample level rating for an artifact that is perceived to be visible in the sample and the second question receives, from the subject, a second response for regions in the sample that are perceived to contain the artifact. The method receives the first response for the sample level rating and the second response for regions that are perceived to contain the artifact. First responses are combined from multiple subjects to generate an opinion score for the sample and second responses are combined to generate region scores for regions. The method generates training data from the opinion score and the region scores to train a process to perform an action based on the artifacts.
    Type: Application
    Filed: April 11, 2024
    Publication date: October 31, 2024
    Applicants: Disney Enterprises, Inc., Beijing Hulu Software Technology Development Co., Ltd.
    Inventors: Yuanyi XUE, Scott LABROZZI, Wenhao ZHANG, Christopher Richard SCHROERS, Roberto Gerson DE ALBUQUERQUE AZEVEDO, Xuchang HUANGFU, Lemei HUANG, Yang ZHANG
  • Patent number: 12120359
    Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: October 15, 2024
    Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson De Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
  • Publication number: 20240305842
    Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.
    Type: Application
    Filed: March 6, 2023
    Publication date: September 12, 2024
    Applicants: Beijing Hulu Software Technology Development Co., Ltd., Disney Enterprises, Inc.
    Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi XUE, Xuchang Huangfu, Xiaobo Liu
  • Patent number: 12087024
    Abstract: According to one implementation, an image compression system includes a computing platform having a hardware processor and a system memory storing a software code. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image, and quantize the latent space representation of the input image to produce multiple quantized latents. The hardware processor further executes the software code to encode the quantized latents using a probability density function of the latent space representation of the input image, to generate a bitstream, and convert the bitstream into an output image corresponding to the input image. The probability density function of the latent space representation of the input image is obtained based on a normalizing flow mapping of one of the input image or the latent space representation of the input image.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: September 10, 2024
    Assignees: Disney Enterprises, Inc., ETH Zurich
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Scott Labrozzi, Yuanyi Xue, Erika Varis Doggett, Jared McPhillen, Christopher Richard Schroers
  • Publication number: 20240283957
    Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.
    Type: Application
    Filed: May 2, 2024
    Publication date: August 22, 2024
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
  • Publication number: 20240196066
    Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.
    Type: Application
    Filed: February 24, 2023
    Publication date: June 13, 2024
    Inventors: Yuanyi XUE, Michael John BRACCO, Scott Christopher LABROZZI, Christopher Richard SCHROERS, Wenhao ZHANG
  • Patent number: 12010369
    Abstract: Embodiments provide for improved stream generation. A first encoded segment is generated by encoding a first segment, of a plurality of segments in a media asset, using a first bitrate of a plurality of bitrates specified in an encoding ladder. A second encoded segment is generated by encoding the first segment using a second bitrate, where the second bitrate is lower than the first bitrate. Upon receiving a request for the first segment at the first bitrate, the second encoded segment is output based at least in part on determining that a first quality of the second encoded segment is within a tolerance of a second quality of the first encoded segment.
    Type: Grant
    Filed: January 31, 2023
    Date of Patent: June 11, 2024
    Assignee: Disney Enterprises, Inc.
    Inventors: Scott C. Labrozzi, Chetan K. Mathur, Yuanyi Xue, Michael J. Bracco
  • Patent number: 12010335
    Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: June 11, 2024
    Assignee: Disney Enterprises, Inc.
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue