Patents by Inventor Scott Labrozzi

Scott Labrozzi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12284360
    Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.
    Type: Grant
    Filed: October 19, 2023
    Date of Patent: April 22, 2025
    Assignees: DISNEY ENTERPRISES, INC., ETH ZÜRICH (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)
    Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
  • Publication number: 20250126309
    Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. The first representation is analyzed to determine a first list of potential bitrates for the first portion of video. The method analyzes potential bitrates and quality associated with the respective potential bitrates to refine the first list of potential bitrates to a second list of bitrates. The second list of bitrates includes a different list of bitrates than the first list of potential bitrates. The method outputs the second list of bitrates for encoding the first portion of video.
    Type: Application
    Filed: December 20, 2024
    Publication date: April 17, 2025
    Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.
    Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
  • Patent number: 12278969
    Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.
    Type: Grant
    Filed: August 4, 2023
    Date of Patent: April 15, 2025
    Assignees: Disney Enterprises, Inc., ETH Zurich (Eidgenossische Technische Hochschule Zurich)
    Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
  • Publication number: 20250106408
    Abstract: In some embodiments, a method analyzes flagged locations from a plurality of locations in an encoding of a video to form a cluster of locations. Draft micro-chunk boundaries for the cluster are determined based on searching for a first start location and a first end location in the encoding. The method searches in a first search range before the first start location and a second search range after the first end location for a second start location in the first search range and a second end location in the second search range. The second start location and the second end location form a micro-chunk. An encoding parameter set is determined for the micro-chunk formed by the second start location and the second end location based on content characteristics of the micro-chunk. The method uses the encoding parameter set to encode the micro-chunk for insertion in the encoding of the video.
    Type: Application
    Filed: September 25, 2023
    Publication date: March 27, 2025
    Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.
    Inventors: YUANYI XUE, Roberto Gerson De Albuquerque Azevedo, Christopher Richard Schroers, SCOTT LABROZZI, Wenhao Zhang
  • Publication number: 20250095115
    Abstract: In some embodiments, a grain analysis system is configured for analyzing a first video frame and outputting respective first film grain information for film grain that is included in the first video frame or configured for analyzing a second video frame and outputting second film grain information. At least one of a grain removal system and a grain synthesis system is included. The grain removal system is configured for removing the film grain from the first video frame using the first film grain information to generate a third video frame corresponding to the first video frame with film grain removed. The grain analysis system is separate from the grain removal system. The grain synthesis system is configured for synthesizing film grain for the third video frame using the first film grain information or the second film grain information. The grain analysis system is separate from the grain synthesis system.
    Type: Application
    Filed: September 20, 2023
    Publication date: March 20, 2025
    Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd., ETH Zürich (Eidgenössische Technische Hochschule Zürich)
    Inventors: Abdelaziz Djelouah, Yang Zhang, Roberto Gerson De Albuquerque Azevedo, Elham Amin Mansour, Mingyang Song, Christopher Richard Schroers, Yuanyi Xue, Scott Labrozzi, Wenhao Zhang, Xuewei Meng, Jeroen Schulte
  • Patent number: 12236979
    Abstract: A system includes processing hardware and a memory storing software code. The processing hardware executes the software code to receive automation data for media content having a default playback experience, analyze, using the automation data, at least one parameter of the media content, and generate, based on the analyzing, one or more automation instruction(s) for at least one portion(s) of the media content. The automation instruction(s) include at least one of: one or more bounding timestamps of the media content portion(s), an increased or reduced playback speed for the media content portion(s) relative to the default playback experience, or a variable playback speed for the media content portion(s). The software code is further executed to outputs the automation instruction(s) to a media delivery platform configured to distribute and control the quality of the media content or to a media player configured to automate playback of the media content.
    Type: Grant
    Filed: July 14, 2022
    Date of Patent: February 25, 2025
    Assignee: Disney Enterprises, Inc.
    Inventors: Manuel Briand, Yuanyi Xue, Nathan Crowe, Scott Labrozzi, Michael Bracco, Mugdha Oltikar
  • Patent number: 12225252
    Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.
    Type: Grant
    Filed: March 6, 2023
    Date of Patent: February 11, 2025
    Assignees: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.
    Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
  • Publication number: 20250016382
    Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.
    Type: Application
    Filed: September 13, 2024
    Publication date: January 9, 2025
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
  • Publication number: 20240430440
    Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.
    Type: Application
    Filed: October 19, 2023
    Publication date: December 26, 2024
    Applicants: Disney Enterprises, Inc., ETH Zürich (Eidgenössische Technische Hochschule Zürich)
    Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
  • Publication number: 20240422380
    Abstract: A system includes a hardware processor and a memory storing a video/audio (V/A) synchronizer including video and audio encoders. The hardware processor executes the V/A synchronizer to receive raw video and audio extracted from media content, partition the raw video into video frame patches, partition the raw audio into audio samples, pre-process the video frame patches and the audio samples for encoding. The hardware processor further executes the V/A synchronizer to encode, using the video encoder, the pre-processed video frame patches to provide pre-processed and encoded video frame patches used to provide a latent representation of the raw video, encode, using the audio encoder, the pre-processed audio samples to provide pre-processed and encoded audio samples used to provide a latent representation of the raw audio, and synchronize, using the latent representations of the raw video and the raw audio, the raw audio with the raw video.
    Type: Application
    Filed: May 24, 2024
    Publication date: December 19, 2024
    Inventors: Clara Fernandez Labrador, Cafer Mertcan Akcay, Christopher Richard Schroers, Joan Massich Vall, Scott Labrozzi, Mitchel Jacobs, Katherine Hinsen, Eitan Abecassis
  • Publication number: 20240362896
    Abstract: In some embodiments, a method sends information for a sample of content, a first question, and a second question for output on an interface. The first question receives, from a subject, a first response for a sample level rating for an artifact that is perceived to be visible in the sample and the second question receives, from the subject, a second response for regions in the sample that are perceived to contain the artifact. The method receives the first response for the sample level rating and the second response for regions that are perceived to contain the artifact. First responses are combined from multiple subjects to generate an opinion score for the sample and second responses are combined to generate region scores for regions. The method generates training data from the opinion score and the region scores to train a process to perform an action based on the artifacts.
    Type: Application
    Filed: April 11, 2024
    Publication date: October 31, 2024
    Applicants: Disney Enterprises, Inc., Beijing Hulu Software Technology Development Co., Ltd.
    Inventors: Yuanyi XUE, Scott LABROZZI, Wenhao ZHANG, Christopher Richard SCHROERS, Roberto Gerson DE ALBUQUERQUE AZEVEDO, Xuchang HUANGFU, Lemei HUANG, Yang ZHANG
  • Patent number: 12126879
    Abstract: A system includes a computing platform having processing hardware, and a memory storing software code. The software code is executed to receive content having a sequence of content segments, and marker data identifying a location within the sequence, identify, using the content and the marker data, segment boundaries of a content segment containing the location, determine, using the location and the segment boundaries, whether the location is situated within a predetermined interval of one of the segment boundaries, and re-encode a subsection of the sequence to produce a new segment boundary at the location. When the location is not situated within the predetermined interval, the subsection of the sequence includes the content segment containing the location. When the location is situated within the predetermined interval, the subsection of the sequence includes the content segment containing the location and a content segment adjoining the content segment containing the location.
    Type: Grant
    Filed: July 8, 2022
    Date of Patent: October 22, 2024
    Assignee: Disney Enterprises, Inc.
    Inventors: Scott Labrozzi, William B. May, Jr.
  • Patent number: 12120359
    Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: October 15, 2024
    Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson De Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
  • Publication number: 20240305842
    Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.
    Type: Application
    Filed: March 6, 2023
    Publication date: September 12, 2024
    Applicants: Beijing Hulu Software Technology Development Co., Ltd., Disney Enterprises, Inc.
    Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi XUE, Xuchang Huangfu, Xiaobo Liu
  • Patent number: 12087024
    Abstract: According to one implementation, an image compression system includes a computing platform having a hardware processor and a system memory storing a software code. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image, and quantize the latent space representation of the input image to produce multiple quantized latents. The hardware processor further executes the software code to encode the quantized latents using a probability density function of the latent space representation of the input image, to generate a bitstream, and convert the bitstream into an output image corresponding to the input image. The probability density function of the latent space representation of the input image is obtained based on a normalizing flow mapping of one of the input image or the latent space representation of the input image.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: September 10, 2024
    Assignees: Disney Enterprises, Inc., ETH Zurich
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Scott Labrozzi, Yuanyi Xue, Erika Varis Doggett, Jared McPhillen, Christopher Richard Schroers
  • Publication number: 20240283957
    Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.
    Type: Application
    Filed: May 2, 2024
    Publication date: August 22, 2024
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
  • Publication number: 20240267534
    Abstract: A method receives a video. The method analyzes information for a pixel of a frame in the video to determine a first value and a second value for the pixel. The first value is based on an image structure formed by the pixel in the frame and the second value is based on interframe motion of the image structure at the pixel. A third value is determined for an amount of judder based on the first value and the second value. The method outputs the third value to evaluate the video.
    Type: Application
    Filed: February 5, 2024
    Publication date: August 8, 2024
    Applicant: Disney Enterprises, Inc.
    Inventors: Christopher Richard Schroers, Blake Sloan, Mitchel Jacobs, Scott Labrozzi, Shinobu Hattori, Felix Klose
  • Patent number: 12010335
    Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: June 11, 2024
    Assignee: Disney Enterprises, Inc.
    Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
  • Patent number: 11983906
    Abstract: Systems and methods for predicting a target set of pixels are disclosed. In one embodiment, a method may include obtaining target content. The target content may include a target set of pixels to be predicted. The method may also include convolving the target set of pixels to generate an estimated set of pixels. The method may include matching a second set of pixels in the target content to the target set of pixels. The second set of pixels may be within a distance from the target set of pixels. The method may include refining the estimated set of pixels to generate a refined set of pixels using a second set of pixels in the target content.
    Type: Grant
    Filed: March 25, 2022
    Date of Patent: May 14, 2024
    Assignee: Disney Enterprises, Inc.
    Inventors: Christopher Schroers, Erika Doggett, Stephan Mandt, Jared Mcphillen, Scott Labrozzi, Romann Weber, Mauro Bamert
  • Publication number: 20240040167
    Abstract: A system includes a computing platform having processing hardware, and a memory storing software code. The software code is executed to receive digital content indexed to a timeline, receive insertion data identifying a timecode of the timeline, and encode the digital content using the insertion data to provide segmented content having a segment boundary at the timecode, and first and second segments adjoining the segment boundary, wherein the first segment precedes, and the second segment succeeds, the segment boundary. The software code also re-processes the first and second segments to apply a fade-out within or to the first segment and a fade-in within or to the second segment, wherein re-processing the first and second segments provides encoded segments having the segment boundary configured as an insertion point for supplemental content.
    Type: Application
    Filed: July 26, 2022
    Publication date: February 1, 2024
    Inventor: Scott Labrozzi