Patents by Inventor Scott Labrozzi
Scott Labrozzi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12284360Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.Type: GrantFiled: October 19, 2023Date of Patent: April 22, 2025Assignees: DISNEY ENTERPRISES, INC., ETH ZÜRICH (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
-
Publication number: 20250126309Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. The first representation is analyzed to determine a first list of potential bitrates for the first portion of video. The method analyzes potential bitrates and quality associated with the respective potential bitrates to refine the first list of potential bitrates to a second list of bitrates. The second list of bitrates includes a different list of bitrates than the first list of potential bitrates. The method outputs the second list of bitrates for encoding the first portion of video.Type: ApplicationFiled: December 20, 2024Publication date: April 17, 2025Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
-
Patent number: 12278969Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.Type: GrantFiled: August 4, 2023Date of Patent: April 15, 2025Assignees: Disney Enterprises, Inc., ETH Zurich (Eidgenossische Technische Hochschule Zurich)Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
-
Publication number: 20250106408Abstract: In some embodiments, a method analyzes flagged locations from a plurality of locations in an encoding of a video to form a cluster of locations. Draft micro-chunk boundaries for the cluster are determined based on searching for a first start location and a first end location in the encoding. The method searches in a first search range before the first start location and a second search range after the first end location for a second start location in the first search range and a second end location in the second search range. The second start location and the second end location form a micro-chunk. An encoding parameter set is determined for the micro-chunk formed by the second start location and the second end location based on content characteristics of the micro-chunk. The method uses the encoding parameter set to encode the micro-chunk for insertion in the encoding of the video.Type: ApplicationFiled: September 25, 2023Publication date: March 27, 2025Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.Inventors: YUANYI XUE, Roberto Gerson De Albuquerque Azevedo, Christopher Richard Schroers, SCOTT LABROZZI, Wenhao Zhang
-
Publication number: 20250095115Abstract: In some embodiments, a grain analysis system is configured for analyzing a first video frame and outputting respective first film grain information for film grain that is included in the first video frame or configured for analyzing a second video frame and outputting second film grain information. At least one of a grain removal system and a grain synthesis system is included. The grain removal system is configured for removing the film grain from the first video frame using the first film grain information to generate a third video frame corresponding to the first video frame with film grain removed. The grain analysis system is separate from the grain removal system. The grain synthesis system is configured for synthesizing film grain for the third video frame using the first film grain information or the second film grain information. The grain analysis system is separate from the grain synthesis system.Type: ApplicationFiled: September 20, 2023Publication date: March 20, 2025Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd., ETH Zürich (Eidgenössische Technische Hochschule Zürich)Inventors: Abdelaziz Djelouah, Yang Zhang, Roberto Gerson De Albuquerque Azevedo, Elham Amin Mansour, Mingyang Song, Christopher Richard Schroers, Yuanyi Xue, Scott Labrozzi, Wenhao Zhang, Xuewei Meng, Jeroen Schulte
-
Patent number: 12236979Abstract: A system includes processing hardware and a memory storing software code. The processing hardware executes the software code to receive automation data for media content having a default playback experience, analyze, using the automation data, at least one parameter of the media content, and generate, based on the analyzing, one or more automation instruction(s) for at least one portion(s) of the media content. The automation instruction(s) include at least one of: one or more bounding timestamps of the media content portion(s), an increased or reduced playback speed for the media content portion(s) relative to the default playback experience, or a variable playback speed for the media content portion(s). The software code is further executed to outputs the automation instruction(s) to a media delivery platform configured to distribute and control the quality of the media content or to a media player configured to automate playback of the media content.Type: GrantFiled: July 14, 2022Date of Patent: February 25, 2025Assignee: Disney Enterprises, Inc.Inventors: Manuel Briand, Yuanyi Xue, Nathan Crowe, Scott Labrozzi, Michael Bracco, Mugdha Oltikar
-
Patent number: 12225252Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.Type: GrantFiled: March 6, 2023Date of Patent: February 11, 2025Assignees: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
-
Publication number: 20250016382Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.Type: ApplicationFiled: September 13, 2024Publication date: January 9, 2025Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
-
Publication number: 20240430440Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.Type: ApplicationFiled: October 19, 2023Publication date: December 26, 2024Applicants: Disney Enterprises, Inc., ETH Zürich (Eidgenössische Technische Hochschule Zürich)Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
-
Publication number: 20240422380Abstract: A system includes a hardware processor and a memory storing a video/audio (V/A) synchronizer including video and audio encoders. The hardware processor executes the V/A synchronizer to receive raw video and audio extracted from media content, partition the raw video into video frame patches, partition the raw audio into audio samples, pre-process the video frame patches and the audio samples for encoding. The hardware processor further executes the V/A synchronizer to encode, using the video encoder, the pre-processed video frame patches to provide pre-processed and encoded video frame patches used to provide a latent representation of the raw video, encode, using the audio encoder, the pre-processed audio samples to provide pre-processed and encoded audio samples used to provide a latent representation of the raw audio, and synchronize, using the latent representations of the raw video and the raw audio, the raw audio with the raw video.Type: ApplicationFiled: May 24, 2024Publication date: December 19, 2024Inventors: Clara Fernandez Labrador, Cafer Mertcan Akcay, Christopher Richard Schroers, Joan Massich Vall, Scott Labrozzi, Mitchel Jacobs, Katherine Hinsen, Eitan Abecassis
-
Publication number: 20240362896Abstract: In some embodiments, a method sends information for a sample of content, a first question, and a second question for output on an interface. The first question receives, from a subject, a first response for a sample level rating for an artifact that is perceived to be visible in the sample and the second question receives, from the subject, a second response for regions in the sample that are perceived to contain the artifact. The method receives the first response for the sample level rating and the second response for regions that are perceived to contain the artifact. First responses are combined from multiple subjects to generate an opinion score for the sample and second responses are combined to generate region scores for regions. The method generates training data from the opinion score and the region scores to train a process to perform an action based on the artifacts.Type: ApplicationFiled: April 11, 2024Publication date: October 31, 2024Applicants: Disney Enterprises, Inc., Beijing Hulu Software Technology Development Co., Ltd.Inventors: Yuanyi XUE, Scott LABROZZI, Wenhao ZHANG, Christopher Richard SCHROERS, Roberto Gerson DE ALBUQUERQUE AZEVEDO, Xuchang HUANGFU, Lemei HUANG, Yang ZHANG
-
Patent number: 12126879Abstract: A system includes a computing platform having processing hardware, and a memory storing software code. The software code is executed to receive content having a sequence of content segments, and marker data identifying a location within the sequence, identify, using the content and the marker data, segment boundaries of a content segment containing the location, determine, using the location and the segment boundaries, whether the location is situated within a predetermined interval of one of the segment boundaries, and re-encode a subsection of the sequence to produce a new segment boundary at the location. When the location is not situated within the predetermined interval, the subsection of the sequence includes the content segment containing the location. When the location is situated within the predetermined interval, the subsection of the sequence includes the content segment containing the location and a content segment adjoining the content segment containing the location.Type: GrantFiled: July 8, 2022Date of Patent: October 22, 2024Assignee: Disney Enterprises, Inc.Inventors: Scott Labrozzi, William B. May, Jr.
-
Patent number: 12120359Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.Type: GrantFiled: March 25, 2022Date of Patent: October 15, 2024Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson De Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
-
Publication number: 20240305842Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.Type: ApplicationFiled: March 6, 2023Publication date: September 12, 2024Applicants: Beijing Hulu Software Technology Development Co., Ltd., Disney Enterprises, Inc.Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi XUE, Xuchang Huangfu, Xiaobo Liu
-
Patent number: 12087024Abstract: According to one implementation, an image compression system includes a computing platform having a hardware processor and a system memory storing a software code. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image, and quantize the latent space representation of the input image to produce multiple quantized latents. The hardware processor further executes the software code to encode the quantized latents using a probability density function of the latent space representation of the input image, to generate a bitstream, and convert the bitstream into an output image corresponding to the input image. The probability density function of the latent space representation of the input image is obtained based on a normalizing flow mapping of one of the input image or the latent space representation of the input image.Type: GrantFiled: March 6, 2020Date of Patent: September 10, 2024Assignees: Disney Enterprises, Inc., ETH ZurichInventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Scott Labrozzi, Yuanyi Xue, Erika Varis Doggett, Jared McPhillen, Christopher Richard Schroers
-
Publication number: 20240283957Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.Type: ApplicationFiled: May 2, 2024Publication date: August 22, 2024Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
-
Publication number: 20240267534Abstract: A method receives a video. The method analyzes information for a pixel of a frame in the video to determine a first value and a second value for the pixel. The first value is based on an image structure formed by the pixel in the frame and the second value is based on interframe motion of the image structure at the pixel. A third value is determined for an amount of judder based on the first value and the second value. The method outputs the third value to evaluate the video.Type: ApplicationFiled: February 5, 2024Publication date: August 8, 2024Applicant: Disney Enterprises, Inc.Inventors: Christopher Richard Schroers, Blake Sloan, Mitchel Jacobs, Scott Labrozzi, Shinobu Hattori, Felix Klose
-
Patent number: 12010335Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.Type: GrantFiled: March 25, 2022Date of Patent: June 11, 2024Assignee: Disney Enterprises, Inc.Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
-
Patent number: 11983906Abstract: Systems and methods for predicting a target set of pixels are disclosed. In one embodiment, a method may include obtaining target content. The target content may include a target set of pixels to be predicted. The method may also include convolving the target set of pixels to generate an estimated set of pixels. The method may include matching a second set of pixels in the target content to the target set of pixels. The second set of pixels may be within a distance from the target set of pixels. The method may include refining the estimated set of pixels to generate a refined set of pixels using a second set of pixels in the target content.Type: GrantFiled: March 25, 2022Date of Patent: May 14, 2024Assignee: Disney Enterprises, Inc.Inventors: Christopher Schroers, Erika Doggett, Stephan Mandt, Jared Mcphillen, Scott Labrozzi, Romann Weber, Mauro Bamert
-
Publication number: 20240040167Abstract: A system includes a computing platform having processing hardware, and a memory storing software code. The software code is executed to receive digital content indexed to a timeline, receive insertion data identifying a timecode of the timeline, and encode the digital content using the insertion data to provide segmented content having a segment boundary at the timecode, and first and second segments adjoining the segment boundary, wherein the first segment precedes, and the second segment succeeds, the segment boundary. The software code also re-processes the first and second segments to apply a fade-out within or to the first segment and a fade-in within or to the second segment, wherein re-processing the first and second segments provides encoded segments having the segment boundary configured as an insertion point for supplemental content.Type: ApplicationFiled: July 26, 2022Publication date: February 1, 2024Inventor: Scott Labrozzi