Patents by Inventor Aljoscha Smolic

Aljoscha Smolic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11936936
    Abstract: A method including receiving video of an event; generating an overlay for the video; generating an information message containing information enabling a receiver of the video and the overlay to selectively display or hide the overlay; and transmitting the video, the overlay, and the information message. The video is transmitted in a primary stream of a multi-stream transmission including a primary stream and one or more auxiliary streams. The overlay is transmitted in a first one of the auxiliary streams.
    Type: Grant
    Filed: October 9, 2014
    Date of Patent: March 19, 2024
    Assignee: DISNEY ENTERPRISES, INC.
    Inventors: Aljoscha Smolic, Nikolce Stefanoski, Oliver Wang
  • Patent number: 11140440
    Abstract: Novel systems and methods are described for creating, compressing, and distributing video or image content graded for a plurality of displays with different dynamic ranges. In implementations, the created content is “continuous dynamic range” (CDR) content—a novel representation of pixel-luminance as a function of display dynamic range. The creation of the CDR content includes grading a source content for a minimum dynamic range and a maximum dynamic range, and defining a luminance of each pixel of an image or video frame of the source content as a continuous function between the minimum and the maximum dynamic ranges. In additional implementations, a novel graphical user interface for creating and editing the CDR content is described.
    Type: Grant
    Filed: May 2, 2019
    Date of Patent: October 5, 2021
    Assignee: Disney Enterprises, Inc.
    Inventors: Aljoscha Smolic, Alexandre Chapiro, Simone Croci, Tunc Ozan Aydin, Nikolce Stefanoski, Markus Gross
  • Patent number: 11010398
    Abstract: There is provided a system including a computing platform having a hardware processor and a memory, and a metadata extraction and management unit stored in the memory. The hardware processor is configured to execute the metadata extraction and management unit to extract a plurality of metadata types from a media asset sequentially and in accordance with a prioritized order of extraction based on metadata type, aggregate the plurality of metadata types to produce an aggregated metadata describing the media asset, use the aggregated metadata to include at least one database entry in a graphical database, wherein the at least one database entry describes the media asset, display a user interface for a user to view tags of metadata associated with the media asset, and correcting presence of one of the tags of metadata associated with the media asset, in response to an input from the user via the user interface.
    Type: Grant
    Filed: May 21, 2018
    Date of Patent: May 18, 2021
    Assignee: Disney Enterprises, Inc.
    Inventors: Miquel Angel Farre Guiu, Marc Junyent Martin, Jordi Pont-Tuset, Pablo Beltran, Nimesh Narayan, Leonid Sigal, Aljoscha Smolic, Anthony M. Accardo
  • Patent number: 10699396
    Abstract: Systems and methods are disclosed for weighting the image quality prediction of any visual-attention-agnostic quality metric with a saliency map. By accounting for the salient regions of an image or video frame, the disclosed systems and methods may dramatically improve the precision of the visual-attention-agnostic quality metric during image or video quality assessment. In one implementation, a method of saliency-weighted video quality assessment includes: determining a per-pixel image quality vector of an encoded video frame; determining per-pixel saliency values of the encoded video frame or a reference video frame corresponding to the encoded video frame; and computing a saliency-weighted image quality metric of the encoded video frame by weighting the per-pixel image quality vector using the per-pixel saliency values.
    Type: Grant
    Filed: February 1, 2018
    Date of Patent: June 30, 2020
    Assignee: Disney Enterprises, Inc.
    Inventors: Tunc Ozan Aydin, Nikolce Stefanoski, Aljoscha Smolic, Mark Arana
  • Patent number: 10664973
    Abstract: There is provided a system including a memory and a processor configured to obtain a first frame of a video content including an object and a first region based on a segmentation hierarchy of the first frame, insert a synthetic object into the first frame, merge an object segmentation hierarchy of the synthetic object with the segmentation hierarchy of the first frame to create a merged segmentation hierarchy, select a second region based on the merged segmentation hierarchy, provide the first frame including the first region and the second region to a crowd user for creating a corrected frame, receive the corrected frame from the crowd user including a first corrected region including the object and a second corrected region including the synthetic object, determine a quality based on the synthetic object and the second corrected region, and accept the first corrected region based on the quality.
    Type: Grant
    Filed: July 10, 2018
    Date of Patent: May 26, 2020
    Assignee: Disney Enterprises, Inc.
    Inventors: Miquel Angel Farre Guiu, Marc Junyent Martin, Aljoscha Smolic
  • Publication number: 20190261049
    Abstract: Novel systems and methods are described for creating, compressing, and distributing video or image content graded for a plurality of displays with different dynamic ranges. In implementations, the created content is “continuous dynamic range” (CDR) content—a novel representation of pixel-luminance as a function of display dynamic range. The creation of the CDR content includes grading a source content for a minimum dynamic range and a maximum dynamic range, and defining a luminance of each pixel of an image or video frame of the source content as a continuous function between the minimum and the maximum dynamic ranges. In additional implementations, a novel graphical user interface for creating and editing the CDR content is described.
    Type: Application
    Filed: May 2, 2019
    Publication date: August 22, 2019
    Inventors: Aljoscha Smolic, Alexandre Chapiro, Simone Croci, Tunc Ozan Aydin, Nikolce Stefanoski, Markus Gross
  • Patent number: 10349127
    Abstract: Novel systems and methods are described for creating, compressing, and distributing video or image content graded for a plurality of displays with different dynamic ranges. In implementations, the created content is “continuous dynamic range” (CDR) content—a novel representation of pixel-luminance as a function of display dynamic range. The creation of the CDR content includes grading a source content for a minimum dynamic range and a maximum dynamic range, and defining a luminance of each pixel of an image or video frame of the source content as a continuous function between the minimum and the maximum dynamic ranges. In additional implementations, a novel graphical user interface for creating and editing the CDR content is described.
    Type: Grant
    Filed: September 22, 2015
    Date of Patent: July 9, 2019
    Assignees: Disney Enterprises, Inc., Eidgenoessische Technische Hochschule Zurich (ETH Zurich)
    Inventors: Aljoscha Smolic, Alexandre Chapiro, Simone Croci, Tunc Ozan Aydin, Nikolce Stefanoski, Markus Gross
  • Patent number: 10248864
    Abstract: There is provided a method that includes receiving a video having video shots, and creating video shot groups based on similarities between the video shots, where each video shot group of the video shot groups includes one or more of the video shots and has different ones of the video shots than other video shot groups. The method further includes creating at least one video supergroup including at least one video shot group of the video shot groups based on interactions among the one or more of the video shots in each of the video shot groups, and divide the at least one video supergroup into connected video supergroups, each connected video supergroup of the connected video supergroups including one or more of the video shot groups based on the interactions among the one or more of video shots in each of the video shot groups.
    Type: Grant
    Filed: December 3, 2015
    Date of Patent: April 2, 2019
    Assignee: Disney Enterprises, Inc.
    Inventors: Miquel Angel Farre Guiu, Pablo Beltran Sanchidrian, Aljoscha Smolic
  • Patent number: 10157318
    Abstract: A storyboard interface displaying key frames of a video may be presented to a user. Individual key frames may represent individual shots of the video. Shots may be grouped based on similarity. Key frames may be displayed in a chronological order of the corresponding shots. Key frames of grouped shots may be spatially correlated within the storyboard interface. For example, shots of a common group may be spatially correlated so that they may be easily discernable as a group even though the shots may not be temporally consecutive and/or or even temporally close to each other in the timeframe of the video itself.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: December 18, 2018
    Assignees: Disney Enterprises, Inc., ETH Zurich
    Inventors: Aljoscha Smolic, Marc Junyent Martin, Jordi Pont-Tusert, Alexandre Chapiro, Miquel Angel Farre Guiu
  • Publication number: 20180322636
    Abstract: There is provided a system including a memory and a processor configured to obtain a first frame of a video content including an object and a first region based on a segmentation hierarchy of the first frame, insert a synthetic object into the first frame, merge an object segmentation hierarchy of the synthetic object with the segmentation hierarchy of the first frame to create a merged segmentation hierarchy, select a second region based on the merged segmentation hierarchy, provide the first frame including the first region and the second region to a crowd user for creating a corrected frame, receive the corrected frame from the crowd user including a first corrected region including the object and a second corrected region including the synthetic object, determine a quality based on the synthetic object and the second corrected region, and accept the first corrected region based on the quality.
    Type: Application
    Filed: July 10, 2018
    Publication date: November 8, 2018
    Inventors: Miquel Angel Farre Guiu, Marc Junyent Martin, Aljoscha Smolic
  • Patent number: 10102630
    Abstract: A system is provided for tagging an object in a video having a plurality of frames. The system includes a memory storing a segmentation hierarchy of a first frame of the plurality of frames and having a plurality of elements, a display, and a processor configured to display the first frame including the plurality of elements on the display, receive a first input selecting a first element of the plurality of elements displayed on the display, select a first region of the first frame based on the first input, display the first region of the first frame on the display, receive a second input from the user altering the first region of the first frame displayed on the display, and alter the first region by selecting a second region of the first frame based on the second input from the user and the segmentation hierarchy.
    Type: Grant
    Filed: April 21, 2015
    Date of Patent: October 16, 2018
    Assignee: Disney Enterprises, Inc.
    Inventors: Aljoscha Smolic, Jordi Pont-Tuset, Miquel Angel Farre Guiu
  • Publication number: 20180276286
    Abstract: There is provided a system including a computing platform having a hardware processor and a memory, and a metadata extraction and management unit stored in the memory. The hardware processor is configured to execute the metadata extraction and management unit to extract a plurality of metadata types from a media asset sequentially and in accordance with a prioritized order of extraction based on metadata type, aggregate the plurality of metadata types to produce an aggregated metadata describing the media asset, use the aggregated metadata to include at least one database entry in a graphical database, wherein the at least one database entry describes the media asset, display a user interface for a user to view tags of metadata associated with the media asset, and correcting presence of one of the tags of metadata associated with the media asset, in response to an input from the user via the user interface.
    Type: Application
    Filed: May 21, 2018
    Publication date: September 27, 2018
    Inventors: Miquel Angel Farre Guiu, Marc Junyent Martin, Jordi Pont-Tuset, Pablo Beltran Sanchidrian, Nimesh Narayan, Leonid Sigal, Aljoscha Smolic, Anthony M. Accardo
  • Patent number: 10068616
    Abstract: According to one implementation, a video processing system for performing thumbnail generation includes a computing platform having a hardware processor and a system memory storing a thumbnail generator software code. The hardware processor executes the thumbnail generator software code to receive a video file, and identify a plurality of shots in the video file, each of the plurality of shots including a plurality of frames of the video file. For each of the plurality of shots, the hardware processor further executes the thumbnail generator software code to filter the plurality of frames to obtain a plurality of key frame candidates, determine a ranking of the plurality of key frame candidates based in part on a blur detection analysis and an image distribution analysis of each of the plurality of key frame candidates, and generate a thumbnail based on the ranking.
    Type: Grant
    Filed: January 11, 2017
    Date of Patent: September 4, 2018
    Assignee: Disney Enterprises, Inc.
    Inventors: Miquel Angel Farre Guiu, Aljoscha Smolic, Marc Junyent Martin, Asier Aduriz, Tunc Ozan Aydin, Christopher A. Eich
  • Patent number: 10037605
    Abstract: There is provided a system including a memory and a processor configured to obtain a first frame of a video content including an object and a first region based on a segmentation hierarchy of the first frame, insert a synthetic object into the first frame, merge an object segmentation hierarchy of the synthetic object with the segmentation hierarchy of the first frame to create a merged segmentation hierarchy, select a second region based on the merged segmentation hierarchy, provide the first frame including the first region and the second region to a crowd user for creating a corrected frame, receive the corrected frame from the crowd user including a first corrected region including the object and a second corrected region including the synthetic object, determine a quality based on the synthetic object and the second corrected region, and accept the first corrected region based on the quality.
    Type: Grant
    Filed: August 23, 2016
    Date of Patent: July 31, 2018
    Assignee: Disney Enterprises, Inc.
    Inventors: Miquel Angel Farre Guiu, Marc Junyent Martin, Aljoscha Smolic
  • Publication number: 20180197577
    Abstract: According to one implementation, a video processing system for performing thumbnail generation includes a computing platform having a hardware processor and a system memory storing a thumbnail generator software code. The hardware processor executes the thumbnail generator software code to receive a video file, and identify a plurality of shots in the video file, each of the plurality of shots including a plurality of frames of the video file.
    Type: Application
    Filed: January 11, 2017
    Publication date: July 12, 2018
    Inventors: Miquel Angel Farre Guiu, Aljoscha Smolic, Marc Junyent Martin, Asier Aduriz, Tunc Ozan Aydin, Christopher A. Eich
  • Patent number: 10007713
    Abstract: There are provided systems and methods for performing metadata extraction and management. Such a system includes a computing platform having a hardware processor, a system memory, and metadata extraction and management unit stored in the system memory. The system is configured to extract multiple metadata types from a media asset, and to aggregate the multiple metadata types to produce an aggregated metadata describing the media asset. The system is further configured to transform the aggregated metadata into at least one database entry identifying the media asset, and to map the at least one database entry into a graphical database so as to relate the media asset to at least one other media asset represented in the graphical database.
    Type: Grant
    Filed: October 15, 2015
    Date of Patent: June 26, 2018
    Assignee: Disney Enterprises, Inc.
    Inventors: Miguel Angel Farre Guiu, Marc Junyent Martin, Jordi Pont-Tuset, Pablo Beltran, Nimesh Narayan, Leonid Sigal, Aljoscha Smolic, Anthony M. Accardo
  • Publication number: 20180158184
    Abstract: Systems and methods are disclosed for weighting the image quality prediction of any visual-attention-agnostic quality metric with a saliency map. By accounting for the salient regions of an image or video frame, the disclosed systems and methods may dramatically improve the precision of the visual-attention-agnostic quality metric during image or video quality assessment. In one implementation, a method of saliency-weighted video quality assessment includes: determining a per-pixel image quality vector of an encoded video frame; determining per-pixel saliency values of the encoded video frame or a reference video frame corresponding to the encoded video frame; and computing a saliency-weighted image quality metric of the encoded video frame by weighting the per-pixel image quality vector using the per-pixel saliency values.
    Type: Application
    Filed: February 1, 2018
    Publication date: June 7, 2018
    Applicant: Disney Enterprises, Inc.
    Inventors: Tunc Ozan Aydin, Nikolce Stefanoski, Aljoscha Smolic, Mark Arana
  • Patent number: 9986278
    Abstract: There is provided a server for providing an interactive broadcast. The server includes a memory configured to store a story manager including an event controller, a story controller and an action processor, and a hardware processor configured to execute the story manager. The story manager is configured to provide, using the event controller, an event based on a control script, story elements metadata, one or more user performance analyses, and one or more user preferences. The story manager is also configured to generate, using the story controller, an action command based on the event received from the event controller and a story state. The story manager is further configured to determine, using the action processor, an action corresponding to the action command for initiating one or more control processes for distributing the interactive broadcast.
    Type: Grant
    Filed: August 18, 2015
    Date of Patent: May 29, 2018
    Assignee: Disney Enterprises, Inc.
    Inventors: Nikolce Stefanoski, Aljoscha Smolic
  • Patent number: 9922411
    Abstract: Systems and methods are disclosed for weighting the image quality prediction of any visual-attention-agnostic quality metric with a saliency map. By accounting for the salient regions of an image or video frame, the disclosed systems and methods may dramatically improve the precision of the visual-attention-agnostic quality metric during image or video quality assessment. In one implementation, a method of saliency-weighted video quality assessment includes: determining a per-pixel image quality vector of an encoded video frame; determining per-pixel saliency values of the encoded video frame or a reference video frame corresponding to the encoded video frame; and computing a saliency-weighted image quality metric of the encoded video frame by weighting the per-pixel image quality vector using the per-pixel saliency values.
    Type: Grant
    Filed: November 30, 2015
    Date of Patent: March 20, 2018
    Assignee: Disney Enterprises, Inc.
    Inventors: Tunc Ozan Aydin, Nikolce Stefanoski, Aljoscha Smolic, Mark Arana
  • Publication number: 20180061057
    Abstract: There is provided a system including a memory and a processor configured to obtain a first frame of a video content including an object and a first region based on a segmentation hierarchy of the first frame, insert a synthetic object into the first frame, merge an object segmentation hierarchy of the synthetic object with the segmentation hierarchy of the first frame to create a merged segmentation hierarchy, select a second region based on the merged segmentation hierarchy, provide the first frame including the first region and the second region to a crowd user for creating a corrected frame, receive the corrected frame from the crowd user including a first corrected region including the object and a second corrected region including the synthetic object, determine a quality based on the synthetic object and the second corrected region, and accept the first corrected region based on the quality.
    Type: Application
    Filed: August 23, 2016
    Publication date: March 1, 2018
    Inventors: MIQUEL ANGEL FARRE GUIU, MARC JUNYENT MARTIN, ALJOSCHA SMOLIC