Patents by Inventor Aljoscha Smolic
Aljoscha Smolic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20170154415Abstract: Systems and methods are disclosed for weighting the image quality prediction of any visual-attention-agnostic quality metric with a saliency map. By accounting for the salient regions of an image or video frame, the disclosed systems and methods may dramatically improve the precision of the visual-attention-agnostic quality metric during image or video quality assessment. In one implementation, a method of saliency-weighted video quality assessment includes: determining a per-pixel image quality vector of an encoded video frame; determining per-pixel saliency values of the encoded video frame or a reference video frame corresponding to the encoded video frame; and computing a saliency-weighted image quality metric of the encoded video frame by weighting the per-pixel image quality vector using the per-pixel saliency values.Type: ApplicationFiled: November 30, 2015Publication date: June 1, 2017Applicant: Disney Enterprises, Inc.Inventors: TUNC OZAN AYDIN, NIKOLCE STEFANOSKI, ALJOSCHA SMOLIC, MARK ARANA
-
Publication number: 20170109419Abstract: There are provided systems and methods for performing metadata extraction and management. Such a system includes a computing platform having a hardware processor, a system memory, and metadata extraction and management unit stored in the system memory. The system is configured to extract multiple metadata types from a media asset, and to aggregate the multiple metadata types to produce an aggregated metadata describing the media asset. The system is further configured to transform the aggregated metadata into at least one database entry identifying the media asset, and to map the at least one database entry into a graphical database so as to relate the media asset to at least one other media asset represented in the graphical database.Type: ApplicationFiled: October 15, 2015Publication date: April 20, 2017Inventors: Miquel Angel Farre Guiu, Marc Junyent Martin, Jordi Pont-Tuset, Pablo Beltran, Nimesh Narayan, Leonid Sigal, Aljoscha Smolic, Anthony M. Accardo
-
SYSTEMS AND METHODS FOR AUTOMATIC KEY FRAME EXTRACTION AND STORYBOARD INTERFACE GENERATION FOR VIDEO
Publication number: 20170091558Abstract: A storyboard interface displaying key frames of a video may be presented to a user. Individual key frames may represent individual shots of the video. Shots may be grouped based on similarity. Key frames may be displayed in a chronological order of the corresponding shots. Key frames of grouped shots may be spatially correlated within the storyboard interface. For example, shots of a common group may be spatially correlated so that they may be easily discernable as a group even though the shots may not be temporally consecutive and/or or even temporally close to each other in the timeframe of the video itself.Type: ApplicationFiled: December 12, 2016Publication date: March 30, 2017Inventors: Aljoscha Smolic, Marc Junyent Martin, Jordi Pont-Tusert, Alexandre Chapiro, Miquel Angel Farre Guiu -
Publication number: 20170076153Abstract: There is provided a method that includes receiving a video having video shots, and creating video shot groups based on similarities between the video shots, where each video shot group of the video shot groups includes one or more of the video shots and has different ones of the video shots than other video shot groups. The method further includes creating at least one video supergroup including at least one video shot group of the video shot groups based on interactions among the one or more of the video shots in each of the video shot groups, and divide the at least one video supergroup into connected video supergroups, each connected video supergroup of the connected video supergroups including one or more of the video shot groups based on the interactions among the one or more of video shots in each of the video shot groups.Type: ApplicationFiled: December 3, 2015Publication date: March 16, 2017Inventors: Miquel Angel Farre Guiu, Pablo Beltran Sanchidrian, Aljoscha Smolic
-
Systems and methods for automatic key frame extraction and storyboard interface generation for video
Patent number: 9552520Abstract: A storyboard interface displaying key frames of a video may be presented to a user. Individual key frames may represent individual shots of the video. Shots may be grouped based on similarity. Key frames may be displayed in a chronological order of the corresponding shots. Key frames of grouped shots may be spatially correlated within the storyboard interface. For example, shots of a common group may be spatially correlated so that they may be easily discernable as a group even though the shots may not be temporally consecutive and/or or even temporally close to each other in the timeframe of the video itself.Type: GrantFiled: July 7, 2015Date of Patent: January 24, 2017Assignees: Disney Enterprises, Inc., ETH ZurichInventors: Aljoscha Smolic, Marc Junyent Martin, Jordi Pont-Tuset, Alexandre Chapiro, Miquel Angel Farre Guiu -
SYSTEMS AND METHODS FOR AUTOMATIC KEY FRAME EXTRACTION AND STORYBOARD INTERFACE GENERATION FOR VIDEO
Publication number: 20170011264Abstract: A storyboard interface displaying key frames of a video may be presented to a user. Individual key frames may represent individual shots of the video. Shots may be grouped based on similarity. Key frames may be displayed in a chronological order of the corresponding shots. Key frames of grouped shots may be spatially correlated within the storyboard interface. For example, shots of a common group may be spatially correlated so that they may be easily discernable as a group even though the shots may not be temporally consecutive and/or or even temporally close to each other in the timeframe of the video itself.Type: ApplicationFiled: July 7, 2015Publication date: January 12, 2017Inventors: Aljoscha SMOLIC, Marc Junyent MARTIN, Jordi PONT-TUSET, Alexandre CHAPIRO, Miquel Angel FARRE GUIU -
Patent number: 9530240Abstract: A method including receiving a first image of a scene captured from a first perspective, the first image including an object and a background; segmenting the first image to extract a first two-dimensional contour of the object; approximating a plurality of three-dimensional locations of a plurality of points on the first contour; generating a three-dimensional billboard of the object based on the three-dimensional locations; and projecting the first image onto the three-dimensional billboard.Type: GrantFiled: September 10, 2013Date of Patent: December 27, 2016Assignee: DISNEY ENTERPRISES, INC.Inventors: Oliver Wang, Simone Croci, Marvin White, Aljoscha Smolic, Michael Gay
-
Publication number: 20160373795Abstract: There is provided a server for providing an interactive broadcast. The server includes a memory configured to store a story manager including an event controller, a story controller and an action processor, and a hardware processor configured to execute the story manager. The story manager is configured to provide, using the event controller, an event based on a control script, story elements metadata, one or more user performance analyses, and one or more user preferences. The story manager is also configured to generate, using the story controller, an action command based on the event received from the event controller and a story state. The story manager is further configured to determine, using the action processor, an action corresponding to the action command for initiating one or more control processes for distributing the interactive broadcast.Type: ApplicationFiled: August 18, 2015Publication date: December 22, 2016Inventors: Nikolce Stefanoski, Aljoscha Smolic
-
Publication number: 20160353164Abstract: Novel systems and methods are described for creating, compressing, and distributing video or image content graded for a plurality of displays with different dynamic ranges. In implementations, the created content is “continuous dynamic range” (CDR) content—a novel representation of pixel-luminance as a function of display dynamic range. The creation of the CDR content includes grading a source content for a minimum dynamic range and a maximum dynamic range, and defining a luminance of each pixel of an image or video frame of the source content as a continuous function between the minimum and the maximum dynamic ranges. In additional implementations, a novel graphical user interface for creating and editing the CDR content is described.Type: ApplicationFiled: September 22, 2015Publication date: December 1, 2016Inventors: ALJOSCHA SMOLIC, ALEXANDRE CHAPIRO, SIMONE CROCI, TUNC OZAN AYDIN, NIKOLCE STEFANOSKI, MARKUS GROSS
-
Publication number: 20160313894Abstract: A system is provided for tagging an object in a video having a plurality of frames. The system includes a memory storing a segmentation hierarchy of a first frame of the plurality of frames and having a plurality of elements, a display, and a processor configured to display the first frame including the plurality of elements on the display, receive a first input selecting a first element of the plurality of elements displayed on the display, select a first region of the first frame based on the first input, display the first region of the first frame on the display, receive a second input from the user altering the first region of the first frame displayed on the display, and alter the first region by selecting a second region of the first frame based on the second input from the user and the segmentation hierarchy.Type: ApplicationFiled: April 21, 2015Publication date: October 27, 2016Inventors: Aljoscha Smolic, Jordi Pont-Tuset, Miquel Angel Farre Guiu
-
Patent number: 9445072Abstract: Techniques are disclosed for generating autostereoscopic video content. A multiscopic video frame is received that includes a first image and a second image. The first and second images are analyzed to determine a set of image characteristics. A mapping function is determined based on the set of image characteristics. At least a third image is generated based on the mapping function and added to the multiscopic video frame.Type: GrantFiled: August 31, 2012Date of Patent: September 13, 2016Assignees: DISNEY ENTERPRISES, INC., ETH ZURICH (EIDGENOESSISCHE TECHNISCHE HOCHSCHULE ZURICH)Inventors: Nikolce Stefanoski, Aljoscha Smolic, Manuel Lang, Miquel À Farré, Alexander Hornung, Pedro Christian Espinosa Fricke, Oliver Wang
-
Publication number: 20160261927Abstract: A device and method for receiving video content, generating at least two overlays for the video content, generating an information message containing information enabling a receiver of the video content and of the at least two overlays to selectively display or hide the generated overlays, and transmitting, using a multi-stream transmission including a primary stream and auxiliary streams, the information message, the video content in the primary stream and the at least two overlays in the auxiliary streams.Type: ApplicationFiled: March 31, 2016Publication date: September 8, 2016Inventors: Aljoscha SMOLIC, Nikolce STEFANOSKI, Oliver WANG
-
Patent number: 9363427Abstract: A device and method incorporates features of a temporal contrast sensor with a camera sensor in an imager. The method includes registering the camera sensor with the temporal contrast sensor as a function of a calibration target. The method includes receiving camera sensor data from the camera sensor and temporal contrast sensor data from the temporal contrast sensor. The method includes generating a plurality of images as a function of incorporating the temporal contrast sensor data with the camera sensor data.Type: GrantFiled: August 28, 2013Date of Patent: June 7, 2016Assignee: DISNEY ENTERPRISES, INC.Inventors: Simon Heinzle, Aljoscha Smolic, Oliver Wang, Alexander Sorkine Hornung, Yael Pritch, Henning Zimmer
-
Publication number: 20160057488Abstract: A method including receiving video of an event; generating an overlay for the video; generating an information message containing information enabling a receiver of the video and the overlay to selectively display or hide the overlay; and transmitting the video, the overlay, and the information message. The video is transmitted in a primary stream of a multi-stream transmission including a primary stream and one or more auxiliary streams. The overlay is transmitted in a first one of the auxiliary streams.Type: ApplicationFiled: November 3, 2015Publication date: February 25, 2016Inventors: Aljoscha SMOLIC, NikoIce STEFANOSKl, Oliver WANG
-
Patent number: 9237331Abstract: A closed-loop control system for stereoscopic video capture is provided. At least two motorized lenses are positioned in accordance with specified parameters to capture spatially-disparate images of a scene. The motorized lenses focus light on a corresponding one of the at least two sensors, which generate image streams. One or more processors execute instructions to provide a stream analyzer and a control module. The stream analyzer receives the image streams from the sensors and analyzes the image streams and the specified parameters in real time; the stream analyzer then modifies the image streams and generates metadata. The control module then receives and analyzes the image streams and metadata and transmits updated parameters to a control mechanism that is coupled to the at least two motorized lenses. The control mechanism then modifies operation of the at least two motorized lenses in real time in accordance with the updated parameters.Type: GrantFiled: April 8, 2011Date of Patent: January 12, 2016Assignees: DISNEY ENTERPRISES, INC., ETH ZÜRICH (EIDGENÖESSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)Inventors: Simon Heinzle, Pierre Greisen, Aljoscha Smolic, Wojciech Matusik, Markus Gross
-
Publication number: 20150135212Abstract: A method including receiving video of an event; generating an overlay for the video; generating an information message containing information enabling a receiver of the video and the overlay to selectively display or hide the overlay; and transmitting the video, the overlay, and the information message. The video is transmitted in a primary stream of a multi-stream transmission including a primary stream and one or more auxiliary streams. The overlay is transmitted in a first one of the auxiliary streams.Type: ApplicationFiled: October 9, 2014Publication date: May 14, 2015Inventors: Aljoscha Smolic, Nikolce Stefanoski, Oliver Wang
-
Publication number: 20150070346Abstract: A method including receiving a first image of a scene captured from a first perspective, the first image including an object and a background; segmenting the first image to extract a first two-dimensional contour of the object; approximating a plurality of three-dimensional locations of a plurality of points on the first contour; generating a three-dimensional billboard of the object based on the three-dimensional locations; and projecting the first image onto the three-dimensional billboard.Type: ApplicationFiled: September 10, 2013Publication date: March 12, 2015Applicant: Disney Enterprises, Inc.Inventors: Oliver WANG, Simone CROCI, Marvin WHITE, Aljoscha SMOLIC, Michael GAY
-
Publication number: 20150062351Abstract: A device and method incorporates features of a temporal contrast sensor with a camera sensor in an imager. The method includes registering the camera sensor with the temporal contrast sensor as a function of a calibration target. The method includes receiving camera sensor data from the camera sensor and temporal contrast sensor data from the temporal contrast sensor. The method includes generating a plurality of images as a function of incorporating the temporal contrast sensor data with the camera sensor data.Type: ApplicationFiled: August 28, 2013Publication date: March 5, 2015Applicant: Disney Enterprises Inc.Inventors: Simon HEINZLE, Aljoscha SMOLIC, Oliver WANG, Alexander Sorkine HORNUNG, Yael PRITCH, Henning ZIMMER
-
Publication number: 20130057644Abstract: Techniques are disclosed for generating autostereoscopic video content. A multiscopic video frame is received that includes a first image and a second image. The first and second images are analyzed to determine a set of image characteristics. A mapping function is determined based on the set of image characteristics. At least a third image is generated based on the mapping function and added to the multiscopic video frame.Type: ApplicationFiled: August 31, 2012Publication date: March 7, 2013Applicant: DISNEY ENTERPRISES, INC.Inventors: Nikolce Stefanoski, Aljoscha Smolic, Manuel Lang, Miquel À. Farré, Alexander Hornung, Pedro Christian Espinosa Fricke, Oliver Wang
-
Publication number: 20120182397Abstract: A closed-loop control system for stereoscopic video capture is provided. At least two motorized lenses are positioned in accordance with specified parameters to capture spatially-disparate images of a scene. The motorized lenses focus light on a corresponding one of the at least two sensors, which generate image streams. One or more processors execute instructions to provide a stream analyzer and a control module. The stream analyzer receives the image streams from the sensors and analyzes the image streams and the specified parameters in real time; the stream analyzer then modifies the image streams and generates metadata. The control module then receives and analyzes the image streams and metadata and transmits updated parameters to a control mechanism that is coupled to the at least two motorized lenses. The control mechanism then modifies operation of the at least two motorized lenses in real time in accordance with the updated parameters.Type: ApplicationFiled: April 8, 2011Publication date: July 19, 2012Applicant: Disney Enterprises, Inc.Inventors: Simon Heinzle, Pierre Greisen, Aljoscha Smolic, Wojciech Matusik, Markus Gross