Abstract: Embodiments of the present disclosure provide a method and apparatus for capturing video, an electronic device and a computer-readable storage medium. The method includes: receiving a video capture trigger operation from a user via a video playing interface for an original video; superimposing a video capture window on the video playing interface, in response to the video capture trigger operation; receiving a video capture operation from the user via the video playing interface: and capture a user video in response to the video capture operation, and displaying the user video via the video capture window. According to the embodiments of the present disclosure, a user only needs to perform operations related to capturing a user video on the video playing interface, thereby implementing a function of combining video, and the operation process is simple and fast. The user video can represent the user's feelings, comments, or viewing reactions to the original video.
Abstract: An information processing apparatus includes: a processor is configured to: read an image of at least one of a first surface or a second surface of a medium with the medium folded; discriminate whether the image is of the first surface or the second surface based on an identification image in the image; specify, based on the image, a surface on which a folding deviation occurs; and determine a correction direction of the folding deviation according to whether the surface on which the folding deviation occurs is the first surface or the second surface.
Abstract: A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.
Type:
Grant
Filed:
June 14, 2019
Date of Patent:
January 24, 2023
Assignee:
ROKU, INC.
Inventors:
Mihailo M. Stojancic, Sunil Suresh Kulkarni, Shashank Merchant, Jose Pio Pereira, Oleksiy Bolgarov
Abstract: A video player for a portable multifunction device is disclosed. In some embodiments, a list of video items is displayed in a portrait orientation of a touch screen display of a portable electronic device. Upon user selection of a respective video item in the list, the user selected video item is automatically displayed in a landscape orientation of the touch screen display.
Type:
Grant
Filed:
May 26, 2021
Date of Patent:
October 25, 2022
Assignee:
Apple Inc.
Inventors:
Freddy Allen Anzures, Gregory Christie, Scott Forstall, Charles J. Pisula