Abstract: Innovations in encoder-side decisions that use the results of hash-based block matching are presented. For example, some of the innovations relate to ways of building hash tables that include some (but not all) uniform blocks. Other innovations relate to ways of determining motion vector resolution based on results of hash-based block matching. Still other innovations relate to scene change detection, including long-term reference picture selection and picture quality determination during encoding.
Abstract: A mask wearing status alarming method, a mobile device, and a computer readable storage medium are provided. The method includes: performing a face detection on an image to determine face areas each including a target determined as a face; determining a mask wearing status of the target in each face area; confirming the mask wearing status of the target in each face area using a trained face confirmation model to remove the face areas comprising the target being mistakenly determined as the face and determining a face pose in each of the remaining face areas to remove the face areas with the face pose not meeting a preset condition, in response to determining the mask wearing status as a not-masked-well status or a unmasked status; and releasing an alert corresponding to the mask wearing status of the target in each of the remaining face areas.
Abstract: A file generation apparatus generates a file which includes material data used for generation of a virtual viewpoint image that is based on a multi-viewpoint image and type information for specifying a type of the material data, and outputs the generated file.
Abstract: A method to correct a digital image to reverse the effect of signal diffusion among pixels of the digital image. For a target pixel j of the digital image, a set of signal values and a set of signal amplitudes are received, each corresponding to a set of kernel pixels i surrounding and including the target pixel j. For each kernel pixel i, a weighting coefficient is computed based on the signal amplitude of that kernel pixel i and on the signal amplitude of the target pixel j. A linear combination of signal values corresponding to the set of kernel pixels i is computed, wherein the signal value for each pixel i is weighted by the weighting coefficient corresponding to that pixel i. The linear combination is stored in volatile memory of an electronic device as a corrected signal value for the target pixel j.
Abstract: A video management system (VMS) may search for one or more objects and/or events in one or more video streams, and may receive time-stamped metadata that may identify one or more objects and/or events occurring in the corresponding video stream as well as an identifier that uniquely identifies the corresponding video stream. A user may enter a query into a video query engine, wherein the video query engine includes one or more cognitive models. The VMS may apply the search query to the time-stamped metadata via the video query engine to search for one or more objects and/or events in the one or more video streams that match the search query, and returning a search result to the user.
Abstract: Disclosed are systems and methods for portrait animation. An example method includes receiving, by a computing device, a scenario video, where the scenario video includes at least one input frame and the at least one input frame includes a first face, receiving, by the computing device, a target image, where the target image includes a second face, determining, by the computing device and based on the at least one input frame and the target image, two-dimensional (2D) deformations of the second face in the target image, where the 2D deformations, when applied to the second face, modify the second face to imitate at least a facial expression of the first face, and applying, by the computing device, the 2D deformations to the target image to obtain at least one output frame of an output video.
Type:
Grant
Filed:
May 24, 2022
Date of Patent:
January 31, 2023
Assignee:
Snap Inc.
Inventors:
Eugene Krokhalev, Aleksandr Mashrabov, Pavel Savchenkov
Abstract: A method including loading a data file into a memory of a mobile computing device, displaying a portion of the data file on a display of the mobile computing device, detecting user motion data corresponding to a motion of the mobile computing device by a user, the motion being indicative of the user's desire to view a further portion of the data file, and displaying the further portion of the data file on the display as a function of the user motion data.