Patents by Inventor Muhammad Raffay Hamid
Muhammad Raffay Hamid has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12211222Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.Type: GrantFiled: October 4, 2023Date of Patent: January 28, 2025Assignee: Amazon Technologies, Inc.Inventors: Xiaohan Nie, Muhammad Raffay Hamid
-
Publication number: 20240346686Abstract: Systems, devices, and methods are provided for depth-guided structure from motion. A system may obtain a plurality of image frames from a digital content item that corresponds to a scene and determine, based at least in part on a correspondence search, a set of 2-D keypoints for the plurality of image frames. A depth estimator may be used to determine a plurality of dense depth map for the plurality of image frames. The set of 2-D keypoints and the plurality of dense depth maps may be used to determine a corresponding set of depth priors. Initialization and/or depth-regularized optimization may be performed using the keypoints and depth priors.Type: ApplicationFiled: June 20, 2024Publication date: October 17, 2024Applicant: Amazon Technologies, Inc.Inventors: Xiaohan Nie, Michael Thomas Pecchia, Leo Chan, Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Sheng Liu
-
Patent number: 12073625Abstract: Systems and methods are provided herein for generating optimized video segments. A derivative video segment (e.g., a scene) can be identified from derivative video content (e.g., a movie trailer). The segment may be used a query to search video content (e.g., the movie) for the segment. Once found, an optimized video segment may be generated from the video content. The optimized video segment may have a different start time and/or end time than those corresponding to the original segment. Once optimized, the video segment may be presented to a user or stored for subsequent content recommendations.Type: GrantFiled: July 18, 2023Date of Patent: August 27, 2024Assignee: Amazon Technologies, Inc.Inventors: Najmeh Sadoughi Nourabadi, Kewen Chen, Tu Anh Ho, Christina Botkins, Dongqing Zhang, Muhammad Raffay Hamid
-
Patent number: 12067779Abstract: A plurality of similar video pairs may be determined based on one or more similarity information types. Each video pair of the plurality of similar video pairs may include a first respective video and a second respective video. For each video pair, one or more similar scene pairs may be determined. Each of the one or more similar scene pairs may include a respective first scene from the first respective video and a second respective scene from the second respective video. An encoder may be trained using a contrastive learning model that contrasts a plurality of similar scene pairs with a plurality of random scenes. The plurality of similar scene pairs may include the one or more scene pairs for each video pair. One or more scene features of one or more other scenes of one or more other videos may be determined using the encoder.Type: GrantFiled: February 9, 2022Date of Patent: August 20, 2024Assignee: Amazon Technologies, Inc.Inventors: Shixing Chen, Xiang Hao, Xiaohan Nie, Muhammad Raffay Hamid
-
Patent number: 12056949Abstract: Techniques are disclosed for detecting an uncovered portion of a body of a person in a frame of video content. In an example, a first machine learning model of a computing system may output a first score for the frame based on a map that identifies a region of the frame associated with an uncovered body part type. Depending on a value of the first score, a second machine learning model that includes a neural network architecture may further analyze the frame to output a second score. The first score and second score may be merged to produce a third score for the frame. A plurality of scores may be determined, respectively, for frames of the video content, and a maximum score may be selected. The video content may be selected for presentation on a display for further evaluation based on the maximum score.Type: GrantFiled: March 29, 2021Date of Patent: August 6, 2024Assignee: Amazon Technologies, Inc.Inventors: Xiaohang Sun, Mohamed Kamal Omar, Alexander Ratnikov, Ahmed Aly Saad Ahmed, Tai-Ching Li, Travis Silvers, Hanxiao Deng, Muhammad Raffay Hamid, Ivan Ryndin
-
Patent number: 12046002Abstract: Systems, devices, and methods are provided for depth guided structure from motion. A system may obtain a plurality of image frames from a digital content item that corresponds to a scene and determine, based at least in part on a correspondence search, a set of 2-D keypoints for the plurality of image frames. A depth estimator may be used to determine a plurality of dense depth map for the plurality of image frames. The set of 2-D keypoints and the plurality of dense depth maps may be used to determine a corresponding set of depth priors. Initialization and/or depth-regularized optimization may be performed using the keypoints and depth priors.Type: GrantFiled: March 1, 2022Date of Patent: July 23, 2024Assignee: Amazon Technologies, Inc.Inventors: Xiaohan Nie, Michael Thomas Pecchia, Leo Chan, Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Sheng Liu
-
Patent number: 11928880Abstract: Techniques are disclosed for detecting an uncovered portion of a body of a person in a frame of video content. In an example, a first machine learning model of a computing system may output a first score for the frame based on a map that identifies a region of the frame associated with an uncovered body part type. Depending on a value of the first score, a second machine learning model that includes a neural network architecture may further analyze the frame to output a second score. The first score and second score may be merged to produce a third score for the frame. A plurality of scores may be determined, respectively, for frames of the video content, and a maximum score may be selected. The video content may be selected for presentation on a display for further evaluation based on the maximum score.Type: GrantFiled: March 29, 2021Date of Patent: March 12, 2024Assignee: Amazon Technologies, Inc.Inventors: Xiaohang Sun, Mohamed Kamal Omar, Alexander Ratnikov, Ahmed Aly Saad Ahmed, Tai-Ching Li, Travis Silvers, Hanxiao Deng, Muhammad Raffay Hamid, Ivan Ryndin
-
Patent number: 11900700Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.Type: GrantFiled: February 27, 2023Date of Patent: February 13, 2024Assignee: Amazon Technologies, Inc.Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi
-
Publication number: 20240029278Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.Type: ApplicationFiled: October 4, 2023Publication date: January 25, 2024Inventors: Xiaohan Nie, Muhammad Raffay Hamid
-
Patent number: 11869537Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating audio segments using the audio file, the audio segments including a first segment and a second segment, and determining that the first segment includes first voice activity. Methods may include determining that the second segment comprises second voice activity, determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment, and generating text data representing the voice activity that is present between the first timestamp and the second timestamp.Type: GrantFiled: November 10, 2021Date of Patent: January 9, 2024Assignee: Amazon Technologies, Inc.Inventors: Mayank Sharma, Sandeep Joshi, Muhammad Raffay Hamid
-
Patent number: 11829413Abstract: Techniques for temporal localization of mature content in long-form videos using only video-level labels are described.Type: GrantFiled: September 23, 2020Date of Patent: November 28, 2023Assignee: Amazon Technologies, Inc.Inventors: Xiang Hao, Jingxiang Chen, Vernon Germano, Muhammad Raffay Hamid, Lakshay Sharma
-
Patent number: 11816849Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.Type: GrantFiled: September 30, 2022Date of Patent: November 14, 2023Assignee: Amazon Technologies, Inc.Inventors: Xiaohan Nie, Muhammad Raffay Hamid
-
Patent number: 11790695Abstract: Devices, systems, and methods are provided for enhanced video annotations using image analysis. A method may include identifying, by a first device, first faces of first video frames, and second faces of second video frames. The method may include determining a first score for the first video frames, the first score indicative of a first number of faces to label, the first number of faces represented by the first video frames, and determining a second score for the second video frames, the second score indicative of a second number of faces to label. The method may include selecting the first video frames for face labeling, and receiving a first face label for the first face. The method may include generating a second face label for the second faces. The method may include sending the first face label and the second face label to a second device for presentation.Type: GrantFiled: May 17, 2021Date of Patent: October 17, 2023Assignee: Amazon Technologies, Inc.Inventors: Abhinav Aggarwal, Yash Pandya, Laxmi Shivaji Ahire, Lokesh Amarnath Ravindranathan, Manivel Sethu, Muhammad Raffay Hamid
-
Patent number: 11776273Abstract: Techniques for automatic scene change detection are described. As one example, a computer-implemented method includes receiving a request to train an ensemble of machine learning models on a training dataset of videos having labels that indicate scene changes to detect a scene change in a video, partitioning each video file of the training dataset of videos into a plurality of shots, training the ensemble of machine learning models into a trained ensemble of machine learning models based at least in part on the plurality of shots of the training dataset of videos and the labels that indicate scene changes, receiving an inference request for an input video, partitioning the input video into a plurality of shots, generating, by the trained ensemble of machine learning models, an inference of one or more scene changes in the input video based at least in part on the plurality of shots of the input video, and transmitting the inference to a client application or to a storage location.Type: GrantFiled: November 30, 2020Date of Patent: October 3, 2023Assignee: Amazon Technologies, Inc.Inventors: Shixing Chen, Muhammad Raffay Hamid, Vimal Bhat, Shiva Krishnamurthy
-
Patent number: 11763564Abstract: Systems and methods are provided herein for generating optimized video segments. A derivative video segment (e.g., a scene) can be identified from derivative video content (e.g., a movie trailer). The segment may be used a query to search video content (e.g., the movie) for the segment. Once found, an optimized video segment may be generated from the video content. The optimized video segment may have a different start time and/or end time than those corresponding to the original segment. Once optimized, the video segment may be presented to a user or stored for subsequent content recommendations.Type: GrantFiled: March 29, 2021Date of Patent: September 19, 2023Assignee: Amazon Technologies, Inc.Inventors: Najmeh Sadoughi Nourabadi, Kewen Chen, Tu Anh Ho, Christina Botkins, Dongqing Zhang, Muhammad Raffay Hamid
-
Publication number: 20230282006Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.Type: ApplicationFiled: February 27, 2023Publication date: September 7, 2023Applicant: Amazon Technologies, Inc.Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi
-
Patent number: 11748988Abstract: Techniques for automatic scene change detection in a video are described. As one example, a computer-implemented method includes extracting features of a query shot and its neighboring shots of a first set of shots without labels with a query model, determining a key shot of the neighboring shots which is most similar to the query shot based at least in part on the features of the query shot and its neighboring shots, extracting features of the key shot with a key model, training the query model into a trained query model based at least in part on a comparison of the features of the query shot and the features of the key shot, extracting features of a second set of shots with labels with the trained query model, and training a temporal model into a trained temporal model based at least in part on the features extracted from the second set of shots and the labels of the second set of shots.Type: GrantFiled: April 21, 2021Date of Patent: September 5, 2023Assignee: Amazon Technologies, Inc.Inventors: Shixing Chen, Xiaohan Nie, David Jiatian Fan, Dongqing Zhang, Vimal Bhat, Muhammad Raffay Hamid
-
Patent number: 11734930Abstract: Methods and apparatus are described for generating compelling preview clips of media presentations. Compelling clips are identified based on the extent to which human faces are shown and/or the loudness of the audio associated with the clips. One or more of these compelling clips are then provided to a client device for playback.Type: GrantFiled: May 9, 2022Date of Patent: August 22, 2023Assignee: Amazon Technologies, Inc.Inventors: Kewen Chen, Tu Anh Ho, Muhammad Raffay Hamid, Shixing Chen
-
Patent number: 11657850Abstract: Techniques are described for automating virtual placements in video content.Type: GrantFiled: December 9, 2020Date of Patent: May 23, 2023Assignee: Amazon Technologies, Inc.Inventors: Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Yongjun Wu, Yash Chaturvedi, Steven James Cox, Travis Silvers, Amit S. Jain, Amjad Y. A. Abu Jbara, Prasanth Saraswatula
-
Patent number: 11625928Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.Type: GrantFiled: September 1, 2020Date of Patent: April 11, 2023Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi