Patents by Inventor Muhammad Raffay Hamid

Muhammad Raffay Hamid has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for video-based sports field registration

Patent number: 12211222

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Grant

Filed: October 4, 2023

Date of Patent: January 28, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
DEPTH-GUIDED STRUCTURE-FROM-MOTION TECHNIQUES

Publication number: 20240346686

Abstract: Systems, devices, and methods are provided for depth-guided structure from motion. A system may obtain a plurality of image frames from a digital content item that corresponds to a scene and determine, based at least in part on a correspondence search, a set of 2-D keypoints for the plurality of image frames. A depth estimator may be used to determine a plurality of dense depth map for the plurality of image frames. The set of 2-D keypoints and the plurality of dense depth maps may be used to determine a corresponding set of depth priors. Initialization and/or depth-regularized optimization may be performed using the keypoints and depth priors.

Type: Application

Filed: June 20, 2024

Publication date: October 17, 2024

Applicant: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Michael Thomas Pecchia, Leo Chan, Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Sheng Liu
Techniques for generating optimized video segments utilizing a visual search

Patent number: 12073625

Abstract: Systems and methods are provided herein for generating optimized video segments. A derivative video segment (e.g., a scene) can be identified from derivative video content (e.g., a movie trailer). The segment may be used a query to search video content (e.g., the movie) for the segment. Once found, an optimized video segment may be generated from the video content. The optimized video segment may have a different start time and/or end time than those corresponding to the original segment. Once optimized, the video segment may be presented to a user or stored for subsequent content recommendations.

Type: Grant

Filed: July 18, 2023

Date of Patent: August 27, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Najmeh Sadoughi Nourabadi, Kewen Chen, Tu Anh Ho, Christina Botkins, Dongqing Zhang, Muhammad Raffay Hamid
Contrastive learning of scene representation guided by video similarities

Patent number: 12067779

Abstract: A plurality of similar video pairs may be determined based on one or more similarity information types. Each video pair of the plurality of similar video pairs may include a first respective video and a second respective video. For each video pair, one or more similar scene pairs may be determined. Each of the one or more similar scene pairs may include a respective first scene from the first respective video and a second respective scene from the second respective video. An encoder may be trained using a contrastive learning model that contrasts a plurality of similar scene pairs with a plurality of random scenes. The plurality of similar scene pairs may include the one or more scene pairs for each video pair. One or more scene features of one or more other scenes of one or more other videos may be determined using the encoder.

Type: Grant

Filed: February 9, 2022

Date of Patent: August 20, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Shixing Chen, Xiang Hao, Xiaohan Nie, Muhammad Raffay Hamid
Frame-based body part detection in video clips

Patent number: 12056949

Abstract: Techniques are disclosed for detecting an uncovered portion of a body of a person in a frame of video content. In an example, a first machine learning model of a computing system may output a first score for the frame based on a map that identifies a region of the frame associated with an uncovered body part type. Depending on a value of the first score, a second machine learning model that includes a neural network architecture may further analyze the frame to output a second score. The first score and second score may be merged to produce a third score for the frame. A plurality of scores may be determined, respectively, for frames of the video content, and a maximum score may be selected. The video content may be selected for presentation on a display for further evaluation based on the maximum score.

Type: Grant

Filed: March 29, 2021

Date of Patent: August 6, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohang Sun, Mohamed Kamal Omar, Alexander Ratnikov, Ahmed Aly Saad Ahmed, Tai-Ching Li, Travis Silvers, Hanxiao Deng, Muhammad Raffay Hamid, Ivan Ryndin
Depth-guided structure-from-motion techniques

Patent number: 12046002

Abstract: Systems, devices, and methods are provided for depth guided structure from motion. A system may obtain a plurality of image frames from a digital content item that corresponds to a scene and determine, based at least in part on a correspondence search, a set of 2-D keypoints for the plurality of image frames. A depth estimator may be used to determine a plurality of dense depth map for the plurality of image frames. The set of 2-D keypoints and the plurality of dense depth maps may be used to determine a corresponding set of depth priors. Initialization and/or depth-regularized optimization may be performed using the keypoints and depth priors.

Type: Grant

Filed: March 1, 2022

Date of Patent: July 23, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Michael Thomas Pecchia, Leo Chan, Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Sheng Liu
Frame-based body part detection in video clips

Patent number: 11928880

Abstract: Techniques are disclosed for detecting an uncovered portion of a body of a person in a frame of video content. In an example, a first machine learning model of a computing system may output a first score for the frame based on a map that identifies a region of the frame associated with an uncovered body part type. Depending on a value of the first score, a second machine learning model that includes a neural network architecture may further analyze the frame to output a second score. The first score and second score may be merged to produce a third score for the frame. A plurality of scores may be determined, respectively, for frames of the video content, and a maximum score may be selected. The video content may be selected for presentation on a display for further evaluation based on the maximum score.

Type: Grant

Filed: March 29, 2021

Date of Patent: March 12, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohang Sun, Mohamed Kamal Omar, Alexander Ratnikov, Ahmed Aly Saad Ahmed, Tai-Ching Li, Travis Silvers, Hanxiao Deng, Muhammad Raffay Hamid, Ivan Ryndin
Language agnostic drift correction

Patent number: 11900700

Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.

Type: Grant

Filed: February 27, 2023

Date of Patent: February 13, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi
SYSTEMS AND METHODS FOR VIDEO-BASED SPORTS FIELD REGISTRATION

Publication number: 20240029278

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Application

Filed: October 4, 2023

Publication date: January 25, 2024

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
Language agnostic automated voice activity detection

Patent number: 11869537

Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating audio segments using the audio file, the audio segments including a first segment and a second segment, and determining that the first segment includes first voice activity. Methods may include determining that the second segment comprises second voice activity, determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment, and generating text data representing the voice activity that is present between the first timestamp and the second timestamp.

Type: Grant

Filed: November 10, 2021

Date of Patent: January 9, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Mayank Sharma, Sandeep Joshi, Muhammad Raffay Hamid
Temporal localization of mature content in long-form videos using only video-level labels

Patent number: 11829413

Abstract: Techniques for temporal localization of mature content in long-form videos using only video-level labels are described.

Type: Grant

Filed: September 23, 2020

Date of Patent: November 28, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Xiang Hao, Jingxiang Chen, Vernon Germano, Muhammad Raffay Hamid, Lakshay Sharma
Systems and methods for video-based sports field registration

Patent number: 11816849

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Grant

Filed: September 30, 2022

Date of Patent: November 14, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
Enhanced video annotation using image analysis

Patent number: 11790695

Abstract: Devices, systems, and methods are provided for enhanced video annotations using image analysis. A method may include identifying, by a first device, first faces of first video frames, and second faces of second video frames. The method may include determining a first score for the first video frames, the first score indicative of a first number of faces to label, the first number of faces represented by the first video frames, and determining a second score for the second video frames, the second score indicative of a second number of faces to label. The method may include selecting the first video frames for face labeling, and receiving a first face label for the first face. The method may include generating a second face label for the second faces. The method may include sending the first face label and the second face label to a second device for presentation.

Type: Grant

Filed: May 17, 2021

Date of Patent: October 17, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Abhinav Aggarwal, Yash Pandya, Laxmi Shivaji Ahire, Lokesh Amarnath Ravindranathan, Manivel Sethu, Muhammad Raffay Hamid
Ensemble of machine learning models for automatic scene change detection

Patent number: 11776273

Abstract: Techniques for automatic scene change detection are described. As one example, a computer-implemented method includes receiving a request to train an ensemble of machine learning models on a training dataset of videos having labels that indicate scene changes to detect a scene change in a video, partitioning each video file of the training dataset of videos into a plurality of shots, training the ensemble of machine learning models into a trained ensemble of machine learning models based at least in part on the plurality of shots of the training dataset of videos and the labels that indicate scene changes, receiving an inference request for an input video, partitioning the input video into a plurality of shots, generating, by the trained ensemble of machine learning models, an inference of one or more scene changes in the input video based at least in part on the plurality of shots of the input video, and transmitting the inference to a client application or to a storage location.

Type: Grant

Filed: November 30, 2020

Date of Patent: October 3, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Shixing Chen, Muhammad Raffay Hamid, Vimal Bhat, Shiva Krishnamurthy
Techniques for generating optimized video segments utilizing a visual search

Patent number: 11763564

Abstract: Systems and methods are provided herein for generating optimized video segments. A derivative video segment (e.g., a scene) can be identified from derivative video content (e.g., a movie trailer). The segment may be used a query to search video content (e.g., the movie) for the segment. Once found, an optimized video segment may be generated from the video content. The optimized video segment may have a different start time and/or end time than those corresponding to the original segment. Once optimized, the video segment may be presented to a user or stored for subsequent content recommendations.

Type: Grant

Filed: March 29, 2021

Date of Patent: September 19, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Najmeh Sadoughi Nourabadi, Kewen Chen, Tu Anh Ho, Christina Botkins, Dongqing Zhang, Muhammad Raffay Hamid
LANGUAGE AGNOSTIC DRIFT CORRECTION

Publication number: 20230282006

Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.

Type: Application

Filed: February 27, 2023

Publication date: September 7, 2023

Applicant: Amazon Technologies, Inc.

Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi
Shot contras five self-supervised learning of a plurality of machine learning models for video analysis applications

Patent number: 11748988

Abstract: Techniques for automatic scene change detection in a video are described. As one example, a computer-implemented method includes extracting features of a query shot and its neighboring shots of a first set of shots without labels with a query model, determining a key shot of the neighboring shots which is most similar to the query shot based at least in part on the features of the query shot and its neighboring shots, extracting features of the key shot with a key model, training the query model into a trained query model based at least in part on a comparison of the features of the query shot and the features of the key shot, extracting features of a second set of shots with labels with the trained query model, and training a temporal model into a trained temporal model based at least in part on the features extracted from the second set of shots and the labels of the second set of shots.

Type: Grant

Filed: April 21, 2021

Date of Patent: September 5, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Shixing Chen, Xiaohan Nie, David Jiatian Fan, Dongqing Zhang, Vimal Bhat, Muhammad Raffay Hamid
Leveraging computer vision and machine learning to identify compelling scenes

Patent number: 11734930

Abstract: Methods and apparatus are described for generating compelling preview clips of media presentations. Compelling clips are identified based on the extent to which human faces are shown and/or the loudness of the audio associated with the clips. One or more of these compelling clips are then provided to a client device for playback.

Type: Grant

Filed: May 9, 2022

Date of Patent: August 22, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Kewen Chen, Tu Anh Ho, Muhammad Raffay Hamid, Shixing Chen
Virtual product placement

Patent number: 11657850

Abstract: Techniques are described for automating virtual placements in video content.

Type: Grant

Filed: December 9, 2020

Date of Patent: May 23, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Yongjun Wu, Yash Chaturvedi, Steven James Cox, Travis Silvers, Amit S. Jain, Amjad Y. A. Abu Jbara, Prasanth Saraswatula
Language agnostic drift correction

Patent number: 11625928

Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.

Type: Grant

Filed: September 1, 2020

Date of Patent: April 11, 2023

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi

1 2 3 next