Patents by Inventor Muhammad Raffay Hamid

Muhammad Raffay Hamid has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11928880
    Abstract: Techniques are disclosed for detecting an uncovered portion of a body of a person in a frame of video content. In an example, a first machine learning model of a computing system may output a first score for the frame based on a map that identifies a region of the frame associated with an uncovered body part type. Depending on a value of the first score, a second machine learning model that includes a neural network architecture may further analyze the frame to output a second score. The first score and second score may be merged to produce a third score for the frame. A plurality of scores may be determined, respectively, for frames of the video content, and a maximum score may be selected. The video content may be selected for presentation on a display for further evaluation based on the maximum score.
    Type: Grant
    Filed: March 29, 2021
    Date of Patent: March 12, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Xiaohang Sun, Mohamed Kamal Omar, Alexander Ratnikov, Ahmed Aly Saad Ahmed, Tai-Ching Li, Travis Silvers, Hanxiao Deng, Muhammad Raffay Hamid, Ivan Ryndin
  • Patent number: 11900700
    Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.
    Type: Grant
    Filed: February 27, 2023
    Date of Patent: February 13, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi
  • Publication number: 20240029278
    Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.
    Type: Application
    Filed: October 4, 2023
    Publication date: January 25, 2024
    Inventors: Xiaohan Nie, Muhammad Raffay Hamid
  • Patent number: 11869537
    Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating audio segments using the audio file, the audio segments including a first segment and a second segment, and determining that the first segment includes first voice activity. Methods may include determining that the second segment comprises second voice activity, determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment, and generating text data representing the voice activity that is present between the first timestamp and the second timestamp.
    Type: Grant
    Filed: November 10, 2021
    Date of Patent: January 9, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Mayank Sharma, Sandeep Joshi, Muhammad Raffay Hamid
  • Patent number: 11829413
    Abstract: Techniques for temporal localization of mature content in long-form videos using only video-level labels are described.
    Type: Grant
    Filed: September 23, 2020
    Date of Patent: November 28, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Xiang Hao, Jingxiang Chen, Vernon Germano, Muhammad Raffay Hamid, Lakshay Sharma
  • Patent number: 11816849
    Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.
    Type: Grant
    Filed: September 30, 2022
    Date of Patent: November 14, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Xiaohan Nie, Muhammad Raffay Hamid
  • Patent number: 11790695
    Abstract: Devices, systems, and methods are provided for enhanced video annotations using image analysis. A method may include identifying, by a first device, first faces of first video frames, and second faces of second video frames. The method may include determining a first score for the first video frames, the first score indicative of a first number of faces to label, the first number of faces represented by the first video frames, and determining a second score for the second video frames, the second score indicative of a second number of faces to label. The method may include selecting the first video frames for face labeling, and receiving a first face label for the first face. The method may include generating a second face label for the second faces. The method may include sending the first face label and the second face label to a second device for presentation.
    Type: Grant
    Filed: May 17, 2021
    Date of Patent: October 17, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Abhinav Aggarwal, Yash Pandya, Laxmi Shivaji Ahire, Lokesh Amarnath Ravindranathan, Manivel Sethu, Muhammad Raffay Hamid
  • Patent number: 11776273
    Abstract: Techniques for automatic scene change detection are described. As one example, a computer-implemented method includes receiving a request to train an ensemble of machine learning models on a training dataset of videos having labels that indicate scene changes to detect a scene change in a video, partitioning each video file of the training dataset of videos into a plurality of shots, training the ensemble of machine learning models into a trained ensemble of machine learning models based at least in part on the plurality of shots of the training dataset of videos and the labels that indicate scene changes, receiving an inference request for an input video, partitioning the input video into a plurality of shots, generating, by the trained ensemble of machine learning models, an inference of one or more scene changes in the input video based at least in part on the plurality of shots of the input video, and transmitting the inference to a client application or to a storage location.
    Type: Grant
    Filed: November 30, 2020
    Date of Patent: October 3, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Shixing Chen, Muhammad Raffay Hamid, Vimal Bhat, Shiva Krishnamurthy
  • Patent number: 11763564
    Abstract: Systems and methods are provided herein for generating optimized video segments. A derivative video segment (e.g., a scene) can be identified from derivative video content (e.g., a movie trailer). The segment may be used a query to search video content (e.g., the movie) for the segment. Once found, an optimized video segment may be generated from the video content. The optimized video segment may have a different start time and/or end time than those corresponding to the original segment. Once optimized, the video segment may be presented to a user or stored for subsequent content recommendations.
    Type: Grant
    Filed: March 29, 2021
    Date of Patent: September 19, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Najmeh Sadoughi Nourabadi, Kewen Chen, Tu Anh Ho, Christina Botkins, Dongqing Zhang, Muhammad Raffay Hamid
  • Publication number: 20230282006
    Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift, and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.
    Type: Application
    Filed: February 27, 2023
    Publication date: September 7, 2023
    Applicant: Amazon Technologies, Inc.
    Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi
  • Patent number: 11748988
    Abstract: Techniques for automatic scene change detection in a video are described. As one example, a computer-implemented method includes extracting features of a query shot and its neighboring shots of a first set of shots without labels with a query model, determining a key shot of the neighboring shots which is most similar to the query shot based at least in part on the features of the query shot and its neighboring shots, extracting features of the key shot with a key model, training the query model into a trained query model based at least in part on a comparison of the features of the query shot and the features of the key shot, extracting features of a second set of shots with labels with the trained query model, and training a temporal model into a trained temporal model based at least in part on the features extracted from the second set of shots and the labels of the second set of shots.
    Type: Grant
    Filed: April 21, 2021
    Date of Patent: September 5, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Shixing Chen, Xiaohan Nie, David Jiatian Fan, Dongqing Zhang, Vimal Bhat, Muhammad Raffay Hamid
  • Patent number: 11734930
    Abstract: Methods and apparatus are described for generating compelling preview clips of media presentations. Compelling clips are identified based on the extent to which human faces are shown and/or the loudness of the audio associated with the clips. One or more of these compelling clips are then provided to a client device for playback.
    Type: Grant
    Filed: May 9, 2022
    Date of Patent: August 22, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Kewen Chen, Tu Anh Ho, Muhammad Raffay Hamid, Shixing Chen
  • Patent number: 11657850
    Abstract: Techniques are described for automating virtual placements in video content.
    Type: Grant
    Filed: December 9, 2020
    Date of Patent: May 23, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Yongjun Wu, Yash Chaturvedi, Steven James Cox, Travis Silvers, Amit S. Jain, Amjad Y. A. Abu Jbara, Prasanth Saraswatula
  • Patent number: 11625928
    Abstract: Systems, methods, and computer-readable media are disclosed for language-agnostic subtitle drift detection and correction. A method may include determining subtitles and/or captions from media content (e.g., videos), the subtitles and/or captions corresponding to dialog in the media content. The subtitles may be broken up into segments which may be analyzed to determine a likelihood of drift (e.g., a likelihood that the subtitles are out of synchronization with the dialog in the media content) for each segment. For segments with a high likelihood of drift, the subtitles may be incrementally adjusted to determine an adjustment that eliminates and/or reduces the amount of drift and the drift in the segment may be corrected based on the drift amount detected. A linear regression model and/or human blocks determined by human operators may be used to otherwise optimize drift correction.
    Type: Grant
    Filed: September 1, 2020
    Date of Patent: April 11, 2023
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Tamojit Chatterjee, Mayank Sharma, Muhammad Raffay Hamid, Sandeep Joshi
  • Patent number: 11617008
    Abstract: Methods, systems, and computer-readable media for media classification using local and global audio features are disclosed. A media classification system determines local features of an audio input using an audio event detector model that is trained to detect a plurality of audio event classes descriptive of objectionable content. The local features are extracted using maximum values of audio event scores for individual audio event classes. The media classification system determines one or more global features of the audio input using the audio event detector model. The global feature(s) are extracted using averaging of clip-level descriptors of a plurality of clips of the audio input. The media classification system determines a content-based rating for media comprising the audio input based (at least in part) on the local features of the audio input and based (at least in part) on the global feature(s) of the audio input.
    Type: Grant
    Filed: March 30, 2021
    Date of Patent: March 28, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Tarun Gupta, Mayank Sharma, Xiang Hao, Muhammad Raffay Hamid, Zhitao Qiu
  • Patent number: 11589116
    Abstract: Techniques are disclosed for detecting a type of prurient activity shown by a portion of video content. In an example, a machine learning model of a computer system may receive a second portion of video content, the machine learning model including a neural network that is trained to analyze a temporal dimension of the second portion. The machine learning model determines a score indicating a likelihood that the video content shows the type of prurient activity based in part on applying a three-dimensional filter to the second portion of the video content. The computer system then generates a video clip that includes at least the portion of the video content showing the type of prurient activity based on the score, and provides the video clip for display.
    Type: Grant
    Filed: May 3, 2021
    Date of Patent: February 21, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Mohamed Kamal Omar, Xiaohang Sun, Ivan Ryndin, Tai-Ching Li, Alexander Ratnikov, Muhammad Raffay Hamid, Ahmed Aly Saad Ahmed, Travis Silvers, Hanxiao Deng
  • Publication number: 20230023419
    Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.
    Type: Application
    Filed: September 30, 2022
    Publication date: January 26, 2023
    Inventors: Xiaohan Nie, Muhammad Raffay Hamid
  • Patent number: 11532111
    Abstract: Techniques for a comic book feature are described herein. A visual data stream of a video may be parsed into a plurality of frames. Scene boundaries may be determined to generate a scene using the plurality of frames where a scene includes a subset of frames. A key frame may be determined for the scene using the subset of frames. An audio portion of an audio data stream of the video may be identified that maps to the subset of frames based on time information. The key frame may be converted to a comic image based on an algorithm. First dimensions and placement for a data object may be determined for the comic image. The data object may include the audio portion for the comic image. A comic panel may be generated for the comic image that incorporates the data object using the determined first dimensions and the placement.
    Type: Grant
    Filed: June 10, 2021
    Date of Patent: December 20, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Dongqing Zhang, Muhammad Raffay Hamid, Xiaohan Nie, Shixing Chen
  • Patent number: 11468578
    Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.
    Type: Grant
    Filed: September 14, 2020
    Date of Patent: October 11, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Xiaohan Nie, Muhammad Raffay Hamid
  • Publication number: 20220180898
    Abstract: Techniques are described for automating virtual placements in video content.
    Type: Application
    Filed: December 9, 2020
    Publication date: June 9, 2022
    Inventors: Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Yongjun Wu, Yash Chaturvedi, Steven James Cox, Travis Silvers, Amit S. Jain, Amjad Y. A. Abu Jbara, Prasanth Saraswatula