Patents by Inventor Mubarak Shah

Mubarak Shah has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Self-supervised privacy preservation action recognition system

Patent number: 12142053

Abstract: This invention relates to a self-supervised privacy preservation action recognition system leveraging a learnable transformation anonymization function. The system is designed to process videos by removing spatial cues to protect privacy while retaining critical information for action recognition. The anonymization function, based on an encoder-decoder model, undergoes iterative training on a dataset to optimize the balance between obscuring privacy-sensitive information and preserving capability of recognizing actions. This training involves freezing and adjusting the weights of an action recognition branch and a self-supervised privacy removal branch to refine the model's effectiveness. The outcome is an anonymized video with minimized privacy information leakage, suitable for action analysis without privacy labels.

Type: Grant

Filed: March 29, 2024

Date of Patent: November 12, 2024

Assignee: University of Central Florida Research Foundation, Inc.

Inventors: Ishan Rajendrakumar Dave, Chen Chen, Mubarak Shah
Self-Supervised Privacy Preservation Action Recognition System

Publication number: 20240331389

Abstract: This invention relates to a self-supervised privacy preservation action recognition system leveraging a learnable transformation anonymization function. The system is designed to process videos by removing spatial cues to protect privacy while retaining critical information for action recognition. The anonymization function, based on an encoder-decoder model, undergoes iterative training on a dataset to optimize the balance between obscuring privacy-sensitive information and preserving capability of recognizing actions. This training involves freezing and adjusting the weights of an action recognition branch and a self-supervised privacy removal branch to refine the model's effectiveness. The outcome is an anonymized video with minimized privacy information leakage, suitable for action analysis without privacy labels.

Type: Application

Filed: March 29, 2024

Publication date: October 3, 2024

Inventors: Ishan Rajendrakumar Dave, Chen Chen, Mubarak Shah
CROSS-VIEW IMAGE GEO-LOCALIZATION

Publication number: 20240303770

Abstract: CNN-based methods for cross-view image geo-localization rely on polar transform and fail to model global correlation. A pure transformer-based approach (TransGeo) is described to address these limitations from a different perspective. TransGeo takes full advantage of the strengths of the transformer related to global information modeling and explicit position information encoding. The claimed invention further leverages transformer input's flexibility and discloses an attention-guided non-uniform cropping method so that uninformative image patches are removed with a negligible drop in performance to reduce computation cost. The saved computation can be reallocated to increase resolution only for informative patches, resulting in performance improvement with no additional computation cost. This “attend and zoom-in” strategy is highly similar to human behavior when observing images.

Type: Application

Filed: January 17, 2024

Publication date: September 12, 2024

Inventors: Sijie ZHU, Chen CHEN, Mubarak SHAH
Methods of real-time spatio-temporal activity detection and categorization from untrimmed video segments

Patent number: 11468676

Abstract: Methods of detecting and categorizing an action in an untrimmed video segment regardless of the scale of the action and the close proximity of other actions. The methods improve upon the prior art which either require trimmed video segments including only a single activity depicted therein, or untrimmed video segments including relatively few actions, persons, or objects of interest, thereby directing the classification. Instead, the methods utilize a plurality of tubelets used to represent discreet actions, persons, and objects of interest within the comprehensive untrimmed video segment. The tubelets are localized to correct for pixel-level foreground-background biases, which are then turned into short spatio-temporal action tubelets that are passed to a classification network to obtain multi-label predictions. After classification, the tubelets are be linked together to obtain the final detections with varying lengths, and the method merges the short action tubelets into final action detections.

Type: Grant

Filed: January 7, 2022

Date of Patent: October 11, 2022

Assignee: University of Central Florida Research Foundation, Inc.

Inventors: Yogesh Singh Rawat, Mubarak Shah, Aayush Jung Bahadur Rana, Praveen Tirupattur, Mamshad Nayeem Rizve
METHODS OF REAL-TIME SPATIO-TEMPORAL ACTIVITY DETECTION AND CATEGORIZATION FROM UNTRIMMED VIDEO SEGMENTS

Publication number: 20220222940

Abstract: Methods of detecting and categorizing an action in an untrimmed video segment regardless of the scale of the action and the close proximity of other actions. The methods improve upon the prior art which either require trimmed video segments including only a single activity depicted therein, or untrimmed video segments including relatively few actions, persons, or objects of interest, thereby directing the classification. Instead, the methods utilize a plurality of tubelets used to represent discreet actions, persons, and objects of interest within the comprehensive untrimmed video segment. The tubelets are localized to correct for pixel-level foreground-background biases, which are then turned into short spatio-temporal action tubelets that are passed to a classification network to obtain multi-label predictions. After classification, the tubelets are be linked together to obtain the final detections with varying lengths, and the method merges the short action tubelets into final action detections.

Type: Application

Filed: January 7, 2022

Publication date: July 14, 2022

Inventors: Yogesh Singh Rawat, Mubarak Shah, Aayush Jung Bahadur Rana, Praveen Tirupattur, Mamshad Nayeem Rizve
System and method for human pose estimation in unconstrained video

Patent number: 10509957

Abstract: A system and method for estimating a sequence of human poses in an unconstrained video. In the present invention, a unified two stage, tree-based, optimization problem is solved for which an efficient and exact solution exists. While the proposed method finds an exact solution, it does not sacrifice the ability to model the spatial and temporal constraints between body parts in the video frames on the unconstrained video.

Type: Grant

Filed: February 6, 2017

Date of Patent: December 17, 2019

Assignee: University of Central Florida Research Foundation, Inc.

Inventors: Dong Zhang, Mubarak Shah
Multi-source, multi-scale counting in dense crowd images

Patent number: 9946952

Abstract: A method for counting individuals in an image containing a dense, uniform or non-uniform crowd. The current invention leverages multiple sources of information to compute an estimate of the number of individuals present in a dense crowd visible in a single image. This approach relies on multiple sources, such as low confidence head detections, repetition of texture elements (using SIFT), and frequency-domain analysis to estimate counts, along with confidence associated with observing individuals in an image region. Additionally, a global consistency constraint can be employed on counts using Markov Random Field. This caters for disparity in counts in local neighborhoods and across scales. The methodology was tested on a new dataset of fifty (50) crowd images containing over 64,000 annotated humans, with the head counts ranging from 94 to 4,543. Efficient and accurate results were attained.

Type: Grant

Filed: June 25, 2014

Date of Patent: April 17, 2018

Assignee: University of Central Florida Research Foundation, Inc.

Inventors: Haroon Idrees, Imran Saleemi, Mubarak Shah
Multi-Source, Multi-Scale Counting in Dense Crowd Images

Publication number: 20180005071

Abstract: A method for counting individuals in an image containing a dense, uniform or non-uniform crowd. The current invention leverages multiple sources of information to compute an estimate of the number of individuals present in a dense crowd visible in a single image. This approach relies on multiple sources, such as low confidence head detections, repetition of texture elements (using SIFT), and frequency-domain analysis to estimate counts, along with confidence associated with observing individuals in an image region. Additionally, a global consistency constraint can be employed on counts using Markov Random Field. This caters for disparity in counts in local neighborhoods and across scales. The methodology was tested on a new dataset of fifty (50) crowd images containing over 64,000 annotated humans, with the head counts ranging from 94 to 4,543. Efficient and accurate results were attained.

Type: Application

Filed: June 25, 2014

Publication date: January 4, 2018

Inventors: Haroon Idrees, Imran Saleemi, Mubarak Shah
SYSTEM AND METHOD FOR HUMAN POSE ESTIMATION IN UNCONSTRAINED VIDEO

Publication number: 20170228587

Abstract: A system and method for estimating a sequence of human poses in an unconstrained video. In the present invention, a unified two stage, tree-based, optimization problem is solved for which an efficient and exact solution exists. While the proposed method finds an exact solution, it does not sacrifice the ability to model the spatial and temporal constraints between body parts in the video frames on the unconstrained video.

Type: Application

Filed: February 6, 2017

Publication date: August 10, 2017

Inventors: Dong Zhang, Mubarak Shah
Video face recognition

Patent number: 9336433

Abstract: The present invention is a complete end-to-end video face recognition system. The invention performs a track-by-track labeling of all tracks within a video. A novel algorithm Mean Sequence SRC (MSSRC) is applied to a face track to perform video face recognition using a joint optimization to leverage all of the available video data and the knowledge that the face track frames belong to the same individual. Additionally the system constructs a probabilistic affinity graph combining appearance and co-occurrence similarities to model the relationship between face tracks in a video. Finally, using this relationship graph, random walk analysis is employed to propagate strong class predictions among similar face tracks, while dampening weak predictions.

Type: Grant

Filed: July 24, 2014

Date of Patent: May 10, 2016

Assignee: UNIVERSITY OF CENTRAL FLORIDA RESEARCH FOUNDATION, INC.

Inventors: Enrique Ortiz, Mubarak Shah, Alan Wright
Method and system for modeling and processing fMRI image data using a bag-of-words approach

Patent number: 9072496

Abstract: Systems and methods for processing image data are provided. A computer implemented method for processing image data, comprises gathering 4-D image data from a subject, extracting time series data, and spatial and degree data of each voxel of the subject, deriving at least one feature from the time series data, deriving at least one feature from the spatial and degree data, combining the at least one feature from the time series data, and the at least one feature from the spatial and degree data to generate combined data, and inputting the combined data to a classifier, wherein the classifier outputs a classification based on the combined data.

Type: Grant

Filed: February 1, 2013

Date of Patent: July 7, 2015

Assignees: International Business Machines Corporation, University of Central Florida Research Foundation, Inc.

Inventors: Ravishankar Rao, Soumyabrata Dey, Mubarak Shah, Solmaz Berkan
Systems and methods for modeling three-dimensional objects from two-dimensional images

Patent number: 8363926

Abstract: In one embodiment, a system and method for modeling a three-dimensional object includes capturing two-dimensional images of the object from multiple different viewpoints to obtain multiple views of the object, estimating slices of the object that lie in parallel planes that cut through the object, and computing a surface of the object from the estimated slices.

Type: Grant

Filed: February 5, 2009

Date of Patent: January 29, 2013

Assignee: University of Central Florida Research Foundation, Inc.

Inventors: Saad M. Khan, Pingkun Yan, Mubarak Shah
Systems and methods for three-dimensionally modeling moving objects

Publication number: 20110007072

Abstract: In one embodiment, a system and method for three-dimensionally modeling a moving object pertain to capturing sequential images of the moving object from multiple different viewpoints to obtain multiple views of the moving object, identifying silhouettes of the moving object in each view, determining the location in each view of a temporal occupancy point for each silhouette boundary pixel, each temporal occupancy point being the estimated localization of a three-dimensional scene point that gave rise to its associated silhouette boundary pixel, generating blurred occupancy images that comprise silhouettes of the moving object composed of the temporal occupancy points, deblurring the blurred occupancy images to generate deblurred occupancy maps of the moving object, and reconstructing the moving object by performing visual hull intersection using the blurred occupancy maps to generate a three-dimensional model of the moving object.

Type: Application

Filed: July 9, 2009

Publication date: January 13, 2011

Applicant: University of Central Florida Research Foundation, Inc.

Inventors: Saad M. Khan, Mubarak Shah
SYSTEMS AND METHODS FOR MODELING THREE-DIMENSIONAL OBJECTS FROM TWO- DIMENSIONAL IMAGES

Publication number: 20090304265

Abstract: In one embodiment, a system and method for modeling a three-dimensional object includes capturing two-dimensional images of the object from multiple different viewpoints to obtain multiple views of the object, estimating slices of the object that lie in parallel planes that cut through the object, and computing a surface of the object from the estimated slices.

Type: Application

Filed: February 5, 2009

Publication date: December 10, 2009

Inventors: Saad M. Khan, Pingkun Yan, Mubarak Shah
Tracking across multiple cameras with disjoint views

Patent number: 7450735

Abstract: Tracking and surveillance methods and systems for monitoring objects passing in front of non-overlapping cameras. Invention finds corresponding tracks from different cameras and works out which object passing in front of the camera(s) made the tracks, in order to track the object from camera to camera. The invention uses an algorithm to learn inter-camera spatial temporal probability using Parzen windows, learns inter-camera appearance probabilities using distribution of Bhattacharyya distances between appearance models, establishes correspondences based on Maximum A Posteriori (MAP) framework combining both spatial temporal and appearance probabilities, and updates learned probabilities throughout the lifetime of the system.

Type: Grant

Filed: October 16, 2004

Date of Patent: November 11, 2008

Assignee: University of Central Florida Research Foundation, Inc.

Inventors: Mubarak Shah, Omar Javed, Khurram Shafique, Zeeshan Rasheed
Algorithm for monitoring head/eye motion for driver alertness with one camera

Patent number: 6927694

Abstract: Visual methods and systems are described for detecting alertness and vigilance of persons under conditions of fatigue, lack of sleep, and exposure to mind altering substances such as alcohol and drugs. In particular, the intention can have particular applications for truck drivers, bus drivers, train operators, pilots and watercraft controllers and stationary heavy equipment operators, and students and employees during either daytime or nighttime conditions. The invention robustly tracks a person's head and facial features with a single on-board camera with a fully automatic system, that can initialize automatically, and can reinitialize when it need's to and provide outputs in realtime. The system can classify rotation in all viewing direction, detects' eye/mouth occlusion, detects' eye blinking, and recovers the 3D(three dimensional) gaze of the eyes. In addition, the system is able to track both through occlusion like eye blinking and also through occlusion like rotation.

Type: Grant

Filed: August 16, 2002

Date of Patent: August 9, 2005

Assignee: Research Foundation of the University of Central Florida

Inventors: Raymond Paul Smith, Mubarak Shah, Niels da Vitoria Lobo