Patents by Inventor Hisham Cholakkal

Hisham Cholakkal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for attention-aware relation mixer for person search

Patent number: 12260674

Abstract: A video system and method for person search includes video cameras for capturing video images, a display device, and a computer system. The computer system including a deep learning network to determine person images, from among the video images, matching a target query person. The deep learning network having a person detection branch, a person re-identification branch, and an attention-aware relation mixer connected to the person detection branch and to the person re-identification branch. The attention-aware relation mixer including a relation mixer having a spatial and channel mixer that performs spatial attention followed by spatial mixing (tokenized multi-layered perceptron) and channel attention followed by channel mixing (channel multi-layered perceptron), and a joint spatio-channel attention layer that utilizes 3D attention weights to modulate 3D spatio-channel region of interest features and aggregate the features with output of the relation mixer.

Type: Grant

Filed: November 9, 2022

Date of Patent: March 25, 2025

Assignee: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Muhammad Anwer, Fahad Khan
System and method of cross-modulated dense local fusion for few-shot image generation

Patent number: 12100082

Abstract: An apparatus, computer readable storage medium and method of generating a diverse set of images from few-shot images, includes a parameter input receiving values for control parameters to control an extent to which each reference image impacts a newly generated image. The apparatus involves an image generation deep learning network for generating an image for each of the values for the control parameters. The deep learning network has an encoder, a transformer-based fusion block, and a decoder. The transformer-based fusion block includes a mapping network that computes meta-weights from features extracted from the reference images and the control parameters, and a cross-attention block to generate modulation weights based on the meta-weights. An output displays high-quality and diverse images generated based on the values for the control parameter.

Type: Grant

Filed: November 9, 2022

Date of Patent: September 24, 2024

Assignee: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Amandeep Kumar, Ankan Kumar Bhunia, Hisham Cholakkal, Sanath Narayan, Rao Muhammad Anwer, Fahad Khan
System and method for handwriting generation

Patent number: 11756244

Abstract: A system and computer readable storage medium for automated handwriting generation, including a text input device for inputting a text query having at least one textual word string, an image input device for inputting a handwriting sample with characters in a writing style of a user, and a computer implemented deep learning transformer model including an encoder network and a decoder network in which each are a hybrid of convolution and multi-head self-attention networks. The encoder produces a sequence of style feature embeddings from the input handwriting sample. The decoder takes the sequence of style feature embeddings in order to convert the at least one textual word string into a generated handwritten image having substantially same writing style as the handwriting sample. An output device to output the generated handwriting image.

Type: Grant

Filed: July 19, 2022

Date of Patent: September 12, 2023

Assignee: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Khan
Dense and discriminative neural network architectures for improved object detection and instance segmentation

Patent number: 11244188

Abstract: This disclosure relates to improved techniques for performing computer vision functions, including common object detection and instance segmentation. The techniques described herein utilize neural network architectures to perform these functions in various types of images, such as natural images, UAV images, satellite images, and other images. The neural network architecture can include a dense location regression network that performs object localization and segmentation functions, at least in part, by generating offset information for multiple sub-regions of candidate object proposals, and utilizing this dense offset information to derive final predictions for locations of target objects. The neural network architecture also can include a discriminative region-of-interest (RoI) pooling network that performs classification of the localized objects, at least in part, by sampling various sub-regions of candidate proposals and performing adaptive weighting to obtain discriminative features.

Type: Grant

Filed: April 10, 2020

Date of Patent: February 8, 2022

Assignee: Inception Institute of Artificial Intelligence, Ltd.

Inventors: Hisham Cholakkal, Jiale Cao, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao
Dense and Discriminative Neural Network Architectures for Improved Object Detection and Instance Segmentation

Publication number: 20210319242

Abstract: This disclosure relates to improved techniques for performing computer vision functions, including common object detection and instance segmentation. The techniques described herein utilize neural network architectures to perform these functions in various types of images, such as natural images, UAV images, satellite images, and other images. The neural network architecture can include a dense location regression network that performs object localization and segmentation functions, at least in part, by generating offset information for multiple sub-regions of candidate object proposals, and utilizing this dense offset information to derive final predictions for locations of target objects. The neural network architecture also can include a discriminative region-of-interest (Rol) pooling network that performs classification of the localized objects, at least in part, by sampling various sub-regions of candidate proposals and performing adaptive weighting to obtain discriminative features.

Type: Application

Filed: April 10, 2020

Publication date: October 14, 2021

Inventors: Hisham Cholakkal, Jiale Cao, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao
Object counting and instance segmentation using neural network architectures with image-level supervision

Patent number: 10453197

Abstract: This disclosure relates to improved techniques for performing computer vision functions including common object counting and instance segmentation. The techniques described herein utilize a neural network architecture to perform these functions. The neural network architecture can be trained using image-level supervision techniques that utilize a loss function to jointly train an image classification branch and a density branch of the neural network architecture. The neural network architecture constructs per-category density maps that can be used to generate analysis information comprising global object counts and locations of objects in images.

Type: Grant

Filed: February 18, 2019

Date of Patent: October 22, 2019

Assignee: Inception Institute of Artificial Intelligence, Ltd.

Inventors: Hisham Cholakkal, Guolei Sun, Fahad Shahbaz Khan, Ling Shao

System and method for attention-aware relation mixer for person search

System and method of cross-modulated dense local fusion for few-shot image generation

System and method for handwriting generation

Dense and discriminative neural network architectures for improved object detection and instance segmentation

Dense and Discriminative Neural Network Architectures for Improved Object Detection and Instance Segmentation

Object counting and instance segmentation using neural network architectures with image-level supervision