Patents by Inventor Salman Khan

Salman Khan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for self-distilled vision transformer for domain generalization

Patent number: 12288384

Abstract: An apparatus and method for a machine learning engine for domain generalization which trains a vision transformer neural network using a training dataset including at least two domains for diagnosis of a medical condition. Image patches and class tokens are processed through a sequence of feature extraction transformer blocks to obtain a predicted class token. In parallel, intermediate class tokens are extracted as outputs of each of the feature extraction transformer blocks, where each transformer block is a sub-model. One sub-model is randomly sampled from the sub-models to obtain a sampled intermediate class token. The intermediate class token is used to make a sub-model prediction. The vision transformer neural network is optimized based on a difference between the predicted class token and the sub-model prediction. Inferencing is performed for a target medical image in a target domain that is different from the at least two domains.

Type: Grant

Filed: December 19, 2022

Date of Patent: April 29, 2025

Assignee: Mohamed bin Zayed University of Artifical Intellegence

Inventors: Maryam Sultana, Muhammad Muzammal Naseer, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan
System and method of bridging the gap between object and image-level representations for open-vocabulary detection

Patent number: 12288372

Abstract: An object detection system and method in which a machine learning engine is configured with a region-based knowledge distillation stage that generates region embeddings from a training image having bounding boxes. A linear layer learns a region-level vision-language mapping for projecting feature embeddings from the training image to a common feature space shared by text embeddings to obtain the region embeddings. An image-level supervision stage generates pseudo-box labels for a classification training image and region embeddings from the training image having bounding boxes and corresponding class labels and the classification training image having an image-level label as input. Pseudo-box labels are determined on the classification training image as an image-level vision-language mapping. A weight transfer function conditions the image-level vision-language mapping on the learned region-level vision-language mapping.

Type: Grant

Filed: December 20, 2022

Date of Patent: April 29, 2025

Assignee: Mohamed bin Zayed University of Artificial Intellegence

Inventors: Hanoona Abdul Rasheed Bangalath, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, Fahad Shahbaz Khan
SYSTEM AND METHOD FOR CONTRASTIVE AFFINITY LEARNING VIA AUXILIARY PROMPTS FOR GENERALIZED NOVEL CATEGORY DISCOVERY

Publication number: 20250078546

Abstract: A system and method of fine-grained image category discovery with few human annotations includes a camera and a trained machine learning model, which predicts a label for an object in a captured image and outputs the predicted label. The machine learning model is trained by contrastive affinity learning, including retrieving images having an object, a warm-up stage in which semi-supervised contrastive learning is performed based on projected features of a class token and an ensembled prompt, respectively. In a contrastive affinity learning stage, a student model and an exponentially moving averaged teacher model are forwarded with different augmented views of the retrieved images. Teacher embeddings are enqueued into a token-specific memory. A semi-supervised contrastive loss is computed on a current batch and a contrastive affinity learning loss for student embeddings and the teacher embeddings with pseudo-labels from a affinity graph dynamically generated by semi-supervised affinity generation.

Type: Application

Filed: September 5, 2023

Publication date: March 6, 2025

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Sheng ZHANG, Salman KHAN, Zhiqiang SHEN, Muzammal NASEER, Guangyi CHEN, Fahad KHAN
Frame structure in NR

Patent number: 12245262

Abstract: The present application at least describes a wireless transmit/receive unit (WTRU) including a processor and non-transitory memory including stored instructions which are executed by the processor. The instructions include receiving, from a base station, configuration information indicating a set of channel state information interference channel measurement (CSI-ICM) resources for the WTRU. The configuration information indicates a period and an offset for the set of CSI-ICM resources. The instructions also include receiving, from the base station, downlink control information (DCI) indicating a subset of the set of CSI-ICM resources. The instructions further include measuring the subset of CSI-ICM resources. The instructions even further include transmitting feedback to the base station based on the subset of CSI-ICM resources. The instructions yet even further include receiving a MAC control element (CE) indicating an activation of a CSI report from the base station.

Type: Grant

Filed: November 16, 2023

Date of Patent: March 4, 2025

Assignee: InterDigital Patent Holdings, Inc.

Inventors: Lakshmi R. Iyer, Allan Y. Tsai, Tianyi Xu, Guodong Zhang, Pascal M. Adjakple, Ahmed Elsamadouny, Salman Khan, Yifan Li
A TRAIN-TIME LOSS IN A SYSTEM AND METHOD FOR CALIBRATING OBJECT DETECTION

Publication number: 20250061697

Abstract: A system and method of training a deep neural network for object detection in an object detection system. The object detection system including a camera and a controller including the DNN. The method including capturing an image by the camera, receiving the image, predicting, using the DNN, a bounding box and corresponding class label, evaluating the prediction with a total loss function including an object detection loss function, a box regression loss function, and a calibration loss function that takes into account precision and confidence. The method outputs a calibrated image with the object bounding box, the corresponding label, and a respective confidence score, in which the confidence score is a probability associated with the predicted class label.

Type: Application

Filed: December 1, 2023

Publication date: February 20, 2025

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Muhammad Akhtar MUNIR, Muhammad Haris KHAN, Salman KHAN, Fahad Shahbaz KHAN
RADIO PDCCH TO FACILITATE NUMEROLOGY OPERATIONS

Publication number: 20250039887

Abstract: New radio download numerology allocation information may be obtained through master information block data, system information block data, radio resource control signals, or signals or a physical downlink numerology indication channel, and used along with a reference signal detected in a search space to obtain resource element positions in an antenna port reference signal in a resource block that belongs to a particular band slice according to a reference signal allocation scheme for a band slice numerology. A physical download control may then be decoded based upon one or more resource elements of the reference signal, allowing the connection of, e.g., an enhanced mobile broadband, massive machine type communication, or ultra-reliable/low-latency application to a communications network thereby. Alternatively, multiple physical downlink control channels may be blindly demodulated at each of a number calculated reference signal locations, and one channel selected based on passing a cyclic redundancy check.

Type: Application

Filed: September 26, 2024

Publication date: January 30, 2025

Applicant: InterDigital Patent Holdings, Inc.

Inventors: Allan Y. Tsai, Lakshmi R. Iyer, Guodong Zhang, Qian Zhang, Pascal M. Adjakple, Qing Li, Joseph M. Murray, Tianyi Xu, Wei Chen, Ahmed ElSamadouny, Salman Khan, Yifan Li
Impact features

Patent number: 12168475

Abstract: A vehicle platform with a variety of impact safety features including front and rear impact features as well as side impact features designed to protect the passenger compartment as well as the battery compartment and vehicle chassis components. Some features may include crumple zone components, deflectors and modular energy absorption units.

Type: Grant

Filed: December 30, 2021

Date of Patent: December 17, 2024

Assignee: Canoo Technologies Inc.

Inventors: Alexi Charbonneau, Naesung Lyu, Mahesh Chinchani, Salman Khan, Yufeng Long, Morteza Kiani, Daniel McCarron, William Rohr, Aniruddha Ranade
Radio PDCCH to facilitate numerology operations

Patent number: 12150146

Abstract: New radio download numerology allocation information may be obtained through master information block data, system information block data, radio resource control signals, or signals or a physical downlink numerology indication channel, and used along with a reference signal detected in a search space to obtain resource element positions in an antenna port reference signal in a resource block that belongs to a particular band slice according to a reference signal allocation scheme for a band slice numerology. A physical download control may then be decoded based upon one or more resource elements of the reference signal, allowing the connection of, e.g., an enhanced mobile broadband, massive machine type communication, or ultra-reliable/low-latency application to a communications network thereby. Alternatively, multiple physical downlink control channels may be blindly demodulated at each of a number calculated reference signal locations, and one channel selected based on passing a cyclic redundancy check.

Type: Grant

Filed: March 31, 2023

Date of Patent: November 19, 2024

Assignee: InterDigital Patent Holdings, Inc.

Inventors: Allan Y. Tsai, Lakshmi R. Iyer, Guodong Zhang, Qian Zhang, Pascal M. Adjakple, Qing Li, Joseph M. Murray, Tianyi Xu, Wei Chen, Ahmed ElSamadouny, Salman Khan, Yifan Li
SYSTEM AND METHOD FOR 3D MEDICAL IMAGE SEGMENTATION

Publication number: 20240362788

Abstract: A system for 3D medical image segmentation includes a medical imaging device for obtaining a plurality of 2D images forming a volumetric image, processing circuitry, and a display. The processing circuitry is configured with a first stage to divide the volumetric image into 3D image patches, a hierarchical encoder-decoder structure in which resolution of features of the 3D image patches is decreased by a factor of two in each of a plurality of stages of the encoder, an encoder output connected to the decoder via skip connections, and a convolutional block to produce a voxel-wise final segmentation mask. The encoder includes a plurality of efficient paired attention blocks each with a spatial attention branch and a channel attention branch that learn respective spatial and channel attention feature maps. The display displays the final segmentation mask.

Type: Application

Filed: April 26, 2023

Publication date: October 31, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Abdelrahman SHAKER, Muhammad MAAZ, Hanoona RASHEED, Salman KHAN, Fahad Shahbaz KHAN
METHOD AND APPARATUS FOR DETERMINING MORPHOLOGY OF A HUMAN BREAST AND/OR PROVIDING A MANUFACTURED GARMENT

Publication number: 20240362877

Abstract: A computer-implemented method and apparatus for determining morphology of a human breast, the method comprising: obtaining at least one image of a subject; extracting features of at least a portion of the subject's body from the at least one image, wherein the features correspond to a model of standard human anatomy; generating a three-dimensional model of the subject's body based on the extracted features and the model of standard human anatomy; and determining a morphological parameter of the subject's breast from the three-dimensional model of the subject's body. In another aspect, a computer implemented method is presented for adjusting a virtual garment to fit the three-dimensional model and determining a parameter based on the adjusted virtual garment. In a further aspect, there is provided a method for providing a manufactured wearable garment.

Type: Application

Filed: July 7, 2022

Publication date: October 31, 2024

Inventors: Prashant Aparajeya, Vandita Shukla, Salman Khan, Frederic Leymarie, Tigran Hakobyan, Tang Thuy Trang Ngo
MULTI-MODAL PROMPT LEARNING FOR REPRESENTATION TRANSFER ON IMAGE RECOGNITION TASKS

Publication number: 20240220722

Abstract: A method and system for multi-modal prompt learning of vision-language models. Encodings of image-text pairs can be combined with image prompts and text prompts before being input into an image encoder and text encoder of a vision-language model respectively. The image prompt can be generated using the text prompt using a vision-language coupling function to encourage synergy between the two prompts. The combination of encodings and prompts can be fed through the transformer layers of the encoders, and the output of each layer can be combined with a new prompt before entering the next layer, up until a specific depth. The subsequent transformer layers can process the output and generate a final representation for the image and text which can then be used for downstream tasks.

Type: Application

Filed: December 28, 2022

Publication date: July 4, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Muhammad Uzair KHATTAK, Hanoona Abdul Rasheed BANGALATH, Muhammad MAAZ, Salman KHAN, Fahad Shahbaz KHAN
SYSTEM AND METHOD OF BRIDGING THE GAP BETWEEN OBJECT AND IMAGE-LEVEL REPRESENTATIONS FOR OPEN-VOCABULARY DETECTION

Publication number: 20240203085

Abstract: An object detection system and method in which a machine learning engine is configured with a region-based knowledge distillation stage that generates region embeddings from a training image having bounding boxes. A linear layer learns a region-level vision-language mapping for projecting feature embeddings from the training image to a common feature space shared by text embeddings to obtain the region embeddings. An image-level supervision stage generates pseudo-box labels for a classification training image and region embeddings from the training image having bounding boxes and corresponding class labels and the classification training image having an image-level label as input. Pseudo-box labels are determined on the classification training image as an image-level vision-language mapping. A weight transfer function conditions the image-level vision-language mapping on the learned region-level vision-language mapping.

Type: Application

Filed: December 20, 2022

Publication date: June 20, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Hanoona Abdul Rasheed BANGALATH, Muhammad MAAZ, Muhammad Uzair KHATTAK, Salman KHAN, Fahad Shahbaz KHAN
SYSTEM AND METHOD FOR SELF-DISTILLED VISION TRANSFORMER FOR DOMAIN GENERALIZATION

Publication number: 20240203098

Abstract: An apparatus and method for a machine learning engine for domain generalization which trains a vision transformer neural network using a training dataset including at least two domains for diagnosis of a medical condition. Image patches and class tokens are processed through a sequence of feature extraction transformer blocks to obtain a predicted class token. In parallel, intermediate class tokens are extracted as outputs of each of the feature extraction transformer blocks, where each transformer block is a sub-model. One sub-model is randomly sampled from the sub-models to obtain a sampled intermediate class token. The intermediate class token is used to make a sub-model prediction. The vision transformer neural network is optimized based on a difference between the predicted class token and the sub-model prediction. Inferencing is performed for a target medical image in a target domain that is different from the at least two domains.

Type: Application

Filed: December 19, 2022

Publication date: June 20, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Maryam SULTANA, Muhammad Muzammal NASEER, Muhammad Haris KHAN, Salman KHAN, Fahad Shahbaz KHAN
SYSTEM AND METHOD FOR EFFICIENTLY AMALGAMATED CNN-TRANSFORMER ARCHITECTURE FOR MOBILE VISION APPLICATIONS

Publication number: 20240193404

Abstract: An edge computing system, computer readable storage medium and method for object detection, including processing circuitry. The processing circuitry is configured with a hybrid CNN and vision transformer backbone network in an object detection deep learning network. The backbone network receives an image, and includes a first convolutional encoder to extract local features from feature maps of the image, a second stage having consecutive second convolutional encoders, a positional encoding layer, split depth-wise transpose attention (SDTA) encoders, consecutive convolutional encoders, a third stage and a fourth stage SDTA encoder. Each of the SDTA encoders perform multi-headed self-attention by applying a dot product operation across channel dimensions in order to compute cross-covariance across channels to generate attention feature maps.

Type: Application

Filed: December 9, 2022

Publication date: June 13, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Muhammad MAAZ, Abdelrahman SHAKER, Hisham CHOLAKKAL, Salman KHAN, Syed Waqas ZAMIR, Rao Muhammad ANWER, Fahad Shahbaz KHAN
SYSTEM AND METHOD FOR SELF-SUPERVISED VIDEO TRANSFORMER

Publication number: 20240169692

Abstract: A system, computer readable medium and method trains a video transformer, using a machine learning engine, for human action recognition in a video. The method includes sampling video clips with varying temporal resolutions in global views and sampling the video clips from different spatiotemporal windows in local views. The machine learning engine is configured to match the global and local views in a framework of student-teacher networks to learn cross-view correspondence between local and global views, and to learn motion correspondence between varying temporal resolutions. The video transformer can output for display video clips in a manner that emphasizes attention to the recognized human action.

Type: Application

Filed: November 21, 2022

Publication date: May 23, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Kanchana RANASINGHE, Muhammad Muzammal NASEER, Salman KHAN, Fahad KHAN
SYSTEM AND METHOD FOR VIDEO INSTANCE SEGMENTATION VIA MULTI-SCALE SPATIO-TEMPORAL SPLIT ATTENTION TRANSFORMER

Publication number: 20240161334

Abstract: A system, method, computer readable storage medium for a computer vision system includes at least one video camera, and video processor circuitry. The method includes inputting a stream of video data and generating a sequence of image frames, segmenting and tracking, by the video analysis apparatus, object instances in the stream of video data, including receiving the sequence of image frames, analyzing the sequence of image frames using a video instance segmentation transformer to obtain a video instance mask sequence from the sequence of image frames, the transformer having a backbone network, a transformer encoder-decoder, and an instance matching and segmentation block, The encoder contains a multi-scale spatio-temporal split attention module to capture spatio-temporal feature relationships at multiple scales across multiple frames. The decoder contains a temporal attention block for enhancing a temporal consistency of transformer queries. The method includes displaying the video instance mask sequence.

Type: Application

Filed: November 9, 2022

Publication date: May 16, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Omkar THAWAKAR, Sanath NARAYAN, Hisham CHOLAKKAL, Rao Muhammad ANWER, Muhammad HARIS, Salman KHAN, Fahad KHAN
OPTICAL ENCRYPTION CAMERA

Publication number: 20240154784

Abstract: An optical encryption camera includes a sensor array and a filter positioned over the sensor array to receive light prior to the sensor array. The filter includes a multiplexing mask and a scaling mask in sequence. The multiplexing mask and the scaling mask combine to provide an encryption key to encrypt image data prior to capture.

Type: Application

Filed: October 31, 2023

Publication date: May 9, 2024

Inventors: Francesco Pittaluga, Xiang Yu, Salman Khan
SYSTEM AND METHOD FOR BURST IMAGE RESTORATION AND ENHANCEMENT

Publication number: 20240135496

Abstract: A mobile device and mobile application, in which the mobile device includes a camera having an image capture circuit operating in a mode to capture a RAW image burst, and processing circuitry, including a neural network engine, to generate a single enhanced image from the RAW image burst. The neural network engine executing program instructions including an edge boosting feature alignment stage to remove inter-frame spatial and color misalignment from the RAW image burst to obtain aligned burst frames, a pseudo-burst feature fusion stage to create a set of pseudo-burst features that combine complementary information from the aligned burst frames, and an adaptive group upsampling stage to progressively increase spatial resolution while merging the set of pseudo-burst features and output the single enhanced image. The mobile application and mobile device perform super-resolution, low-light image enhancement, and burst denoising using a RAW image burst.

Type: Application

Filed: October 19, 2022

Publication date: April 25, 2024

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Akshay DUDHANE, Syed Waqas ZAMIR, Salman KHAN, Fahad Shahbaz KHAN
SYSTEM AND METHOD FOR HANDWRITING GENERATION

Publication number: 20230316603

Abstract: A system and computer readable storage medium for automated handwriting generation, including a text input device for inputting a text query having at least one textual word string, an image input device for inputting a handwriting sample with characters in a writing style of a user, and a computer implemented deep learning transformer model including an encoder network and a decoder network in which each are a hybrid of convolution and multi-head self-attention networks. The encoder produces a sequence of style feature embeddings from the input handwriting sample. The decoder takes the sequence of style feature embeddings in order to convert the at least one textual word string into a generated handwritten image having substantially same writing style as the handwriting sample. An output device to output the generated handwriting image.

Type: Application

Filed: July 19, 2022

Publication date: October 5, 2023

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Ankan Kumar BHUNIA, Salman KHAN, Hisham CHOLAKKAL, Rao Muhammad ANWER, Fahad KHAN
System for Aggregating, Analyzing, and Reporting Medical Information

Publication number: 20230317278

Abstract: In one embodiment, system for aggregating, analyzing, and reporting medical information includes a front end module for managing a user interface, a back end module for exchanging patient information with a clinic record system and obtaining one or more medical images therefrom, a machine learning/artificial intelligence (ML/AI) engine for analyzing said one or more medical images and generating analysis results, and a report generator for generating a report that includes the analysis results. The ML/AI engine can include an anatomical plane classifier such as a 20+2 classifier, and the anatomical structure classifier can apply sematic segmentation.

Type: Application

Filed: April 5, 2022

Publication date: October 5, 2023

Applicant: BioticsAI

Inventors: Hisham Elgammal, Robhy Bustami, Chaskin Saroff, Salman Khan

1 2 3 next