Patents Assigned to Inception Institute of Artificial Intelligence, Ltd

Complementary learning for multi-modal saliency detection

Patent number: 12026930

Abstract: A saliency detection explicitly models complementary information between appearance, or color, and depth information in images. A mutual-information minimization is used as a regularizer to reduce the redundancy between appearance features from RGB and geometric features from depth in the latent space. Then the latent features of each of the appearance and geometric modalities are fused to achieve multi-modal feature fusion for saliency detection.

Type: Grant

Filed: September 14, 2021

Date of Patent: July 2, 2024

Assignee: Inception Institute of Artificial Intelligence Ltd

Inventors: Deng-Ping Fan, Jing Zhang, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao
Activity recognition in dark video based on both audio and video content

Patent number: 11960576

Abstract: Videos captured in low light conditions can be processed in order to identify an activity being performed in the video. The processing may use both the video and audio streams for identifying the activity in the low light video. The video portion is processed to generate a darkness-aware feature which may be used to modulate the features generated from the audio and video features. The audio features may be used to generate a video attention feature and the video features may be used to generate an audio attention feature. The audio and video attention features may also be used in modulating the audio video features. The modulated audio and video features may be used to predict an activity occurring in the video.

Type: Grant

Filed: July 20, 2021

Date of Patent: April 16, 2024

Assignee: Inception Institute of Artificial Intelligence Ltd

Inventors: Yunhua Zhang, Xiantong Zhen, Ling Shao, Cees G. M. Snoek
Systems and methods for counting repetitive activity in audio video content

Patent number: 11694442

Abstract: Repetitive activities can be captured in audio video content. The AV content can be processed in order to predict the number of repetitive activities present in the AV content. The accuracy of the predicted number may be improved, especially for AV content with challenging conditions, by basing the predictions on both the audio and video portions of the AV content.

Type: Grant

Filed: June 18, 2021

Date of Patent: July 4, 2023

Assignee: Inception Institute of Artificial Intelligence Ltd

Inventors: Yunhua Zhang, Cees G. M. Snoek, Ling Shao
Domain specific pre-training of cross modality transformer model

Patent number: 11687835

Abstract: A transformer based vision-linguistic (VL) model and training technique uses a number of different image patches covering the same portion of an image, along with a text description of the image to train the model. The model and pre-training techniques may be used in domain specific training of the model. The model can be used for fine-grained image-text tasks in the fashion domain.

Type: Grant

Filed: February 26, 2021

Date of Patent: June 27, 2023

Assignee: Inception Institute of Artificial Intelligence Ltd

Inventors: Deng-Ping Fan, Mingchen Zhuge, Ling Shao
Human parsing techniques utilizing neural network architectures

Patent number: 11410449

Abstract: This disclosure relates to improved techniques for performing human parsing functions using neural network architectures. The neural network architecture can model human objects in images using a hierarchal graph of interconnected nodes that correspond to anatomical features at various levels. Multi-level inference information can be generated for each of the nodes using separate inference processes. The multi-level inference information for each node can be combined or fused to generate final predictions for each of the nodes. Parsing results may be generated based on the final predictions.

Type: Grant

Filed: October 17, 2019

Date of Patent: August 9, 2022

Assignee: Inception Institute of Artificial Intelligence, Ltd.

Inventors: Wenguan Wang, Jianbing Shen, Zhijie Zhang, Ling Shao
Dense and discriminative neural network architectures for improved object detection and instance segmentation

Patent number: 11244188

Abstract: This disclosure relates to improved techniques for performing computer vision functions, including common object detection and instance segmentation. The techniques described herein utilize neural network architectures to perform these functions in various types of images, such as natural images, UAV images, satellite images, and other images. The neural network architecture can include a dense location regression network that performs object localization and segmentation functions, at least in part, by generating offset information for multiple sub-regions of candidate object proposals, and utilizing this dense offset information to derive final predictions for locations of target objects. The neural network architecture also can include a discriminative region-of-interest (RoI) pooling network that performs classification of the localized objects, at least in part, by sampling various sub-regions of candidate proposals and performing adaptive weighting to obtain discriminative features.

Type: Grant

Filed: April 10, 2020

Date of Patent: February 8, 2022

Assignee: Inception Institute of Artificial Intelligence, Ltd.

Inventors: Hisham Cholakkal, Jiale Cao, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao
Object counting and instance segmentation using neural network architectures with image-level supervision

Patent number: 10453197

Abstract: This disclosure relates to improved techniques for performing computer vision functions including common object counting and instance segmentation. The techniques described herein utilize a neural network architecture to perform these functions. The neural network architecture can be trained using image-level supervision techniques that utilize a loss function to jointly train an image classification branch and a density branch of the neural network architecture. The neural network architecture constructs per-category density maps that can be used to generate analysis information comprising global object counts and locations of objects in images.

Type: Grant

Filed: February 18, 2019

Date of Patent: October 22, 2019

Assignee: Inception Institute of Artificial Intelligence, Ltd.

Inventors: Hisham Cholakkal, Guolei Sun, Fahad Shahbaz Khan, Ling Shao
3D scene synthesis techniques using neural network architectures

Patent number: 10297070

Abstract: This disclosure relates to improved techniques for synthesizing three-dimensional (3D) scenes. The techniques can utilize a neural network architecture to analyze images for detecting objects, classifying scenes and objects, and determining degree of freedom information for objects in the images. These tasks can be performed by, at least in part, using inter-object and object-scene dependency information that captures the spatial correlations and dependencies among objects in the images, as well as the correlations and relationships of objects to scenes associated with the images. 3D scenes corresponding to the images can then be synthesized using the inferences provided by the neural network architecture.

Type: Grant

Filed: October 16, 2018

Date of Patent: May 21, 2019

Assignee: Inception Institute of Artificial Intelligence, Ltd

Inventors: Fan Zhu, Li Liu, Jin Xie, Fumin Shen, Ling Shao, Yi Fang