Patents by Inventor Zhiqiang SHEN

Zhiqiang SHEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for solving missing annotation object detection

Patent number: 12266156

Abstract: Disclosed herein is a system and method for improving the accuracy of an object detector when trained with a dataset having a significant number of missing annotations. The method uses a novel Background Recalibration Loss (BRL) which adjusts the gradient direction according to its own activation to reduce the adverse effect of error signals by replacing the negative branch of the focal loss with a mirror of the positive branch when the activation is below a confusion threshold.

Type: Grant

Filed: February 14, 2022

Date of Patent: April 1, 2025

Assignee: Carnegie Mellon University

Inventors: Marios Savvides, Zhiqiang Shen, Fangyi Chen, Han Zhang
SYSTEM AND METHOD FOR CONTRASTIVE AFFINITY LEARNING VIA AUXILIARY PROMPTS FOR GENERALIZED NOVEL CATEGORY DISCOVERY

Publication number: 20250078546

Abstract: A system and method of fine-grained image category discovery with few human annotations includes a camera and a trained machine learning model, which predicts a label for an object in a captured image and outputs the predicted label. The machine learning model is trained by contrastive affinity learning, including retrieving images having an object, a warm-up stage in which semi-supervised contrastive learning is performed based on projected features of a class token and an ensembled prompt, respectively. In a contrastive affinity learning stage, a student model and an exponentially moving averaged teacher model are forwarded with different augmented views of the retrieved images. Teacher embeddings are enqueued into a token-specific memory. A semi-supervised contrastive loss is computed on a current batch and a contrastive affinity learning loss for student embeddings and the teacher embeddings with pseudo-labels from a affinity graph dynamically generated by semi-supervised affinity generation.

Type: Application

Filed: September 5, 2023

Publication date: March 6, 2025

Applicant: Mohamed bin Zayed University of Artificial Intelligence

Inventors: Sheng ZHANG, Salman KHAN, Zhiqiang SHEN, Muzammal NASEER, Guangyi CHEN, Fahad KHAN
System and method for the discovering efficient random neural networks

Patent number: 12182707

Abstract: Disclosed herein is a system and method for novel neural architecture search using a random graph network backbone to facilitate the creation of an efficient network structure. The method utilizes reinforcement learning algorithms to build a complex relationship between intra-connections (i.e., links between blocks in a random graph network) and extra-connections (i.e., links among blocks across the random graphs network) for discovering an efficient random neural architecture.

Type: Grant

Filed: February 18, 2022

Date of Patent: December 31, 2024

Assignee: Carnegie Mellon University

Inventors: Marios Savvides, Thanh Hai Phan, Zhiqiang Shen, Akshay Chawla
Feature pyramids for object detection

Patent number: 11954175

Abstract: Disclosed herein is an improvement to prior art feature pyramids for general object detection that inserts a simple norm calibration (NC) operation between the feature pyramids and detection head to alleviate and balance the norm bias caused by feature pyramid network (FPN) and which leverages an enhanced multi-feature selective strategy (MS) during training to assign the ground-truth to one or more levels of the feature pyramid.

Type: Grant

Filed: July 28, 2021

Date of Patent: April 9, 2024

Assignee: Carnegie Mellon University

Inventors: Fangyi Chen, Chenchen Zhu, Zhiqiang Shen, Han Zhang, Marios Savvides
SYSTEM AND METHOD FOR THE DISCOVERING EFFICIENT RANDOM NEURAL NETWORKS

Publication number: 20240095523

Abstract: Disclosed herein is a system and method for novel neural architecture search using a random graph network backbone to facilitate the creation of an efficient network structure. The method utilizes reinforcement learning algorithms to build a complex relationship between intra-connections (i.e., links between blocks in a random graph network) and extra-connections (i.e., links among blocks across the random graphs network) for discovering an efficient random neural architecture.

Type: Application

Filed: February 18, 2022

Publication date: March 21, 2024

Inventors: Marios Savvides, Thanh Hai Phan, Zhiqiang Shen, Askhay Chawla
SOFT ANCHOR POINT OBJECT DETECTION

Publication number: 20240071029

Abstract: Disclosed herein is a method of soft anchor-point detection (SAPD), which implements a concise, single-stage anchor-point detector with both faster speed and higher accuracy. Also disclosed is a novel training strategy with two softened optimization techniques: soft-weighted anchor points and soft-selected pyramid levels.

Type: Application

Filed: January 24, 2022

Publication date: February 29, 2024

Inventors: Chenchen ZHU, Marios Savvides, Zhiqiang Shen, Fangyi Chen
SYSTEM AND METHOD FOR DOMAIN ADAPTIVE OBJECT DETECTION VIA GRADIENT DETACH BASED STACKED COMPLEMENTARY LOSSES

Publication number: 20240054775

Abstract: Disclosed herein an effective detach strategy which suppresses the flow of gradients from context sub-networks through the detection backbone path to obtain a more discriminative context by forcing the representation of context sub-network to be dissimilar from the detection network. A sub-network is defined to generate the context information from early layers of the detection backbone. Because instance and context focus on perceptually different parts of an image, the representations from either of them should also be discrepant. In addition, a stacked complementary loss is generated to and backpropagated to the detection network.

Type: Application

Filed: January 31, 2022

Publication date: February 15, 2024

Inventors: Zhiqiang Shen, Harsh Maheshwari, Marios Savvides
Image generation method, electronic device, and storage medium

Patent number: 11900648

Abstract: An image generation method includes: extracting a content feature of a first image; respectively extracting an overall image style feature of a second image, and an object style feature of a partial image block, which includes an object, in the second image; determining a target style feature at least according to the overall image style feature and the object style feature; and generating a third image according to the content feature and the target style feature.

Type: Grant

Filed: October 26, 2021

Date of Patent: February 13, 2024

Assignees: SENSETIME GROUP LIMITED, HONDA MOTOR CO. LTD.

Inventors: Zhiqiang Shen, Mingyang Huang, Jianping Shi, Hideki Matsunaga
AIR BLOWING STRUCTURE AND FAN

Publication number: 20240035482

Abstract: The present disclosure relates to a blowing structure and a fan. The blowing structure includes a main body, a base, a control panel, a second drive member, and a first drive member. The main body is provided with a blowing channel, and the main body is rotatably connected to the base. The control panel is disposed on the main body. The second drive member is disposed in the blowing channel. The first drive member is connected to the main body, and the first drive member is configured to adjust an angle of rotation of the main body with respect to the base. The control panel is electrically connected to the second drive member and the first drive member.

Type: Application

Filed: November 5, 2021

Publication date: February 1, 2024

Applicant: GREE ELECTRIC APPLIANCES, INC. OF ZHUHAI

Inventors: Zhiqiang SHEN, Yong WANG, Shuang FENG, Jiyan WANG, Ruipeng FAN
BINARY NEURAL NETWORKS WITH GENERALIZED ACTIVATION FUNCTIONS

Publication number: 20240013032

Abstract: Disclosed herein is a design for a 1-bit CNN that closes the performance gap between binary neural networks and real-valued networks on challenging large-scale datasets. The design starts with a high-performance baseline network. Blocks with identity shortcuts which bypass 1-bit generic convolutions are adopted to replace the convolutions in the baseline network. Reshaping and shifting of activation functions is introduced. Finally, a distributional loss to further is adopted enforce the binary network to learn similar output distributions as those of a real-valued network.

Type: Application

Filed: January 25, 2022

Publication date: January 11, 2024

Inventors: Zhiqiang Shen, Zechun Liu, Marios Savvides
IMPROVED FINE-TUNING STRATEGY FOR FEW SHOT LEARNING

Publication number: 20230368038

Abstract: Disclosed herein is a method providing a flexible way to transfer knowledge from base to novel classes in a few shot learning scenario. The invention introduces a partial transfer paradigm for the few-shot classification task in which a model is first trained on the base classes. Then, instead of transferring the learned representation by freezing the whole backbone network, an efficient evolutionary search method is used to automatically determine which layer or layers need to be frozen and which will be fine-tuned on the support set of the novel class.

Type: Application

Filed: January 24, 2022

Publication date: November 16, 2023

Inventors: Zhiqiang Shen, Zechun Liu, Marios Savvides
Techniques for dense video descriptions

Patent number: 11790644

Abstract: Techniques and apparatus for generating dense natural language descriptions for video content are described. In one embodiment, for example, an apparatus may include at least one memory and logic, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to receive a source video comprising a plurality of frames, determine a plurality of regions for each of the plurality of frames, generate at least one region-sequence connecting the determined plurality of regions, apply a language model to the at least one region-sequence to generate description information comprising a description of at least a portion of content of the source video. Other embodiments are described and claimed.

Type: Grant

Filed: January 6, 2022

Date of Patent: October 17, 2023

Assignee: INTEL CORPORATION

Inventors: Yurong Chen, Jianguo Li, Zhou Su, Zhiqiang Shen
SYSTEM AND METHOD FOR SOLVING MISSING ANNOTATION OBJECT DETECTION

Publication number: 20220262101

Abstract: Disclosed herein is a system and method for improving the accuracy of an object detector when trained with a dataset having a significant number of missing annotations. The method uses a novel Background Recalibration Loss (BRL) which adjusts the gradient direction according to its own activation to reduce the adverse effect of error signals by replacing the negative branch of the focal loss with a mirror of the positive branch when the activation is below a confusion threshold.

Type: Application

Filed: February 14, 2022

Publication date: August 18, 2022

Inventors: Marios Savvides, Zhiqiang Shen, Fangyi Chen, Han Zhang
TECHNIQUES FOR DENSE VIDEO DESCRIPTIONS

Publication number: 20220180127

Abstract: Techniques and apparatus for generating dense natural language descriptions for video content are described. In one embodiment, for example, an apparatus may include at least one memory and logic, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to receive a source video comprising a plurality of frames, determine a plurality of regions for each of the plurality of frames, generate at least one region-sequence connecting the determined plurality of regions, apply a language model to the at least one region-sequence to generate description information comprising a description of at least a portion of content of the source video. Other embodiments are described and claimed.

Type: Application

Filed: January 6, 2022

Publication date: June 9, 2022

Applicant: INTEL CORPORATION

Inventors: Yurong CHEN, Jianguo LI, Zhou SU, Zhiqiang SHEN
Techniques for dense video descriptions

Patent number: 11263489

Abstract: Techniques and apparatus for generating dense natural language descriptions for video content are described. In one embodiment, for example, an apparatus may include at least one memory and logic, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to receive a source video comprising a plurality of frames, determine a plurality of regions for each of the plurality of frames, generate at least one region-sequence connecting the determined plurality of regions, apply a language model to the at least one region-sequence to generate description information comprising a description of at least a portion of content of the source video. Other embodiments are described and claimed.

Type: Grant

Filed: June 29, 2017

Date of Patent: March 1, 2022

Assignee: INTEL CORPORATION

Inventors: Yurong Chen, Jianguo Li, Zhou Su, Zhiqiang Shen
IMAGE GENERATION METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM

Publication number: 20220044054

Abstract: An image generation method includes: extracting a content feature of a first image; respectively extracting an overall image style feature of a second image, and an object style feature of a partial image block, which includes an object, in the second image; determining a target style feature at least according to the overall image style feature and the object style feature; and generating a third image according to the content feature and the target style feature.

Type: Application

Filed: October 26, 2021

Publication date: February 10, 2022

Applicants: SENSETIME GROUP LIMITED, HONDA MOTOR CO. LTD.

Inventors: Zhiqiang SHEN, Mingyang HUANG, Jianping SHI, Hideki MATSUNAGA
FEATURE PYRAMIDS FOR OBJECT DETECTION

Publication number: 20220044073

Abstract: Disclosed herein is an improvement to prior art feature pyramids for general object detection that inserts a simple norm calibration (NC) operation between the feature pyramids and detection head to alleviate and balance the norm bias caused by feature pyramid network (FPN) and which leverages an enhanced multi-feature selective strategy (MS) during training to assign the ground-truth to one or more levels of the feature pyramid.

Type: Application

Filed: July 28, 2021

Publication date: February 10, 2022

Inventors: Fangyi Chen, Chenchen Zhu, Zhiqiang Shen, Han Zhang, Marios Savvides
TECHNIQUES FOR DENSE VIDEO DESCRIPTIONS

Publication number: 20210142115

Abstract: Techniques and apparatus for generating dense natural language descriptions for video content are described. In one embodiment, for example, an apparatus may include at least one memory and logic, at least a portion of the logic comprised in hardware coupled to the at least one memory, the logic to receive a source video comprising a plurality of frames, determine a plurality of regions for each of the plurality of frames, generate at least one region-sequence connecting the determined plurality of regions, apply a language model to the at least one region-sequence to generate description information comprising a description of at least a portion of content of the source video. Other embodiments are described and claimed.

Type: Application

Filed: June 29, 2017

Publication date: May 13, 2021

Applicant: INTEL CORPORATION

Inventors: Yurong CHEN, Jianguo LI, Zhou SU, Zhiqiang SHEN