Patents by Inventor Weilin Huang

Weilin Huang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Retail inventory shrinkage reduction via action recognition

Patent number: 11636744

Abstract: This disclosure includes technologies for action recognition in general. The disclosed system may automatically detect various types of actions in a video, including reportable actions that cause shrinkage in a practical application for loss prevention in the retail industry. Further, appropriate responses may be invoked if a reportable action is recognized. In some embodiments, a three-branch architecture may be used in a machine learning model for action and/or activity recognition. The three-branch architecture may include a main branch for action recognition, an auxiliary branch for learning/identifying an actor (e.g., human parsing) related to an action, and an auxiliary branch for learning/identifying a scene related to an action. In this three-branch architecture, the knowledge of the actor and the scene may be integrated in two different levels for action and/or activity recognition.

Type: Grant

Filed: February 5, 2021

Date of Patent: April 25, 2023

Assignee: Shenzhen Malong Technologies Co., Ltd.

Inventors: Weilin Huang, Shiwen Zhang, Limin Wang, Sheng Guo, Matthew Robert Scott
Decoupling category-wise independence and relevance with self-attention for multi-label image classification

Patent number: 11494616

Abstract: Methods and systems are provided for generating a multi-label classification system. The multi-label classification system can use a multi-label classification neural network system to identify one or more labels for an image. The multi-label classification system can explicitly take into account the relationship between classes in identifying labels. A relevance sub-network of the multi-label classification neural network system can capture relevance information between the classes. Such a relevance sub-network can decouple independence between classes to focus learning on relevance between the classes.

Type: Grant

Filed: July 11, 2019

Date of Patent: November 8, 2022

Assignee: Shenzhen Malong Technologies Co., Ltd.

Inventors: Sheng Guo, Weilin Huang, Matthew Robert Scott, Luchen Liu
RETAIL SYSTEM AND METHODS WITH VISUAL OBJECT TRACKING

Publication number: 20210319420

Abstract: This disclosure includes technologies for object tracking in general. The disclosed system can detect the event type based on one or more tracked objects. Further, appropriate responses may be invoked based on the event type.

Type: Application

Filed: April 12, 2020

Publication date: October 14, 2021

Inventors: Yuechen YU, Yilei XIONG, Weilin HUANG, Matthew Robert SCOTT
PROGRESSIVE VERIFICATION SYSTEM AND METHODS

Publication number: 20210279505

Abstract: This disclosure includes technologies for automated verification via digital image transformation or analysis in several progressive stages, which may include a stage of retrieval-model-based mismatch detection, a stage of cross-class mismatch detection, or a stage of inner-class mismatch detection. Further, the disclosed system is designed to progressively execute the verification process from a generality-attentive manner to a specificity-attentive manner. Finally, the disclosed system is designed to launch appropriate responses based on the verification outcome.

Type: Application

Filed: March 9, 2020

Publication date: September 9, 2021

Inventors: Yujie ZHONG, Zelu DENG, Weilin HUANG, Matthew Robert SCOTT
4D CONVOLUTIONAL NEURAL NETWORKS FOR VIDEO RECOGNITION

Publication number: 20210248377

Abstract: This disclosure includes technologies for video recognition in general. The disclosed system can automatically detect various types of actions in a video, including reportable actions that cause shrinkage in a practical application for loss prevention in the retail industry. The temporal evolution of spatio-temporal features in the video are used for action recognition. Such features may be learned via a 4D convolutional operation, which is adapted to model low-level features based on a residual 4D block. Further, appropriate responses may be invoked if a reportable action is recognized.

Type: Application

Filed: June 9, 2020

Publication date: August 12, 2021

Inventors: Weilin HUANG, Shiwen ZHANG, Sheng GUO, Limin WANG, Matthew Robert SCOTT
RETAIL INVENTORY SHRINKAGE REDUCTION VIA ACTION RECOGNITION

Publication number: 20210248885

Abstract: This disclosure includes technologies for action recognition in general. The disclosed system may automatically detect various types of actions in a video, including reportable actions that cause shrinkage in a practical application for loss prevention in the retail industry. Further, appropriate responses may be invoked if a reportable action is recognized. In some embodiments, a three-branch architecture may be used in a machine learning model for action and/or activity recognition. The three-branch architecture may include a main branch for action recognition, an auxiliary branch for learning/identifying an actor (e.g., human parsing) related to an action, and an auxiliary branch for learning/identifying a scene related to an action. In this three-branch architecture, the knowledge of the actor and the scene may be integrated in two different levels for action and/or activity recognition.

Type: Application

Filed: February 5, 2021

Publication date: August 12, 2021

Inventors: Weilin HUANG, Shiwen ZHANG, Limin WANG, Sheng GUO, Matthew Robert SCOTT
CHANNEL INTERACTION NETWORKS FOR IMAGE CATEGORIZATION

Publication number: 20210248421

Abstract: This disclosure includes computer vision technologies for image categorization, such as used for product recognition. In one embodiment, the disclosed system uses a channel interaction network to learn stronger fine-grained features and to distinguish the subtle differences between two similar images. Additionally, the disclosed channel interaction network may be integrated into an existing feature extractor network to boost its performance for image categorization.

Type: Application

Filed: July 12, 2020

Publication date: August 12, 2021

Inventors: Yu GAO, Xintong HAN, Weilin HUANG, Matthew Robert SCOTT
Complexity-based progressive training for machine vision models

Patent number: 11062180

Abstract: Methods and systems for training machine vision models (MVMs) with “noisy” training datasets are described. A noisy set of images is received, where labels for some of the images are “noisy” and/or incorrect. A progressively-sequenced learning curriculum is designed for the noisy dataset, where the images that are easiest to learn machine-vision knowledge from are sequenced near the beginning of the curriculum and images that are harder to learn machine-vision knowledge from are sequenced later in the curriculum. An MVM is trained via providing the sequenced curriculum to a supervised learning method, so that the MVM learns from the easiest examples first and the harder training examples later, i.e., the MVM progressively accumulates knowledge from simplest to most complex. To sequence the curriculum, the training images are embedded in a feature space and the “complexity” of each image is determined via density distributions and clusters in the feature space.

Type: Grant

Filed: July 18, 2018

Date of Patent: July 13, 2021

Assignee: Shenzhen Malong Technologies Co., Ltd.

Inventors: Sheng Guo, Weilin Huang, Haozhi Zhang, Chenfan Zhuang, Dengke Dong, Matthew R. Scott, Dinglong Huang
Appearance-flow-based image generation

Patent number: 11055888

Abstract: Aspects of this disclosure include technologies for appearance-flow-based image generation. In applications for pose-guided person image generation or virtual try-on, the disclosed system can model the appearance flow between source and target clothing regions. Further, a cascaded appearance flow estimation network is used to progressively refine the appearance flow. The resulting appearance flow can properly encode the geometric changes between the source and the target for image generation.

Type: Grant

Filed: August 27, 2019

Date of Patent: July 6, 2021

Assignee: Shenzhen Malong Technologies Co., Ltd.

Inventors: Xintong Han, Xiaojun Hu, Weilin Huang, Matthew Robert Scott
CROSS-BATCH MEMORY FOR EMBEDDING LEARNING

Publication number: 20210182686

Abstract: This disclosure includes computer vision technologies, specifically for embeddings and metric learning. In various practical applications, such as product recognition, image retrieval, face recognition, etc., the disclosed technologies use a cross-batch memory mechanism to memorize prior embeddings, so that a pair-based learning model can mine more pairs across multiple mini-batches or even over the whole dataset. The disclosed technologies not only boost the performance for various applications, but considerably improve the computation itself with its memory-efficient approach.

Type: Application

Filed: June 24, 2020

Publication date: June 17, 2021

Inventors: Xun WANG, Haozhi ZHANG, Weilin HUANG, Matthew Robert SCOTT
AUTOMATED CUSTOMIZATION WITH COMPATIBLE OBJECTS

Publication number: 20210160018

Abstract: This disclosure includes technologies for ranking or generating compatible objects. In retail-oriented applications, the disclosed technologies can rank products based on their respective compatibilities with contextual products, both in shape and appearance, and facilitate users to select products compatible with contextual products or surrounding conditions. In design-oriented applications, the disclosed technologies can generate diverse objects compatible with contextual objects or surrounding conditions.

Type: Application

Filed: November 26, 2019

Publication date: May 27, 2021

Inventors: Xintong HAN, Weilin HUANG, Matthew Robert SCOTT
Complexity-Based Progressive Training for Machine Vision Models

Publication number: 20210125001

Abstract: Methods and systems for training machine vision models (MVMs) with “noisy” training datasets are described. A noisy set of images is received, where labels for some of the images are “noisy” and/or incorrect. A progressively-sequenced learning curriculum is designed for the noisy dataset, where the images that are easiest to learn machine-vision knowledge from are sequenced near the beginning of the curriculum and images that are harder to learn machine-vision knowledge from are sequenced later in the curriculum. An MVM is trained via providing the sequenced curriculum to a supervised learning method, so that the MVM learns from the easiest examples first and the harder training examples later, i.e., the MVM progressively accumulates knowledge from simplest to most complex. To sequence the curriculum, the training images are embedded in a feature space and the “complexity” of each image is determined via density distributions and clusters in the feature space.

Type: Application

Filed: July 18, 2018

Publication date: April 29, 2021

Inventors: Sheng Guo, Weilin Huang, Haozhi Zhang, Chenfan Zhuang, Dengke Dong, Matthew R. Scott, Dinlong Huang
IRREGULAR SCAN DETECTION FOR RETAIL SYSTEMS

Publication number: 20210117949

Abstract: Aspects of this disclosure include technologies for detecting irregular scans, specifically when a retail system fails to collect the genuine information of a product. The disclosed retail system retrieves images covering a specific region, e.g., the designated scanning area. Further the disclosed retail system uses neural networks to detect the product from such images and track a moving path of the product over the specific region. Irregular scans may be detected when the tracked product in the images does not match what is collected by the scanner of the retail system.

Type: Application

Filed: December 5, 2019

Publication date: April 22, 2021

Inventors: Sheng GUO, Haozhi ZHANG, Xun WANG, Weilin HUANG, Matthew Robert SCOTT
CHARACTER-BASED TEXT DETECTION AND RECOGNITION

Publication number: 20210110189

Abstract: Aspects of this disclosure include technologies for character-based text detection and recognition. The disclosed single-stage model is configured for joint text detection and word recognition in natural images. In the disclosed solution, a character recognition branch is integrated into a word detection model. This results in an end-to-end trainable model that can implement text detection and word recognition jointly. Further, the disclosed technical solution includes an iterative character detection method, which is configured to generate character-level bounding boxes on real-world images by using synthetic data first.

Type: Application

Filed: November 4, 2019

Publication date: April 15, 2021

Inventors: Weilin HUANG, Matthew Robert SCOTT, Linjie XING
APPEARANCE-FLOW-BASED IMAGE GENERATION

Publication number: 20210065418

Abstract: Aspects of this disclosure include technologies for appearance-flow-based image generation. In applications for pose-guided person image generation or virtual try-on, the disclosed system can model the appearance flow between source and target clothing regions. Further, a cascaded appearance flow estimation network is used to progressively refine the appearance flow. The resulting appearance flow can properly encode the geometric changes between the source and the target for image generation.

Type: Application

Filed: August 27, 2019

Publication date: March 4, 2021

Inventors: Xintong HAN, Xiaojun HU, Weilin HUANG, Matthew Robert SCOTT
MISLABELED PRODUCT DETECTION

Publication number: 20210049400

Abstract: Aspects of this disclosure include technologies for detecting mislabeled products. In one embodiment, the disclosed system will capture an image of a product when the MRL of the product is scanned or being scanned. After recognizing the product in the image, the size of the area containing the product may be calculated. Subsequently, the disclosed system can determine whether the MRL mismatches the product in the image if this size of the area containing the product does not match the standard size associated with the MRL.

Type: Application

Filed: November 2, 2020

Publication date: February 18, 2021

Inventors: MATTHEW ROBERT SCOTT, DINGLONG HUANG, LE YIN, SHENG GUO, HAOZHI ZHANG, WEILIN HUANG
Dual-Stream Pyramid Registration Network

Publication number: 20210049733

Abstract: Aspects of this disclosure include technologies for object registration based on a dual-stream pyramid registration network, which is configured to compute multi-scale deformation fields from dual feature pyramids. The disclosed technologies further enable the multi-scale deformation fields to be refined in a coarse-to-fine manner, resulting in the capability for handling significant deformations between two objects, such as large displacements in spatial domain or slice space. Further, the disclosed technologies enable various functions based on the registered objects, such as automatic labeling, image comparison and differentiation, and medical image registration and navigation.

Type: Application

Filed: August 13, 2019

Publication date: February 18, 2021

Inventors: Miao KANG, Xiaojun HU, Weilin HUANG, Matthew Robert SCOTT
Weakly Supervised Image Segmentation Via Curriculum Learning

Publication number: 20210027098

Abstract: Weakly supervised instance segmentation refers to the task of training a system to detect object locations and segment instances of the detected objects, where the training data includes only images and image-level labels. This disclosure includes an enhanced pipeline and enhanced training methods that progressively mine pixel-wise labels, when trained via image-level labels. Four cascaded modules are employed, including: a multi-label classification module, an object detection module, an instance refinement module, and instance segmentation module. The modules share a common backbone. The cascaded pipeline is trained alternatively with a curriculum learning strategy which generalizes image level supervision to pixel level supervision, and a post validation training stage, which runs in the inverse order. In the curriculum learning stage, a proposal refinement sub-module is employed to locate object parts and finding key pixels during classification.

Type: Application

Filed: July 29, 2019

Publication date: January 28, 2021

Inventors: Weifeng Ge, Sheng Guo, Weilin Huang, Matthew Robert Scott
Lunch box

Patent number: D977318

Type: Grant

Filed: June 28, 2022

Date of Patent: February 7, 2023

Inventor: Weilin Huang
Lunch box

Patent number: D1017339

Type: Grant

Filed: July 19, 2023

Date of Patent: March 12, 2024

Inventor: Weilin Huang

1 2 next