Patents by Inventor Wai Kai Arvin TANG

Wai Kai Arvin TANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

APPARATUS AND METHOD FOR PRODUCING LABELLED CURSIVE HANDWRITTEN TEXT SAMPLES WITH CONTEXT STYLE VARIABILITY SUITABLE FOR FORMING TRAINING DATA FOR TRAINING MACHINE LEARNING NEURAL NETWORKS FOR CURSIVE HANDWRITING RECOGNITION

Publication number: 20250087007

Abstract: The present invention provides an apparatus for producing labelled cursive handwritten text samples with context style variability, and method of training and using the same. The apparatus comprises: a word embedding units configured for encoding an input text to obtain an input text word embedding; a word embedding unit configured for encoding a handwriting context style description into a context style description embedding; a transformer configured for transforming the context style description embedding to obtain a handwriting context style feature embedding; a feature embedding combiner configured for combining the handwriting context style feature embedding and the input text word embeddings to form a combined feature embedding; a generator configured for generating a synthetic image containing characters occurred in the input text with the handwriting context style defined in the handwriting context style description and context style variability introduced by the generator.

Type: Application

Filed: September 8, 2023

Publication date: March 13, 2025

Inventors: Wai Kai Arvin TANG, Sergio Rodolfo CRUZ GOMEZ
METHOD AND APPARATUS FOR DOCUMENT ANALYSIS THROUGH LAYER SEPARATION BY MACHINE LEARNING WITH CROSS-LAYER REASONING

Publication number: 20250078557

Abstract: A method for processing electronic documents, comprising: receiving an electronic document; recognizing one or more content components in the electronic document; identifying a content type for each of the recognized content components; creating, by a layer separator, one or more logical layers from the recognized content components such that each of the logical layer contains only the content components of the same content type; and invoking a content-type specific content handler for each of the logical layers created. The layer separator comprises a machine learning (ML) model based on a modified U-Net convolutional neural network and trained to classify the content types of the content components. The modified U-Net CNN is improved over traditional U-Net CNN with transformers at each layer to achieve high recovery rate.

Type: Application

Filed: September 1, 2023

Publication date: March 6, 2025

Inventors: Wai Kai Arvin TANG, Ping Yin KOON
METHOD FOR OPTIMIZING WORKFLOW-BASED NEURAL NETWORK INCLUDING ATTENTION LAYER

Publication number: 20240303499

Abstract: A method for optimizing a workflow-based neural network including an attention layer is provided. The method comprises: training the workflow-based neural network to predict a result from input elements under a prediction model with the attention layer assigning attention placements and weights, based on an original attention function, to the input elements; obtaining an original attention mask pattern and a proposed attention mask pattern; creating an attention mask updating function based on the original attention mask pattern and the proposed attention mask pattern; and combining the attention mask updating function with the original attention function to form an updated attention function.

Type: Application

Filed: March 7, 2023

Publication date: September 12, 2024

Inventors: Wai Kai Arvin TANG, Kai Kin CHAN
Apparatus and method for recognizing image-based content presented in a structured layout

Patent number: 11403488

Abstract: A method for extracting information from a table includes steps as follows. Characters of a table are extracted. The characters are merged into n-gram characters. The n-gram characters are merged into words and text lines through a two-stage GNN mode. The two-stage GNN mode comprises sub steps as: spatial features, semantic features, CNN image features are extracted from a target source; a first GNN stage is processed to output graph embedding spatial features from the spatial features; and a second GNN stage is processed to output graph embedding semantic features and graph embedding CNN image features from the semantic features and the CNN image features, respectively. The text lines are merged into cells. The cells are grouped into rows, columns, and key-value pairs based on one or more adjacency matrices, a row relationship among the cells, a column relationship among the cells, and a key-value relationship among the cells.

Type: Grant

Filed: March 19, 2020

Date of Patent: August 2, 2022

Assignee: Hong Kong Applied Science and Technology Research Institute Company Limited

Inventors: Wai Kai Arvin Tang, Jinning Yang
Method and system of image based anomaly localization for vehicles through generative contextualized adversarial network

Patent number: 11328402

Abstract: The present invention provides an anomaly detection method and apparatus based on a neural network which can be trained on undamaged normal vehicle images and able to detect unknown/unseen vehicle damages of stochastic types and extents from images which are taken in various contexts. The provided method and apparatus are implemented with functional units which are trained to perform the anomaly detection under a GCAN model with a training dataset containing images of undamaged vehicles, intact-vehicle frame images and augmented vehicle frame images of the vehicles.

Type: Grant

Filed: September 29, 2020

Date of Patent: May 10, 2022

Assignee: Hong Kong Applied Science and Technology Research Institute Company Limited

Inventors: Wai Kai Arvin Tang, Pak Kan Wong
METHOD AND SYSTEM OF IMAGE BASED ANOMALY LOCALIZATION FOR VEHICLES THROUGH GENERATIVE CONTEXTUALIZED ADVERSARIAL NETWORK

Publication number: 20220101508

Abstract: The present invention provides an anomaly detection method and apparatus based on a neural network which can be trained on undamaged normal vehicle images and able to detect unknown/unseen vehicle damages of stochastic types and extents from images which are taken in various contexts. The provided method and apparatus are implemented with functional units which are trained to perform the anomaly detection under a GCAN model with a training dataset containing images of undamaged vehicles, intact-vehicle frame images and augmented vehicle frame images of the vehicles.

Type: Application

Filed: September 29, 2020

Publication date: March 31, 2022

Inventors: Wai Kai Arvin TANG, Pak Kan WONG
Apparatus and Method for Recognizing Image-Based Content Presented in a Structured Layout

Publication number: 20210295101

Abstract: A method for extracting information from a table includes steps as follows. Characters of a table are extracted. The characters are merged into n-gram characters. The n-gram characters are merged into words and text lines through a two-stage GNN mode. The two-stage GNN mode comprises sub steps as: spatial features, semantic features, CNN image features are extracted from a target source; a first GNN stage is processed to output graph embedding spatial features from the spatial features; and a second GNN stage is processed to output graph embedding semantic features and graph embedding CNN image features from the semantic features and the CNN image features, respectively. The text lines are merged into cells. The cells are grouped into rows, columns, and key based on one or more adjacency matrices, a row relationship among the cells, a column relationship among the cells, and a key-value relationship among the cells.

Type: Application

Filed: March 19, 2020

Publication date: September 23, 2021

Inventors: Wai Kai Arvin TANG, Jinning YANG
Flexible integrating recognition and semantic processing

Patent number: 10810467

Abstract: A method for character recognition and semantic for natural language processing comprising extracting a sequence of feature vectors from a sequence of input character images by a convolutional neural network (CNN) feature extractor. The sequence of feature vectors comprises a plurality of feature vectors, each feature vector representing an approximate-match of its corresponding input character in the sequence of input character images. The method further comprises applying a sequential classifier sequentially as a sliding window of a size of a plurality consecutive feature vectors upon the sequence of feature vectors from a first feature vector in the sequence of feature vectors to the last feature vector in the sequence of feature vectors; and recognizing an output character for a targeted feature vector among the applied-upon consecutive feature vectors within the sliding window as it is sliding across the sequence of feature vectors.

Type: Grant

Filed: November 15, 2018

Date of Patent: October 20, 2020

Assignee: Hong Kong Applied Science and Technology Research Institute Company Limited

Inventor: Wai Kai Arvin Tang
FLEXIBLE INTEGRATING RECOGNITION AND SEMANTIC PROCESSING

Publication number: 20190156156

Abstract: A method for character recognition and semantic for natural language processing comprising extracting a sequence of feature vectors from a sequence of input character images by a convolutional neural network (CNN) feature extractor. The sequence of feature vectors comprises a plurality of feature vectors, each feature vector representing an approximate-match of its corresponding input character in the sequence of input character images. The method further comprises applying a sequential classifier sequentially as a sliding window of a size of a plurality consecutive feature vectors upon the sequence of feature vectors from a first feature vector in the sequence of feature vectors to the last feature vector in the sequence of feature vectors; and recognizing an output character for a targeted feature vector among the applied-upon consecutive feature vectors within the sliding window as it is sliding across the sequence of feature vectors.

Type: Application

Filed: November 15, 2018

Publication date: May 23, 2019

Inventor: Wai Kai Arvin TANG