Patents by Inventor Yanshuai CAO

Yanshuai CAO has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for machine learning architecture with variational autoencoder pooling

Patent number: 11914955

Abstract: A computer implemented method is described for conducting text sequence machine learning, the method comprising: receiving an input sequence x=[x1, x2, . . . , xn], to produce a feature vector for a series of hidden states hx=[h1, h2, . . . , hn], wherein the feature vector for the series of hidden states hx is generated by performing pooling over a temporal dimension of all hidden states output by the encoder machine learning data architecture; and extracting from the series of hidden states hx, a mean and a variance parameter, and to encapsulate the mean and the variance parameter as an approximate posterior data structure.

Type: Grant

Filed: May 21, 2020

Date of Patent: February 27, 2024

Assignee: ROYAL BANK OF CANADA

Inventors: Teng Long, Yanshuai Cao, Jackie C. K. Cheung
System and method for machine learning with long-range dependency

Patent number: 11763129

Abstract: A system, electronic device and method for improved neural network training are provided. The improved system is adapted for tracking long range dependence in sequential data during training, and includes bootstrapping a lower bound on the mutual information (MI) over groups of variables (segments or sentences) and subsequently applying the bound to encourage high MI.

Type: Grant

Filed: March 4, 2020

Date of Patent: September 19, 2023

Assignee: ROYAL BANK OF CANADA

Inventors: Yanshuai Cao, Peng Xu
System and method for controllable machine text generation architecture

Patent number: 11763100

Abstract: A system is provided comprising a processor and a memory storing instructions which configure the processor to process an original sentence structure through an encoder neural network to decompose the original sentence structure into an original semantics component and an original syntax component, process the original syntax component through a syntax variation autoencoder (VAE) to receive a syntax mean vector and a syntax covariance matrix, obtain a sampled syntax value from a syntax Gaussian posterior parameterized by the syntax mean vector and the syntax covariance matrix, process the original semantics component through a semantics VAE to receive a semantics mean vector and a semantics covariance matrix, obtain a sampled semantics vector from the Gaussian semantics posterior parameterized by the semantics mean vector and the semantics covariance matrix, and process the sampled syntax vector and the sampled semantics vector through a decoder neural network to compose a new sentence.

Type: Grant

Filed: May 22, 2020

Date of Patent: September 19, 2023

Assignee: ROYAL BANK OF CANADA

Inventors: Peng Xu, Yanshuai Cao, Jackie C. K. Cheung
System and method for improving deep neural network performance

Patent number: 11755916

Abstract: An improved computer implemented method and corresponding systems and computer readable media for improving performance of a deep neural network are provided to mitigate effects related to catastrophic forgetting in neural network learning. In an embodiment, the method includes storing, in memory, logits of a set of samples from a previous set of tasks (D1); and maintaining classification information from the previous set of tasks by utilizing the logits for matching during training on a new set of tasks (D2).

Type: Grant

Filed: September 5, 2019

Date of Patent: September 12, 2023

Assignee: ROYAL BANK OF CANADA

Inventors: Yanshuai Cao, Ruitong Huang, Junfeng Wen
System and method for machine learning architecture with variational hyper-RNN

Patent number: 11615305

Abstract: A variational hyper recurrent neural network (VHRNN) can be trained by, for each step in sequential training data: determining a prior probability distribution for a latent variable from a prior network of the VHRNN using an initial hidden state; determining a hidden state from a recurrent neural network (RNN) of the VHRNN using an observation state, the latent variable and the initial hidden state; determining an approximate posterior probability distribution for the latent variable from an encoder network of the VHRNN using the observation state and the initial hidden state; determining a generating probability distribution for the observation state from a decoder network of the VHRNN using the latent variable and the initial hidden state; and maximizing a variational lower bound of a marginal log-likelihood of the training data. The trained VHRNN can be used to generate sequential data.

Type: Grant

Filed: May 22, 2020

Date of Patent: March 28, 2023

Assignee: ROYAL BANK OF CANADA

Inventors: Ruizhi Deng, Yanshuai Cao, Bo Chang, Marcus Brubaker
Robust pruned neural networks via adversarial training

Patent number: 11562244

Abstract: Systems, methods, and computer readable media are described to train a compressed neural network with high robustness. The neural network is first adversarially pre-trained with both original data as well as data perturbed by adversarial attacks for some epochs, then “unimportant” weights or filters are pruned through criteria based on their magnitudes or other method (e.g., Taylor approximation of the loss function), and the pruned neural network is retrained with both clean and perturbed data for more epochs.

Type: Grant

Filed: February 7, 2019

Date of Patent: January 24, 2023

Assignee: ROYAL BANK OF CANADA

Inventors: Luyu Wang, Weiguang Ding, Ruitong Huang, Yanshuai Cao, Yik Chau Lui
Method and device for conducting measurements for an N-dimensional data structure

Patent number: 11551041

Abstract: A method for acquiring measurements for a data structure corresponding to an array of variable includes: selecting a subset of elements from the data structure; measuring a sampled value for each of the selected subset of elements; storing each of the sampled values in a K-nearest neighbour (KNN) database and labelling the sampled value as certain; generating a predicted value data structure where each predicted element is generated as the value of its nearest neighbor based on the values stored in the KNN database; for each predicted element: retrieve the predicted element's X nearest neighbours for the sampled value in the KNN database, and when a value of the X nearest neighbours is the same as the predicted element, the predicted element is labelled as certain, otherwise the predicted element is labelled the values as uncertain; and repeating until all elements are labelled as certain.

Type: Grant

Filed: October 31, 2018

Date of Patent: January 10, 2023

Assignee: ROYAL BANK OF CANADA

Inventors: Weiguang Ding, Ruitong Huang, Luyu Wang, Yanshuai Cao
SYSTEM AND METHOD FOR TRANSFERABLE NATURAL LANGUAGE INTERFACE

Publication number: 20220129450

Abstract: A computer system and method for answering a natural language question is provided. The system comprises at least one processor and a memory storing instructions which when executed by the processor configure the processor to perform the method. The method comprises receiving a natural language question, generating a SQL query based on the natural language question, generating an explanation regarding a solution to the natural language question as answered by the SQL query, and presenting the solution and the explanation.

Type: Application

Filed: October 22, 2021

Publication date: April 28, 2022

Inventors: Yanshuai CAO, Peng XU, Keyi TANG, Wei YANG, Wenjie ZI, Teng LONG, Jackie Chit Kit CHEUNG, Chenyang HUANG, Lili MOU, Hamidreza SHAHIDI, Ákos KÁDÁR
System and method for cross-domain transferable neural coherence model

Patent number: 11270072

Abstract: Systems and methods of automatically generating a coherence score for text data is provided. The approach includes receiving a plurality of string tokens representing decomposed portions of the target text data object. A trained neural network is provided that has been trained against a plurality of corpuses of training text across a plurality of topics. The string tokens are arranged to extract string tokens representing adjacent sentence pairs of the target text data object. For each adjacent sentence pair, the neural network generates a local coherence score representing a coherence level of the adjacent sentence pair of the target text data object, which are then aggregated for each adjacent sentence pair of the target text data object to generate a global coherence score for the target text data object.

Type: Grant

Filed: October 31, 2019

Date of Patent: March 8, 2022

Assignee: ROYAL BANK OF CANADA

Inventors: Yanshuai Cao, Peng Z. Xu, Hamidreza Saghir, Jin Sung Kang, Teng Long, Jackie C. K. Cheung
SYSTEM AND METHOD FOR TESTING MACHINE LEARNING

Publication number: 20210319338

Abstract: A machine learning failure discriminator machine is described, along with corresponding systems, methods, and non-transitory computer readable media. The approach operates in relation to an iterative machine learning model and includes a phased approach to extract p-values from the iterative machine learning model based on modified versions of the training or validation data sets. The p-values are then used to identify whether various null hypotheses can be rejected, and accordingly, to generate an output data structure indicative of an estimated failure reason, if any. The output data structure may be made available on an API or on a graphical user interface.

Type: Application

Filed: April 9, 2021

Publication date: October 14, 2021

Inventors: Yik Chau LUI, Yanshuai CAO
System, methods, and devices for visual construction of operations for data querying

Patent number: 11080292

Abstract: A platform, device and process is provided for visual construction of operations for data querying. In particular, embodiments described herein provides a platform, device and process for visual construction of nested operations for data querying. The visual construction is a display of one or more projected data spaces enabling a selection of data indicators on the display. The selection is conducted graphically on the visual construction and the system is configured to translate the selection to generate and conduct a query operating visually on the visualized (e.g., projected) data space. The visual data space includes distinct views of the plurality of multi-dimensionality data points mapped to reduced-dimensionality data points with a transformation function associated with each view. The selections are used to augment the multi-dimensionality data points with one or more additional dimensions to track the selections and to perform operations and visualizations.

Type: Grant

Filed: November 13, 2018

Date of Patent: August 3, 2021

Assignee: ROYAL BANK OF CANADA

Inventors: Yanshuai Cao, Luyu Wang
Method and device for generative adversarial network training

Patent number: 11062179

Abstract: An electronic device for neural network training includes at least one processor and one or more memories configured to provide or train: a generative adversarial network (GAN) using a generator and a discriminator for: receiving a plurality of training cases; and training the generative adversarial network, based on the plurality of training cases, to classify the training cases; wherein the generator generates hard negative examples for the discriminator.

Type: Grant

Filed: November 2, 2018

Date of Patent: July 13, 2021

Assignee: ROYAL BANK OF CANADA

Inventors: Avishek Bose, Yanshuai Cao
SYSTEM AND METHOD FOR MACHINE LEARNING ARCHITECTURE WITH VARIATIONAL HYPER-RNN

Publication number: 20200372352

Abstract: A variational hyper recurrent neural network (VHRNN) can be trained by, for each step in sequential training data: determining a prior probability distribution for a latent variable from a prior network of the VHRNN using an initial hidden state; determining a hidden state from a recurrent neural network (RNN) of the VHRNN using an observation state, the latent variable and the initial hidden state; determining an approximate posterior probability distribution for the latent variable from an encoder network of the VHRNN using the observation state and the initial hidden state; determining a generating probability distribution for the observation state from a decoder network of the VHRNN using the latent variable and the initial hidden state; and maximizing a variational lower bound of a marginal log-likelihood of the training data. The trained VHRNN can be used to generate sequential data.

Type: Application

Filed: May 22, 2020

Publication date: November 26, 2020

Inventors: Ruizhi DENG, Yanshuai CAO, Bo CHANG, Marcus BRUBAKER
SYSTEM AND METHOD FOR MACHINE LEARNING ARCHITECTURE FOR PARTIALLY-OBSERVED MULTIMODAL DATA

Publication number: 20200372369

Abstract: Variational Autoencoders (VAEs) have been shown to be effective in modeling complex data distributions. Conventional VAEs operate with fully-observed data during training. However, learning a VAE model from partially-observed data is still a problem. A modified VAE framework is proposed that can learn from partially-observed data conditioned on the fully-observed mask. A model described in various embodiments is capable of learning a proper proposal distribution based on the missing data. The framework is evaluated for both high-dimensional multimodal data and low dimensional tabular data.

Type: Application

Filed: May 22, 2020

Publication date: November 26, 2020

Inventors: Yu GONG, Jiawei HE, Thibaut DURAND, Megha NAWHAL, Yanshuai CAO, Gregory MORI, Seyed Hossein HAJIMIRSADEGHI
SYSTEM AND METHOD FOR CONTROLLABLE MACHINE TEXT GENERATION ARCHITECTURE

Publication number: 20200372225

Abstract: A computer system and method for machine text generation is provided. The system comprises at least one processor and a memory storing instructions which when executed by the processor configure the processor to perform the method.

Type: Application

Filed: May 22, 2020

Publication date: November 26, 2020

Inventors: Peng XU, Yanshuai CAO, Jackie C. K. CHEUNG
SYSTEM AND METHOD FOR MACHINE LEARNING ARCHITECTURE WITH VARIATIONAL AUTOENCODER POOLING

Publication number: 20200372214

Abstract: A computer implemented method is described for conducting text sequence machine learning, the method comprising: receiving an input sequence x=[x1, x2, . . . , xn], to produce a feature vector for a series of hidden states hx=[h1, h2, . . . , hn], wherein the feature vector for the series of hidden states hx is generated by performing pooling over a temporal dimension of all hidden states output by the encoder machine learning data architecture; and extracting from the series of hidden states hx, a mean and a variance parameter, and to encapsulate the mean and the variance parameter as an approximate posterior data structure.

Type: Application

Filed: May 21, 2020

Publication date: November 26, 2020

Inventors: Teng LONG, Yanshuai CAO, Jackie C. K. CHEUNG
Systems and methods for cyberbot network detection

Patent number: 10819724

Abstract: There is provided a neural network system for detection of domain generation algorithm generated domain names, the neural network system comprising: an input receiver configured for receiving domain names from one or more input sources; a convolutional neural network unit including one or more convolutional layers, the convolutional unit configured for receiving the input text and processing the input text through the one or more convolutional layers; a recurrent neural network unit including one or more long short term memory layers, the recurrent neural network unit configured to process the output from the convolutional neural network unit to perform pattern recognition; and a classification unit including one or more classification layers, the classification unit configured to receive output data from the recurrent neural network unit to perform a determination of whether the input text or portions of the input text are DGA-generated or benign domain names.

Type: Grant

Filed: April 3, 2018

Date of Patent: October 27, 2020

Assignee: ROYAL BANK OF CANADA

Inventors: Ashkan Amiri, Bryce Croll, Cory Fong, Athinthra Krishnaswamy Sethurajan, Vikash Yadav, Sylvester King Chun Chiang, Zhengyi Qin, Cathal Smyth, Yik Chau Lui, Yanshuai Cao
System and method for reproducible machine learning

Patent number: 10802822

Abstract: Systems and methods for computationally generating a set of more “stable” configuration default values that are used for traceability and improving reproducibility of machine learning approaches. Hash values are generated based on a merged/modified configuration and both configuration content and hash are stored together in one or more data structures. These data structures can be used to link back to the actual values used in experiments.

Type: Grant

Filed: August 21, 2018

Date of Patent: October 13, 2020

Assignee: ROYAL BANK OF CANADA

Inventors: Weiguang Ding, Yanshuai Cao
SYSTEM AND METHOD FOR MACHINE LEARNING WITH LONG-RANGE DEPENDENCY

Publication number: 20200285964

Abstract: A system, electronic device and method for improved neural network training are provided. The improved system is adapted for tracking long range dependence in sequential data during training, and includes bootstrapping a lower bound on the mutual information (MI) over groups of variables (segments or sentences) and subsequently applying the bound to encourage high MI.

Type: Application

Filed: March 4, 2020

Publication date: September 10, 2020

Inventors: Yanshuai CAO, Peng XU
System and method for adaptive data visualization

Patent number: 10739955

Abstract: An interactive data visualization system is provided that utilizes unsupervised learning process, to automatically choose the hyperparameters for generating insights, which are then used for visualizing the data using interactive plots that update dynamically in response to input control commands.

Type: Grant

Filed: May 8, 2018

Date of Patent: August 11, 2020

Assignee: ROYAL BANK OF CANADA

Inventors: Luyu Wang, Yanshuai Cao

1 2 next