Patents Examined by Leon W Cheung

Machine learning via double layer optimization

Patent number: 11170301

Abstract: A computer-based system trains a neural network by solving a double layer optimization problem. The system includes an input interface to receive an input to the neural network and labels of the input to the neural network; a processor to solve a double layer optimization to produce parameters of the neural network, and an output interface to output the parameters of the neural network. The double layer optimization includes an optimization of a first layer subject to an optimization of a second layer. The optimization of the first layer minimizes a difference between an output of the neural network processing the input and the labels of the input to the neural network, the optimization of the second layer minimizes a distance between a non-negative output vector of each layer and a corresponding input vector to each layer. The input vector of a current layer is a linear transformation of the non-negative output vector of the previous layer.

Type: Grant

Filed: November 16, 2017

Date of Patent: November 9, 2021

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Ziming Zhang, Matthew Brand
Machine learning systems and methods for training with noisy labels

Patent number: 11164050

Abstract: Machine learning classification models which are robust against label noise are provided. Noise may be modelled explicitly by modelling “label flips”, where incorrect binary labels are “flipped” relative to their ground truth value. Distributions of label flips may be modelled as prior and posterior distributions in a flexible architecture for machine learning systems. An arbitrary classification model may be provided within the system. The classification model is made more robust to label noise by operation of the prior and posterior distributions. Particular prior and approximating posterior distributions are disclosed.

Type: Grant

Filed: November 27, 2017

Date of Patent: November 2, 2021

Assignee: D-WAVE SYSTEMS INC.

Inventor: Arash Vahdat
Content delivery based on corrective modeling techniques

Patent number: 11106997

Abstract: An online system uses multiple machine learning models to select content for providing to a user of the online system. Specifically, the online system trains a general model that intakes a first set of features and outputs predictions at a general level. The online system further trains a residual model that intakes a second set of features. The residual model predicts a residual (e.g., an error) of the predictions outputted by the general model. Therefore, the predicted residual from the residual model is combined with the prediction from the general model in order to correct for the over-generality of the general model. The online system may use the combined prediction to send content to users.

Type: Grant

Filed: September 29, 2017

Date of Patent: August 31, 2021

Assignee: Facebook, Inc.

Inventors: Andrew Donald Yates, Gunjit Singh, Kurt Dodge Runke
Temporal ensembling for semi-supervised learning

Patent number: 11068781

Abstract: A method, computer readable medium, and system are disclosed for implementing a temporal ensembling model for training a deep neural network. The method for training the deep neural network includes the steps of receiving a set of training data for a deep neural network and training the deep neural network utilizing the set of training data by: analyzing the plurality of input vectors by the deep neural network to generate a plurality of prediction vectors, and, for each prediction vector in the plurality of prediction vectors corresponding to the particular input vector, computing a loss term associated with the particular input vector by combining a supervised component and an unsupervised component according to a weighting function and updating the target prediction vector associated with the particular input vector.

Type: Grant

Filed: September 29, 2017

Date of Patent: July 20, 2021

Assignee: NVIDIA Corporation

Inventors: Samuli Matias Laine, Timo Oskari Aila

Machine learning via double layer optimization

Machine learning systems and methods for training with noisy labels

Content delivery based on corrective modeling techniques

Temporal ensembling for semi-supervised learning