Patents by Inventor Mingxing Tan

Mingxing Tan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Scale-Permuted Machine Learning Architecture

Publication number: 20220108204

Abstract: A computer-implemented method of generating scale-permuted models can generate models having improved accuracy and reduced evaluation computational requirements. The method can include defining, by a computing system including one or more computing devices, a search space including a plurality of candidate permutations of a plurality of candidate feature blocks, each of the plurality of candidate feature blocks having a respective scale. The method can include performing, by the computing system, a plurality of search iterations by a search algorithm to select a scale-permuted model from the search space, the scale-permuted model based at least in part on a candidate permutation of the plurality of candidate permutations.

Type: Application

Filed: October 1, 2020

Publication date: April 7, 2022

Inventors: Xianzhi Du, Yin Cui, Tsung-Yi Lin, Quoc V. Le, Pengchong Jin, Mingxing Tan, Golnaz Ghiasi, Xiaodan Song
Neural Architecture Search with Factorized Hierarchical Search Space

Publication number: 20220101090

Abstract: The present disclosure is directed to an automated neural architecture search approach for designing new neural network architectures such as, for example, resource-constrained mobile CNN models. In particular, the present disclosure provides systems and methods to perform neural architecture search using a novel factorized hierarchical search space that permits layer diversity throughout the network, thereby striking the right balance between flexibility and search space size. The resulting neural architectures are able to be run relatively faster and using relatively fewer computing resources (e.g., less processing power, less memory usage, less power consumption, etc.), all while remaining competitive with or even exceeding the performance (e.g., accuracy) of current state-of-the-art mobile-optimized models.

Type: Application

Filed: October 6, 2021

Publication date: March 31, 2022

Inventors: Mingxing Tan, Quoc V. Le, Bo Chen, Vijay Vasudevan, Ruoming Pang
HARDWARE-OPTIMIZED NEURAL ARCHITECTURE SEARCH

Publication number: 20220019869

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining an architecture for a task neural network that is configured to perform a particular machine learning task on a target set of hardware resources. When deployed on a target set of hardware, such as a collection of datacenter accelerators, the task neural network may be capable of performing the particular machine learning task with enhanced accuracy and speed.

Type: Application

Filed: September 30, 2020

Publication date: January 20, 2022

Inventors: Sheng Li, Norman Paul Jouppi, Quoc V. Le, Mingxing Tan, Ruoming Pang, Liqun Cheng, Andrew Li
Joint Architecture And Hyper-Parameter Search For Machine Learning Models

Publication number: 20210383223

Abstract: The present disclosure provides a differentiable joint hyper-parameter and architecture search approach, with some implementations including the idea of discretizing the continuous space into a linear combination of multiple categorical basis. One example element of the proposed approach is the use of weight sharing across all architecture- and hyper-parameters which enables it to search efficiently over the large joint search space. Experimental results on MobileNet/ResNet/EfficientNet/BERT show that the proposed systems significantly improve the accuracy by up to 2% on ImageNet and the F1 by up to 0.4 on SQuAD, with search cost comparable to training a single model. Compared to other AutoML methods, such as random search or Bayesian method, the proposed techniques can achieve better accuracy with 10× less compute cost.

Type: Application

Filed: June 3, 2021

Publication date: December 9, 2021

Inventors: Mingxing Tan, Xuanyi Dong, Wei Yu, Quoc V. Le, Daiyi Peng
Training Robust Neural Networks Via Smooth Activation Functions

Publication number: 20210383237

Abstract: Generally, the present disclosure is directed to the training of robust neural network models by using smooth activation functions. Systems and methods according to the present disclosure may generate and/or train neural network models with improved robustness without incurring a substantial accuracy penalty and/or increased computational cost, or without any such penalty at all. For instance, in some examples, the accuracy may improve. A smooth activation function may replace an original activation function in a machine-learned model when backpropagating a loss function through the model. Optionally, one activation function may be used in the model at inference time, and a replacement activation function may be used when backpropagating a loss function through the model. The replacement activation function may be used to update learnable parameters of the model and/or to generate adversarial examples for training the model.

Type: Application

Filed: June 3, 2021

Publication date: December 9, 2021

Inventors: Mingxing Tan, Cihang Xie, Boqing Gong, Quoc V. Le
COMPOUND MODEL SCALING FOR NEURAL NETWORKS

Publication number: 20210133578

Abstract: A method for determining a final architecture for a neural network to perform a particular machine learning task is described.

Type: Application

Filed: January 8, 2021

Publication date: May 6, 2021

Inventors: Mingxing Tan, Quoc V. Le
Compound model scaling for neural networks

Patent number: 10909457

Abstract: A method for determining a final architecture for a neural network to perform a particular machine learning task is described.

Type: Grant

Filed: January 23, 2020

Date of Patent: February 2, 2021

Assignee: Google LLC

Inventors: Mingxing Tan, Quoc V. Le
COMPOUND MODEL SCALING FOR NEURAL NETWORKS

Publication number: 20200234132

Abstract: A method for determining a final architecture for a neural network to perform a particular machine learning task is described.

Type: Application

Filed: January 23, 2020

Publication date: July 23, 2020

Inventors: Mingxing Tan, Quoc V. Le
Neural Architecture Search with Factorized Hierarchical Search Space

Publication number: 20200143227

Abstract: The present disclosure is directed to an automated neural architecture search approach for designing new neural network architectures such as, for example, resource-constrained mobile CNN models. In particular, the present disclosure provides systems and methods to perform neural architecture search using a novel factorized hierarchical search space that permits layer diversity throughout the network, thereby striking the right balance between flexibility and search space size. The resulting neural architectures are able to be run relatively faster and using relatively fewer computing resources (e.g., less processing power, less memory usage, less power consumption, etc.), all while remaining competitive with or even exceeding the performance (e.g., accuracy) of current state-of-the-art mobile-optimized models.

Type: Application

Filed: January 28, 2019

Publication date: May 7, 2020

Inventors: Mingxing Tan, Quoc Le, Bo Chen, Vijay Vasudevan, Ruoming Pang

prev 1 2