Patents by Inventor An V. Le

An V. Le has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Polymer seal assembly

Patent number: 11460112

Abstract: A seal assembly includes a seal body, a spring disposed adjacent to the seal body, and a seal ring disposed adjacent to the seal body. The seal body and the seal ring can include a plastic polymer material. The seal assembly can be a subcomponent of hydraulic strut in the landing gear of an aircraft.

Type: Grant

Filed: February 26, 2019

Date of Patent: October 4, 2022

Assignee: SAINT-GOBAIN PERFORMANCE PLASTICS CORPORATION

Inventors: Jon M. Lenhert, Robert T. Racicot, Kha V. Le
Hierarchical device placement with reinforcement learning

Patent number: 11455514

Abstract: A method for determining a placement for machine learning model operations across multiple hardware devices includes receiving data specifying machine learning operations, and determining a placement that assigns each of the operations specified by the data to a respective device from the multiple hardware devices.

Type: Grant

Filed: August 28, 2019

Date of Patent: September 27, 2022

Assignee: Google LLC

Inventors: Benoit Steiner, Anna Darling Goldie, Jeffrey Adgate Dean, Hieu Hy Pham, Azalia Mirhoseini, Quoc V. Le
MULTI-TASK SELF-TRAINING FOR LEARNING GENERAL REPRESENTATIONS

Publication number: 20220301298

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training an image representation neural network.

Type: Application

Filed: March 17, 2022

Publication date: September 22, 2022

Inventors: Tsung-Yi Lin, Barret Zoph, Ekin Dogus Cubuk, Golnaz Ghiasi, Quoc V. Le
Contrastive pre-training for language tasks

Patent number: 11449684

Abstract: Systems and methods are provided that train a machine-learned language encoding model through the use of a contrastive learning task. In particular, the present disclosure describes a contrastive learning task where the encoder learns to distinguish input tokens from plausible alternatives. In some implementations, on each training example the proposed method masks out some subset (e.g., 15%) of the original input tokens, replaces the masked tokens with samples from a “generator” (e.g., which may be a small masked language model), and then trains the encoder to predict whether each token comes from the original data or is a replacement produced by the generator.

Type: Grant

Filed: September 21, 2020

Date of Patent: September 20, 2022

Assignee: GOOGLE LLC

Inventors: Thang Minh Luong, Quoc V. Le, Kevin Stefan Clark
Systems and methods for progressive learning for machine-learned models to optimize training speed

Patent number: 11450096

Abstract: Systems and methods of the present disclosure can include a computer-implemented method for efficient machine-learned model training. The method can include obtaining a plurality of training samples for a machine-learned model. The method can include, for one or more first training iterations, training, based at least in part on a first regularization magnitude configured to control a relative effect of one or more regularization techniques, the machine-learned model using one or more respective first training samples of the plurality of training samples. The method can include, for one or more second training iterations, training, based at least in part on a second regularization magnitude greater than the first regularization magnitude, the machine-learned model using one or more respective second training samples of the plurality of training samples.

Type: Grant

Filed: December 29, 2021

Date of Patent: September 20, 2022

Assignee: GOOGLE LLC

Inventors: Mingxing Tan, Quoc V. Le
SYSTEMS AND METHODS FOR PROGRESSIVE LEARNING FOR MACHINE-LEARNED MODELS TO OPTIMIZE TRAINING SPEED

Publication number: 20220245928

Abstract: Systems and methods of the present disclosure can include a computer-implemented method for efficient machine-learned model training. The method can include obtaining a plurality of training samples for a machine-learned model. The method can include, for one or more first training iterations, training, based at least in part on a first regularization magnitude configured to control a relative effect of one or more regularization techniques, the machine-learned model using one or more respective first training samples of the plurality of training samples. The method can include, for one or more second training iterations, training, based at least in part on a second regularization magnitude greater than the first regularization magnitude, the machine-learned model using one or more respective second training samples of the plurality of training samples.

Type: Application

Filed: December 29, 2021

Publication date: August 4, 2022

Inventors: Mingxing Tan, Quoc V. Le
Neural Architecture Scaling For Hardware Accelerators

Publication number: 20220230048

Abstract: Methods, systems, and apparatus, including computer-readable media, for scaling neural network architectures on hardware accelerators. A method includes receiving training data and information specifying target computing resources, and performing using the training data, a neural architecture search over a search space to identify an architecture for a base neural network. A plurality of scaling parameter values for scaling the base neural network can be identified, which can include repeatedly selecting a plurality of candidate scaling parameter values, and determining a measure of performance for the base neural network scaled according to the plurality of candidate scaling parameter values, in accordance with a plurality of second objectives including a latency objective. An architecture for a scaled neural network can be determined using the architecture of the base neural network scaled according to the plurality of scaling parameter values.

Type: Application

Filed: February 12, 2021

Publication date: July 21, 2022

Inventors: Andrew Li, Sheng Li, Mingxing Tan, Ruoming Pang, Liqun Cheng, Quoc V. Le, Norman Paul Jouppi
TRAINING MACHINE LEARNING MODELS USING UNSUPERVISED DATA AUGMENTATION

Publication number: 20220215209

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a IT machine learning model. One of the methods includes receiving training data comprising a plurality of unlabeled training inputs and a plurality of labeled training inputs; generating augmented training data, comprising generating, for each of the plurality of unlabeled training inputs, a respective augmented training input by applying a data augmentation technique to the unlabeled training input; and training the machine learning model on the augmented training data. In particular, but not exclusively, the model may be trained for perceptual tasks (e.g. tasks relating to vision or speech).

Type: Application

Filed: April 24, 2020

Publication date: July 7, 2022

Inventors: Thang Minh Luong, Quoc V. Le, Qizhe Xie, Zihang Dai
GENERATING AUTHOR VECTORS

Publication number: 20220198145

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating author vectors. One of the methods includes obtaining a set of sequences of words, the set of sequences of words comprising a plurality of first sequences of words and, for each first sequence of words, a respective second sequence of words that follows the first sequence of words, wherein each first sequence of words and each second sequence of words has been classified as being authored by a first author; and training a neural network system on the first sequences and the second sequences to determine an author vector for the first author, wherein the author vector characterizes the first author.

Type: Application

Filed: March 14, 2022

Publication date: June 23, 2022

Applicant: GOOGLE LLC

Inventors: Quoc V. Le, Brian Patrick Strope
META PSEUDO-LABELS

Publication number: 20220188636

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network using meta pseudo-labels. One of the methods includes training a student neural network using pseudo-labels generated by a teacher neural network that is being trained jointly with the student neural network.

Type: Application

Filed: December 14, 2021

Publication date: June 16, 2022

Inventors: Hieu Hy Pham, Zihang Dai, Qizhe Xie, Quoc V. Le
CONVOLUTIONAL NEURAL NETWORKS WITH SOFT KERNEL SELECTION

Publication number: 20220129740

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing inputs using neural networks that include one or more conditional convolutional layers. A conditional convolutional layer has a plurality of kernels and determines a respective input-dependent weight for each of the plurality of kernels and generates an input-dependent kernel by computing a weighted sum of the plurality of kernels in accordance with the respective input-dependent weights.

Type: Application

Filed: January 23, 2020

Publication date: April 28, 2022

Inventors: Brandon Chauloon Yang, Quoc V. Le, Jiquan Ngiam, Gabriel Mintzer Bender
TRAINING NEURAL NETWORKS USING DATA AUGMENTATION POLICIES

Publication number: 20220114400

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a machine learning model. One of the methods includes obtaining a training data set for training a machine learning model, the training data set comprising a plurality of training inputs; determining a plurality of data augmentation policies, wherein each data augmentation policy defines a procedure for processing a training input to generate a transformed training input; for each data augmentation policy, training the machine learning model using the data augmentation policy; determining, for each data augmentation policy, a quality measure of the machine learning model that has been trained using the data augmentation policy; and selecting a final data augmentation policy based using the quality measures of the machine learning models.

Type: Application

Filed: December 20, 2021

Publication date: April 14, 2022

Inventors: Jonathon Shlens, Quoc V. Le, Ekin Dogus Cubuk, Barret Zoph
GENERATING INTEGRATED CIRCUIT PLACEMENTS USING NEURAL NETWORKS

Publication number: 20220108058

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a computer chip placement. One of the methods includes obtaining netlist data for a computer chip; and generating a computer chip placement, comprising placing a respective macro node at each time step in a sequence comprising a plurality of time steps, the placing comprising, for each time step: generating an input representation for the time step; processing the input representation using a node placement neural network having a plurality of network parameters, wherein the node placement neural network is configured to process the input representation in accordance with current values of the network parameters to generate a score distribution over a plurality of positions on the surface of the computer chip; and assigning the macro node to be placed at the time step to a position from the plurality of positions using the score distribution.

Type: Application

Filed: December 17, 2021

Publication date: April 7, 2022

Inventors: Anna Darling Goldie, Azalia Mirhoseini, Ebrahim Songhori, Wenjie Jiang, Shen Wang, Roger David Carpenter, Young-Joon Lee, Mustafa Nazim Yazgan, Chian-min Richard Ho, Quoc V. Le, James Laudon, Jeffrey Adgate Dean, Kavya Srinivasa Setty, Omkar Pathak
Scale-Permuted Machine Learning Architecture

Publication number: 20220108204

Abstract: A computer-implemented method of generating scale-permuted models can generate models having improved accuracy and reduced evaluation computational requirements. The method can include defining, by a computing system including one or more computing devices, a search space including a plurality of candidate permutations of a plurality of candidate feature blocks, each of the plurality of candidate feature blocks having a respective scale. The method can include performing, by the computing system, a plurality of search iterations by a search algorithm to select a scale-permuted model from the search space, the scale-permuted model based at least in part on a candidate permutation of the plurality of candidate permutations.

Type: Application

Filed: October 1, 2020

Publication date: April 7, 2022

Inventors: Xianzhi Du, Yin Cui, Tsung-Yi Lin, Quoc V. Le, Pengchong Jin, Mingxing Tan, Golnaz Ghiasi, Xiaodan Song
GENERATING REPRESENTATIONS OF INPUT SEQUENCES USING NEURAL NETWORKS

Publication number: 20220101082

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating representations of input sequences. One of the methods includes obtaining an input sequence, the input sequence comprising a plurality of inputs arranged according to an input order; processing the input sequence using a first long short term memory (LSTM) neural network to convert the input sequence into an alternative representation for the input sequence; and processing the alternative representation for the input sequence using a second LSTM neural network to generate a target sequence for the input sequence, the target sequence comprising a plurality of outputs arranged according to an output order.

Type: Application

Filed: December 10, 2021

Publication date: March 31, 2022

Inventors: Oriol Vinyals, Quoc V. Le, Ilya Sutskever
Neural Architecture Search with Factorized Hierarchical Search Space

Publication number: 20220101090

Abstract: The present disclosure is directed to an automated neural architecture search approach for designing new neural network architectures such as, for example, resource-constrained mobile CNN models. In particular, the present disclosure provides systems and methods to perform neural architecture search using a novel factorized hierarchical search space that permits layer diversity throughout the network, thereby striking the right balance between flexibility and search space size. The resulting neural architectures are able to be run relatively faster and using relatively fewer computing resources (e.g., less processing power, less memory usage, less power consumption, etc.), all while remaining competitive with or even exceeding the performance (e.g., accuracy) of current state-of-the-art mobile-optimized models.

Type: Application

Filed: October 6, 2021

Publication date: March 31, 2022

Inventors: Mingxing Tan, Quoc V. Le, Bo Chen, Vijay Vasudevan, Ruoming Pang
Systems and Methods for Producing an Architecture of a Pyramid Layer

Publication number: 20220092387

Abstract: A computing system for producing an architecture of a pyramid layer is disclosed. The computing system can include a controller model configured to generate new architectures for a pyramid layer that receives a plurality of input feature representations output by a backbone model and, in response, outputs a plurality of output feature representations. The plurality of input feature representations can have a plurality of different input resolutions, and the plurality of output feature representations can have a plurality of different output resolutions. The computing system can be configured to perform a plurality of iterations. For each iteration, the computing system can receive a new pyramid layer architecture as an output of the controller model and evaluate one or more performance characteristics of a machine-learned pyramidal feature model that includes the backbone model and one or more pyramid layers that have the new pyramid layer architecture.

Type: Application

Filed: February 25, 2020

Publication date: March 24, 2022

Inventors: Quoc V. Le, Golnaz Ghiasi, Tsung-Yi Lin
SELF-TRAINING TECHNIQUE FOR GENERATING NEURAL NETWORK MODELS

Publication number: 20220083840

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, used to implement a self-training technique for generating neural network (NN) models. A first model is generated in response to training a first NN using labeled data. A respective pseudo label is generated for each item of unlabeled data when items of unlabeled data are processed using the first model. A second NN is used to process each item of a combined dataset to train the second NN. The combined dataset includes items of labeled data and a corresponding item for each respective pseudo label. Attributes of items in the combined dataset are modified to inject noise into the combined dataset when the second NN is trained. A second model is generated after the second NN is trained by processing items in the combined dataset, including processing items that represent the noise injected into the combined dataset.

Type: Application

Filed: September 11, 2020

Publication date: March 17, 2022

Inventors: Thang Minh Luong, Quoc V. Le, Qizhe Xie
Generating author vectors

Patent number: 11275895

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating author vectors. One of the methods includes obtaining a set of sequences of words, the set of sequences of words comprising a plurality of first sequences of words and, for each first sequence of words, a respective second sequence of words that follows the first sequence of words, wherein each first sequence of words and each second sequence of words has been classified as being authored by a first author; and training a neural network system on the first sequences and the second sequences to determine an author vector for the first author, wherein the author vector characterizes the first author.

Type: Grant

Filed: March 19, 2020

Date of Patent: March 15, 2022

Assignee: GOOGLE LLC

Inventors: Brian Patrick Strope, Quoc V. Le
Energy-Based Language Models

Publication number: 20220067304

Abstract: Systems and methods are provided for training and using energy-based language models such as cloze language models. In particular, one aspect of the present disclosure is directed to an energy-based cloze language model for representation learning over text. In some instances, the models provided herein can be referred to as the “Electric” model. Similar to the BERT model, example models proposed herein can be a conditional generative model of tokens given their contexts. However, example models proposed herein do not mask text or output a full distribution over tokens that could occur in a context. Instead, the example proposed models assign a scalar energy score to each input token. Another aspect of the present disclosure provides techniques to train the proposed models to assign low energies to data tokens and high energies to other ones using an algorithm based on noise-contrastive estimation.

Type: Application

Filed: August 27, 2021

Publication date: March 3, 2022

Inventors: Thang Minh Luong, Quoc V. Le, Kevin Stefan Clark

prev 1 2 3 4 5 6 7 8 … next