Patents by Inventor Hanxiao Liu

Hanxiao Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ATTENTION NEURAL NETWORKS WITH GATED ATTENTION UNITS

Publication number: 20250139431

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing a machine learning task on a network input to generate a network output. In one aspect, one of the systems includes a neural network configured to perform the machine learning task, the neural network including one or more attentive layers that each include a gated attention unit.

Type: Application

Filed: January 30, 2023

Publication date: May 1, 2025

Inventors: Hanxiao Liu, Weizhe Hua, Zihang Dai, Quoc V. Le
ROUTING TO EXPERT SUBNETWORKS IN MIXTURE-OF-EXPERTS NEURAL NETWORKS

Publication number: 20250131251

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing a machine learning task on a network input to generate a network output. In one aspect, one of the systems includes a neural network configured to perform the machine learning task, the neural network including one or more expert neural network blocks that each include router that performs expert-choice routing between multiple expert neural networks.

Type: Application

Filed: January 30, 2023

Publication date: April 24, 2025

Inventors: Hanxiao Liu, Quoc V. Le, Yanqi Zhou, Tao Lei, Yuzhe Zhao, Yanping Huang, Nan Du, Zhifeng Chen, Andrew M. Dai, James Laudon
NEURAL NETWORK ARCHITECTURE FOR IMPLEMENTING GROUP CONVOLUTIONS

Publication number: 20250124700

Abstract: Methods, systems, and apparatus, including computer-readable media, are described for processing an input image using a convolutional neural network (CNN). The CNN includes a sequence of layer blocks. Each of a first subset of the layer blocks in the sequence is configured to perform operations that include: i) receiving an input feature map for the layer block, ii) generating an expanded feature map from the input feature map using a group convolution, and iii) generating a reduced feature map from the expanded feature map. The input feature map is an h w feature map with c1 channels. The expanded feature map is an h w feature map with c2 channels, whereas the reduced feature map is an h w feature map with c1 channels. C2 is greater than c1. An output feature map is generated for the layer block from the reduced feature map.

Type: Application

Filed: October 8, 2021

Publication date: April 17, 2025

Inventors: Berkin Akin, Suyog Gupta, Cao Gao, Ping Zhou, Gabriel Mintzer Bender, Hanxiao Liu
GRANULAR NEURAL NETWORK ARCHITECTURE SEARCH OVER LOW-LEVEL PRIMITIVES

Publication number: 20240428071

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a machine learning task on a network input to generate a network output. One of the systems includes an attention neural network configured to perform the machine learning task. The attention neural network includes one or more attentions layers that each include a squared ReLU activation layer, a depth-wise convolution layer, or both.

Type: Application

Filed: September 3, 2024

Publication date: December 26, 2024

Inventors: David Richard So, Quoc V. Le, Hanxiao Liu, Wojciech Andrzej Manke, Zihang Dai, Noam M. Shazeer
HARDWARE ACCELERATOR OPTIMIZED GROUP CONVOLUTION BASED NEURAL NETWORK MODELS

Publication number: 20240386260

Abstract: Methods, systems, and apparatus, including computer-readable media, are described for processing an input image using integrated circuit that implements a convolutional neural network with a group convolution layer. The processing includes determining a mapping of partitions along a channel dimension of an input feature map to multiply accumulate cells (MACs) in a computational unit of the circuit and applying a group convolution to the input feature map. Applying the group convolution includes, for each partition: providing weights for the group convolution layer to a subset of MACs based on the mapping; providing, via an input bus of the circuit, an input of the feature map to each MAC in the subset; and computing, at each MAC in the subset, a product using the input and a weight for the group convolution layer. An output feature map is generated for the group convolution layer based on an accumulation of products.

Type: Application

Filed: October 8, 2021

Publication date: November 21, 2024

Inventors: Berkin Akin, Suyog Gupta, Cao Gao, Ping Zhou, Gabriel Mintzer Bender, Hanxiao Liu
Using Hierarchical Representations for Neural Network Architecture Searching

Publication number: 20240249146

Abstract: A computer-implemented method for automatically determining a neural network architecture represents a neural network architecture as a data structure defining a hierarchical set of directed acyclic graphs in multiple levels. Each graph has an input, an output, and a plurality of nodes between the input and the output. At each level, a corresponding set of the nodes are connected pairwise by directed edges which indicate operations performed on outputs of one node to generate an input to another node. Each level is associated with a corresponding set of operations. At a lowest level, the operations associated with each edge are selected from a set of primitive operations. The method includes repeatedly generating new sample neural network architectures, and evaluating their fitness. The modification is performed by selecting a level, selecting two nodes at that level, and modifying, removing or adding an edge between those nodes according to operations associated with lower levels of the hierarchy.

Type: Application

Filed: January 17, 2024

Publication date: July 25, 2024

Inventors: Chrisantha Thomas Fernando, Karen Simonyan, Koray Kavukcuoglu, Hanxiao Liu, Oriol Vinyals
Using hierarchical representations for neural network architecture searching

Patent number: 11907853

Abstract: A computer-implemented method for automatically determining a neural network architecture represents a neural network architecture as a data structure defining a hierarchical set of directed acyclic graphs in multiple levels. Each graph has an input, an output, and a plurality of nodes between the input and the output. At each level, a corresponding set of the nodes are connected pairwise by directed edges which indicate operations performed on outputs of one node to generate an input to another node. Each level is associated with a corresponding set of operations. At a lowest level, the operations associated with each edge are selected from a set of primitive operations. The method includes repeatedly generating new sample neural network architectures, and evaluating their fitness. The modification is performed by selecting a level, selecting two nodes at that level, and modifying, removing or adding an edge between those nodes according to operations associated with lower levels of the hierarchy.

Type: Grant

Filed: October 26, 2018

Date of Patent: February 20, 2024

Assignee: DeepMind Technologies Limited

Inventors: Chrisantha Thomas Fernando, Karen Simonyan, Koray Kavukcuoglu, Hanxiao Liu, Oriol Vinyals
Systems and Methods for Machine-Learned Models Having Convolution and Attention

Publication number: 20230359862

Abstract: A computer-implemented method for performing computer vision with reduced computational cost and improved accuracy can include obtaining, by a computing system including one or more computing devices, input data comprising an input tensor having one or more dimensions, providing, by the computing system, the input data to a machine-learned convolutional attention network, the machine-learned convolutional attention network including two or more network stages, and, in response to providing the input data to the machine-learned convolutional attention network, receiving, by the computing system, a machine-learning prediction from the machine-learned convolutional attention network. The convolutional attention network can include at least one attention block, wherein the attention block includes a relative attention mechanism, the relative attention mechanism including the sum of a static convolution kernel with an adaptive attention matrix.

Type: Application

Filed: July 19, 2023

Publication date: November 9, 2023

Inventors: Zihang Dai, Mingxing Tan, Quoc V. Le, Hanxiao Liu
Systems and methods for machine-learned models having convolution and attention

Patent number: 11755883

Abstract: A computer-implemented method for performing computer vision with reduced computational cost and improved accuracy can include obtaining, by a computing system including one or more computing devices, input data comprising an input tensor having one or more dimensions, providing, by the computing system, the input data to a machine-learned convolutional attention network, the machine-learned convolutional attention network including two or more network stages, and, in response to providing the input data to the machine-learned convolutional attention network, receiving, by the computing system, a machine-learning prediction from the machine-learned convolutional attention network. The convolutional attention network can include at least one attention block, wherein the attention block includes a relative attention mechanism, the relative attention mechanism including the sum of a static convolution kernel with an adaptive attention matrix.

Type: Grant

Filed: May 27, 2022

Date of Patent: September 12, 2023

Assignee: GOOGLE LLC

Inventors: Zihang Dai, Hanxiao Liu, Mingxing Tan, Quoc V. Le
LEARNED GRAPH OPTIMIZATIONS FOR COMPILERS

Publication number: 20230176840

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for compiler optimizations using a compiler optimization network. One of the methods includes receiving an input program, wherein the input program defines a graph of operation modules, wherein each node in the graph is a respective operation module, and each edge between nodes in the graph represents one operation module receiving the output generated by another operation module. The input program is processed by a compiler optimization network comprising a graph-embedding network that is configured to encode operation features and operation dependencies of the operation modules of the input program into a graph embedding representation and a policy network that is configured to generate an optimization action for each of one or more nodes encoded in the graph embedding representation.

Type: Application

Filed: June 7, 2021

Publication date: June 8, 2023

Inventors: Yanqi Zhou, Sudip Roy, Amirali Abdolrashidi, Daniel Lin-Kit Wong, Chao Ma, Qiumin Xu, Hanxiao Liu, Phitchaya Mangpo Phothilimthana, Shen Wang, Anna Darling Goldie, Azalia Mirhoseini, James Laudon
MEMORY-OPTIMIZED CONTRASTIVE LEARNING

Publication number: 20230154161

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using memory-optimized contrastive learning to train image encoder and text encoder neural networks.

Type: Application

Filed: November 16, 2022

Publication date: May 18, 2023

Inventors: Hieu Hy Pham, Zihang Dai, Golnaz Ghiasi, Hanxiao Liu, Wei Yu, Mingxing Tan, Quoc V. Le
SEARCHING FOR NORMALIZATION-ACTIVATION LAYER ARCHITECTURES

Publication number: 20230121404

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for searching for an architecture for an activation-normalization layer to be included in a neural network to replace a set of layers that receive a layer input comprising a plurality of values, apply one or more normalization operations to the values in the layer input to generate a normalized layer input, and apply an element-wise activation function to the normalized layer input to generate a layer output.

Type: Application

Filed: February 8, 2021

Publication date: April 20, 2023

Inventors: Hanxiao Liu, Quoc V. Le, Andrew Brock, Karen Simonyan
SINGLE-STAGE MODEL TRAINING FOR NEURAL ARCHITECTURE SEARCH

Publication number: 20220405579

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting a neural network to perform a particular machine learning task while satisfying a set of constraints.

Type: Application

Filed: March 3, 2021

Publication date: December 22, 2022

Inventors: Jiahui Yu, Pengchong Jin, Hanxiao Liu, Gabriel Mintzer Bender, Pieter-Jan Kindermans, Mingxing Tan, Xiaodan Song, Ruoming Pang, Quoc V. Le
GRANULAR NEURAL NETWORK ARCHITECTURE SEARCH OVER LOW-LEVEL PRIMITIVES

Publication number: 20220383119

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing a machine learning task on a network input to generate a network output. One of the systems includes an attention neural network configured to perform the machine learning task. The attention neural network includes one or more attentions layers that each include a squared ReLU activation layer, a depth-wise convolution layer, or both.

Type: Application

Filed: May 27, 2022

Publication date: December 1, 2022

Inventors: David Richard So, Quoc V. Le, Jr., Hanxiao Liu, Wojciech Andrzej Manke, Zihang Dai, Noam M. Shazeer
Systems and Methods for Machine-Learned Models Having Convolution and Attention

Publication number: 20220383069

Abstract: A computer-implemented method for performing computer vision with reduced computational cost and improved accuracy can include obtaining, by a computing system including one or more computing devices, input data comprising an input tensor having one or more dimensions, providing, by the computing system, the input data to a machine-learned convolutional attention network, the machine-learned convolutional attention network including two or more network stages, and, in response to providing the input data to the machine-learned convolutional attention network, receiving, by the computing system, a machine-learning prediction from the machine-learned convolutional attention network. The convolutional attention network can include at least one attention block, wherein the attention block includes a relative attention mechanism, the relative attention mechanism including the sum of a static convolution kernel with an adaptive attention matrix.

Type: Application

Filed: May 27, 2022

Publication date: December 1, 2022

Inventors: Zihang Dai, Hanxiao Liu, Mingxing Tan, Quoc V. Le
NEURAL NETWORKS WITH FEEDFORWARD SPATIAL TRANSFORMATION UNITS

Publication number: 20220367052

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing a machine learning task on a network input to generate a network output. In one aspect, one of the systems includes a neural network configured to perform the machine learning task, the neural network including one or more blocks that each include a feedforward spatial transformation unit.

Type: Application

Filed: May 16, 2022

Publication date: November 17, 2022

Inventors: Hanxiao Liu, David Richard So, Quoc V. Le, Zihang Dai
Using Hierarchical Representations for Neural Network Architecture Searching

Publication number: 20200293899

Abstract: A computer-implemented method for automatically determining a neural network architecture represents a neural network architecture as a data structure defining a hierarchical set of directed acyclic graphs in multiple levels. Each graph has an input, an output, and a plurality of nodes between the input and the output. At each level, a corresponding set of the nodes are connected pairwise by directed edges which indicate operations performed on outputs of one node to generate an input to another node. Each level is associated with a corresponding set of operations. At a lowest level, the operations associated with each edge are selected from a set of primitive operations. The method includes repeatedly generating new sample neural network architectures, and evaluating their fitness. The modification is performed by selecting a level, selecting two nodes at that level, and modifying, removing or adding an edge between those nodes according to operations associated with lower levels of the hierarchy.

Type: Application

Filed: October 26, 2018

Publication date: September 17, 2020

Inventors: Chrisantha Thomas Fernando, Karen Simonyan, Koray Kavukcuoglu, Hanxiao Liu, Oriol Vinyals