Patents by Inventor Runfei Luo

Runfei Luo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Searching compression profiles for trained neural networks

Patent number: 12314277

Abstract: Compression profiles may be searched for trained neural networks. An iterative compression profile search may be performed response to a search request. Different prospective compression profiles may be generated for trained neural networks according to a search policy. Performance of compressed versions of the trained neural networks according to the compression profiles may be tracked. The search policy may be updated according to an evaluation of the performance of the compression profiles for the compressed versions of the trained neural networks using compression performance criteria. When a search criteria is satisfied, a result for the compression profile search may be provided.

Type: Grant

Filed: June 13, 2023

Date of Patent: May 27, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Ragav Venkatesan, Gurumurthy Swaminathan, Xiong Zhou, Runfei Luo, Vineet Khare
SEARCHING COMPRESSION PROFILES FOR TRAINED NEURAL NETWORKS

Publication number: 20230409584

Abstract: Compression profiles may be searched for trained neural networks. An iterative compression profile search may be performed response to a search request. Different prospective compression profiles may be generated for trained neural networks according to a search policy. Performance of compressed versions of the trained neural networks according to the compression profiles may be tracked. The search policy may be updated according to an evaluation of the performance of the compression profiles for the compressed versions of the trained neural networks using compression performance criteria. When a search criteria is satisfied, a result for the compression profile search may be provided.

Type: Application

Filed: June 13, 2023

Publication date: December 21, 2023

Applicant: Amazon Technologies, Inc.

Inventors: Ragav Venkatesan, Gurumurthy Swaminathan, Xiong Zhou, Runfei Luo, Vineet Khare
Applying compression profiles across similar neural network architectures

Patent number: 11809992

Abstract: Neural networks with similar architectures may be compressed using shared compression profiles. A request to compress a trained neural network may be received and an architecture of the neural network identified. The identified architecture may be compared with the different network architectures mapped to compression profiles to select a compression profile for the neural network. The compression profile may be applied to remove features of the neural network to generate a compressed version of the neural network.

Type: Grant

Filed: March 31, 2020

Date of Patent: November 7, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Gurumurthy Swaminathan, Ragav Venkatesan, Xiong Zhou, Runfei Luo, Vineet Khare
Searching compression profiles for trained neural networks

Patent number: 11755603

Abstract: Compression profiles may be searched for trained neural networks. An iterative compression profile search may be performed response to a search request. Different prospective compression profiles may be generated for trained neural networks according to a search policy. Performance of compressed versions of the trained neural networks according to the compression profiles may be tracked. The search policy may be updated according to an evaluation of the performance of the compression profiles for the compressed versions of the trained neural networks using compression performance criteria. When a search criteria is satisfied, a result for the compression profile search may be provided.

Type: Grant

Filed: March 26, 2020

Date of Patent: September 12, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Ragav Venkatesan, Gurumurthy Swaminathan, Xiong Zhou, Runfei Luo, Vineet Khare
Iterative model training and deployment for automated learning systems

Patent number: 11605021

Abstract: Techniques for iterative model training and deployment for automated learning systems are described. A method of iterative model training and deployment for automated learning systems comprises generating training data based on inference data, provided by a first version of a model hosted at an endpoint of a machine learning service, and feedback data, received from a client application, using an identifier associated with the inference data and the feedback data, generating a second version of the model using the training data, and deploying the model to the endpoint of the machine learning service.

Type: Grant

Filed: September 30, 2019

Date of Patent: March 14, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Vineet Khare, Saurabh Gupta, Yijie Zhuang, Bharathan Balaji, Runfei Luo, Siddhartha Agarwal
Reinforcement learning for training compression policies for machine learning models

Patent number: 11501173

Abstract: A compression policy to produce compression profiles for compressing trained machine learning models may be trained using reinforcement learning. An iterative reinforcement learning may be performed response to a search request. Different prospective compression profiles may be generated for received machine learning models according to a compression policy being trained. Performance of compressed versions of the trained neural networks according to the compression profiles may be caused using data sets used to train the machine learning models. The compression policy may be updated according to reward signal determined from an application of a reward function for performance criteria to performance results of the different versions of the machine learning models. When a search criteria is satisfied, the trained compression policy may be provided.

Type: Grant

Filed: March 26, 2020

Date of Patent: November 15, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Gurumurthy Swaminathan, Ragav Venkatesan, Xiong Zhou, Runfei Luo, Vineet Khare

Searching compression profiles for trained neural networks

SEARCHING COMPRESSION PROFILES FOR TRAINED NEURAL NETWORKS

Applying compression profiles across similar neural network architectures

Searching compression profiles for trained neural networks

Iterative model training and deployment for automated learning systems

Reinforcement learning for training compression policies for machine learning models