Patents by Inventor Deniz Oktay

Deniz Oktay has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Compression of machine-learned models via entropy penalized weight reparameterization

Patent number: 11907818

Abstract: Example aspects of the present disclosure are directed to systems and methods that learn a compressed representation of a machine-learned model (e.g., neural network) via representation of the model parameters within a reparameterization space during training of the model. In particular, the present disclosure describes an end-to-end model weight compression approach that employs a latent-variable data compression method. The model parameters (e.g., weights and biases) are represented in a “latent” or “reparameterization” space, amounting to a reparameterization. In some implementations, this space can be equipped with a learned probability model, which is used first to impose an entropy penalty on the parameter representation during training, and second to compress the representation using arithmetic coding after training. The proposed approach can thus maximize accuracy and model compressibility jointly, in an end-to-end fashion, with the rate-error trade-off specified by a hyperparameter.

Type: Grant

Filed: February 6, 2023

Date of Patent: February 20, 2024

Assignee: GOOGLE LLC

Inventors: Deniz Oktay, Saurabh Singh, Johannes Balle, Abhinav Shrivistava
Compression of Machine-Learned Models via Entropy Penalized Weight Reparameterization

Publication number: 20230186166

Abstract: Example aspects of the present disclosure are directed to systems and methods that learn a compressed representation of a machine-learned model (e.g., neural network) via representation of the model parameters within a reparameterization space during training of the model. In particular, the present disclosure describes an end-to-end model weight compression approach that employs a latent-variable data compression method. The model parameters (e.g., weights and biases) are represented in a “latent” or “reparameterization” space, amounting to a reparameterization. In some implementations, this space can be equipped with a learned probability model, which is used first to impose an entropy penalty on the parameter representation during training, and second to compress the representation using arithmetic coding after training. The proposed approach can thus maximize accuracy and model compressibility jointly, in an end-to-end fashion, with the rate-error trade-off specified by a hyperparameter.

Type: Application

Filed: February 6, 2023

Publication date: June 15, 2023

Inventors: Deniz Oktay, Saurabh Singh, Johannes Balle, Abhinav Shrivistava
Compression of machine-learned models via entropy penalized weight reparameterization

Patent number: 11574232

Abstract: Example aspects of the present disclosure are directed to systems and methods that learn a compressed representation of a machine-learned model (e.g., neural network) via representation of the model parameters within a reparameterization space during training of the model. In particular, the present disclosure describes an end-to-end model weight compression approach that employs a latent-variable data compression method. The model parameters (e.g., weights and biases) are represented in a “latent” or “reparameterization” space, amounting to a reparameterization. In some implementations, this space can be equipped with a learned probability model, which is used first to impose an entropy penalty on the parameter representation during training, and second to compress the representation using arithmetic coding after training. The proposed approach can thus maximize accuracy and model compressibility jointly, in an end-to-end fashion, with the rate-error trade-off specified by a hyperparameter.

Type: Grant

Filed: May 13, 2020

Date of Patent: February 7, 2023

Assignee: GOOGLE LLC

Inventors: Deniz Oktay, Saurabh Singh, Johannes Balle, Abhinav Shrivastava
Compression of Machine-Learned Models via Entropy Penalized Weight Reparameterization

Publication number: 20200364603

Abstract: Example aspects of the present disclosure are directed to systems and methods that learn a compressed representation of a machine-learned model (e.g., neural network) via representation of the model parameters within a reparameterization space during training of the model. In particular, the present disclosure describes an end-to-end model weight compression approach that employs a latent-variable data compression method. The model parameters (e.g., weights and biases) are represented in a “latent” or “reparameterization” space, amounting to a reparameterization. In some implementations, this space can be equipped with a learned probability model, which is used first to impose an entropy penalty on the parameter representation during training, and second to compress the representation using arithmetic coding after training. The proposed approach can thus maximize accuracy and model compressibility jointly, in an end-to-end fashion, with the rate-error trade-off specified by a hyperparameter.

Type: Application

Filed: May 13, 2020

Publication date: November 19, 2020

Inventors: Deniz Oktay, Saurabh Singh, Johannes Balle, Abhinav Shrivastava

Compression of machine-learned models via entropy penalized weight reparameterization

Compression of Machine-Learned Models via Entropy Penalized Weight Reparameterization

Compression of machine-learned models via entropy penalized weight reparameterization

Compression of Machine-Learned Models via Entropy Penalized Weight Reparameterization