Abstract: A method and system for training a model for image generation. The model includes a hybrid variational auto-encoder (VAE)—generative adversarial network (GAN) framework. The method includes the steps of: multiple input of an input image into the VAE which outputs in response multiple distinct output image samples, determining the best of the multiple output image samples as a best-of-many sample, the best-of-many sample having the minimum reconstruction cost, and training the model based on a predefined training objective, the predefined training objective integrating the best-of-many sample reconstruction cost and a GAN-based synthetic likelihood term.
Type:
Application
Filed:
May 28, 2019
Publication date:
July 28, 2022
Applicants:
TOYOTA MOTOR EUROPE, MAX-PLANCK-INSTITUT FÜR INFORMATIK
Inventors:
Daniel OLMEDA REINO, Apratim BHATTACHARYYA, Mario FRITZ, Bernt SCHIELE