The Effect of Latent Space Dimension on the Quality of Synthesized Human Face Images

Published online: May 24, 2021 Full Text: PDF (9.32 MiB) DOI: 10.24138/jcomss-2021-0035
Cite this paper
Authors:
Ivana Marin, Sven Gotovac, Mladen Russo, Dunja Božić-Štulić

Abstract

In recent years Generative Adversarial Networks (GANs) have achieved remarkable results in the task of realistic image synthesis. Despite their continued success and advances, there still lacks a thorough understanding of how precisely GANs map random latent vectors to realistic-looking images and how the priors set on the latent space affect the learned mapping. In this work, we analyze the effect of the chosen latent dimension on the final quality of synthesized images of human faces and learned data representations. We show that GANs can generate images plausibly even with latent dimensions significantly smaller than the standard dimensions like 100 or 512. Although one might expect that larger latent dimensions encourage the generation of more diverse and enhanced quality images, we show that an increase of latent dimension after some point does not lead to visible improvements in perceptual image quality nor in quantitative estimates of its generalization abilities.

Keywords

Generative Adversarial Networks, Latent space exploration, Latent dimension, Evaluation, Frechet Inception Distance (FID), Image synthesis
Creative Commons License 4.0
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.