The Effect of Latent Space Dimension on the Quality of Synthesized Human Face Images
Abstract
In recent years Generative Adversarial Networks (GANs) have achieved remarkable results in the task of realistic image synthesis. Despite their continued success and advances, there still lacks a thorough understanding of how precisely GANs map random latent vectors to realistic-looking images and how the priors set on the latent space affect the learned mapping. In this work, we analyze the effect of the chosen latent dimension on the final quality of synthesized images of human faces and learned data representations. We show that GANs can generate images plausibly even with latent dimensions significantly smaller than the standard dimensions like 100 or 512. Although one might expect that larger latent dimensions encourage the generation of more diverse and enhanced quality images, we show that an increase of latent dimension after some point does not lead to visible improvements in perceptual image quality nor in quantitative estimates of its generalization abilities.
Keywords
Generative Adversarial Networks, Latent space exploration, Latent dimension, Evaluation, Frechet Inception Distance (FID), Image synthesisThis work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
I. Marin, S. Gotovac, M. Russo and D. Božić-Štulić, "The Effect of Latent Space Dimension on the Quality of Synthesized Human Face Images," in Journal of Communications Software and Systems, vol. 17, no. 2, pp. 124-133, May 2021, doi: 10.24138/jcomss-2021-0035
@article{marin2021effectlatent, author = {Ivana Marin and Sven Gotovac and Mladen Russo and Dunja Božić-Štulić}, title = {The Effect of Latent Space Dimension on the Quality of Synthesized Human Face Images}, journal = {Journal of Communications Software and Systems}, month = {5}, year = {2021}, volume = {17}, number = {2}, pages = {124--133}, doi = {10.24138/jcomss-2021-0035}, url = {https://doi.org/10.24138/jcomss-2021-0035} }