Abstract
We present an autoencoder that leverages learned representations to better measure similarities in data space. By combining a variational autoencoder (VAE) with a generative adversarial network (GAN) we can use learned feature representations in the GAN discriminator as basis for the VAE reconstruction objective. Thereby, we replace element-wise errors with feature-wise errors to better capture the data distribution while offering invariance towards e.g. translation. We apply our method to images of faces and show that it outperforms VAEs with element-wise similarity measures in terms of visual fidelity. Moreover, we show that the method learns an embedding in which high-level abstract visual features (e.g. wearing glasses) can be modified using simple arithmetic.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of The 33rd International Conference on Machine Learning |
Redaktører | Maria Florina Balcan, Kilian Q. Weinberger |
Antal sider | 9 |
Publikationsdato | 2016 |
Sider | 1558–1566 |
ISBN (Elektronisk) | 978-151082900-8 |
Status | Udgivet - 2016 |
Begivenhed | 33rd International Conference on Machine Learning - New York, USA Varighed: 19 jun. 2016 → 24 jun. 2016 Konferencens nummer: 33 |
Konference
Konference | 33rd International Conference on Machine Learning |
---|---|
Nummer | 33 |
Land/Område | USA |
By | New York |
Periode | 19/06/2016 → 24/06/2016 |
Navn | JMLR: Workshop and Conference Proceedings |
---|---|
Vol/bind | 48 |