A Style Based Variational Autoencoder

Abstract

VAE are among the state of the art generative models, but have recently lost their shine to GANs. The most prominent work recently in which is the Style-GAN by Karras et al. VAE has the ability to encode as well as decode - this advantage over the style-gan is useful in many downstream tasks. In this work we combine the style based architecture and VAE and achieve state of the art reconstruction and generation. We follow the work of Hou et al. DFC-VAE to use perceptual loss and we compare our results to this work.

Dataset

Celeb-A - the results below model (64x64)
FFHQ - for other models (128x128, 256x256)

Architecture

The loss is comprised out of two components:

Reconstruction Loss - based on perceptual loss (pre-trained VGG16 features)
Latent Loss - kl-divergence loss

Reconstruction Results

Random Sample Results

Interpolation

Comparison to State of the Art

Code Structure

. /Model
- Layers class VaeLayers
- The perceptual model class PerceptualModel
- Our model class StyleVae
. /Train
- The trainer class StyleVaeTrainer
- Train script $ python train.py --load <True|False>
- Test script $ python test.py
- All saved models should be saved under ./train_output to restore
. /Data
- Dataset class Dataset
- All data should be saved under /data/svae/*.png

Training Curves

Test

Test results can be seen in the visuals part

Additional Experiments

We used the provided model to train on the FFHQ dataset to produce a 256x256 results:

Future Work

Adversarial Training- Adding an adversarial term to improve generation of hair and fine details (hair) and background especially in the high resulotion models
Weighted KL-Divergence- The hierarchical structure of the code injection would allow weighted KL-divergence loss term where we allow the VAE encoder to different divergence fine/coarse features. This flexibility makes sense perhaps because fine details like hairs are more normally distributed, while coarse feature behave differently (not many dark skin red heads i.e.)

Trained Models

Available soon...

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
style_vae		style_vae
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
conftest.py		conftest.py
pylint.rc		pylint.rc
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Style Based Variational Autoencoder

Abstract

Dataset

Architecture

Reconstruction Results

Random Sample Results

Interpolation

Comparison to State of the Art

Code Structure

Training Curves

Test

Additional Experiments

Future Work

Trained Models

About

Releases

Packages

Contributors 2

Languages

License

orgoro/style-vae

Folders and files

Latest commit

History

Repository files navigation

A Style Based Variational Autoencoder

Abstract

Dataset

Architecture

Reconstruction Results

Random Sample Results

Interpolation

Comparison to State of the Art

Code Structure

Training Curves

Test

Additional Experiments

Future Work

Trained Models

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages