CVAE and VQ-VAE

This is an implementation of the VQ-VAE (Vector Quantized Variational Autoencoder) and Convolutional Varational Autoencoder. from Neural Discrete representation learning for compressing MNIST and Cifar10. The code is based upon pytorch/examples/vae.

pip install -r requirements.txt
python main.py

requirements

Python 3.6 (maybe 3.5 will work as well)
PyTorch 0.4
Additional requirements in requirements.txt

Usage

# For example
python3 main.py --dataset=cifar10 --model=vqvae --data-dir=~/.datasets --epochs=3

Results

All images are taken from the test set. Top row is the original image. Bottom row is the reconstruction.

k - number of elements in the dictionary. d - dimension of elements in the dictionary (number of channels in bottleneck).

MNIST (k=10, d=64)

CIFAR10 (k=128, d=256)

Imagenet (k=512, d=128)

TODO:

Implement Continuous Relaxation Training of Discrete Latent Variable Image Models
Sample using PixelCNN prior
Improve results on cifar - nearest neighbor should be performed to 10 dictionaries rather than 1
Improve results on cifar - replace MSE with NLL
Improve results on cifar - measure bits/dim
Compare architecture with the offical one
Merge VAE and VQ-VAE for MNIST and Cifar to one script

Acknowledgement

tf-vaevae for a good reference.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
images		images
vq_vae		vq_vae
.gitignore		.gitignore
.gitmodules		.gitmodules
1.png		1.png
LICENSE		LICENSE
README.md		README.md
main.py		main.py
read_latent.py		read_latent.py
requirements.txt		requirements.txt
setup.py		setup.py
test.png		test.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CVAE and VQ-VAE

requirements

Usage

Results

TODO:

Acknowledgement

About

Releases

Packages

Contributors 5

Languages

License

AugmentedDesignLab/vqvae

Folders and files

Latest commit

History

Repository files navigation

CVAE and VQ-VAE

requirements

Usage

Results

TODO:

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages