Skip to content

Implementation of various DDPM papers to understand how they work

Notifications You must be signed in to change notification settings

peiqi-liu/DDPMs-Pytorch

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DDPM Pytorch

Pytorch implementation of "Improved Denoising Diffusion Probabilistic Models", "Denoising Diffusion Probabilistic Models" and "Classifier-free Diffusion Guidance"

Project structure

  .
  ├── ddpm_pytorch  # Source files
  │   ├── config    # YAML config files with all the hyperparameters
  │   ├── distributions.py
  │   ├── ema.py  # Pytorch-Lightning implementation of exponential moving average
  │   ├── generate.py  # Entry point to generate a new batch of images given the checkpoint path
  │   ├── __init__.py
  │   ├── model  # UNet and DDPM train/generation processes are here
  │   ├── paths.py  # Path constants
  │   ├── train.py # Entry point to train a new DDPM model
  │   └── variance_scheduler  # DDPM variance scheduler like linear, cosine
  ├── anaconda-project.yml  # anaconda project file
  ├── anaconda-project-lock.yml  # anaconda project lock file
  └── readme.md   # This file

How to train

  1. Install anaconda

  2. Install all the dependencies with the command

    anaconda-project prepare
    
  3. Train the model

    anaconda-project run train-gpu 
    

    By default, the model trained is the DDPM from "Improved Denoising Diffusion Probabilistic Models" paper on MNIST dataset. You can switch to the original DDPM by disabling the vlb with the following command:

    anaconda-project run train model.vlb=False
    

    You can also train the DDPM with the Classifier-free Diffusion Guidance by changing the model:

    anaconda-project run train model=unet_class_conditioned
    

How to generate

  1. Train a model (See previous section)

  2. Generate a new batch of images

    anaconda-project run generate -r RUN
    

    The other options are: [--seed SEED] [--device DEVICE] [--batch-size BATCH_SIZE] [-w W] [--scheduler {linear,cosine,tan}] [-T T]

Configure the training

Under ddpm_pytorch/config there are several yaml files containing the training parameters such as model class and paramters, noise steps, scheduler and so on. Note that the hyperparameters in such files are taken from the papers "Improved Denoising Diffusion Probabilistic Models" and "Denoising Diffusion Probabilistic Models"

defaults:
  - model: unet_paper  # take the model config from model/unet_paper.yaml
  - scheduler: cosine  # use the cosine scheduler from scheduler/cosine.yaml
  - dataset: mnist
  - optional model_dataset: ${model}-${dataset}  # set particular hyper parameters for specific couples (model, dataset)
  - optional model_scheduler: ${model}-${scheduler} # set particular hyper parameters for specific couples (model, scheduler)

batch_size: 128 # train batch size
noise_steps: 4_000  # noising steps; the T in "Improved Denoising Diffusion Probabilistic Models" and "Denoising Diffusion Probabilistic Models"
accelerator: null  # training hardware; for more details see pytorch lightning
devices: null  # training devices to use; for more details see pytorch lightning
gradient_clip_val: 0.0  # 0.0 means gradient clip disabled
gradient_clip_algorithm: norm  # gradient clip has two values: 'norm' or 'value
ema: true  # use Exponential Moving Average implemented in ddpm_pytorch/ema.py
ema_decay: 0.99  # decay factor of EMA

hydra:
  run:
    dir: saved_models/${now:%Y_%m_%d_%H_%M_%S}

Add custom dataset

To add a custom dataset, you need to create a new class that inherits from torch.utils.data.Dataset and implement the len and getitem methods. Then, you need to add the config file to the ddpm_pytorch/config/dataset folder with a similar structure of mnist.yaml

width: 28  # meta info about the dataset
height: 28
channels: 1   # number of image channels
num_classes: 10  # number of classes
files_location: ~/.cache/torchvision_dataset  # location where to store the dataset, in case to be downloaded
train:  #dataset.train is instantiated with this config
  _target_: torchvision.datasets.MNIST  # Dataset class. Following arguments are passed to the dataset class constructor
  root: ${dataset.files_location}
  train: true
  download: true
  transform:
    _target_: torchvision.transforms.ToTensor
val:  #dataset.val is instantiated with this config
  _target_: torchvision.datasets.MNIST # Same dataset of train, but the validation split
  root: ${dataset.files_location}
  train: false
  download: true
  transform:
    _target_: torchvision.transforms.ToTensor

Examples of custom training

Disable the variational lower bound, hence training like in "Denoising Diffusion Probabilistic Models" with linear scheduler and in GPU

anaconda-project run train scheduler=linear accelerator='gpu' model.vlb=False noise_steps=1000

Classifier-free Guidance

Use the labels for Diffusion Guidance, as in "Classifier-free Diffusion Guidance" with the following command

anaconda-project run train model=unet_class_conditioned noise_steps=1000

Anaconda-project

mac-os lock file

  1. Remove cudatoolkit from anaconda-project.yml file at the bottom of the file, under env_specs -> default -> packages
  2. De-comment - osx-64 under env_specs -> default -> platforms
  3. Delete anaconda-project-lock.yml file
  4. Run anaconda-project prepare to generate the new lock file

CPU-only environment

To have an alternative a PyTorch CPU-only environment, de-comment the following lines at the bottom of anaconda-project.yml under env_specs

#  pytorch-cpu:
#    packages:
#    - cpuonly
#    channels:
#    - pytorch
#    platforms:
#    - linux-64
#    - win-64

About

Implementation of various DDPM papers to understand how they work

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%