This is an ongoing study group occuring the EleutherAI Discord server. You can join the server over here, then head to the "Diffusion Reading Group" thread under the #reading-groups channel.
Here is a playlist of the previous session recordings.
- Diffusion Reading Group at EleutherAI
- Week 1: DDPM paper
- Week 2: Score-based generative modeling
- Week 3: NCSNv2 and Score SDEs
- Week 4: Score SDEs and DDIM
- Week 5: IDDPM, ADM, and Classifier-Free Guidance
- Week 6: Review
- Week 7: Classifier-Free Guidance, VDMs, and Denoising Diffusion GANs
- Week 8: Perception-Prioritized Training, Elucidating Design Spaces
- Week 9: DDPM paper, EDM paper code walk-thrus
- Week 10: Paella
- Week 11: Progressive Distillation, Distillation of Guided Models
- Week 12: SDEDit
- Week 13: Latent Diffusion and Stable Diffusion
- Week 14: Q&A with Robin Rombach
- Week 15: Soft Diffusion
- Week 16 & 17: Flow Matching
- Week 18: Consistency Models
- Week 19: Conditional Flow Matching
- Week 20: Inverse Heat Dissipation
- Week 21: Poisson Flow Generative Models
- Week 22: Min-SNR Weighting Strategy
- Week 23: PFGM++
- Week 24: Blurring Diffusion Models
- Week 25: ControlNet
- Week 26: DDPO (RLHF Diffusion)
- Week 27: Diffusion Transformers
- Week 28: simple diffusion
- Week 29: Wuerstchen
- Week 30: Palette
- List of papers to cover:
- Readings:
- Classifier-Free Diffusion Guidance
- Sander Dieleman's blog post on guidance
- Variational Diffusion Models
- Variational Diffusion Model Colab Notebook
- Calvin Luo's blog post
- Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
- Vahdat and Kreis's blog post
- Arash Vahdat's DLCT talk slides
- Recording
- Slides
- Readings:
- Readings:
- Recording
- DDPM Colab notebook
- Readings:
- Recording
- Slides - Progressive Distillation - Presented by marii
- Slides - Distillation of Guided Models - Presented by Griffin Floto
- Readings:
- Same as Week 13
- Recording
- Readings:
- Recording
- Slides
- Slides - other formats
- Readings:
- Recording
- Slides
- Slides - other formats
- Readings:
- Recording
Denoising Diffusion Probabilistic ModelsGenerative Modeling by Estimating Gradients of the Data DistributionImproved techniques for training score-based generative modelsScore-Based Generative Modeling through Stochastic Differential EquationsDenoising Diffusion Implicit ModelsImproved Denoising Diffusion Probabilistic ModelsDiffusion Models Beat GANs on Image SynthesisClassifier-Free Diffusion GuidanceVariational Diffusion ModelsTackling the Generative Learning Trilemma with Denoising Diffusion GANsPerception Prioritized Training of Diffusion ModelsElucidating the Design Space of Diffusion-Based Generative ModelsFast Text-Conditional Discrete Denoising on Vector-Quantized Latent SpacesProgressive Distillation for Fast Sampling of Diffusion ModelsOn Distillation of Guided Diffusion ModelsSDEdit: Guided Image Synthesis and Editing with Stochastic Differential EquationsHigh-Resolution Image Synthesis with Latent Diffusion ModelsStable DiffusionSoft Diffusion: Score Matching for General CorruptionsFlow Matching for Generative ModelingConsistency ModelsConditional Flow Matching: Simulation-Free Dynamic Optimal TransportGenerative Modelling With Inverse Heat DissipationPoisson Flow Generative ModelsEfficient Diffusion Training via Min-SNR Weighting StrategyPFGM++: Unlocking the Potential of Physics-Inspired Generative ModelsBlurring Diffusion ModelsAdding Conditional Control to Text-to-Image Diffusion Models (ControlNet)Training Diffusion Models with Reinforcement LearningScalable Diffusion Models with Transformerssimple diffusion: End-to-end diffusion for high resolution imagesWuerstchen: Efficient Pretraining of Text-to-Image ModelsPalette: Image-to-image diffusion models- Reflected Diffusion Models
- SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
- Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise
- Pyramidal Denoising Diffusion Probabilistic Models
- Cascaded Diffusion Models for High Fidelity Image Generation
- Image super-resolution via iterative refinement
- Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild
- I$^2$SB: Image-to-Image Schr"odinger Bridge
- GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
- Hierarchical Text-Conditional Image Generation with CLIP Latents
- Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
- eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
- Semi-Parametric Neural Image Synthesis
- On the Importance of Noise Scheduling for Diffusion Models
- Pseudo Numerical Methods for Diffusion Models on Manifolds
- DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Step
- DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models
- GENIE: Higher-Order Denoising Diffusion Solvers
- Journey to the BAOAB-limit: finding effective MCMC samplers for score-based models
- Riemannian Score-Based Generative Modeling
- DiffWave: A Versatile Diffusion Model for Audio Synthesis
- Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
- Video diffusion models
- MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
- Imagen Video: High Definition Video Generation with Diffusion Models
- Make-A-Video: Text-to-Video Generation without Text-Video Data
- DreamFusion: Text-to-3D using 2D Diffusion
- Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation
- Magic3D: High-Resolution Text-to-3D Content Creation
- Diffusion-LM Improves Controllable Text Generation.
- Autoregressive Diffusion Models
- Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
- Continuous diffusion for categorical data
- DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
- Vector quantized diffusion model for text-to-image synthesis
- Improved Vector Quantized Diffusion Models
- DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
- Diffusion Models already have a Semantic Latent Space
- Understanding ddpm latent codes through optimal transport
- Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling
- Dual Diffusion Implicit Bridges for Image-to-Image Translation
- Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
- Zero-shot Image-to-Image Translation