v0.1.0
What's Changed
- Allow masking padding tokens in cross attention layers by @jazcollins in #94
- Fix typo in pyproject.toml by @eltociear in #92
- Autoencoder implementation and training by @coryMosaicML in #79
- Hotfix missing lpips requirement by @Skylion007 in #98
- Fixes for doing inference with masked padding by @coryMosaicML in #99
- Add script for running gradio demo from a local checkpoint by @coryMosaicML in #100
- Better StreamingDataset defaults while preserving old shuffle settings by @snarayan21 in #95
- Remove rounding in aspect ratio bucketing transform by @Landanjs in #111
- Add sample SDXL yamls and update README by @jazcollins in #112
- LogDiffusionImages Features and Refactors by @Landanjs in #104
- Bump gradio demo version by @coryMosaicML in #114
- Make custom autoencoders work with SD2 and SDXL models. by @coryMosaicML in #102
- Add algorithm to control randomness over different eval times by @coryMosaicML in #115
- Add ruff linter by @Skylion007 in #122
- Add image only dataset + script to add captions generated by LLaVA to a streaming dataset by @coryMosaicML in #118
- Code cleanup by @coryMosaicML in #120
- Update to latest transformers, diffusers, and other packages. by @coryMosaicML in #125
- Landan/text encoder refactor by @Landanjs in #124
- Add option to specify image output key in image dataloader factory by @coryMosaicML in #129
- Add explicit per block fsdp wrapping for SDXL by @coryMosaicML in #127
- make local paths optional by @A-Jacobson in #128
- fixed masked padding bug. by @A-Jacobson in #130
- arbitrary aspect ratio buckets by @Landanjs in #126
- Add option to set per-channel mean, std. dev. of the autoencoder latents when training the UNet by @coryMosaicML in #132
- Test PR by @Landanjs in #134
- Arbitrary aspect ratio bucket boundaries by @Landanjs in #133
- Bug to enable fp16 by @RR4787 in #136
- Only download CLIP on rank 0 when doing eval by @coryMosaicML in #135
New Contributors
- @eltociear made their first contribution in #92
- @snarayan21 made their first contribution in #95
- @RR4787 made their first contribution in #136
Full Changelog: v0.0.1...v0.1.0