-
Notifications
You must be signed in to change notification settings - Fork 894
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SANA support #1807
Comments
Sana is very interesting. However, I am concerned that the weight license is CC BY-NC-SA 4.0. It seems that a pull request for optimization is being developed in the Sana repository, so I will look forward to that first. |
What's the difference to flux.1dev? Isn't it research only too? |
From my understanding, the output of FLUX.1 dev is not covered by the license of FLUX.1 dev. https://github.com/black-forest-labs/flux/blob/main/model_licenses/LICENSE-FLUX1-dev
|
Interesting thank you, I always heard differently on reddit, my bad for not reading it myself. If Nvidia doesn't change the license eventually there will be a pony sana. Having things already ready for that could be an advantage. Thank you anyway for the great work you did so far |
Looks like training code under Apache now, not sure |
NVlabs/Sana#54 |
@Muinez add very good code for MAR bucketing and simplified local train: i add code for model load/save and fix some small bugs And example how to use it and train small model from zero in bf16 |
Is it possible to improve the way the dataset cache is generated? When I tried the official code, it used all the system RAM and froze my PC. |
official code dont have cache |
Oh, maybe I'm misunderstanding something. |
current status looks like success train from zero in bf16 its not official training code. |
but i have seen a guy that said it runns with 24gb vram if you use vae cashing is this true or false? |
true, i train batch 256 0.6b and batch 24 1.6b on 48 gpu with vae cache |
May you please add minimal SANA support?
https://github.com/NVlabs/Sana
SANA train implementation very not optimal(
NVlabs/Sana#49
GPU intensive, has no cache/ Multi Aspect Ratio / Adafactor support(
We love you, mr Kohya!
The text was updated successfully, but these errors were encountered: