Adding Validation Loss to detect Overtraining #193

Naegles · 2023-02-15T21:30:39Z

I saw that this repo has added the ability to use a validation loss to help figure out the optimal amount of training. Might be an interesting addition.

https://github.com/victorchall/EveryDream2trainer/blob/main/doc/VALIDATION.md

shirayu · 2023-02-18T06:03:48Z

Good idea.
I believe that it is not enough to simply observe loss/average, since the weights are continually being updated.

kohya-ss · 2023-02-19T06:44:28Z

Thank you for the suggestion!
I was wondering if validation loss is a valid metric since SD loss is so fluctuating, but the document in EveryDream repo is very interesting.

I will consider to implement the validation loss, but it will take some time...

uwidev · 2023-03-30T18:39:11Z

I think this would be a good means to determine learning rates. There's a lot of speculation as to the proper rate, but with validation, we should be able to prove appropriate learning rates. The question then would be if that value would be universally good when training something specific (e.g. style), or good for everything. I think there's a lot of potential to be gained with this.

slashedstar · 2023-05-06T17:45:23Z

So... is this still being considered or was it scraped? @kohya-ss

kohya-ss · 2023-05-07T01:46:30Z

Sorry it has taken so long. This is on the task list, but I have not been able to get to it due to priority issues with other tasks.

It also needs to enhance the dataset classes, it will take a time...

AMorporkian · 2023-05-20T08:38:31Z

I have implemented this in a fork I originally created to implement the new noise scheduling functions. I won't create a PR because the massive refactoring I have done is largely experimental, but you can check it out here. I'm doing a run with wandb right now to test LoRA settings.

https://github.com/AMorporkian/kohya_ss/tree/hypertune

Sorry for the absolutely terrible commits, I was just messing around and the messages are not at all descriptive. If there isn't any movement on this by next week, I'll take some time to actually make some proper code that would integrate better.

kmacmcfarlane · 2023-08-12T01:59:40Z

This seems like a really cool idea. I have to make a lot of different training runs to figure out what the optimum settings are for a given training set. Even doing things in epochs, number of training images seems to still affect the results somewhat.

rockerBOO · 2023-10-30T15:49:56Z

I have implemented this in a fork I originally created to implement the new noise scheduling functions. I won't create a PR because the massive refactoring I have done is largely experimental, but you can check it out here. I'm doing a run with wandb right now to test LoRA settings.

https://github.com/AMorporkian/kohya_ss/tree/hypertune

I borrowed some of your ideas (random_split and collate updates) and put a copy/paste validation loss together. Helped to not have to create a custom dataset to get this ball rolling.

If anyone else is interested in testing, see #914 . Expanding into validation datasets would be the next step but this works well with minimal changes.

jacquesfeng123 · 2024-05-07T05:42:52Z

any updates on this :D

shirayu mentioned this issue Feb 18, 2023

LearningRate-Free Learning Algorithm #181

Open

rockerBOO mentioned this issue Oct 30, 2023

Add validation loss #914

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Validation Loss to detect Overtraining #193

Adding Validation Loss to detect Overtraining #193

Naegles commented Feb 15, 2023

shirayu commented Feb 18, 2023 •

edited

Loading

kohya-ss commented Feb 19, 2023

uwidev commented Mar 30, 2023

slashedstar commented May 6, 2023

kohya-ss commented May 7, 2023 •

edited

Loading

AMorporkian commented May 20, 2023

kmacmcfarlane commented Aug 12, 2023

rockerBOO commented Oct 30, 2023

jacquesfeng123 commented May 7, 2024

Adding Validation Loss to detect Overtraining #193

Adding Validation Loss to detect Overtraining #193

Comments

Naegles commented Feb 15, 2023

shirayu commented Feb 18, 2023 • edited Loading

kohya-ss commented Feb 19, 2023

uwidev commented Mar 30, 2023

slashedstar commented May 6, 2023

kohya-ss commented May 7, 2023 • edited Loading

AMorporkian commented May 20, 2023

kmacmcfarlane commented Aug 12, 2023

rockerBOO commented Oct 30, 2023

jacquesfeng123 commented May 7, 2024

shirayu commented Feb 18, 2023 •

edited

Loading

kohya-ss commented May 7, 2023 •

edited

Loading