[Questions about training] #2

tomguluson92 · 2024-11-11T08:49:36Z

Dear authors,

Thanks for your brilliant works, here are some problems occurred during my test:

--fixed_time_steps 1 2 5 10 what should I use, 1, 2, 5 or 10? and what is the meaning of those this term?
what is the dataset of irt and tgt? How to gen that?

The text was updated successfully, but these errors were encountered:

Hongcheng-Gao · 2024-11-12T08:48:49Z

Thank you for your interest in our work.

In our current paper, we did not fix the timestep sampling during meta-unlearning (fix_timesteps=False). However, we found that since the optimization of meta-unlearning randomly samples a timestep each time, this led to very long optimization times and considerable randomness in the performance. Therefore, to make optimization more stable and achieve faster convergence, we implemented a strategy of using fixed timesteps during meta training. The fixed_time_steps parameter is used to set these fixed timesteps in the meta component. We are currently exploring whether smaller or larger timesteps work better, and if using more timesteps would be beneficial. We will update the specific timestep selection strategy soon.
Additionally, the irt dataset refers to concepts unrelated to the concept being unlearned, while the target (tgt) dataset refers to concepts related to the concept being unlearned. For example, if you want to unlearn nudity-related content, the tgt dataset would contain image-text pairs like women/skin, while the irt dataset would contain completely unrelated content from "nudity", like dogs/cats. All of them can be generated by gen_images.py.

We will be updating the README and code soon to make everything clearer. Thank you for your attention.

tomguluson92 · 2024-11-13T01:57:25Z

Thanks for the feedbacks, should this runs well? it takes nearly 28 A100/H100 GPU hours to finish a training?

Hongcheng-Gao · 2024-11-13T02:33:38Z

Hi, congratulate that the program in your screenshot is running normally. However, we noticed from the screenshots that you have fixed_time_step enabled. When using fixed_time_step, there's no need to train for 1500 steps, as the 1500-step limit was originally set as the maximum training steps for cases where fixed_time_step=True is not used. 100-200 step may be enough for fixed_time_step=True.

Also, since we're still in the ablation study phase for cases with fixed_time_step=True, the current default fixed timesteps (1, 2, 5, 10) may not lead to good performance. We're currently explore which specific timestep values would yield good performance.

tomguluson92 · 2024-11-13T02:38:29Z

So I should change fixed_time_step=False and 200 steps?

Hongcheng-Gao · 2024-11-13T02:44:32Z

100-200 step may be enough for fixed_time_step=True. If you use fixed_time_step=False, this should be 1000-1500 steps. The better choice is fixed_time_step=False and 1000-1500 steps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Questions about training] #2

[Questions about training] #2

tomguluson92 commented Nov 11, 2024

Hongcheng-Gao commented Nov 12, 2024 •

edited

Loading

tomguluson92 commented Nov 13, 2024

Hongcheng-Gao commented Nov 13, 2024

tomguluson92 commented Nov 13, 2024

Hongcheng-Gao commented Nov 13, 2024

[Questions about training] #2

[Questions about training] #2

Comments

tomguluson92 commented Nov 11, 2024

Hongcheng-Gao commented Nov 12, 2024 • edited Loading

tomguluson92 commented Nov 13, 2024

Hongcheng-Gao commented Nov 13, 2024

tomguluson92 commented Nov 13, 2024

Hongcheng-Gao commented Nov 13, 2024

Hongcheng-Gao commented Nov 12, 2024 •

edited

Loading