Training with gradient checkpointing? #289

benjamin-marie · 2024-10-10T16:44:06Z

Is there a way to benchmark training with gradient checkpointing enabled?

I tried to pass
"gradient_checkpointing": True,
"gradient_checkpointing_kwargs": {"use_reentrant": True},

as training_arguments of TrainingConfig but it doesn't have any effect it seems.

IlyasMoutawwakil · 2024-11-25T14:23:24Z

Everything in training_arguments is passed directly to the TrainingArguments instance, which is passed to the trainer.

benjamin-marie · 2024-11-25T15:01:09Z

Thanks for your reply. This is what I was doing. I'll try again. If it still doesn't work, I'll share my full code.

Provide feedback