Back to pull request #912

[`core` / `DDP`] Fix RM trainer + DDP + quantization + propagate `gradient_checkpointing_kwargs` in SFT & DPO #2024

Sign in to view logs

Run time

Learn about OS pricing on GitHub Actions

Job	Run time
check_code_quality (3.9)	21s
tests (3.8, ubuntu-latest)	6m 59s
tests (3.8, windows-latest)	9m 55s
tests (3.9, ubuntu-latest)	6m 9s
tests (3.9, windows-latest)	10m 43s
tests (3.10, ubuntu-latest)	8m 15s
tests (3.10, windows-latest)	9m 10s
	51m 32s