Give example on how to handle gradient accumulation with cross-entropy #574
Annotations
2 warnings
run-tests (latest)
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
run-tests (minimum)
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|