-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add timer impl #44
add timer impl #44
Conversation
17f1bca
to
124d082
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good, just a few small questions / comments
436be20
to
39576f6
Compare
will carry this change over to SFT, DPO, PPO and then undraft. thanks for taking an initial look! |
undrafted! i'm testing it as we speak but let me know what you think |
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]>
Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
9413df0
to
8545153
Compare
Signed-off-by: Gerald Shen <[email protected]>
Signed-off-by: Gerald Shen <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm, just a minor suggestion
Co-authored-by: Olivier Delalleau <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good to merge!
* add timer class Signed-off-by: Gerald Shen <[email protected]> * fixup! add timer class Signed-off-by: Gerald Shen <[email protected]> * change check progress function Signed-off-by: Gerald Shen <[email protected]> * add timer to rm Signed-off-by: Gerald Shen <[email protected]> * add timer to supervised trainer Signed-off-by: Gerald Shen <[email protected]> * Update examples/nlp/gpt/train_reward_model.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * Update nemo_aligner/utils/distributed.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * add logging when finished Signed-off-by: Gerald Shen <[email protected]> * Update nemo_aligner/utils/distributed.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * add timer into sft examples Signed-off-by: Gerald Shen <[email protected]> * add timer onto dpo Signed-off-by: Gerald Shen <[email protected]> * add check progress onto PPO Signed-off-by: Gerald Shen <[email protected]> * add timer onto ppo Signed-off-by: Gerald Shen <[email protected]> * update changelog Signed-off-by: Gerald Shen <[email protected]> * update changelog Signed-off-by: Gerald Shen <[email protected]> * Update CHANGELOG.md Co-authored-by: Olivier Delalleau <[email protected]> --------- Signed-off-by: Gerald Shen <[email protected]> Co-authored-by: Olivier Delalleau <[email protected]> Co-authored-by: trias702 <[email protected]>
* add timer class Signed-off-by: Gerald Shen <[email protected]> * fixup! add timer class Signed-off-by: Gerald Shen <[email protected]> * change check progress function Signed-off-by: Gerald Shen <[email protected]> * add timer to rm Signed-off-by: Gerald Shen <[email protected]> * add timer to supervised trainer Signed-off-by: Gerald Shen <[email protected]> * Update examples/nlp/gpt/train_reward_model.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * Update nemo_aligner/utils/distributed.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * add logging when finished Signed-off-by: Gerald Shen <[email protected]> * Update nemo_aligner/utils/distributed.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * add timer into sft examples Signed-off-by: Gerald Shen <[email protected]> * add timer onto dpo Signed-off-by: Gerald Shen <[email protected]> * add check progress onto PPO Signed-off-by: Gerald Shen <[email protected]> * add timer onto ppo Signed-off-by: Gerald Shen <[email protected]> * update changelog Signed-off-by: Gerald Shen <[email protected]> * update changelog Signed-off-by: Gerald Shen <[email protected]> * Update CHANGELOG.md Co-authored-by: Olivier Delalleau <[email protected]> --------- Signed-off-by: Gerald Shen <[email protected]> Co-authored-by: Olivier Delalleau <[email protected]> Co-authored-by: trias702 <[email protected]>
* add timer class Signed-off-by: Gerald Shen <[email protected]> * fixup! add timer class Signed-off-by: Gerald Shen <[email protected]> * change check progress function Signed-off-by: Gerald Shen <[email protected]> * add timer to rm Signed-off-by: Gerald Shen <[email protected]> * add timer to supervised trainer Signed-off-by: Gerald Shen <[email protected]> * Update examples/nlp/gpt/train_reward_model.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * Update nemo_aligner/utils/distributed.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * add logging when finished Signed-off-by: Gerald Shen <[email protected]> * Update nemo_aligner/utils/distributed.py Co-authored-by: Olivier Delalleau <[email protected]> Signed-off-by: Gerald Shen <[email protected]> * add timer into sft examples Signed-off-by: Gerald Shen <[email protected]> * add timer onto dpo Signed-off-by: Gerald Shen <[email protected]> * add check progress onto PPO Signed-off-by: Gerald Shen <[email protected]> * add timer onto ppo Signed-off-by: Gerald Shen <[email protected]> * update changelog Signed-off-by: Gerald Shen <[email protected]> * update changelog Signed-off-by: Gerald Shen <[email protected]> * Update CHANGELOG.md Co-authored-by: Olivier Delalleau <[email protected]> --------- Signed-off-by: Gerald Shen <[email protected]> Co-authored-by: Olivier Delalleau <[email protected]> Co-authored-by: trias702 <[email protected]>
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Changelog
Usage
# Add a code snippet demonstrating how to use this
Before your PR is "Ready for review"
Pre checks:
Checklist when contributing a new algorithm
max_steps=-1
andvalidation
?Additional Information