-
Notifications
You must be signed in to change notification settings - Fork 140
Pull requests: huggingface/nanotron
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CICD] Add a timeout for unit tests and measure their execution time
#270
opened Jan 17, 2025 by
xrsrke
Loading…
[Feature] Support resume ZeRO1 in a new data parallelism size
#263
opened Dec 10, 2024 by
xrsrke
Loading…
Add shuffling in Nanotron for subsequent epochs when data is repeated
#247
opened Nov 24, 2024 by
Lauler
Loading…
Fix: Update wrong typing on the function get_local_ranks
#194
opened Jun 10, 2024 by
morgangiraud
Loading…
Move MoE Implementation into src/, add Load Balancing Losses
#192
opened Jun 6, 2024 by
haeggee
Loading…
1 task done
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.