Skip to content

Actions: huggingface/nanotron

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,889 workflow runs
1,889 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

quick fix
Run FA2-related unit tests #715: Pull request #258 opened by eliebak
December 2, 2024 17:09 6m 58s eliebak:quick-fix-save-lr
December 2, 2024 17:09 6m 58s
quick fix
Code Quality #618: Pull request #258 opened by eliebak
December 2, 2024 17:09 20s eliebak:quick-fix-save-lr
December 2, 2024 17:09 20s
quick fix
Run non-FA2-related unit tests #715: Pull request #258 opened by eliebak
December 2, 2024 17:09 15m 19s eliebak:quick-fix-save-lr
December 2, 2024 17:09 15m 19s
config_dp
Secret Leaks #111: Commit 38fcb8b pushed by NouamaneTazi
December 2, 2024 16:02 19s nouamane/bench2
December 2, 2024 16:02 19s
by default, do not quantize the first and last layer
Secret Leaks #110: Commit 79341ea pushed by xrsrke
November 30, 2024 12:13 17s xrsrke/fp8_for_nanotron
November 30, 2024 12:13 17s
Fix wrong initialization of lr scheduler
Code Quality #617: Pull request #256 opened by kylematoba
November 29, 2024 23:09 Action required kylematoba:lr_scheduler_fix
November 29, 2024 23:09 Action required
Fix wrong initialization of lr scheduler
Run FA2-related unit tests #714: Pull request #256 opened by kylematoba
November 29, 2024 23:09 Action required kylematoba:lr_scheduler_fix
November 29, 2024 23:09 Action required
Fix wrong initialization of lr scheduler
Run non-FA2-related unit tests #714: Pull request #256 opened by kylematoba
November 29, 2024 23:09 Action required kylematoba:lr_scheduler_fix
November 29, 2024 23:09 Action required
[NEW] Llama3.2 weight converters 🦙
Code Quality #616: Pull request #255 opened by TJ-Solergibert
November 28, 2024 23:04 Action required TJ-Solergibert:llama3.2_conversion
November 28, 2024 23:04 Action required
[NEW] Llama3.2 weight converters 🦙
Run non-FA2-related unit tests #713: Pull request #255 opened by TJ-Solergibert
November 28, 2024 23:04 Action required TJ-Solergibert:llama3.2_conversion
November 28, 2024 23:04 Action required
[NEW] Llama3.2 weight converters 🦙
Run FA2-related unit tests #713: Pull request #255 opened by TJ-Solergibert
November 28, 2024 23:04 Action required TJ-Solergibert:llama3.2_conversion
November 28, 2024 23:04 Action required
new changes
Secret Leaks #108: Commit b764b97 pushed by xrsrke
November 28, 2024 16:36 18s xrsrke/fp8_for_nanotron
November 28, 2024 16:36 18s
refactor + add test for fp8 initialization
Secret Leaks #107: Commit fbbbf4d pushed by xrsrke
November 28, 2024 14:56 15s xrsrke/fp8_for_nanotron
November 28, 2024 14:56 15s
resuming checkpoint without lr schedule or optimizer state
Code Quality #615: Pull request #253 opened by eliebak
November 28, 2024 08:47 23s eliebak:add-load-lr-flag
November 28, 2024 08:47 23s
resuming checkpoint without lr schedule or optimizer state
Run non-FA2-related unit tests #712: Pull request #253 opened by eliebak
November 28, 2024 08:47 15m 29s eliebak:add-load-lr-flag
November 28, 2024 08:47 15m 29s
resuming checkpoint without lr schedule or optimizer state
Run FA2-related unit tests #712: Pull request #253 opened by eliebak
November 28, 2024 08:47 6m 4s eliebak:add-load-lr-flag
November 28, 2024 08:47 6m 4s
Add shuffling in Nanotron for subsequent epochs when data is repeated
Run non-FA2-related unit tests #711: Pull request #247 synchronize by Lauler
November 28, 2024 08:18 Action required Lauler:nanoset-shuffle-epochs-data
November 28, 2024 08:18 Action required
Add shuffling in Nanotron for subsequent epochs when data is repeated
Code Quality #614: Pull request #247 synchronize by Lauler
November 28, 2024 08:18 Action required Lauler:nanoset-shuffle-epochs-data
November 28, 2024 08:18 Action required
Add shuffling in Nanotron for subsequent epochs when data is repeated
Run FA2-related unit tests #711: Pull request #247 synchronize by Lauler
November 28, 2024 08:18 Action required Lauler:nanoset-shuffle-epochs-data
November 28, 2024 08:18 Action required
Add shuffling in Nanotron for subsequent epochs when data is repeated
Run non-FA2-related unit tests #710: Pull request #247 synchronize by Lauler
November 28, 2024 08:04 Action required Lauler:nanoset-shuffle-epochs-data
November 28, 2024 08:04 Action required
Add shuffling in Nanotron for subsequent epochs when data is repeated
Run FA2-related unit tests #710: Pull request #247 synchronize by Lauler
November 28, 2024 08:04 Action required Lauler:nanoset-shuffle-epochs-data
November 28, 2024 08:04 Action required
Add shuffling in Nanotron for subsequent epochs when data is repeated
Code Quality #613: Pull request #247 synchronize by Lauler
November 28, 2024 08:04 Action required Lauler:nanoset-shuffle-epochs-data
November 28, 2024 08:04 Action required
Fix initial_lr when resuming training
Run FA2-related unit tests #709: Pull request #243 reopened by Lauler
November 27, 2024 21:11 Action required Lauler:resume-lr-fix
November 27, 2024 21:11 Action required
Fix initial_lr when resuming training
Run non-FA2-related unit tests #709: Pull request #243 reopened by Lauler
November 27, 2024 21:11 Action required Lauler:resume-lr-fix
November 27, 2024 21:11 Action required