Skip to content

Actions: huggingface/trl

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,307 workflow runs
4,307 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

🪜 Stepwise supervision dataset type
Tests #6112: Pull request #2148 synchronize by qgallouedec
November 15, 2024 17:15 24m 36s step-dataset
November 15, 2024 17:15 24m 36s
🪜 Stepwise supervision dataset type
Tests #6111: Pull request #2148 synchronize by qgallouedec
November 15, 2024 16:51 26m 58s step-dataset
November 15, 2024 16:51 26m 58s
[GKD] add ULD type loss to GKD Trainer
Tests #6108: Pull request #2263 synchronize by kashif
November 15, 2024 14:50 24m 39s kashif:uld
November 15, 2024 14:50 24m 39s
⚖️ Add use_soft_judge option to WinRateCallback (#2347)
Tests #6107: Commit b8c9d9c pushed by kashif
November 15, 2024 14:49 24m 22s main
November 15, 2024 14:49 24m 22s
🧪 [Experimental] Train LeRobot policy with TRL
Tests #6105: Pull request #2359 opened by qgallouedec
November 15, 2024 14:02 24m 44s lerobot
November 15, 2024 14:02 24m 44s
🪜 Stepwise supervision dataset type
Tests #6104: Pull request #2148 synchronize by qgallouedec
November 15, 2024 13:56 23m 17s step-dataset
November 15, 2024 13:56 23m 17s
🪜 Stepwise supervision dataset type
Tests #6103: Pull request #2148 synchronize by qgallouedec
November 15, 2024 13:52 23m 9s step-dataset
November 15, 2024 13:52 23m 9s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6102: Pull request #2347 synchronize by kashif
November 15, 2024 13:51 22m 46s kashif:soft-winrate
November 15, 2024 13:51 22m 46s
🪜 Stepwise supervision dataset type
Tests #6101: Pull request #2148 synchronize by qgallouedec
November 15, 2024 13:50 23m 52s step-dataset
November 15, 2024 13:50 23m 52s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6100: Pull request #2347 synchronize by qgallouedec
November 15, 2024 12:58 22m 47s kashif:soft-winrate
November 15, 2024 12:58 22m 47s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6099: Pull request #2347 synchronize by kashif
November 15, 2024 12:54 24m 3s kashif:soft-winrate
November 15, 2024 12:54 24m 3s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6098: Pull request #2347 synchronize by kashif
November 15, 2024 12:54 23m 13s kashif:soft-winrate
November 15, 2024 12:54 23m 13s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6097: Pull request #2347 synchronize by kashif
November 15, 2024 12:54 23m 35s kashif:soft-winrate
November 15, 2024 12:54 23m 35s
🪜 Stepwise supervision dataset type
Tests #6095: Pull request #2148 synchronize by qgallouedec
November 15, 2024 11:50 22m 32s step-dataset
November 15, 2024 11:50 22m 32s
🔀 Add MergeModelCallBack
Tests #6094: Pull request #2282 synchronize by August-murr
November 15, 2024 10:33 22m 50s August-murr:MergeModelCallBack
November 15, 2024 10:33 22m 50s
Add VAS to TRL
Tests #6088: Pull request #2195 synchronize by idanshen
November 15, 2024 00:55 Action required idanshen:main
November 15, 2024 00:55 Action required
Add VAS to TRL
Tests #6087: Pull request #2195 synchronize by idanshen
November 14, 2024 20:50 Action required idanshen:main
November 14, 2024 20:50 Action required
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6086: Pull request #2347 synchronize by kashif
November 14, 2024 19:44 24m 33s kashif:soft-winrate
November 14, 2024 19:44 24m 33s
🔀 Add MergeModelCallBack
Tests #6085: Pull request #2282 synchronize by August-murr
November 14, 2024 18:13 24m 4s August-murr:MergeModelCallBack
November 14, 2024 18:13 24m 4s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6079: Pull request #2347 synchronize by kashif
November 14, 2024 16:51 28m 16s kashif:soft-winrate
November 14, 2024 16:51 28m 16s
📉 Add PEFT support for PPOTrainer
Tests #6078: Pull request #2344 synchronize by ccs96307
November 14, 2024 12:18 25m 44s ccs96307:feature-ppo-peft
November 14, 2024 12:18 25m 44s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6077: Pull request #2347 synchronize by kashif
November 14, 2024 11:54 21m 44s kashif:soft-winrate
November 14, 2024 11:54 21m 44s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6076: Pull request #2347 synchronize by kashif
November 14, 2024 11:44 22m 26s kashif:soft-winrate
November 14, 2024 11:44 22m 26s
Add SRPO algorithm.
Tests #6075: Pull request #1772 synchronize by qgallouedec
November 14, 2024 11:32 21m 12s frasermince:frasermince/srpo
November 14, 2024 11:32 21m 12s
⚖️ Add use_soft_judge option to WinRateCallback
Tests #6074: Pull request #2347 synchronize by kashif
November 14, 2024 11:05 23m 23s kashif:soft-winrate
November 14, 2024 11:05 23m 23s