-
Notifications
You must be signed in to change notification settings - Fork 4.2k
Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: ensure
deepspeed.initialize
can only be called once.
#6848
opened Dec 10, 2024 by
traincheck-team
Loading…
[inf] Add config var to enable keeping module on host
#6846
opened Dec 10, 2024 by
oelayan7
Loading…
Remove pin from transformers version and fix Processing/Threading issues in tests
#6822
opened Dec 5, 2024 by
loadams
Loading…
Avoid poisoning process with CUDA calls as soon as importing
#6810
opened Nov 29, 2024 by
HollowMan6
Loading…
Zero2: avoid graph breaks in torch.compile by using param_idx
#6803
opened Nov 28, 2024 by
nelyahu
Loading…
Add the missing view operations from sequence parallel(async).
#6750
opened Nov 14, 2024 by
inkcherry
Loading…
Training ops kernels: Speeding up the Llama-based MoE architectures
#6734
opened Nov 8, 2024 by
RezaYazdaniAminabadi
•
Draft
Allow launcher to include
--include=node3
, not just --include=node3:1,2,3,4,5,6,7,8
#6698
opened Nov 1, 2024 by
stephen-nju
Loading…
Reduce the device bubble introduced by heavy loop synchronization in coalesced fetch/release(z3_leaf_module)
#6694
opened Oct 31, 2024 by
inkcherry
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.