-
-
Notifications
You must be signed in to change notification settings - Fork 893
Issues: axolotl-ai-cloud/axolotl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
load_from_disk for rl tpye training
enhancement
New feature or request
#2192
opened Dec 15, 2024 by
leeparkuky
5 tasks done
'AdamW' object has no attribute 'optim_bits'
bug
Something isn't working
#2191
opened Dec 15, 2024 by
e-p-armstrong
1 task done
APOLLO optimizer
enhancement
New feature or request
#2175
opened Dec 11, 2024 by
fblgit
5 tasks done
When starting with DPO datasets, failed error with TypeError.
bug
Something isn't working
waiting for reporter
#2174
opened Dec 11, 2024 by
Yuto-24
6 of 8 tasks
Error During Model Saving QLORA + FSDP
bug
Something isn't working
waiting on upstream
#2149
opened Dec 7, 2024 by
ghsama
6 of 8 tasks
Show sample batch content
enhancement
New feature or request
#2145
opened Dec 7, 2024 by
fzyzcjy
5 tasks done
Support ORPO/DPO Liger losses (and LigerORPOTrainer)
enhancement
New feature or request
#2141
opened Dec 6, 2024 by
ccdv-ai
5 tasks done
Poential memory leak for axolotl v0.5.2 pretrain streaming datasets with liger kernel
bug
Something isn't working
#2108
opened Nov 30, 2024 by
deter3
6 of 8 tasks
Various bugs with ORPO
bug
Something isn't working
#2105
opened Nov 26, 2024 by
ccdv-ai
6 of 8 tasks
Mistral Nemo LoRA training has super high grad_norm
bug
Something isn't working
#2095
opened Nov 21, 2024 by
Nero10578
6 of 8 tasks
chat_template masking is broken with Mistral Small (possibly others)
bug
Something isn't working
under review
#2089
opened Nov 19, 2024 by
kubernetes-bad
6 of 8 tasks
RuntimeError: CUDA error: unknown error
When attempting to fine-tune llama-3 models
bug
#2071
opened Nov 17, 2024 by
cjfreeze
7 of 8 tasks
Deepspeed zero3 + LoRA: RuntimeError: Only Tensors of floating point and complex dtype can require gradients
bug
Something isn't working
waiting on upstream
wip
#2068
opened Nov 16, 2024 by
bursteratom
6 of 8 tasks
Logging behavior since GA fix
bug
Something isn't working
under review
waiting for reporter
#2004
opened Oct 30, 2024 by
ccdv-ai
6 of 8 tasks
Support for Sequence / Context Parallelism
enhancement
New feature or request
#1972
opened Oct 15, 2024 by
dwzhu-pku
5 tasks done
Should New feature or request
under review
tokenizer_legacy
be default as false
?
enhancement
#1955
opened Oct 10, 2024 by
tongyx361
5 tasks done
fix_untrained_tokens doesn't work with zero-3
bug
Something isn't working
#1944
opened Oct 4, 2024 by
winglian
6 of 8 tasks
mistrall small support
enhancement
New feature or request
#1922
opened Sep 21, 2024 by
win4r
5 tasks done
Different training losses when flash_attention is on/off
bug
Something isn't working
#1918
opened Sep 18, 2024 by
zhangchen-xu
6 of 8 tasks
pretrain doesn't work on json\jsonl
bug
Something isn't working
#1895
opened Sep 5, 2024 by
SicariusSicariiStuff
6 of 8 tasks
Training with a large json dataset (>650K) throw error:pyarrow.lib.ArrowInvalid: offset overflow while concatenating arrays
bug
Something isn't working
#1888
opened Sep 3, 2024 by
bofei5675
6 of 8 tasks
MixLoRA finetuning
enhancement
New feature or request
#1880
opened Aug 28, 2024 by
winglian
5 tasks done
Unable to load ORPO dataset in a *.json file
bug
Something isn't working
#1868
opened Aug 26, 2024 by
SicariusSicariiStuff
6 of 8 tasks
ORPO results in Something isn't working
Cannot flatten integer dtype tensors
bug
#1838
opened Aug 20, 2024 by
maziyarpanahi
6 of 8 tasks
Mistral Nemo 12B training CUDA Out of memory only when enabling EVAL. On 2x3090Ti FSDP.
bug
Something isn't working
#1813
opened Aug 6, 2024 by
Nero10578
6 of 8 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.