-
Notifications
You must be signed in to change notification settings - Fork 63
Issues: huggingface/optimum-neuron
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
HF_HUB_OFFLINE
environment variable not being honoured for Neuron cache
bug
#741
opened Nov 24, 2024 by
unography
2 of 4 tasks
AttributeError: can't set attribute 'deepspeed_plugin'
bug
Something isn't working
#735
opened Nov 14, 2024 by
anushka0415
2 of 4 tasks
Size mismatch while loading consolidated checkpoints trained with Tensor parallelism for custom LLama Model
bug
Something isn't working
#734
opened Nov 9, 2024 by
unography
3 of 4 tasks
Could not find a matching NEFF for your HLO in this directory
bug
Something isn't working
#730
opened Oct 30, 2024 by
SteliosGian
4 tasks
SPECULATE option error
enhancement
New feature or request
Stale
#722
opened Oct 23, 2024 by
SteliosGian
1 of 4 tasks
training loss while fine-tuning llama 3.1 with lora is very high compared to rtx 3090
bug
Something isn't working
#721
opened Oct 18, 2024 by
anilozlu
4 tasks done
can't compile llama-3-8B or llama-3.1-8B with lora if batch size is more than 1
bug
Something isn't working
#709
opened Oct 5, 2024 by
anilozlu
3 of 4 tasks
Codellama generates wierd tokens with TGI 0.0.24
bug
Something isn't working
#704
opened Sep 25, 2024 by
pinak-p
1 of 4 tasks
ValueError: The NeuronTrainer only accept NeuronTrainingArguments, but <class 'optimum.neuron.training_args.Seq2SeqNeuronTrainingArguments'> was provided.
bug
Something isn't working
#693
opened Sep 6, 2024 by
industrialeaf
2 of 4 tasks
Cannot host Llama-3-8B exported by optimum-neuron with TGI contianer using optimum-neuron(0.0.24) and neuron-sdk(2.19.1)
bug
Something isn't working
#684
opened Aug 25, 2024 by
cszhz
2 of 4 tasks
Training output reports incorrect num examples when using DDP
bug
Something isn't working
Stale
#683
opened Aug 24, 2024 by
syl-taylor-aws
2 of 4 tasks
text-generation-inference docker builds are not reproducible due to missing Cargo.lock causing builds to fail on previous versions
bug
Something isn't working
#677
opened Aug 8, 2024 by
charlesmelby
1 of 4 tasks
MPMD errors when enabling pipeline parallel for fine-tuning llama 3 8B model
bug
Something isn't working
Stale
#674
opened Jul 31, 2024 by
bingchen-liu
2 of 4 tasks
Underloaded Neuron Cores with Llama3
bug
Something isn't working
Stale
#672
opened Jul 30, 2024 by
dlptv
2 of 4 tasks
Llama 3 8B fine tuning shows nan value as loss
bug
Something isn't working
Stale
#660
opened Jul 20, 2024 by
BaiqingL
2 of 4 tasks
Llama3-8B finetuning shows runtime error of TDRV:v2_cc_execute
bug
Something isn't working
Stale
#658
opened Jul 17, 2024 by
jianyinglangaws
dataclasses.FrozenInstanceError: cannot assign to field
Stale
#220
opened Sep 9, 2023 by
samir-souza
Previous Next
ProTip!
Adding no:label will show everything without a label.