huggingface / optimum-neuron Public

Notifications You must be signed in to change notification settings
Fork 63
Star 210

Code
Issues 30
Pull requests 8
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/optimum-neuron

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

30 Open 256 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

HF_HUB_OFFLINE environment variable not being honoured for Neuron cache bug

Something isn't working

#741 opened Nov 24, 2024 by unography

2 of 4 tasks

AttributeError: can't set attribute 'deepspeed_plugin' bug

Something isn't working

#735 opened Nov 14, 2024 by anushka0415

2 of 4 tasks

Size mismatch while loading consolidated checkpoints trained with Tensor parallelism for custom LLama Model bug

Something isn't working

#734 opened Nov 9, 2024 by unography

3 of 4 tasks

Could not find a matching NEFF for your HLO in this directory bug

Something isn't working

#730 opened Oct 30, 2024 by SteliosGian

4 tasks

SPECULATE option error enhancement

New feature or request

Stale

#722 opened Oct 23, 2024 by SteliosGian

1 of 4 tasks

training loss while fine-tuning llama 3.1 with lora is very high compared to rtx 3090 bug

Something isn't working

#721 opened Oct 18, 2024 by anilozlu

4 tasks done

can't compile llama-3-8B or llama-3.1-8B with lora if batch size is more than 1 bug

Something isn't working

#709 opened Oct 5, 2024 by anilozlu

3 of 4 tasks

Move neuron_parallel_compile outside of bash script

#706 opened Sep 26, 2024 by jgray-aws

Codellama generates wierd tokens with TGI 0.0.24 bug

Something isn't working

#704 opened Sep 25, 2024 by pinak-p

1 of 4 tasks

ValueError: The NeuronTrainer only accept NeuronTrainingArguments, but <class 'optimum.neuron.training_args.Seq2SeqNeuronTrainingArguments'> was provided. bug

Something isn't working

#693 opened Sep 6, 2024 by industrialeaf

2 of 4 tasks

Cannot host Llama-3-8B exported by optimum-neuron with TGI contianer using optimum-neuron(0.0.24) and neuron-sdk(2.19.1) bug

Something isn't working

#684 opened Aug 25, 2024 by cszhz

2 of 4 tasks

Training output reports incorrect num examples when using DDP bug

Something isn't working

Stale

#683 opened Aug 24, 2024 by syl-taylor-aws

2 of 4 tasks

Enable use of IterableDataset when training with DDP

#681 opened Aug 23, 2024 by syl-taylor-aws

text-generation-inference docker builds are not reproducible due to missing Cargo.lock causing builds to fail on previous versions bug

Something isn't working

#677 opened Aug 8, 2024 by charlesmelby

1 of 4 tasks

Add support for new Black Forest's model (Flux)

#676 opened Aug 6, 2024 by mrrfr

MPMD errors when enabling pipeline parallel for fine-tuning llama 3 8B model bug

Something isn't working

Stale

#674 opened Jul 31, 2024 by bingchen-liu

2 of 4 tasks

Underloaded Neuron Cores with Llama3 bug

Something isn't working

Stale

#672 opened Jul 30, 2024 by dlptv

2 of 4 tasks

Add support for Llama3.1

#664 opened Jul 24, 2024 by dacorvo

Llama 3 8B fine tuning shows nan value as loss bug

Something isn't working

Stale

#660 opened Jul 20, 2024 by BaiqingL

2 of 4 tasks

Llama3-8B finetuning shows runtime error of TDRV:v2_cc_execute bug

Something isn't working

Stale

#658 opened Jul 17, 2024 by jianyinglangaws

Speculative Sampling support for TGI Stale

#289 opened Nov 2, 2023 by mmcclean-aws

Optimizer state save / load with ZeRO-1 + TP Stale

#273 opened Oct 25, 2023 by michaelbenayoun

Support for RLHF Stale

#261 opened Oct 13, 2023 by mmcclean-aws

Loss & metric discrepancy with predict_with_generate Stale

#255 opened Oct 10, 2023 by bocchris-aws

dataclasses.FrozenInstanceError: cannot assign to field Stale

#220 opened Sep 9, 2023 by samir-souza

Previous 1 2 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly