Is posible that Full finetune Llama3-8B on 2xA100 (40GB Vram per GPU)? #1166

anhnh2002 · 2024-07-12T03:39:48Z

anhnh2002
Jul 12, 2024

Is posible that Full finetune Llama3-8B on 2xA100 (40GB Vram per GPU)? If so, how can i do it?

Jul 12, 2024

I believe that you could even finetune it on only one A100 for the 8B model. But for two gpus, use the 8B_full config which is for distributed recipes. You can launch with tune run --nnodes=1 --nproc-per-node=2 full_finetune_distributed --config llama3/8B_full. You can also tune cp llama3/8B_full <my_config> to be able to modify the config for a batch size, learning rate, etc that it optimal for your compute setup.

View full answer

pbontrager · 2024-07-12T13:48:50Z

pbontrager
Jul 12, 2024
Collaborator

I believe that you could even finetune it on only one A100 for the 8B model. But for two gpus, use the 8B_full config which is for distributed recipes. You can launch with tune run --nnodes=1 --nproc-per-node=2 full_finetune_distributed --config llama3/8B_full. You can also tune cp llama3/8B_full <my_config> to be able to modify the config for a batch size, learning rate, etc that it optimal for your compute setup.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is posible that Full finetune Llama3-8B on 2xA100 (40GB Vram per GPU)? #1166

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Is posible that Full finetune Llama3-8B on 2xA100 (40GB Vram per GPU)? #1166

anhnh2002 Jul 12, 2024

Replies: 1 comment

pbontrager Jul 12, 2024 Collaborator

anhnh2002
Jul 12, 2024

pbontrager
Jul 12, 2024
Collaborator