put back bf16 for baseline bnb

Signed-off-by: Yu Chin Fabian Lim <[email protected]>
foundation-model-stack · Oct 13, 2024 · a50ff63 · a50ff63
1 parent 50e7e20
commit a50ff63
Show file tree

Hide file tree

Showing 2 changed files with 2 additions and 1 deletion.
diff --git a/plugins/accelerated-peft/README.md b/plugins/accelerated-peft/README.md
@@ -11,7 +11,7 @@ Plugin | Description | Depends | Loading | Augmentation | Callbacks
 
 
 ### Key Points
-- fix upcasting (resulting in slowdown) issue for `bnb` plugin, originally discovered by inventors of [Unsloth](https://unsloth.ai/blog/mistral-benchmark).
+- fix upcasting (resulting in slowdown) issue for `bnb` plugin, originally discovered by inventors of [Unsloth](https://unsloth.ai/blog/mistral-benchmark). **NOTE**: we recommend using *mixed precision* when using 4bit quant for better performance, as per our benchmarks.
 - `bnb` properly configured to work with FSDP following [this guide](https://huggingface.co/docs/bitsandbytes/main/en/fsdp_qlora). 
 - `triton_v2` kernels are not yet properly integrated into huggingface optimum.
 - `triton_v2` kernels are [the only 4bit kernels that work for training](https://github.com/AutoGPTQ/AutoGPTQ/issues/633).

diff --git a/scripts/benchmarks/scenarios.yaml b/scripts/benchmarks/scenarios.yaml
@@ -70,6 +70,7 @@ scenarios:
         framework_config: 
             - baseline-peft-bnb
         arguments:
+            bf16: True
             learning_rate: 2e-4
             torch_dtype: bfloat16
             peft_method: lora