Skip to content

Commit

Permalink
revert a previous change to fix fsdp
Browse files Browse the repository at this point in the history
  • Loading branch information
winglian committed Apr 11, 2024
1 parent 590e412 commit 22259fe
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/axolotl/utils/models.py
Original file line number Diff line number Diff line change
Expand Up @@ -484,7 +484,7 @@ def load_model(
"bnb_4bit_quant_type": "nf4",
"bnb_4bit_quant_storage": torch.bfloat16,
}
if not cfg.deepspeed:
if cfg.model_config_type in ["jamba", "qwen2_moe"] and not cfg.deepspeed:
# for some reason, this causes the loss to be off by an order of magnitude
# but deepspeed needs this still in bfloat16
bnb_config["bnb_4bit_quant_storage"] = torch.float32
Expand Down

0 comments on commit 22259fe

Please sign in to comment.