v1.2.0: Bug Squashing & Fixes across the board
Core
- enable
find_executable_batch_size
on XPU by @faaany in #3236 - Use
numpy._core
instead ofnumpy.core
by @qgallouedec in #3247 - Add warnings and fallback for unassigned devices in infer_auto_device_map by @Nech-C in #3066
- Allow for full dynamo config passed to Accelerator by @muellerzr in #3251
- [WIP] FEAT Decorator to purge accelerate env vars by @BenjaminBossan in #3252
- [
data_loader
] Optionally also propagate set_epoch to batch sampler by @tomaarsen in #3246 - use XPU instead of GPU in the
accelerate config
prompt text by @faaany in #3268
Big Modeling
- Fix
align_module_device
, ensure only cpu tensors forget_state_dict_offloaded_model
by @kylesayrs in #3217 - Remove hook for bnb 4-bit by @SunMarc in #3223
- [docs] add instruction to install bnb on non-cuda devices by @faaany in #3227
- Take care of case when "_tied_weights_keys" is not an attribute by @fabianlim in #3226
- Update deferring_execution.md by @max-yue in #3262
- Revert default behavior of
get_state_dict_from_offload
by @kylesayrs in #3253 - Fix: Resolve #3060,
preload_module_classes
is lost for nested modules by @wejoncy in #3248
DeepSpeed
- Select the DeepSpeedCPUOptimizer based on the original optimizer class. by @eljandoubi in #3255
- support for wrapped schedulefree optimizer when using deepspeed by @winglian in #3266
Documentation
-
Replaced set/check breakpoint with set/check trigger in the troubleshooting documentation by @relh in #3259
-
Fixed multiple typos for Tutorials and Guides docs by @henryhmko in #3274
New Contributors
- @winglian made their first contribution in #3266
- @max-yue made their first contribution in #3262
- @as12138 made their first contribution in #3261
- @relh made their first contribution in #3259
- @wejoncy made their first contribution in #3248
- @henryhmko made their first contribution in #3274
Full Changelog
- Fix
align_module_device
, ensure only cpu tensors forget_state_dict_offloaded_model
by @kylesayrs in #3217 - remove hook for bnb 4-bit by @SunMarc in #3223
- enable
find_executable_batch_size
on XPU by @faaany in #3236 - take care of case when "_tied_weights_keys" is not an attribute by @fabianlim in #3226
- [docs] update code in tracking documentation by @faaany in #3235
- Add warnings and fallback for unassigned devices in infer_auto_device_map by @Nech-C in #3066
- [
data_loader
] Optionally also propagate set_epoch to batch sampler by @tomaarsen in #3246 - [docs] add instruction to install bnb on non-cuda devices by @faaany in #3227
- Use
numpy._core
instead ofnumpy.core
by @qgallouedec in #3247 - Allow for full dynamo config passed to Accelerator by @muellerzr in #3251
- [WIP] FEAT Decorator to purge accelerate env vars by @BenjaminBossan in #3252
- use XPU instead of GPU in the
accelerate config
prompt text by @faaany in #3268 - support for wrapped schedulefree optimizer when using deepspeed by @winglian in #3266
- Update deferring_execution.md by @max-yue in #3262
- Fix: Resolve #3257 by @as12138 in #3261
- Replaced set/check breakpoint with set/check trigger in the troubleshooting documentation by @relh in #3259
- Select the DeepSpeedCPUOptimizer based on the original optimizer class. by @eljandoubi in #3255
- Revert default behavior of
get_state_dict_from_offload
by @kylesayrs in #3253 - Fix: Resolve #3060,
preload_module_classes
is lost for nested modules by @wejoncy in #3248 - [docs] update set-seed by @faaany in #3228
- [docs] fix typo by @faaany in #3221
- [docs] use real path for
checkpoint
by @faaany in #3220 - Fixed multiple typos for Tutorials and Guides docs by @henryhmko in #3274
Code Diff
Release diff: v1.1.1...v1.2.0