v0.8.3: Patch release
What's Changed
- [deepspeed/autotuner] Bug fix for skipping mbs on gas by @rahilbathwal5 in #2171
- Fix issue between our abstract accelerator and colossalai's version of op_builder by @jeffra in #2963
- [zero] prevent poor configs from running w. zero-offload by @jeffra in #2971
- Fix Meta Tensor checkpoint load for OPT models by @lekurile in #2990
- ckpt: create directories in checkpoint_engine by @adammoody in #2988
- Fix buffer size for pipeline parallel and communication schedule by @tohtana in #2862
- [docs] add new paper to readme/docs by @jeffra in #3018
- fix language by @stas00 in #3019
- BF Optimizer Attribute Checks by @jomayeri in #3022
- [logger] implement
logger.warning_once
by @stas00 in #3021 - Convert model parameters from generator to list. by @jomayeri in #3017
- Improve loss overflow logs by @Quentin-Anthony in #3008
- Fix Broken Links by @satpalsr in #3048
New Contributors
Full Changelog: v0.8.2...v0.8.3