v0.11.1: Patch release
What's Changed
- Fix bug in bfloat16 optimizer related to checkpointing by @okoge-kaz in #4434
- Move tensors to device if mp is not enabled by @deepcharm in #4461
- Fix torch import causing release build failure by @mrwyattii in #4468
- add lm_head and embed_out tensor parallel by @Yejing-Lai in #3962
- Fix release workflow by @mrwyattii in #4483
New Contributors
- @okoge-kaz made their first contribution in #4434
- @deepcharm made their first contribution in #4461
Full Changelog: v0.11.0...v0.11.1