v0.2.13
What changes?
Fixes auto-gptq kernel CUDA within base container.
Add support for all vLLM models. Update the vllm to latest stable commit.
Full Changelog: v0.2.12...v0.2.13
Fixes auto-gptq kernel CUDA within base container.
Add support for all vLLM models. Update the vllm to latest stable commit.
Full Changelog: v0.2.12...v0.2.13