-
Notifications
You must be signed in to change notification settings - Fork 496
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
添加对deepseek v3的支持 #2736
Comments
deepseek v3 架构和 deepseek v2.5 是一致的。目前这个模型太大了,我们需要一些时间完成测试。 |
了解了,感谢🙏 |
我下载了int4版本,在384G显存的机器上勉强能载入,但出来的全是乱码。估计要二台384G显存的机器来分布推理。 |
int4 的模型权重是哪个的? |
have you tried this? a version of 2bit https://huggingface.co/unsloth/DeepSeek-V3-GGUF/tree/main |
This issue is stale because it has been open for 7 days with no activity. |
Feature request / 功能建议
添加对deepseek v3的支持
Motivation / 动机
deepseek v3已开源,希望能跟进一下,非常感谢
Your contribution / 您的贡献
摩搭链接https://modelscope.cn/models/deepseek-ai/DeepSeek-V3/summary
The text was updated successfully, but these errors were encountered: