We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuda 12.2 python3.10 transformers 4.47.0
xinference version :1.0.1
conda activate XXX export XINFERENCE_HOME=/data/xinference nohup xinference-supervisor -H $IP_ADDR
#start worker conda activate XXX export XINFERENCE_HOME=/data/xinference export XINFERENCE_ENDPOINT=$IP_ADDR nohup xinference-worker -e "$IP_ADDR:$PORT" -H $IP_ADDR 我有三个节点,分别是4卡,2卡,4卡,启动顺序也是4卡,2卡,4卡 当我启动三个不同的模型时,第一个模型指定了4卡,正常启动,第二个模型启动时最多只能选择2卡,将2卡的节点占用之后才能在启动第三个模型时选择4卡并在4卡的节点上运行。
The text was updated successfully, but these errors were encountered:
都是在web上启动的
Sorry, something went wrong.
比较稳定的做法可能是通过 worker_ip 指定运行节点。
之前有别人出现过这种情况吗?有没有优化的办法?未来会不会有优化?感谢!
This issue is stale because it has been open for 7 days with no activity.
No branches or pull requests
cuda 12.2
python3.10
transformers 4.47.0
xinference version :1.0.1
start supervisor
conda activate XXX
export XINFERENCE_HOME=/data/xinference
nohup xinference-supervisor -H $IP_ADDR
#start worker
conda activate XXX
export XINFERENCE_HOME=/data/xinference
export XINFERENCE_ENDPOINT=$IP_ADDR
nohup xinference-worker -e "$IP_ADDR:$PORT" -H $IP_ADDR
我有三个节点,分别是4卡,2卡,4卡,启动顺序也是4卡,2卡,4卡
当我启动三个不同的模型时,第一个模型指定了4卡,正常启动,第二个模型启动时最多只能选择2卡,将2卡的节点占用之后才能在启动第三个模型时选择4卡并在4卡的节点上运行。
The text was updated successfully, but these errors were encountered: