-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Obs AI Assistant] Use hardware optimised version of Elser for Linux x86-64 #205852
Labels
blocker
bug
Fixes for quality problems that affect the customer experience
Team:Obs AI Assistant
Observability AI Assistant
Comments
Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant) |
sorenlouv
added
the
bug
Fixes for quality problems that affect the customer experience
label
Jan 8, 2025
sorenlouv
added a commit
that referenced
this issue
Jan 8, 2025
Closes #205852 When installing the Obs knowledge base it will always install the model `.elser_model_2`. For Linux with an x86-64 CPU an optimised version of Elser exists (`elser_model_2_linux-x86_64`). We should use that when possible. After this change the inference endpoint will use `.elser_model_2_linux-x86_64` on supported hardware: ![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)
kibanamachine
pushed a commit
to kibanamachine/kibana
that referenced
this issue
Jan 8, 2025
) Closes elastic#205852 When installing the Obs knowledge base it will always install the model `.elser_model_2`. For Linux with an x86-64 CPU an optimised version of Elser exists (`elser_model_2_linux-x86_64`). We should use that when possible. After this change the inference endpoint will use `.elser_model_2_linux-x86_64` on supported hardware: ![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0) (cherry picked from commit ad3b988)
sorenlouv
added a commit
to sorenlouv/kibana
that referenced
this issue
Jan 8, 2025
) Closes elastic#205852 When installing the Obs knowledge base it will always install the model `.elser_model_2`. For Linux with an x86-64 CPU an optimised version of Elser exists (`elser_model_2_linux-x86_64`). We should use that when possible. After this change the inference endpoint will use `.elser_model_2_linux-x86_64` on supported hardware: ![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0) (cherry picked from commit ad3b988)
CAWilson94
pushed a commit
to CAWilson94/kibana
that referenced
this issue
Jan 13, 2025
) Closes elastic#205852 When installing the Obs knowledge base it will always install the model `.elser_model_2`. For Linux with an x86-64 CPU an optimised version of Elser exists (`elser_model_2_linux-x86_64`). We should use that when possible. After this change the inference endpoint will use `.elser_model_2_linux-x86_64` on supported hardware: ![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)
viduni94
pushed a commit
to viduni94/kibana
that referenced
this issue
Jan 23, 2025
) Closes elastic#205852 When installing the Obs knowledge base it will always install the model `.elser_model_2`. For Linux with an x86-64 CPU an optimised version of Elser exists (`elser_model_2_linux-x86_64`). We should use that when possible. After this change the inference endpoint will use `.elser_model_2_linux-x86_64` on supported hardware: ![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
blocker
bug
Fixes for quality problems that affect the customer experience
Team:Obs AI Assistant
Observability AI Assistant
Regression introduced in 8.17.0
When installing the Observability knowledge base an inference endpoint with the Elser model is created. There are two versions of Elser: a cross platform version, and an optimised version. The problem is that the AI Assistant always uses the cross-platform version (
.elser_model_2
).While the cross platform version works it is suboptimal. What's worse is that Security and Search installs the optimised version meaning users will end of with two versions of Elser. This could increase the number of ML nodes used and will therefore increase costs.
Trial users are only given a very small ML node that can only fit a single Elser model so things may entirely break here.
Solution
Use the optimised version of Elser where applicable
The text was updated successfully, but these errors were encountered: