Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Obs AI Assistant] Use hardware optimised version of Elser for Linux x86-64 #205852

Closed
sorenlouv opened this issue Jan 8, 2025 · 1 comment · Fixed by #205851
Closed

[Obs AI Assistant] Use hardware optimised version of Elser for Linux x86-64 #205852

sorenlouv opened this issue Jan 8, 2025 · 1 comment · Fixed by #205851
Assignees
Labels
blocker bug Fixes for quality problems that affect the customer experience Team:Obs AI Assistant Observability AI Assistant

Comments

@sorenlouv
Copy link
Member

sorenlouv commented Jan 8, 2025

Regression introduced in 8.17.0

When installing the Observability knowledge base an inference endpoint with the Elser model is created. There are two versions of Elser: a cross platform version, and an optimised version. The problem is that the AI Assistant always uses the cross-platform version (.elser_model_2).

While the cross platform version works it is suboptimal. What's worse is that Security and Search installs the optimised version meaning users will end of with two versions of Elser. This could increase the number of ML nodes used and will therefore increase costs.

Trial users are only given a very small ML node that can only fit a single Elser model so things may entirely break here.

Solution

Use the optimised version of Elser where applicable

@sorenlouv sorenlouv added the Team:Obs AI Assistant Observability AI Assistant label Jan 8, 2025
@elasticmachine
Copy link
Contributor

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

@sorenlouv sorenlouv self-assigned this Jan 8, 2025
@sorenlouv sorenlouv added the bug Fixes for quality problems that affect the customer experience label Jan 8, 2025
sorenlouv added a commit that referenced this issue Jan 8, 2025
Closes #205852

When installing the Obs knowledge base it will always install the model
`.elser_model_2`.
For Linux with an x86-64 CPU an optimised version of Elser exists
(`elser_model_2_linux-x86_64`). We should use that when possible.

After this change the inference endpoint will use
`.elser_model_2_linux-x86_64` on supported hardware:

![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)
kibanamachine pushed a commit to kibanamachine/kibana that referenced this issue Jan 8, 2025
)

Closes elastic#205852

When installing the Obs knowledge base it will always install the model
`.elser_model_2`.
For Linux with an x86-64 CPU an optimised version of Elser exists
(`elser_model_2_linux-x86_64`). We should use that when possible.

After this change the inference endpoint will use
`.elser_model_2_linux-x86_64` on supported hardware:

![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)

(cherry picked from commit ad3b988)
sorenlouv added a commit to sorenlouv/kibana that referenced this issue Jan 8, 2025
)

Closes elastic#205852

When installing the Obs knowledge base it will always install the model
`.elser_model_2`.
For Linux with an x86-64 CPU an optimised version of Elser exists
(`elser_model_2_linux-x86_64`). We should use that when possible.

After this change the inference endpoint will use
`.elser_model_2_linux-x86_64` on supported hardware:

![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)

(cherry picked from commit ad3b988)
CAWilson94 pushed a commit to CAWilson94/kibana that referenced this issue Jan 13, 2025
)

Closes elastic#205852

When installing the Obs knowledge base it will always install the model
`.elser_model_2`.
For Linux with an x86-64 CPU an optimised version of Elser exists
(`elser_model_2_linux-x86_64`). We should use that when possible.

After this change the inference endpoint will use
`.elser_model_2_linux-x86_64` on supported hardware:

![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)
viduni94 pushed a commit to viduni94/kibana that referenced this issue Jan 23, 2025
)

Closes elastic#205852

When installing the Obs knowledge base it will always install the model
`.elser_model_2`.
For Linux with an x86-64 CPU an optimised version of Elser exists
(`elser_model_2_linux-x86_64`). We should use that when possible.

After this change the inference endpoint will use
`.elser_model_2_linux-x86_64` on supported hardware:

![image](https://github.com/user-attachments/assets/fedc6700-877a-47ab-a3b8-055db53407d0)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
blocker bug Fixes for quality problems that affect the customer experience Team:Obs AI Assistant Observability AI Assistant
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants