[NL-to-ESQL] update internal documentation #205853

pgayvallet · 2025-01-08T10:08:41Z

Summary

Fix #205606

Re-generate the internal ES|QL documentation using the generation script (+ human review)
Add more scenario to the NL-to-ESQL evaluation suite
Some prompt engineering
- improving the system instructions / functions summary
- add more examples to the summary
- adapt a few opinionated examples for some specific functions

Evaluation

average based on 4 runs for each model/branch tuple
the new tests were locally added to main to run against the same suite and properly evaluate the difference

Model	before (main)	after (PR)	delta
GPT-4o	90.9	97.74	+ 6.84
Claude 3.5 Sonnet v2	88.58	96.49	+7.91
Gemini 1.5-pro-002	88.17	94.19	+6.02

Overall, the prompt engineering somewhat significantly improved the generation efficiency.

pgayvallet · 2025-01-08T10:08:54Z

/ci

elasticmachine · 2025-01-08T12:18:21Z

Pinging @elastic/appex-ai-infra (Team:AI Infra)

…-2025-01

pgayvallet · 2025-01-08T12:48:18Z

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/system_message.txt

added new (and missing) functions

added one-sentence description to most functions (except type conversion)

added more examples

added comments explaining reasoning on some examples

legrego

Nice work! Overall LGTM with a few non-blocking nits

legrego · 2025-01-08T14:00:10Z

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/doc_base/suggestions.ts

@@ -13,6 +13,11 @@ const suggestions: Suggestion[] = [
      return ['BUCKET'];
    }
  },
+  (keywords) => {
+    if (keywords.includes('TO_DATETIME')) {


I always suspected that AI was just a bunch of if statements 😄

Sometimes we have to cheat a bit!

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/esql_docs/esql-byte_length.txt

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/esql_docs/esql-categorize.txt

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/esql_docs/esql-hash.txt

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/esql_docs/esql-limit.txt

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/esql_docs/esql-st_xmax.txt

x-pack/platform/plugins/shared/inference/server/tasks/nl_to_esql/esql_docs/esql-st_ymin.txt

…-2025-01

pgayvallet · 2025-01-08T19:17:05Z

/ci

elasticmachine · 2025-01-08T20:44:50Z

💛 Build succeeded, but was flaky

Buildkite Build
Commit: b287ac1

Failed CI Steps

Post-Build

Metrics [docs]

‼️ ERROR: no builds found for mergeBase sha [9bdc995]

History

💔 Build #265262 failed b287ac1
💛 Build #265117 was flaky e8dd216

kibanamachine · 2025-01-09T07:04:46Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/12685167949

## Summary Fix elastic#205606 - Re-generate the internal ES|QL documentation using the generation script (+ human review) - Add more scenario to the NL-to-ESQL evaluation suite - Some prompt engineering - improving the system instructions / functions summary - add more examples to the summary - adapt a few opinionated examples for some specific functions ## Evaluation - average based on 4 runs for each model/branch tuple - the new tests were locally added to main to run against the same suite and properly evaluate the difference | Model | before (main) | after (PR) | delta | | ------------- | ------------- | ------------- | ------------- | | GPT-4o | 90.9 | 97.74 | + 6.84 | | Claude 3.5 Sonnet v2 | 88.58 | 96.49 | +7.91 | | Gemini 1.5-pro-002 | 88.17 | 94.19 | +6.02 | Overall, the prompt engineering somewhat significantly improved the generation efficiency. (cherry picked from commit 5b96912)

kibanamachine · 2025-01-09T07:09:56Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

# Backport This will backport the following commits from `main` to `8.x`: - [[NL-to-ESQL] update internal documentation (#205853)](#205853)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Pierre Gayvallet <[email protected]>

pgayvallet added 7 commits January 7, 2025 14:07

update documentation

57f7ee3

update system message

0913bdd

documentation tweaks / add examples

cbf03ad

system message tweak

1f4e023

add TO_DATETIME -> DATE_PARSE suggestion

4824db7

fix client

57a64b0

add tests to the evaluation suite

5f0279a

pgayvallet added release_note:skip Skip the PR/issue when compiling release notes v9.0.0 backport:version Backport to applied version labels Team:AI Infra AppEx AI Infrastructure Team v8.18.0 labels Jan 8, 2025

pgayvallet marked this pull request as ready for review January 8, 2025 12:18

pgayvallet requested a review from a team as a code owner January 8, 2025 12:18

pgayvallet added 2 commits January 8, 2025 13:35

Merge remote-tracking branch 'upstream/main' into kbn-205606-esql-doc…

7aea01e

…-2025-01

typo

e8dd216

pgayvallet commented Jan 8, 2025

View reviewed changes

legrego approved these changes Jan 8, 2025

View reviewed changes

pgayvallet added 2 commits January 8, 2025 17:19

minor doc fixes

c26ddb0

Merge remote-tracking branch 'upstream/main' into kbn-205606-esql-doc…

b287ac1

…-2025-01

pgayvallet merged commit 5b96912 into elastic:main Jan 9, 2025
8 checks passed

kibanamachine mentioned this pull request Jan 9, 2025

[8.x] [NL-to-ESQL] update internal documentation (#205853) #205995

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NL-to-ESQL] update internal documentation #205853

[NL-to-ESQL] update internal documentation #205853

pgayvallet commented Jan 8, 2025 •

edited by kibanamachine

Loading

pgayvallet commented Jan 8, 2025

elasticmachine commented Jan 8, 2025

pgayvallet Jan 8, 2025

legrego left a comment

legrego Jan 8, 2025

pgayvallet Jan 8, 2025

pgayvallet commented Jan 8, 2025

elasticmachine commented Jan 8, 2025 •

edited

Loading

kibanamachine commented Jan 9, 2025

kibanamachine commented Jan 9, 2025

[NL-to-ESQL] update internal documentation #205853

[NL-to-ESQL] update internal documentation #205853

Conversation

pgayvallet commented Jan 8, 2025 • edited by kibanamachine Loading

Summary

Evaluation

pgayvallet commented Jan 8, 2025

elasticmachine commented Jan 8, 2025

pgayvallet Jan 8, 2025

Choose a reason for hiding this comment

legrego left a comment

Choose a reason for hiding this comment

legrego Jan 8, 2025

Choose a reason for hiding this comment

pgayvallet Jan 8, 2025

Choose a reason for hiding this comment

pgayvallet commented Jan 8, 2025

elasticmachine commented Jan 8, 2025 • edited Loading

💛 Build succeeded, but was flaky

Failed CI Steps

Metrics [docs]

History

kibanamachine commented Jan 9, 2025

kibanamachine commented Jan 9, 2025

💚 All backports created successfully

Questions ?

pgayvallet commented Jan 8, 2025 •

edited by kibanamachine

Loading

elasticmachine commented Jan 8, 2025 •

edited

Loading