stream the output #52

xsuchy · 2024-08-06T17:04:49Z

so it is more interactive

xsuchy · 2024-08-06T17:07:54Z

Currently, the logdetective waits a dozen minutes and then prints the result én bloc. I wanted to turn on the streaming, but with streaming on, the models continue answering and never finishes.
I tried everything possible, but it still does not work.
Does anyone have idea how to stop at the right time?

logdetective/logdetective.py

xsuchy · 2024-08-07T11:05:18Z

I made the stream default, but added --no-stream option to workaround broken models like llama3. But instead of heuristics when not to keep streaming on, I choose rather to document the broken model in README.

Ready for re-review.

TomasTomecek

Tested locally and works really well!

Although there is a big gap (~30 seconds) after the 'Explanation:' text is printed and before the output starts appearing. We may investigate what's happening (is the model being loaded to memory and initialized?) and print some "progress bar" there.

LGTM, very nice!

jpodivin · 2024-08-08T11:26:54Z

README.md

@@ -55,6 +55,10 @@ Example you want to use a different model:
    logdetective https://example.com/logs.txt --model https://huggingface.co/QuantFactory/Meta-Llama-3-8B-Instruct-GGUF/resolve/main/Meta-Llama-3-8B-Instruct.Q5_K_S.gguf?download=true
    logdetective https://example.com/logs.txt --model QuantFactory/Meta-Llama-3-8B-Instruct-GGUF

+Note that streaming with some models (notably Meta-Llama-3 is broken) is brokend and can be workarounded by `no-stream` option:


s/brokend/broken/

README.md

so it is more interactive

addressing: logdetective/logdetective.py:12: R0915[too-many-statements]: main: Too many statements (55/50)

xsuchy marked this pull request as draft August 6, 2024 17:05

xsuchy force-pushed the stream branch from 39cf6eb to 6435982 Compare August 7, 2024 10:47

github-advanced-security bot found potential problems Aug 7, 2024

View reviewed changes

logdetective/logdetective.py Fixed Show fixed Hide fixed

xsuchy force-pushed the stream branch from 6435982 to cb409e6 Compare August 7, 2024 10:49

github-advanced-security bot found potential problems Aug 7, 2024

View reviewed changes

logdetective/logdetective.py Fixed Show fixed Hide fixed

xsuchy force-pushed the stream branch from fb698b2 to afbc7f6 Compare August 7, 2024 11:02

xsuchy marked this pull request as ready for review August 7, 2024 11:03

xsuchy changed the title ~~Draft: stream the output~~ stream the output Aug 7, 2024

TomasTomecek approved these changes Aug 8, 2024

View reviewed changes

jpodivin reviewed Aug 8, 2024

View reviewed changes

xsuchy added 2 commits August 19, 2024 08:32

stream the output

f89055b

so it is more interactive

split main() to make pylint happy

32546ae

addressing: logdetective/logdetective.py:12: R0915[too-many-statements]: main: Too many statements (55/50)

xsuchy force-pushed the stream branch from afbc7f6 to 32546ae Compare August 19, 2024 06:33

jpodivin approved these changes Aug 19, 2024

View reviewed changes

jpodivin merged commit 525b456 into fedora-copr:main Aug 19, 2024
2 checks passed

xsuchy mentioned this pull request Oct 4, 2024

Stream response from the model so users won't wait for long time fedora-copr/logdetective-website#200

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stream the output #52

stream the output #52

xsuchy commented Aug 6, 2024

xsuchy commented Aug 6, 2024

xsuchy commented Aug 7, 2024

TomasTomecek left a comment

jpodivin Aug 8, 2024

xsuchy Aug 19, 2024

stream the output #52

stream the output #52

Conversation

xsuchy commented Aug 6, 2024

xsuchy commented Aug 6, 2024

xsuchy commented Aug 7, 2024

TomasTomecek left a comment

Choose a reason for hiding this comment

jpodivin Aug 8, 2024

Choose a reason for hiding this comment

xsuchy Aug 19, 2024

Choose a reason for hiding this comment