You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The text was updated successfully, but these errors were encountered:
lreardon
changed the title
Word-by-word timestamps restart when multithreading
Token-level timestamps reset at breakpoints when multithreading
Jan 3, 2025
lreardon
changed the title
Token-level timestamps reset at breakpoints when multithreading
Token-level timestamps reset at breakpoints when multi-threading
Jan 3, 2025
My invocation of the command is as follows:
$PROJECT_ROOT/build/bin/whisper-cli -m $PROJECT_ROOT/models/ggml-$model_name.bin -f audio/$audio_name/preprocessed.wav -ojf -of outputs/$audio_name.transcribed -p X
-ojf
outputs to json "full" (i.e. with more data).X
is the number of threads to use for transcription.When
X > 1
, then at the file breakpoints, the token-level timestamps reset to0
. However, the phrase-level timestamps remain accurate.A sample output is attached, where
X = 4
:lin-alg-lect.transcribed.json
The text was updated successfully, but these errors were encountered: