diff options
| author | yum <yum.food.vr@gmail.com> | 2023-06-27 16:01:16 -0700 |
|---|---|---|
| committer | yum <yum.food.vr@gmail.com> | 2023-06-27 16:01:16 -0700 |
| commit | 6638993e313773ba6ca8bdb6d7690b798d41f0d4 (patch) | |
| tree | 75ce815ed4cca607ad237a39fd4ee5c313fb8504 /Scripts/generate_fonts.py | |
| parent | 241813a5af11093c6b86e70ada729788c1f0dee6 (diff) | |
Add UI for fuzzy commit threshold
Recap: In the STT there's an algorithm that tries to determine when a
transcript is "stable" enough to commit. If that is too loose, then
accuracy suffers; if too strict, then the audio buffer eventually fills.
To mitigate the problem, I check whether the last N transcripts are
within some edit distance (Levenshtein edit distance) of each other. The
fuzzy matching lets us forgive small instabilities, like differences in
uppercase/lowercase or punctuation, while rejecting large instabilities.
The default value of 8 seems to be in the sweet spot of accuracy &
performance, but it will likely be tuned in the future.
Diffstat (limited to 'Scripts/generate_fonts.py')
0 files changed, 0 insertions, 0 deletions
