summaryrefslogtreecommitdiffstats
path: root/app/stt.py
Commit message (Collapse)AuthorAge
* Set target loudness to -16, and enable segment metadata logging by defaultHEADmasteryum2025-07-23
|
* Update avg_logprob cutoff, fix sounds, fix electron buildyum2025-07-23
|
* add segment metadata logging featureyum2025-07-23
| | | | | | | | | | | Segment metadata can now be logged to a json as the app runs. The goal is to identify the params that heavily correlate with hallucinations. Also: * use 7zip for compression in build, speeding things up * log dll download progress every few seconds * shrink package
* bugfixesyum2025-07-23
| | | | | | * fix model acquisition * fix local beepsnd * fix volume control
* More stuffyum2025-05-30
| | | | | | | | | | - add desktop and vr input threads - add audio feedback for input - add volume control for audio feedback - add UI for custom chatbox/built in chatbox - add ability to dismiss built in chatbox (sync empty messages) - limit lines in python console - limit length of each transcript
* More stuffyum2025-05-30
| | | | | | | | - fix unicode output from python terminal - fix cpu inference - add filters - add beam search params to UI - DRY up config definition in UI
* More polishyum2025-05-30
| | | | | | | | | | - Filters actually get applied now, huge accuracy boost - Use silero-vad python library instead of rolling our own - Expose prompt parameter - Auto setup venv on launch - Clean up python output - Auto acquire all dependencies on launch - Add icon
* More UI workyum2025-05-29
| | | | | | | | | | 1. main STT app works in new project structure 2. UI dumps mics on startup to populate mic list 3. add missing deps (hf-xet, wave) 4. normalize audio volume when transcribing. Probably still wrong tbqh. 5. add checkbox to save audio segments & improve logic so only segments with speech get saved. 6. add default config settings
* Move core app logic into folderyum2025-05-29