| Commit message (Collapse) | Author | Age |
| | |
|
| |
|
|
|
|
|
|
|
|
| |
- update cursorignore
- add hallucination filter training & inference code
- put logging into a central module
- segment metadata logging occurs before filtering
- segment metadata logging is on by default
- check in embedded python setup script
- include trained hallucination filter model
|
| | |
|
| | |
|
| | |
|
| | |
|
| | |
|
| | |
|
| |
|
|
|
|
|
|
|
|
|
| |
Segment metadata can now be logged to a json as the app runs. The goal
is to identify the params that heavily correlate with hallucinations.
Also:
* use 7zip for compression in build, speeding things up
* log dll download progress every few seconds
* shrink package
|
| |
|
|
|
|
| |
* fix model acquisition
* fix local beepsnd
* fix volume control
|
| |
|
|
|
|
|
|
|
|
| |
- add desktop and vr input threads
- add audio feedback for input
- add volume control for audio feedback
- add UI for custom chatbox/built in chatbox
- add ability to dismiss built in chatbox (sync empty messages)
- limit lines in python console
- limit length of each transcript
|
| |
|
|
|
|
|
|
| |
- fix unicode output from python terminal
- fix cpu inference
- add filters
- add beam search params to UI
- DRY up config definition in UI
|
| |
|
|
|
|
|
|
|
|
| |
- Filters actually get applied now, huge accuracy boost
- Use silero-vad python library instead of rolling our own
- Expose prompt parameter
- Auto setup venv on launch
- Clean up python output
- Auto acquire all dependencies on launch
- Add icon
|
| |
|
|
|
|
|
|
|
|
| |
1. main STT app works in new project structure
2. UI dumps mics on startup to populate mic list
3. add missing deps (hf-xet, wave)
4. normalize audio volume when transcribing. Probably still wrong tbqh.
5. add checkbox to save audio segments & improve logic so only segments
with speech get saved.
6. add default config settings
|
| |
|
|
| |
HEAVILY VIBE CODED!
|
| |
|