TaSTT.git - Free self-hosted STT for VRChat.

	Commit message (Collapse)	Author	Age
*	Delete unused filesv1.0.0-beta00	yum	2025-07-23
\|
*	Replace hardcoded localhost with js magicv0.20.0	yum	2024-07-12
\| \| \| \| \| \|	Use some js magic to deduce the hostname instead of hardcoding localhost. If you used the browser source under 127.0.0.1, then you'd get XSS blocked from making the ajax calls. This fixes that.
*	Fix spacing in browser source	yum	2024-07-12
\|
*	Reimplement BrowserSource as a StreamingPlugin	yum	2023-09-18
\| \| \| \| \| \| \| \| \|	BrowserSource now fades text out continuously over time. TODO * Delete C++ webserver, browsersource, transcript code * Add UI for text age fading
*	General cleanupv0.15.3	yum	2023-09-13
\| \| \| \|	Remove unused proxy code, curl, and images.
*	Browser source now shows preview text as slightly transparent	yum	2023-09-09
\| \| \| \|	Improves viewer experience.
*	Begin work on proxy server	yum	2023-07-03
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Create a simple server with 3 endpoints: * /create_session: Create a session and return its identifier. * /set_transcript: Update a session's transcript. * /get_transcript: Fetch a session's transcript. Right now the session ID provides authentication and authorization. There is no public/private ID so you have to trust whoever you share your ID with. IDs are long and generated by the server, so it should be somewhat secure against low-effort hacking. Other updates: * Drop whisper_requirements.txt - no longer needed. * Vendor curl to make it easier to interact with the server. TODO: * Fuzz test the server.
*	Add visual commit indicator to OBS browser source	yum	2023-06-30
\| \| \| \| \| \| \| \|	Circle goes red when speaking, grey when done. Ideally it would be in the top right portion of the browser source, but this is a good start. Also, hard-cap transcripts to 4096 chars. This prevents the STT from lagging during long sessions.
*	Add grey background to browser src	yum	2023-06-27
\| \| \| \| \| \|	Should improve legibility. * Update README
*	Add browser source, hardcoded to port 8097	yum	2023-06-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Transcription output now streams to localhost:8097. In OBS: * Create a browser source. * url: localhost:8097 * width: 2200 * height: 400 TODO: * Put behind toggle. * Create input field for port. Misc cleanup: * transcribe.py: Drop frames from audio capture thread instead of the transcription thread. Doing it the other way would result in occasional data loss.
*	Begin integrating faster-whisperv0.11.0	yum	2023-04-23
\| \| \| \| \| \|	This is a much faster, lower-VRAM reimplementation of Whisper in Python. Early testing is extremely promising: fast transcription speed, extremely low resource usage (CPU/RAM/VRAM), high accuracy.
*	Complete OBS browser source	yum	2023-02-25
\| \| \| \| \|	* Implement HTTPMapper classes * Browser source respects user-configured source port
*	Finish browser source proof-of-concept	yum	2023-02-24
\| \| \| \| \| \|	It's a crashy mess, but it sort of works. * Add Transcript class to send transcription segments between layers
*	Add HTML for BrowserSource	yum	2023-02-24
	Browser source queries /api/transcript at 10Hz via jquery and renders the response.