diff options
| author | yum <yum.food.vr@gmail.com> | 2023-07-07 17:54:40 -0700 |
|---|---|---|
| committer | yum <yum.food.vr@gmail.com> | 2023-07-07 17:54:40 -0700 |
| commit | 5db7426bb14b7e51275c14d8173bd67e8addc4ce (patch) | |
| tree | 4b741a3c706aa8226b7d739a092d4475b4566404 /README.md | |
| parent | 7a576bcac1c37c3c5a59fadf172aa70b15ff83c8 (diff) | |
Update README
Mostly updating roadmap stuff. Non-VRC use cases are "complete" since I
was mostly targeting streaming. The ability to type into arbitrary text
fields is still somewhat nascent & could be improved.
Also update some other random stuff to be more up to date. KillFrenzy
Avatar Text is now MIT, pog!
Diffstat (limited to 'README.md')
| -rw-r--r-- | README.md | 14 |
1 files changed, 9 insertions, 5 deletions
@@ -100,7 +100,7 @@ Avatar resources used: * Tris: 4 * Material slots: 1 * Texture memory: 340 KB (English), 130 MB (international) -* Parameters: 65-217 (configurable; more bits == faster paging) +* Parameter bits: 65-217 (configurable; more bits == faster paging) * Menu slots: 1 ## Motivation @@ -119,8 +119,8 @@ reason or another: update every ~2 seconds, making it a poor choice for latency-sensitive communication. 3. [KillFrenzy's AvatarText](https://github.com/killfrenzy96/KillFrenzyAvatarText) - only supports text-to-text, and is GPL, making it legally risky for people - who want to sell closed-source software. + only supports text-to-text. It's an excellent product with high-quality + source code, but it lacks integration with a client-side STT engine. 4. [I5UCC's VRCTextboxSTT](https://github.com/I5UCC/VRCTextboxSTT) makes KillFrenzy's AvatarText and Whisper kiss. It's the closest spiritual cousin to this repository. The author has made incredible sustained progress on @@ -222,7 +222,7 @@ Completed at commit 1f15133dd985442, AKA release 0.10.0. ### Milestone 3: STT Generally performant -Status: IN PROGRESS. +Status: COMPLETE. Scope: The speech-to-text may be used on resource constrained systems. @@ -236,9 +236,11 @@ it's caused by the inference layer being unable to "second guess" itself (previous transcriptions cannot be edited in the current architecture), or something else. +Completed at commit 1f2e5c6cf16e7e7, AKA release 0.11.2. + ### Milestone 4: Enable non-VRChat use cases -Status: IN PROGRESS. +Status: COMPLETE. Scope: The speech-to-text may be used as a tool for usecases outside of VRChat. @@ -247,6 +249,8 @@ to type into arbitrary text fields (voice-driven keyboard device). MMO players could also use the voice-driven keyboard (speak -> preview -> rapid commit?) while raiding. +Completed at commit 7a576bcac1c37c3, AKA release 0.13.1. + ### Milestone 5: Integration into other tools Status: NOT STARTED. |
