diff options
| -rw-r--r-- | README.md | 14 | ||||
| -rw-r--r-- | Scripts/transcribe.py | 2 |
2 files changed, 10 insertions, 6 deletions
@@ -100,7 +100,7 @@ Avatar resources used: * Tris: 4 * Material slots: 1 * Texture memory: 340 KB (English), 130 MB (international) -* Parameters: 65-217 (configurable; more bits == faster paging) +* Parameter bits: 65-217 (configurable; more bits == faster paging) * Menu slots: 1 ## Motivation @@ -119,8 +119,8 @@ reason or another: update every ~2 seconds, making it a poor choice for latency-sensitive communication. 3. [KillFrenzy's AvatarText](https://github.com/killfrenzy96/KillFrenzyAvatarText) - only supports text-to-text, and is GPL, making it legally risky for people - who want to sell closed-source software. + only supports text-to-text. It's an excellent product with high-quality + source code, but it lacks integration with a client-side STT engine. 4. [I5UCC's VRCTextboxSTT](https://github.com/I5UCC/VRCTextboxSTT) makes KillFrenzy's AvatarText and Whisper kiss. It's the closest spiritual cousin to this repository. The author has made incredible sustained progress on @@ -222,7 +222,7 @@ Completed at commit 1f15133dd985442, AKA release 0.10.0. ### Milestone 3: STT Generally performant -Status: IN PROGRESS. +Status: COMPLETE. Scope: The speech-to-text may be used on resource constrained systems. @@ -236,9 +236,11 @@ it's caused by the inference layer being unable to "second guess" itself (previous transcriptions cannot be edited in the current architecture), or something else. +Completed at commit 1f2e5c6cf16e7e7, AKA release 0.11.2. + ### Milestone 4: Enable non-VRChat use cases -Status: IN PROGRESS. +Status: COMPLETE. Scope: The speech-to-text may be used as a tool for usecases outside of VRChat. @@ -247,6 +249,8 @@ to type into arbitrary text fields (voice-driven keyboard device). MMO players could also use the voice-driven keyboard (speak -> preview -> rapid commit?) while raiding. +Completed at commit 7a576bcac1c37c3, AKA release 0.13.1. + ### Milestone 5: Integration into other tools Status: NOT STARTED. diff --git a/Scripts/transcribe.py b/Scripts/transcribe.py index 25ce22b..a1b4e8e 100644 --- a/Scripts/transcribe.py +++ b/Scripts/transcribe.py @@ -4,7 +4,7 @@ from datetime import datetime from emotes_v2 import EmotesState from faster_whisper import WhisperModel from functools import partial -from math import floor, ceil +from math import ceil from playsound import playsound from profanity_filter import ProfanityFilter from sentence_splitter import split_text_into_sentences |
