summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authoryum <yum.food.vr@gmail.com>2023-06-27 01:47:53 -0700
committeryum <yum.food.vr@gmail.com>2023-06-27 01:50:40 -0700
commit8ff761153b7de23a9556f2af179ab3bf4b9849a5 (patch)
tree0c7b2a1c54e5ccab6f5b6ad45207c5071ec69afc
parent40e0202f5954c475c9c48155b95bc4dc67433242 (diff)
Add grey background to browser src
Should improve legibility. * Update README
-rw-r--r--BrowserSource/index.html1
-rw-r--r--README.md15
2 files changed, 9 insertions, 7 deletions
diff --git a/BrowserSource/index.html b/BrowserSource/index.html
index d3ab777..000f366 100644
--- a/BrowserSource/index.html
+++ b/BrowserSource/index.html
@@ -32,6 +32,7 @@
success: function(data) {
var transcript = data.transcript;
$('#transcript').html(transcript);
+ $('#transcript').css("background-color", "#00000080");
},
error: function(jqXHR, textStatus, errorThrown) {
console.error('Error getting transcript: ', textStatus, errorThrown);
diff --git a/README.md b/README.md
index 6b75998..324a88b 100644
--- a/README.md
+++ b/README.md
@@ -56,6 +56,7 @@ Basic controls:
* Custom textbox requires as few as 65 parameter bits
* Transcription doesn't affect VRChat framerate much, since VRC is heavily
CPU-bound. Performance impact when not speaking is negligible.
+* Browser source. Use with OBS!
* Multi-language support.
* Japanese, Korean, and Chinese glyphs included, among many other languages.
* Full list of Unicode blocks is defined in
@@ -286,8 +287,8 @@ features and bugfixes will likely continue in the "completed" state.
1. ~~Error detection & correction.~~ DONE
2. ~~Text-to-text interface. Type in terminal, show in game.~~ DONE
3. ~~Speech-to-text interface. Speak out loud, show in game.~~ DONE
- 4. Translation into non-English. Whisper natively supports translating N
- languages into English, but not the other way around.
+ 4. ~~Translation into non-English. Whisper natively supports translating N
+ languages into English, but not the other way around.~~ DONE
5. Display text in overlay. Enables (1) lower latency view of TaSTT's
transcription state; (2) checking transcriptions ahead of time; (3)
checking transcriptions without having to see the board in game.
@@ -300,8 +301,8 @@ features and bugfixes will likely continue in the "completed" state.
keybinds.~~ DONE
9. One-click upgrade. Fetch latest stable release, copy over virtual env and
UI configs, relaunch.
- 10. Browser source for OBS. Blocker: the transcription layer doesn't handle
- long pauses well.
+ 10. ~~Browser source for OBS. Blocker: the transcription layer doesn't handle
+ long pauses well.~~ DONE
11. Test suite. Some long representative transcripts with mechanical word
error rate (WER) calculation.
4. Optimization
@@ -326,8 +327,8 @@ features and bugfixes will likely continue in the "completed" state.
The other encoding scheme is thus ~2.5 times more efficient. This could
be used to significantly speed up sync times. (Thanks, Noppers for the
idea!)
- 6. Use Const-Me/Whisper for transcription.
- 7. Implement beam search in Const-Me/Whisper.
+ 6. ~~Use Const-Me/Whisper for transcription.~~ WON'T DO
+ 7. ~~Implement beam search in Const-Me/Whisper.~~ WON'T DO
5. Bugfixes
1. ~~The whisper STT says "Thank you." when there's no audio?~~ DONE
2. JP and CN transcription does not work in the GUI due to encoding issues.
@@ -335,5 +336,5 @@ features and bugfixes will likely continue in the "completed" state.
1. Smooth scrolling.
2. ~~Infinite scrolling.~~ DONE
3. ~~Sound indicator, maybe like animal crossing :)~~ DONE
- 4. Support texture-based PBR shading
+ 4. ~~Support texture-based PBR shading~~ DONE