<feed xmlns='http://www.w3.org/2005/Atom'>
<title>TaSTT.git/Scripts, branch v0.12.0</title>
<subtitle>Free self-hosted STT for VRChat.</subtitle>
<id>https://git.yummers.dev/TaSTT.git/atom?h=v0.12.0</id>
<link rel='self' href='https://git.yummers.dev/TaSTT.git/atom?h=v0.12.0'/>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/'/>
<updated>2023-05-31T02:13:25+00:00</updated>
<entry>
<title>Finish translation for Western European language speakers</title>
<updated>2023-05-31T02:13:25+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-31T02:01:56+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=0bda49279ec80187d49a922ff2a47141ffb2fd8f'/>
<id>urn:sha1:0bda49279ec80187d49a922ff2a47141ffb2fd8f</id>
<content type='text'>
NLLB needs its input to be split up into sentences. I use the
sentence_splitter Python package to do this. It supports ~20 Western
European languages, but notably, no Asian languages.

* Sort spoken language list. English is still at the top.
* Remove 'Translation source' dropdown. Infer this from the spoken
  language.
* Add lang_compat.py to map language codes between the various libraries
  (whisper, nllb, sentence_splitter).
* Fix bug where old text would appear in textbox when you first bring it
  up.
</content>
</entry>
<entry>
<title>Add ability to translate into 200 languages</title>
<updated>2023-05-26T05:00:56+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-26T04:45:09+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=84f09e1fdf15644d1ea5f955889581932e4f6a8e'/>
<id>urn:sha1:84f09e1fdf15644d1ea5f955889581932e4f6a8e</id>
<content type='text'>
Use Meta's No Language Left Behind (NLLB) algorithm to provide
translation capabilities into 200 languages. Obviously most are very
untested.

This requires either 4.1 or 7.1 GB of RAM and significiantly increases
transcription latency.
</content>
</entry>
<entry>
<title>Add more text filters</title>
<updated>2023-05-25T06:07:07+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-25T06:07:07+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=eed2e8915d83796679c0b7a3ea9121d329ceddab'/>
<id>urn:sha1:eed2e8915d83796679c0b7a3ea9121d329ceddab</id>
<content type='text'>
Add 3 filters:
* Remove trailing period
* Convert to uppercase
* Convert to lowercase

All may be composed. Upper/lower just overwrite each other so just use
one.
</content>
</entry>
<entry>
<title>Add UI toggle for uwu filter</title>
<updated>2023-05-25T05:34:55+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-25T05:34:55+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=ebab21e22dea7931a8a7db05ff26d60cec4f5b5b'/>
<id>urn:sha1:ebab21e22dea7931a8a7db05ff26d60cec4f5b5b</id>
<content type='text'>
UI now has a checkbox for the uwu filter. Does not materially affect
resource usage or latency when enabled.
</content>
</entry>
<entry>
<title>Begin work on uwu filter</title>
<updated>2023-05-25T01:39:27+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-25T01:39:27+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=156a592f35ced01008516e559b62db8829baa787'/>
<id>urn:sha1:156a592f35ced01008516e559b62db8829baa787</id>
<content type='text'>
Use UwwwuPP to translate your boring old speech into uwu-ified version.

Still need to add a UI toggle for this.
</content>
</entry>
<entry>
<title>Add ability to type using STT</title>
<updated>2023-05-23T22:15:39+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-23T22:12:04+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=78de7f02bc364606b0939d66903f02b2f91c141b'/>
<id>urn:sha1:78de7f02bc364606b0939d66903f02b2f91c141b</id>
<content type='text'>
To use it, do a medium hold + long hold. Keep the long hold depressed
until you're done speaking. The transcription will be typed into the
currently selected input field.

* Add more audio feedback
* Make audio feedback play asynchronously so it doesn't slow down the
  controller input state machine as much.
</content>
</entry>
<entry>
<title>Add ability to update textbox in place</title>
<updated>2023-05-22T22:50:12+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-22T22:35:17+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=2d302f73b94a6aa3b2127443c1ba7ddfc9c82052'/>
<id>urn:sha1:2d302f73b94a6aa3b2127443c1ba7ddfc9c82052</id>
<content type='text'>
By holding the button while talking for at least 1.5 seconds, you can
update the contents of the textbox without unlocking it from worldspace.
So now you can carefully position your textbox once, then continually
speak into it without having to reposition it every time.
</content>
</entry>
<entry>
<title>Add keyboard toggle</title>
<updated>2023-05-22T11:04:09+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-22T10:59:45+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=8fafea9d026b2b65599456e70d3f5aa61ef073d1'/>
<id>urn:sha1:8fafea9d026b2b65599456e70d3f5aa61ef073d1</id>
<content type='text'>
Users can now configure a keybind to start/stop/dismiss the STT when in
desktop mode. The default keybind is ctrl+x, since by default VRC
doesn't use 'x' for anything.
</content>
</entry>
<entry>
<title>Fix accidental semicolon typo</title>
<updated>2023-05-22T04:46:08+00:00</updated>
<author>
<name>faker</name>
<email>nspyia2002@gmail.com</email>
</author>
<published>2023-05-22T04:46:08+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=75bce58469fea55b2a57adb0ad4dc97c7870731d'/>
<id>urn:sha1:75bce58469fea55b2a57adb0ad4dc97c7870731d</id>
<content type='text'>
</content>
</entry>
<entry>
<title>Enable selecting specific GPU when transcribing</title>
<updated>2023-05-22T04:27:50+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-05-22T04:27:50+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=2d544407e64bed9328ee493d3daa095b20445b52'/>
<id>urn:sha1:2d544407e64bed9328ee493d3daa095b20445b52</id>
<content type='text'>
Useful on devices with multiple GPUs, such as gaming laptops.

* Update GUI/README.md.
</content>
</entry>
</feed>
