<feed xmlns='http://www.w3.org/2005/Atom'>
<title>TaSTT.git/GUI, branch v0.16.0</title>
<subtitle>Free self-hosted STT for VRChat.</subtitle>
<id>https://git.yummers.dev/TaSTT.git/atom?h=v0.16.0</id>
<link rel='self' href='https://git.yummers.dev/TaSTT.git/atom?h=v0.16.0'/>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/'/>
<updated>2023-10-06T01:28:42+00:00</updated>
<entry>
<title>Transcripts preceding long pauses now drop</title>
<updated>2023-10-06T01:28:42+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-10-06T01:22:55+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=add7bd8ef86ec21cd1327eb45bcb739aa54f7db8'/>
<id>urn:sha1:add7bd8ef86ec21cd1327eb45bcb739aa54f7db8</id>
<content type='text'>
When hot-miking into the built-in chatbox, there are sometimes long
pauses in conversation. After these pauses, it's undesirable to show the
transcript generate before the pause. This feature makes it so that
those transcripts can be dropped.

Also:

* Limit number of segments sent to browser source to 10. Allow this to
  grow up to 10 segments before dropping the first 5 segments.
* Silence warnings generated by `install_in_venv`, used by e.g.
  translation codepath.
* Enable audio normalization to improve accuracy when speaking softly,
  at the cost of some accuracy when speaking normally.

Credit: user endo0269 on Discord suggested this feature.
</content>
</entry>
<entry>
<title>Reimplement BrowserSource as a StreamingPlugin</title>
<updated>2023-09-19T04:23:14+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-19T04:00:56+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=c2bc70c18d2fd1c3601b32f2a93b3b4a704786a5'/>
<id>urn:sha1:c2bc70c18d2fd1c3601b32f2a93b3b4a704786a5</id>
<content type='text'>
BrowserSource now fades text out continuously over time.

TODO

* Delete C++ webserver, browsersource, transcript code
* Add UI for text age fading
</content>
</entry>
<entry>
<title>Add UI for process priority</title>
<updated>2023-09-17T20:05:29+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-17T20:05:29+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=b037e158065bec98d91231c0c6443b63f45ec7ea'/>
<id>urn:sha1:b037e158065bec98d91231c0c6443b63f45ec7ea</id>
<content type='text'>
Default is normal prio.
</content>
</entry>
<entry>
<title>Bugfixes</title>
<updated>2023-09-16T22:49:55+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-16T22:49:55+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=d4c85f4ac4cb627e2611359d18615d76eda29c90'/>
<id>urn:sha1:d4c85f4ac4cb627e2611359d18615d76eda29c90</id>
<content type='text'>
* uwu filter no longer adds extra whitespace before/after segments. This
  would defeat commit logic.
* disabling phonemes works again - path to prefab was being quoted
  twice, breaking the codepath.
</content>
</entry>
<entry>
<title>General cleanup</title>
<updated>2023-09-14T04:55:05+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-14T04:43:33+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=703e183e430f15aff6005d38aefcacf91e2314cd'/>
<id>urn:sha1:703e183e430f15aff6005d38aefcacf91e2314cd</id>
<content type='text'>
Remove unused proxy code, curl, and images.
</content>
</entry>
<entry>
<title>Pin huggingface_hub to 0.16.4</title>
<updated>2023-09-11T20:53:14+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-11T20:46:57+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=0447f37fb744a1b350f6b92e4d140dbdb1c8d3ec'/>
<id>urn:sha1:0447f37fb744a1b350f6b92e4d140dbdb1c8d3ec</id>
<content type='text'>
0.17.x are breaking faster_whisper's ability to download models.

Also:
* Start using frozen requirements.txt.
* Conditionally install torch &amp; legacy whisper only when doing
  mechanical optimization.
</content>
</entry>
<entry>
<title>Users can now choose custom chatbox texture size in UI</title>
<updated>2023-09-11T00:29:01+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-11T00:29:01+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=3db7723aa5c16358f73e3e8d3bb20a959ce43d5d'/>
<id>urn:sha1:3db7723aa5c16358f73e3e8d3bb20a959ce43d5d</id>
<content type='text'>
</content>
</entry>
<entry>
<title>Bugfix: only cap display of transcript at 4K chars</title>
<updated>2023-09-10T10:50:46+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-10T10:50:46+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=da360bcd1870650848daa034aa31b8a337caecc7'/>
<id>urn:sha1:da360bcd1870650848daa034aa31b8a337caecc7</id>
<content type='text'>
Actually retain the whole transcript to avoid breaking the OSC pager.

Also constrain the UI buffer size by characters instead of lines. Since
some lines can be massive and others short, characters are a better way
of consistently keeping the UI memory in check.
</content>
</entry>
<entry>
<title>Add UI for transcription loop delay</title>
<updated>2023-09-10T07:29:04+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-10T07:28:03+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=4ee396584c348c11e0272f0c2842f6a5d3175586'/>
<id>urn:sha1:4ee396584c348c11e0272f0c2842f6a5d3175586</id>
<content type='text'>
Allows users to directly modulate the performance-latency tradeoff.

Also:
* Bump up UI buffer to 1k lines.
* Fix browser source reset. It now also resets preview text.
</content>
</entry>
<entry>
<title>Browser source now shows preview text as slightly transparent</title>
<updated>2023-09-10T06:18:32+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-09-10T06:18:32+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=9924a141b0b1266671915be12e21df6c8f4c5366'/>
<id>urn:sha1:9924a141b0b1266671915be12e21df6c8f4c5366</id>
<content type='text'>
Improves viewer experience.
</content>
</entry>
</feed>
