<feed xmlns='http://www.w3.org/2005/Atom'>
<title>TaSTT.git/Scripts/transcribe.py, branch v0.1</title>
<subtitle>Free self-hosted STT for VRChat.</subtitle>
<id>https://git.yummers.dev/TaSTT.git/atom?h=v0.1</id>
<link rel='self' href='https://git.yummers.dev/TaSTT.git/atom?h=v0.1'/>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/'/>
<updated>2022-12-30T10:33:07+00:00</updated>
<entry>
<title>Bugfix: regions truncate correctly at page boundaries</title>
<updated>2022-12-30T10:33:07+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-30T10:25:50+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=d9a0224466189d0fe1d46f21d586fa8a1a58c687'/>
<id>urn:sha1:d9a0224466189d0fe1d46f21d586fa8a1a58c687</id>
<content type='text'>
Boards whose size is an even multiple of CHARS_PER_SYNC would lose the
entire last region.

* Attempt to fix runaway memory usage of GUI text frames, but this needs
  more work
</content>
</entry>
<entry>
<title>GUI: Expose transcription window duration</title>
<updated>2022-12-30T09:35:11+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-30T09:35:11+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=9a7190a04bbe6feba2abe4a1590eb8114c04f683'/>
<id>urn:sha1:9a7190a04bbe6feba2abe4a1590eb8114c04f683</id>
<content type='text'>
Users can pick longer transcription durations for accuracy-critical
tasks, or shorter durations for latency-critical tasks.
</content>
</entry>
<entry>
<title>Bugfix: regenerated FX layers now work on uploaded avatars</title>
<updated>2022-12-30T09:10:32+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-30T09:10:32+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=d1024fef1b216af5d3d991228c6b83311a71bb42'/>
<id>urn:sha1:d1024fef1b216af5d3d991228c6b83311a71bb42</id>
<content type='text'>
VRChat won't update the FX layer associated with an avatar unless its
GUID changes. Delete the GUID file when overwriting our generated FX
layer to work around this.

* Change paging behavior: when a region is updated, we re-page everything
  that comes after it. This fixes the issue where we go back to update
  something, then jump back to the current screen, leaving some random
  chunk of text somewhere on the board.
* Reduce transcription time from 28s to 10s. I'm going to expose this to
  the user since there's a fundamental latency/stability tradeoff here.
</content>
</entry>
<entry>
<title>Fine-tune transcription</title>
<updated>2022-12-30T08:01:28+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-30T08:01:28+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=abdaa7ce215086bf1070d6093731cd35df866cbb'/>
<id>urn:sha1:abdaa7ce215086bf1070d6093731cd35df866cbb</id>
<content type='text'>
Bump up recording window to 28 seconds. This helps a lot with long-form
transcription tasks, s.a. transcribing an audiobook.

We should expose this as a parameter, since at 10s the transcription delay is
typically 300ms, while at 28s it's typically 1.1-1.2s.
</content>
</entry>
<entry>
<title>GUI: Users can now control board dimensions</title>
<updated>2022-12-30T07:01:09+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-30T07:01:09+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=12bcd1d40df21cb9bad6ae353ea4f6697e1275a6'/>
<id>urn:sha1:12bcd1d40df21cb9bad6ae353ea4f6697e1275a6</id>
<content type='text'>
Users can now control how many letters wide and tall the board is.

Tested at 4x48, 5x60, 10x120, and 20x240. At 20x240, Unity freezes and
does not make forward progress. Perhaps creating 4800 float parameters
isn't a truly scalable interface.
</content>
</entry>
<entry>
<title>Users can disable local beep</title>
<updated>2022-12-29T22:47:49+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-29T22:47:49+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=0b5c3841b5c2ed99b718c880bf0f161ac4db0788'/>
<id>urn:sha1:0b5c3841b5c2ed99b718c880bf0f161ac4db0788</id>
<content type='text'>
The transcription engine beeps when you start/stop transcribing so you know
that it's listening. Users can now disable this.

* add help text to all input fields in GUI
* make TaSTT generated file textctrls readonly, since I haven't tested
  them being reassigned
* document idea to configure unity &amp; transcription apps with config files
* controller input thread no longer crashes if steamvr isn't running, it just
  slowly spins and waits
* when you stop transcribing, the transcription engine re-transcribes a few
  times. I think this should improve end-of-transcription tail latencies
* transcribe.py now prints out its args
</content>
</entry>
<entry>
<title>Encapsulate paging &amp; text wrapping logic</title>
<updated>2022-12-28T00:07:10+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-28T00:07:10+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=f48ae0fffcd06f3cddd6cfc99b4c3d3a18c20038'/>
<id>urn:sha1:f48ae0fffcd06f3cddd6cfc99b4c3d3a18c20038</id>
<content type='text'>
Define proper interfaces for these things. Simplify osc_ctrl,
temporarily dropping support for emotes (they were broken anyway).

* Bugfix: Japanese no longer crashes transcribe.py, but it still doesn't
  show up in the wxTextCtrl
</content>
</entry>
<entry>
<title>Make transcription sleeps interruptible</title>
<updated>2022-12-24T20:37:22+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-24T20:37:22+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=6490a8a5db0a7eb8b3906765cd7ea26ad4b2824a'/>
<id>urn:sha1:6490a8a5db0a7eb8b3906765cd7ea26ad4b2824a</id>
<content type='text'>
This reduces the expected delay to wake up the board &amp; start
transcribing from 750 milliseconds to 2.5 milliseconds.
</content>
</entry>
<entry>
<title>GUI: expose chars per sync, bytes per char</title>
<updated>2022-12-24T20:13:07+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-23T07:10:37+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=50d327b83b496085ec91e31100d12f5f60c7d4ac'/>
<id>urn:sha1:50d327b83b496085ec91e31100d12f5f60c7d4ac</id>
<content type='text'>
Users can now control how many characters they send per sync event, as
well as the number of bytes used to represent each character.

This gives them the power to pick between faster paging and fewer sync
params.

International users must use 2 bytes per char (at least for now).

* package.ps1: don't distribute the gigantic TTF files, just the bitmaps
</content>
</entry>
<entry>
<title>Quick hack: don't exponentially back off when unpaused</title>
<updated>2022-12-22T21:35:32+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2022-12-22T21:34:05+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=02a1cfb67bbd67bbaca53de77ce3aad97648427b'/>
<id>urn:sha1:02a1cfb67bbd67bbaca53de77ce3aad97648427b</id>
<content type='text'>
This fixed some slowness I was seeing when waking up the STT. The right
fix is to add interruptible sleeps. Let's fix this soon.
</content>
</entry>
</feed>
