<feed xmlns='http://www.w3.org/2005/Atom'>
<title>TaSTT.git/TaSTT-Whisper, branch master</title>
<subtitle>Free self-hosted STT for VRChat.</subtitle>
<id>https://git.yummers.dev/TaSTT.git/atom?h=master</id>
<link rel='self' href='https://git.yummers.dev/TaSTT.git/atom?h=master'/>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/'/>
<updated>2023-06-27T00:21:59+00:00</updated>
<entry>
<title>Scrub out old C++-based Whisper code</title>
<updated>2023-06-27T00:21:59+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-06-27T00:21:59+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=694756a96a6109cd79a77221dd4e40638ff55b82'/>
<id>urn:sha1:694756a96a6109cd79a77221dd4e40638ff55b82</id>
<content type='text'>
No longer used.
</content>
</entry>
<entry>
<title>Begin integrating faster-whisper</title>
<updated>2023-04-24T03:52:36+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-04-24T03:52:36+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=b4bb6524652e0f76834ca26a4afa232855ca1348'/>
<id>urn:sha1:b4bb6524652e0f76834ca26a4afa232855ca1348</id>
<content type='text'>
This is a much faster, lower-VRAM reimplementation of Whisper in Python.
Early testing is extremely promising: fast transcription speed,
extremely low resource usage (CPU/RAM/VRAM), high accuracy.
</content>
</entry>
<entry>
<title>Fix _socket module not found issue</title>
<updated>2023-03-21T22:02:29+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-03-21T21:28:46+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=656d7c2092545b18d981acfac000c73fb2128e4a'/>
<id>urn:sha1:656d7c2092545b18d981acfac000c73fb2128e4a</id>
<content type='text'>
Need python310._pth, specifically 'import site' line, for
embedded python + pip to get along.
</content>
</entry>
<entry>
<title>Expose more C++ whisper parameters in GUI</title>
<updated>2023-03-08T22:59:57+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-03-08T22:59:57+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=b6dc76afb4f76a8b0eaa8f821c557cd83f69daf4'/>
<id>urn:sha1:b6dc76afb4f76a8b0eaa8f821c557cd83f69daf4</id>
<content type='text'>
Expose decode method, beam search parameters, and voice activity
detection parameters in GUI.

* Remove WhisperCPP::Init(), do it on launch instead.
* Add float support to ConfigMarshal
</content>
</entry>
<entry>
<title>Animator generation and dumping mics no longer hang GUI</title>
<updated>2023-03-06T00:26:18+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-03-06T00:26:18+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=058cd57ed2d18d6e14408881c3229fab7edbadb1'/>
<id>urn:sha1:058cd57ed2d18d6e14408881c3229fab7edbadb1</id>
<content type='text'>
Do these in a std::future.

* SetAffinityMask() now returns a value on all control paths
</content>
</entry>
<entry>
<title>Bugfix: fix use-after-free in GetMicsImpl</title>
<updated>2023-02-28T08:34:10+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-02-28T08:34:10+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=f65b93aa6f0a5b7ddd069cd1f50f142029e4a5e5'/>
<id>urn:sha1:f65b93aa6f0a5b7ddd069cd1f50f142029e4a5e5</id>
<content type='text'>
* Plumb beam search params into whisper cpp implementation
  (currently broken)
</content>
</entry>
<entry>
<title>Improve behavior around VAD segmentation events</title>
<updated>2023-02-27T04:12:36+00:00</updated>
<author>
<name>yum</name>
<email>yum.food.vr@gmail.com</email>
</author>
<published>2023-02-27T04:12:36+00:00</published>
<link rel='alternate' type='text/html' href='https://git.yummers.dev/TaSTT.git/commit/?id=d96851f716b9e18d827ba7795a343dbf3cf529c4'/>
<id>urn:sha1:d96851f716b9e18d827ba7795a343dbf3cf529c4</id>
<content type='text'>
Use forked Whisper implementation which has tweaks to reduce dropped
words around the beginning VAD segments.

* Retain audio after VAD segmentation events
</content>
</entry>
</feed>
