blob: 644384c06d847040c8b1ee30754e3515e54d8813 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
|
This library implements high-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model.
The library requires a hardware GPU which supports Direct3D 11.0, a 64-bit Windows OS, only works within 64-bit processes, and requires a 64 bit CPU which supports SSE 4.1.
The main entry point of the llibrary is `Whisper.Library` static class.
Call `loadModel` function from that class to load an ML model from a binary file.
These binary files are available for free download on [Hugging Face]( https://huggingface.co/datasets/ggerganov/whisper.cpp).
I recommend `ggml-medium.bin` (1.42GB in size), because I’ve mostly tested the software with that model.
Once the model is loaded, create a context by calling `createContext` extension method,
then use that object to transcribe or translate multimedia files or realtime audio captured from microphones.
|