Worker for the SUNET transcription service
-
Create a virtual environment:
python -m venv venv
-
Activate the virtual environment:
source venv/bin/activate
-
Install the required packages:
pip install -r requirements.txt
-
Edit the environment settings, should be in a file named
.env
. The following settings should be sufficient for most cases:DEBUG=True API_BACKEND_URL="http://localhost:8000" API_VERSION="v1" WORKERS=2 WHISPER_CPP_PATH=<Path to whisper.cpp> FILE_STORAGE_DIR="<Your file storage directory>"
-
Build and install whisper.cpp, see https://github.com/ggml-org/whisper.cpp for details.
-
Download the needed Whisper models. From the transcriber-worker directory run:
./download_models.sh
-
Run the worker:
python3 main.py --foreground --debug