This Python project downloads and parses the list of speech recognition models from the official Vosk models page (https://alphacephei.com/vosk/models). It extracts information about each available model, such as name, language, size, WER, notes, download URL, and license, and saves the results to a JSON file (vosk_models.json
).
- Make sure you have Python 3 installed.
- Install the required dependencies:
pip install requests beautifulsoup4
- Run the script:
python parse_vosk_models.py
- After execution, the file
vosk_models.json
will contain the parsed models data in JSON format.
This tool is useful for anyone who wants to programmatically access and analyze the list of available Vosk speech recognition models, including their download links and metadata.