whisperd 🎙️
A simple HTTP server written in Rust for the OpenAI Whisper speech-to-text model.
Features ✨
- 🎧 Transcribe audio files
- 🔄 OpenAI API compatibility
- 🌈 Models
tiny.entinybase.enbasesmall.ensmallmedium.enmediumlargelarge-v1
- 🌎 Languages
- 🇬🇧 English (en)
- 🇨🇳 Chinese (zh)
- 🇩🇪 German (de)
- 🇪🇸 Spanish (es)
- 🇷🇺 Russian (ru)
Quickstart 🚀
- Clone this repository:
- Navigate to the repository and build:
- Run the server:
Now, the server is running at http://localhost:8000 and ready to transcribe!
Usage 🛠️
CLI Commands
- Start the transcription server:
- Transcribe a given audio file (this downloads the model automatically from HuggingFace):
For more advanced options, use:
Contribution 🤝
Pull requests and issues are welcome!
License 📜
This project is licensed under the MIT License - see the LICENSE file for details.