Loading...
Loading...
Transcribe audio files to text using OpenAI Whisper
npx skill4agent add trpc-group/trpc-agent-go whisperpython3 scripts/transcribe.py <audio_file> <output_file># Specify model size (default: base)
python3 scripts/transcribe.py audio.mp3 transcript.txt --model medium
# Specify language (improves accuracy)
python3 scripts/transcribe.py audio.mp3 transcript.txt --language zh
# Include timestamps
python3 scripts/transcribe.py audio.mp3 transcript.txt --timestamps
# JSON output with metadata
python3 scripts/transcribe.py audio.mp3 output.json --format jsonaudio_fileoutput_file--model--language--timestamps--format| Model | Parameters | Speed | Accuracy | Memory |
|---|---|---|---|---|
| tiny | 39M | ~32x | Good | ~1GB |
| base | 74M | ~16x | Better | ~1GB |
| small | 244M | ~6x | Great | ~2GB |
| medium | 769M | ~2x | Excellent | ~5GB |
| large | 1.5B | 1x | Best | ~10GB |
pip install openai-whisper
sudo apt-get install ffmpeg # Ubuntu/Debian