Loading...
Loading...
Local speech-to-text via Handy app (push-to-talk) and NeMo CLI scripts. Parakeet V3: 25 languages, auto-detection, ~30x realtime on M4 Max, 6% WER. This skill should be used when transcribing audio files or dictating voice input.
npx skill4agent add tdimino/claude-code-minoan parakeetbrew install --cask handy~/Library/Application Support/com.pais.handy/models/| System | Speed | Engine |
|---|---|---|
| Handy (M4 Max) | ~30x realtime | transcribe-rs / ONNX int8 |
| Handy (Zen 3) | ~20x realtime | transcribe-rs / ONNX int8 |
| Handy (Skylake i5) | ~5x realtime | transcribe-rs / ONNX int8 |
| NeMo CLI (MPS) | Varies | NeMo / PyTorch |
/parakeet path/to/audio.wav
/parakeet ~/recordings/interview.mp3
/parakeet meeting.m4a.wav.mp3.m4a.flac.ogg.aac/parakeet
/parakeet dictate/parakeet checkbrew install --cask handy~/Programming/parakeet-dictate/cd ~/Programming/parakeet-dictate
uv venv && uv pip install -r requirements.txtexport PARAKEET_HOME=/path/to/parakeet-dictatecd ~/.claude/skills/parakeet/scripts && \
${PARAKEET_HOME:-~/Programming/parakeet-dictate}/.venv/bin/python transcribe.py "<filepath>"cd ~/.claude/skills/parakeet/scripts && \
${PARAKEET_HOME:-~/Programming/parakeet-dictate}/.venv/bin/python dictate.pycd ~/.claude/skills/parakeet/scripts && \
${PARAKEET_HOME:-~/Programming/parakeet-dictate}/.venv/bin/python check_setup.py| System | Cache Location | Size | Engine |
|---|---|---|---|
| Handy | | ~478MB | transcribe-rs (ONNX int8) |
| NeMo CLI | | ~1.2GB | NeMo / PyTorch |
parakeet-tdt-0.6b-v3-int8/
├── encoder-model.int8.onnx
├── decoder_joint-model.int8.onnx
├── nemo128.onnx (audio preprocessor)
└── vocab.txt| Variable | Default | Description |
|---|---|---|
| | Parakeet Dictate installation path |
brew install --cask handy$PARAKEET_HOME~/Programming/parakeet-dictate$PARAKEET_HOME/.venvnemo_toolkit[asr]>=2.0.0