chough
Original:🇺🇸 English
Translated
Fast ASR CLI tool for transcribing audio/video files. Use when user wants to transcribe audio/video, generate subtitles (VTT), convert speech to text with timestamps (JSON), or optimize transcription for low memory.
8installs
Sourcehyperpuncher/dotagents
Added on
NPX Install
npx skill4agent add hyperpuncher/dotagents choughTags
Translated version includes tags in frontmatterSKILL.md Content
View Translation Comparison →Installation
Arch Linux:
macOS:
Windows:
Source:
paru -S chough-binbrew install --cask hyperpuncher/tap/choughwinget install choughgo install github.com/hyperpuncher/chough/cmd/chough@latestRequires: for audio/video support
ffmpegQuick Reference
bash
# Basic transcription (text to stdout)
chough audio.mp3
# JSON with timestamps
chough -f json podcast.mp3 > transcript.json
# WebVTT subtitles
chough -f vtt -o subs.vtt video.mp4
# Low memory (30s chunks)
chough -c 30 audiobook.mp3Flags
| Flag | Description | Default |
|---|---|---|
| Chunk size in seconds | 60 |
| Output: text, json, vtt | text |
| Output file | stdout |
| Show version | - |
Chunk Size Guide
- 15-30s: Low memory (~500MB), higher error rate
- 60s: Balanced (default) - ~1.6GB RAM
Performance
| Duration | Time | Speed |
|---|---|---|
| 15s | 2.0s | 7.4x realtime |
| 1min | 4.3s | 14.1x realtime |
| 5min | 16.2s | 18.5x realtime |
| 30min | 90.2s | 19.9x realtime |
Troubleshooting
Out of memory: Use or
Model fails: Check internet, verify is writable
ffmpeg errors: Ensure ffmpeg is installed
-c 30-c 15$XDG_CACHE_HOMENotes
- First run downloads ~650MB model to
$XDG_CACHE_HOME/chough/models - Auto-extracts audio from video files
- Set env var to use custom model path
CHOUGH_MODEL - VTT groups tokens into subtitle cues automatically