Loading...
Loading...
Real-time streaming speech-to-text via Deepgram WebSocket API — sub-300 ms latency, Nova-2 model, speaker diarization, auto-reconnect.
npx skill4agent add framersai/agentos-skills streaming-stt-deepgramDEEPGRAM_API_KEY{
"voice": {
"stt": "deepgram"
}
}{
"voice": {
"stt": "deepgram",
"providerOptions": {
"model": "nova-2",
"diarize": true,
"keywords": ["AgentOS:2"],
"endpointing": 300
}
}
}nova-2diarize: truespeakerendpointingproviderOptions.keywords| Event | Description |
|---|---|
| Every hypothesis (interim + final) |
| Non-final hypothesis |
| Stable, final hypothesis |
| First non-empty word in an utterance |
| Deepgram |
| Unrecoverable provider error |
| Session fully terminated |
DEEPGRAM_API_KEY