Speech AI, in your terminal.
Transcribe, synthesize, and stream audio. The official CLI for Deepgram's speech platform.
Everything speech AI. One command.
Built for developers who live in the terminal and need their tools to keep up.
Speech-to-text
Transcribe files and URLs with nova-3. Speaker diarization, smart formatting, topic detection, and summaries — all from one command.
Live streaming
WebSocket stream from mic or stdin. Real-time transcripts with interim results and word-level timing.
Text-to-speech
Generate speech with Aura voices. Pipe to ffplay or save to file.
Agent-ready by default
Auto-detects Claude Code, Aider, and Codex. Disables prompts, routes status to stderr, switches to JSON — no flags needed.
UNIX philosophy
Pipe-friendly.
Script-ready.
Every command writes structured data to stdout and diagnostics to stderr.
Switch formats with -o json or let it
auto-switch when piped. Plays nicely with every UNIX tool you already know.
ALL COMMANDS
Every workflow, covered.
From a quick transcription to a full production pipeline — there's a command for that.
Transcribe a local file
Smart formatting, punctuation, and speaker diarization on any audio or video file.
Fetch and transcribe a URL
Live microphone transcription
Real-time WebSocket stream with interim results as you speak.
Stream via ffmpeg
Pipe any source ffmpeg can read — HLS, RTMP, webcam, or broadcast.
Write live WebVTT captions
Pipe real-time caption output directly to a .vtt file.
Pipe transcript to an LLM
Chain with any LLM CLI for instant summarization or Q&A.
Synthesize speech to a file
Pipe TTS to your speaker
Any text in, audio out. Works with ffplay, aplay, or mpv.
Analyze a text document
Topics, sentiment, and summary — all in one pass against Deepgram's text intelligence API.
Summarize piped text
Authenticate the CLI
Walks through API key setup and saves it to your local profile.
Create a scoped API key
List all projects
View project members
Browse available models
Filter by type, language, or tier to find the right model.
Check usage and limits
View account balance
Audit recent API calls
Call any Deepgram endpoint
Escape hatch for endpoints not yet covered by a dedicated command.
Diagnose audio devices
Lists input devices and tests recording capability.
Test Deepgram connectivity
Checks DNS, TLS, and WebSocket reachability to Deepgram endpoints.
Scaffold a starter app
Clone a Deepgram starter app, pre-wired and ready to run.
Run as MCP server
Expose every CLI command to Claude, Cursor, and other MCP clients.
Regenerate AI skill files
Keep your coding agent's context current with the latest commands.
Install a community plugin
Self-update the CLI
Detects your install method and runs the right upgrade command.
Install shell tab-completion
Get started in seconds
One command. Every platform. Authenticate once, then go.
Works with your AI coding tools
dg
auto-detects Claude Code, Aider, Codex and more. Detected agents get
non-interactive defaults, clean JSON output, and stderr-only status — automatically.
dg listen audio.mp3 --agent-friendly