███████╗██████╗ ███████╗ █████╗ ██╗ ██╗
██╔════╝██╔══██╗██╔════╝██╔══██╗██║ ██╔╝
███████╗██████╔╝█████╗ ███████║█████╔╝
╚════██║██╔═══╝ ██╔══╝ ██╔══██║██╔═██╗
███████║██║ ███████╗██║ ██║██║ ██╗
╚══════╝╚═╝ ╚══════╝╚═╝ ╚═╝╚═╝ ╚═╝
Voice cloning. Long documents. Audiobook quality. Local & private.
speak article.md --stream → Audio starts in seconds
For AI Agents (Claude Code, Cursor, Windsurf):
npx skills add EmZod/speakCLI:
git clone https://github.com/EmZod/speak.git
cd speak && bun install
alias speak="bun run $(pwd)/src/index.ts"Requirements: macOS Apple Silicon · Bun · Python 3.10+ · sox (brew install sox)
speak "Hello, world!" --play # Generate and play
speak article.md --stream # Stream long content
speak document.md --output out.wav # Save to file
speak --clipboard --play # Read from clipboardClone any voice from a 10-30 second sample:
# Use your cloned voice
speak "Hello" --voice ~/.chatter/voices/morgan_freeman.wav --playspeak book.md --auto-chunk --output book.wav # Auto-chunk for reliability
speak --resume manifest.json # Resume interrupted generation
speak *.md --output-dir ~/Audio/ # Batch processing
speak --estimate document.md # Estimate duration firstspeak <text|file> Generate speech
speak health Check system status
speak models List available models
speak concat <files> Combine audio files
speak daemon kill Stop TTS server
--play Play after generation
--stream Stream as it generates
--output Output file or directory
--voice Custom voice file (WAV)
--auto-chunk Chunk long documents
--estimate Show duration estimate
--dry-run Preview without generating
Long documents ████████████████████ Streaming, auto-chunk
Voice cloning ████████████████████ Any voice from sample
Emotion tags ████████████████████ [laugh], [sigh], etc.
Quality ████████████████████ Audiobook grade
Need instant audio (~90ms)? Try speakturbo.
| File | Content |
|---|---|
| SKILL.md | Full usage guide for agents |
| docs/usage.md | Complete CLI reference |
| docs/troubleshooting.md | Common issues & fixes |
| AGENTS.md | Architecture & development |
MIT License · Built on Chatterbox TTS