TTS Engines¶

DPO Reader supports three text-to-speech backends with different trade-offs.

Bark (Default)¶

Neural TTS that runs locally. Produces natural-sounding speech with good intonation and emotion.

dpo-reader listen URL -e bark

Bark works without a GPU but runs slower. On Apple Silicon Macs, it uses Metal Performance Shaders automatically.

Cloud-based TTS with the best quality. Requires an API key and costs money per character.

export OPENAI_API_KEY=sk-...
dpo-reader listen URL -e openai

Available voices: alloy, echo, fable, onyx, nova, shimmer.

Lightweight local TTS optimized for CPU. Good for batch processing or machines without GPUs.

uv pip install dpo-reader[piper]
dpo-reader listen URL -e piper

Piper uses ONNX runtime and works on any machine. Models download automatically on first use.

Engine	Quality	Speed	Requirements
OpenAI	Best	Fast	API key ($)
Bark	Excellent	Slow	GPU helps
Piper	Good	Fast	CPU only