uv
as the package manager which is also open-source and free to use. To get started run:
Plugin Name | Description | Docs Link |
---|---|---|
Cartesia | TTS plugin for realistic voice synthesis in real-time voice applications | Cartesia |
Deepgram | STT plugin for fast, accurate real-time transcription with speaker diarization | Deepgram |
ElevenLabs | TTS plugin with highly realistic and expressive voices for conversational agents | ElevenLabs |
Kokoro | Local TTS engine for offline voice synthesis with low latency | Kokoro |
Moonshine | Local STT engine optimized for edge deployments and low-resource environments | Moonshine |
OpenAI | Realtime API for building conversational agents with out of the box support for real-time video directly over WebRTC | OpenAI |
Gemini | Realtime API for building conversational agents with support for both voice and video | Gemini |
Silero | VAD plugin for detecting human speech activity in real-time audio streams | Silero |
Wizper | STT plugin with real-time translation capabilities powered by Whisper v3 | Wizper |