Introduction to Integrations

Vision Agents ships with 25+ plugins that connect AI providers to your real-time voice and video applications. Each plugin wraps a provider’s API with a consistent interface—swap providers without rewriting your agent logic.

Vision Agents requires a Stream account for real-time transport. Most providers offer free tiers to get started.

Plugin Categories

Category	Plugins	Description
Realtime	OpenAI, Gemini, Qwen, AWS Bedrock	Native speech-to-speech over WebRTC/WebSocket
LLM	OpenAI, Gemini, OpenRouter, xAI, HuggingFace	Text generation with function calling
VLM	NVIDIA, HuggingFace, Moondream, OpenRouter	Video understanding via chat completions
STT	Deepgram, ElevenLabs, Fish, Fast-Whisper, Wizper	Speech-to-text transcription
TTS	ElevenLabs, Deepgram, Cartesia, Kokoro, Pocket, AWS Polly, Inworld	Text-to-speech synthesis
Turn Detection	Smart Turn, Vogent	Neural turn-taking detection
Video Processors	Ultralytics, Roboflow, Moondream, Decart, HeyGen	Detection, pose, style transfer, avatars
RAG	TurboPuffer, Gemini FileSearch	Vector search and knowledge retrieval

Installation

Plugins install as extras. Add only the ones you need:

uv add "vision-agents[gemini,deepgram,elevenlabs]"

See the Installation guide for the full list of available plugins.

Consistent Interface

Plugins of the same type share a common interface:

# STT plugins implement process_audio() and emit transcript events
stt = deepgram.STT()
stt = fish.STT()
stt = fast_whisper.STT()

# TTS plugins implement send()
tts = elevenlabs.TTS()
tts = cartesia.TTS()
tts = kokoro.TTS()

# LLM plugins implement simple_response() and register_function()
llm = gemini.LLM("gemini-2.5-flash")
llm = openai.LLM(model="gpt-4o")
llm = openrouter.LLM(model="anthropic/claude-sonnet-4")

Creating Custom Plugins

Build your own plugins to connect additional providers. See the Create Your Own Plugin guide.

Overview

AI Providers

Custom Integrations

Introduction to Integrations

Plugin Categories

Installation

Consistent Interface

Creating Custom Plugins

Overview

AI Providers

Custom Integrations

​Plugin Categories

​Installation

​Consistent Interface

​Creating Custom Plugins

Plugin Categories

Installation

Consistent Interface

Creating Custom Plugins