Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
Installation
Quick Start
Parameters
| Name | Type | Default | Description |
|---|---|---|---|
avatar_id | str | "default" | HeyGen avatar ID (from dashboard) |
quality | VideoQuality | HIGH | Quality (LOW, MEDIUM, HIGH) |
resolution | Tuple[int, int] | (1920, 1080) | Output resolution |
api_key | str | None | API key (defaults to HEYGEN_API_KEY env var) |
How It Works
The avatar works differently depending on your LLM type: With Streaming LLMs (Lower Latency)- LLM generates text → Text sent to HeyGen for lip-sync → HeyGen generates avatar video + audio
- Realtime LLM generates audio → Audio transcribed → Text sent to HeyGen for lip-sync → HeyGen generates video only (audio from LLM)
Next Steps
Build a Voice Agent
Get started with voice
Build a Video Agent
Add video processing

