Anam provides real-time interactive avatar video with automatic lip-sync. Add a video avatar to your agent that speaks with natural movements synchronized to your agent’s voice output.Documentation Index
Fetch the complete documentation index at: https://visionagents.ai/llms.txt
Use this file to discover all available pages before exploring further.
Installation
Quick Start
Parameters
| Name | Type | Default | Description |
|---|---|---|---|
avatar_id | str | None | Anam avatar ID (defaults to ANAM_AVATAR_ID env var) |
api_key | str | None | API key (defaults to ANAM_API_KEY env var) |
client_options | ClientOptions | None | Advanced Anam client configuration |
connect_timeout | float | None | Seconds to wait for connection (None = wait indefinitely) |
session_ready_timeout | float | None | Seconds to wait for session ready (None = wait indefinitely) |
width | int | 1920 | Output video width in pixels |
height | int | 1080 | Output video height in pixels |
How It Works
- Agent TTS audio is resampled to 24 kHz mono and streamed to Anam
- Anam generates lip-synced avatar video and audio from the input
- Avatar video and audio frames are streamed back to call participants via Stream Edge
- When a user starts speaking, the avatar is automatically interrupted
Next Steps
Build a Voice Agent
Get started with voice
Build a Video Agent
Add video processing

