Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
Installation
Quick start
Parameters
| Name | Type | Default | Description |
|---|---|---|---|
agent_id | str | None | LemonSlice agent ID (from dashboard) |
agent_image_url | str | None | Custom avatar image URL (368x560px) - alternative to agent_id |
agent_prompt | str | None | Prompt to influence avatar expressions and movements |
idle_timeout | int | None | Seconds before an idle session is closed |
api_key | str | None | API key (defaults to LEMONSLICE_API_KEY env var) |
livekit_url | str | None | LiveKit server URL (defaults to LIVEKIT_URL env var) |
livekit_api_key | str | None | LiveKit API key (defaults to LIVEKIT_API_KEY env var) |
livekit_api_secret | str | None | LiveKit API secret (defaults to LIVEKIT_API_SECRET env var) |
width | int | 1920 | Output video width in pixels |
height | int | 1080 | Output video height in pixels |
How it works
The avatar works differently depending on your LLM type: With standard LLMs- LLM generates text → TTS converts to audio → Audio sent to LemonSlice → LemonSlice generates synchronized avatar video and audio
- Realtime LLM generates audio → Audio sent to LemonSlice → LemonSlice generates video only (audio from LLM)

