Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
Installation
Quick Start
Parameters
| Name | Type | Default | Description |
|---|---|---|---|
model | str | "gpt-4o-mini-tts" | TTS model |
voice | str | "alloy" | Voice (“alloy”, “echo”, “fable”, “onyx”, “nova”, “shimmer”) |
Next Steps
OpenAI LLM
Responses API and ChatCompletions
OpenAI Realtime
Speech-to-speech over WebRTC

