OpenAI LLM

OpenAI provides industry-leading language models. The plugin supports the Responses API (for GPT-5+) and ChatCompletionsLLM (any OpenAI-compatible API). Requires separate STT/TTS.

Vision Agents requires a Stream account for real-time transport. Most providers offer free tiers to get started.

OpenAI also provides Realtime speech-to-speech and text-to-speech.

Installation

uv add "vision-agents[openai]"

LLM (Responses API)

Uses the Responses API (default for GPT-5+).

from vision_agents.core import Agent, User
from vision_agents.plugins import openai, deepgram, getstream

agent = Agent(
    edge=getstream.Edge(),
    agent_user=User(name="Assistant", id="agent"),
    instructions="You are a helpful assistant.",
    llm=openai.LLM(model="gpt-5.4"),
    stt=deepgram.STT(),
    tts=openai.TTS(),
)

Name	Type	Default	Description
`model`	`str`	`"gpt-5.4"`	Model (e.g., `"gpt-5.4"`)
`api_key`	`str`	`None`	API key (defaults to `OPENAI_API_KEY` env var)
`base_url`	`str`	`None`	Custom API endpoint
`max_tool_rounds`	`int`	`3`	Maximum tool-call rounds per response

ChatCompletionsLLM

Works with any OpenAI-compatible API (Together AI, Fireworks, DeepSeek, etc.).

from vision_agents.plugins import openai

llm = openai.ChatCompletionsLLM(
    model="deepseek-chat",
    base_url="https://api.deepseek.com",
    api_key="your_api_key"
)

Name	Type	Default	Description
`model`	`str`	—	Model identifier
`api_key`	`str`	`None`	API key (defaults to `OPENAI_API_KEY` env var)
`base_url`	`str`	`None`	Custom API endpoint
`tools_max_rounds`	`int`	`3`	Maximum tool-call rounds per response

Function Calling

@agent.llm.register_function(description="Get weather for a location")
async def get_weather(location: str) -> dict:
    return {"temperature": "72°F", "condition": "Sunny"}

See the Function Calling guide for details.

Events

The OpenAI plugin emits a low-level event for raw stream data. Most developers should use the core events (LLMResponseCompletedEvent, RealtimeUserSpeechTranscriptionEvent, etc.) instead.

from vision_agents.plugins.openai.events import OpenAIStreamEvent

@agent.events.subscribe
async def on_openai_stream(event: OpenAIStreamEvent):
    # Access raw OpenAI stream data
    print(f"Raw event: {event.event_type}, {event.event_data}")

Overview

Language Models

Realtime

Speech-to-Text

Text-to-Speech

Vision & Video

Avatars

Turn Detection

Infrastructure

Edge Transport

Custom Integrations

Installation

LLM (Responses API)

ChatCompletionsLLM

Function Calling

Events

Next Steps

OpenAI Realtime

OpenAI TTS

Overview

Language Models

Realtime

Speech-to-Text

Text-to-Speech

Vision & Video

Avatars

Turn Detection

Infrastructure

Edge Transport

Custom Integrations

Documentation Index

​Installation

​LLM (Responses API)

​ChatCompletionsLLM

​Function Calling

​Events

​Next Steps

OpenAI Realtime

OpenAI TTS

Installation

LLM (Responses API)

ChatCompletionsLLM

Function Calling

Events

Next Steps