Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
Installation
Quick Start
Built-in Tools
Gemini provides built-in tools you can enable:| Tool | Description |
|---|---|
GoogleSearch | Ground responses with web data |
CodeExecution | Run Python code |
FileSearch | RAG over your documents |
URLContext | Read specific web pages |
File Search (RAG)
Managed RAG with automatic chunking and retrieval:Function Calling
Events
The Gemini plugin emits events for connection state and responses. Most developers should use the core events (LLMResponseCompletedEvent, etc.) for provider-agnostic code.| Event | Description |
|---|---|
GeminiConnectedEvent | Realtime connection established |
GeminiErrorEvent | Error occurred |
GeminiAudioEvent | Audio output received |
GeminiTextEvent | Text output received |
GeminiResponseEvent | Response chunk received |
Next Steps
Gemini Realtime
Speech-to-speech with optional video
Build a Voice Agent
Get started with voice

