> ## Documentation Index
> Fetch the complete documentation index at: https://visionagents.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Overview

The SDK is built around a modular architecture where each component can be independently configured and extended. At its heart is the `Agent` class, which orchestrates all other components to create intelligent, interactive experiences. The Agent class is transport agnostic, meaning developers can use Stream's low-latency Video transport by default or bring custom transports.

<img src="https://mintcdn.com/stream-52f5fdce/Y-GlyMOvSEtBGg9k/_assets/agent_overview.jpg?fit=max&auto=format&n=Y-GlyMOvSEtBGg9k&q=85&s=45e74048b0a4b286c921a1072ca70e4d" alt="Framework Overview" width="3840" height="2160" data-path="_assets/agent_overview.jpg" />

### Design Philosophy

* **Modularity**: Each component (LLM, STT, TTS, VAD, Processors) can be independently configured
* **Provider Agnostic**: Support for multiple AI providers through standardised interfaces
* **Event-Driven**: Built on an event system for loose coupling and extensibility
* **Low Latency**: Optimized for real-time interactions with sub-50ms latency
* **Type Safety**: Rich typing throughout for better developer experience