Core Architecture
The SDK is built around a modular architecture where each component can be independently configured and extended. At its heart is theAgent
class, which orchestrates all other components to create intelligent, interactive experiences. The Agent class is transport agnostic, meaning developers can use Stream’s low-latency Video transport by default or bring custom transports.

Design Philosophy
- Modularity: Each component (LLM, STT, TTS, VAD, Processors) can be independently configured
- Provider Agnostic: Support for multiple AI providers through standardised interfaces
- Event-Driven: Built on an event system for loose coupling and extensibility
- Low Latency: Optimized for real-time interactions with sub-50ms latency
- Type Safety: Rich typing throughout for better developer experience