Vision Agents requires a Stream account
for real-time transport. Get your Sarvam API key from the Sarvam
dashboard.
Installation
Quick start
Parameters
| Name | Type | Default | Description |
|---|---|---|---|
model | str | "saaras:v3" | Streaming model (saaras:v3, saarika:v2.5, saaras:v2.5) |
language | str | None | Language code (e.g. hi-IN, en-IN). None for auto-detect |
mode | str | None | transcribe, translate, verbatim, translit, or codemix |
sample_rate | int | 16000 | Input sample rate — 8000 or 16000 Hz |
high_vad_sensitivity | bool | False | Increase VAD sensitivity for noisy input |
vad_signals | bool | True | Emit speech start/end events for turn detection |
prompt | str | None | Optional biasing prompt sent after connect |
api_key | str | None | API key (defaults to SARVAM_API_KEY env var) |
Next steps
Sarvam TTS
Streaming text-to-speech for Indian languages
Sarvam LLM
Chat completions with Sarvam models

