> ## Documentation Index
> Fetch the complete documentation index at: https://visionagents.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Mistral Voxtral

[Mistral Voxtral](https://docs.mistral.ai/capabilities/audio_transcription#realtime-transcription) provides real-time speech-to-text via WebSocket streaming with automatic language detection and low-latency transcription.

<Info>
  Vision Agents requires a [Stream](https://getstream.io/try-for-free/) account
  for real-time transport. Most providers offer free tiers to get started.
</Info>

## Installation

```sh theme={null}
uv add "vision-agents[mistral]"
```

## Quick start

```python theme={null}
from vision_agents.core import Agent, User
from vision_agents.plugins import mistral, gemini, deepgram, getstream

agent = Agent(
    edge=getstream.Edge(),
    agent_user=User(name="Assistant", id="agent"),
    instructions="You are a helpful assistant.",
    llm=gemini.LLM("gemini-3-flash-preview"),
    stt=mistral.STT(),
    tts=deepgram.TTS(),
)
```

<Warning>
  Set `MISTRAL_API_KEY` in your environment or pass `api_key` directly.
</Warning>

## Parameters

| Name          | Type  | Default                                   | Description                                                |
| ------------- | ----- | ----------------------------------------- | ---------------------------------------------------------- |
| `api_key`     | `str` | `None`                                    | API key (defaults to `MISTRAL_API_KEY` env var)            |
| `model`       | `str` | `"voxtral-mini-transcribe-realtime-2602"` | Model identifier                                           |
| `sample_rate` | `int` | `16000`                                   | Audio sample rate in Hz (8000, 16000, 22050, 44100, 48000) |

## Turn detection

Mistral Voxtral STT does not include built-in turn detection. Pair it with an external turn detection plugin like [Smart Turn](/integrations/turn-detection/smart-turn) or [Vogent](/integrations/turn-detection/vogent).

```python theme={null}
from vision_agents.plugins import mistral, smart_turn

agent = Agent(
    stt=mistral.STT(),
    turn_detection=smart_turn.TurnDetection(),
    # ... other config
)
```

## Next steps

<CardGroup cols={2}>
  <Card title="Build a Voice Agent" icon="microphone" href="/introduction/voice-agents">
    Get started with voice
  </Card>

  <Card title="Build a Video Agent" icon="video" href="/introduction/video-agents">
    Add video processing
  </Card>
</CardGroup>
