> ## Documentation Index
> Fetch the complete documentation index at: https://visionagents.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Fish Audio STT

[Fish Audio](https://fish.audio) provides speech-to-text with automatic language detection. Buffers audio per participant (minimum 1 second) before sending to the API for accurate transcription.

<Info>
  Vision Agents requires a [Stream](https://getstream.io/try-for-free/) account
  for real-time transport. Most providers offer free tiers to get started.
</Info>

<Tip>
  Fish Audio also provides high-quality [text-to-speech](/integrations/tts/fish) with prosody control and voice cloning. You can use both in the same agent.
</Tip>

## Installation

```sh theme={null}
uv add "vision-agents[fish]"
```

## Quick Start

```python theme={null}
from vision_agents.core import Agent, User
from vision_agents.plugins import fish, gemini, getstream

agent = Agent(
    edge=getstream.Edge(),
    agent_user=User(name="Assistant", id="agent"),
    instructions="You are a helpful assistant.",
    llm=gemini.LLM("gemini-3-flash-preview"),
    stt=fish.STT(),
    tts=fish.TTS(),
)
```

<Warning>
  Set `FISH_API_KEY` in your environment or pass `api_key` directly.
</Warning>

## Parameters

```python theme={null}
stt = fish.STT(language="en")  # Or None for auto-detection
```

| Name       | Type  | Default | Description                                                    |
| ---------- | ----- | ------- | -------------------------------------------------------------- |
| `language` | `str` | `None`  | Language code (`"en"`, `"zh"`, etc.) or `None` for auto-detect |
| `api_key`  | `str` | `None`  | API key (defaults to `FISH_API_KEY` env var)                   |

## Next Steps

<CardGroup cols={2}>
  <Card title="Fish Audio TTS" icon="volume-high" href="/integrations/tts/fish">
    Text-to-speech with prosody control
  </Card>

  <Card title="Build a Voice Agent" icon="microphone" href="/introduction/voice-agents">
    Get started with voice
  </Card>
</CardGroup>
