> ## Documentation Index
> Fetch the complete documentation index at: https://visionagents.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# NVIDIA

[NVIDIA](https://www.nvidia.com/) provides powerful vision language models through their NIM platform. The plugin enables real-time video understanding using models like Cosmos Reason2 with automatic frame buffering and NVCF asset management.

<Info>
  Vision Agents requires a [Stream](https://getstream.io/try-for-free/) account
  for real-time transport. Most providers offer free tiers to get started.
</Info>

## Installation

```sh theme={null}
uv add "vision-agents[nvidia]"
```

## Quick Start

```python theme={null}
from vision_agents.core import Agent, User
from vision_agents.plugins import nvidia, getstream, deepgram, elevenlabs

agent = Agent(
    edge=getstream.Edge(),
    agent_user=User(name="Assistant", id="agent"),
    instructions="Analyze the video and answer questions.",
    llm=nvidia.VLM(
        model="nvidia/cosmos-reason2-8b",
        fps=1,
        frame_buffer_seconds=10,
    ),
    stt=deepgram.STT(),
    tts=elevenlabs.TTS(),
)
```

<Warning>
  Set `NVIDIA_API_KEY` in your environment or pass `api_key` directly.
</Warning>

## Parameters

| Name                   | Type  | Default                      | Description                                    |
| ---------------------- | ----- | ---------------------------- | ---------------------------------------------- |
| `model`                | `str` | `"nvidia/cosmos-reason2-8b"` | NVIDIA model ID                                |
| `fps`                  | `int` | `1`                          | Video frames per second to buffer              |
| `frame_buffer_seconds` | `int` | `10`                         | Seconds of video to buffer                     |
| `frame_width`          | `int` | `800`                        | Frame width                                    |
| `frame_height`         | `int` | `600`                        | Frame height                                   |
| `api_key`              | `str` | `None`                       | API key (defaults to `NVIDIA_API_KEY` env var) |

## Next Steps

<CardGroup cols={2}>
  <Card title="Build a Voice Agent" icon="microphone" href="/introduction/voice-agents">
    Get started with voice
  </Card>

  <Card title="Build a Video Agent" icon="video" href="/introduction/video-agents">
    Add video processing
  </Card>
</CardGroup>
