- Video understanding: Automatically buffers and forwards video frames to NVIDIA VLM models
- Streaming responses: Real-time text responses with chunk events
- Asset management: Automatic upload and cleanup of frame assets via NVCF
Installation
Install the NVIDIA plugin with:Configuration
Set your NVIDIA API key:Usage
Parameters
| Name | Type | Default | Description |
|---|---|---|---|
model | str | "nvidia/cosmos-reason2-8b" | NVIDIA model ID to use. |
api_key | Optional[str] | None | NVIDIA API token. If not provided, reads from NVIDIA_API_KEY environment variable. |
fps | int | 1 | Number of video frames per second to buffer. |
frame_buffer_seconds | int | 10 | Number of seconds of video to buffer for the model’s input. |
frame_width | int | 800 | Width of video frames to send. |
frame_height | int | 600 | Height of video frames to send. |
max_tokens | int | 1024 | Maximum response tokens. |
temperature | float | 0.2 | Temperature for sampling. |
top_p | float | 0.7 | Top-p sampling parameter. |
frames_per_second | int | 8 | Frames per second sent to video models. |

