TurboPuffer is a high-performance vector database with native hybrid search (vector + BM25). The plugin provides RAG with precise control over chunking, embeddings, and search strategies.Documentation Index
Fetch the complete documentation index at: https://visionagents.ai/llms.txt
Use this file to discover all available pages before exploring further.
Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
Installation
Quick Start
Parameters
| Name | Type | Default | Description |
|---|---|---|---|
namespace | str | Required | TurboPuffer namespace |
embedding_model | str | "models/gemini-embedding-001" | Embedding model |
chunk_size | int | 10000 | Text chunk size |
chunk_overlap | int | 200 | Overlap between chunks |
Search Modes
How Hybrid Search Works
Hybrid search combines vector and BM25 using Reciprocal Rank Fusion (RRF):- Vector search catches semantic meaning even when exact words differ
- BM25 catches exact matches (product names, SKUs, technical terms)
- RRF balances both without requiring tuning
With Function Calling
Cache Warming
For low-latency queries, TurboPuffer supports cache warming (called automatically afteradd_directory()):
Next Steps
Build a Voice Agent
Get started with voice
Build a Video Agent
Add video processing

