Vision Agents requires a Stream account for real-time transport.
Options
| Option | Best For | Complexity |
|---|---|---|
| Gemini File Search | Quick setup, automatic chunking/embedding | Simple |
| TurboPuffer | Full control, hybrid search, production | More setup |
Gemini File Search
Gemini’s File Search handles chunking, embedding, and retrieval automatically.- Store reuse (finds existing stores by name)
- Content deduplication via SHA-256 hash
- Concurrent batch uploads
TurboPuffer
TurboPuffer provides hybrid search combining vector (semantic) and BM25 (keyword) search with Reciprocal Rank Fusion.RAG Pipeline Overview
For custom implementations, a typical RAG pipeline involves:- Document gathering — URLs, folders, PDFs, external APIs
- Parsing — Convert to text (markdownify, BeautifulSoup, OCR)
- Chunking — Split into retrievable pieces (fixed size, semantic, recursive)
- Embedding — Convert text to vectors (MTEB leaderboard)
- Vector storage — Store embeddings for similarity search
- Hybrid search — Combine vector + full-text search (TurboPuffer guide)
- Reranking — Score and filter results before passing to LLM
Comparison
| Feature | Gemini File Search | TurboPuffer |
|---|---|---|
| Setup | Simple | More setup |
| Chunking | Automatic | Configurable |
| Search | Managed | Hybrid (vector + BM25) |
| Control | Less | Full control |
| Cost | Included with Gemini | Separate service |
| Best for | Prototypes | Production with custom needs |

