View Sales Assistant Example on GitHub
Check out the complete Sales Assistant example in our GitHub repository
Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
What You Will Build
- Silently listen to meeting audio (microphone + system audio) in real time
- Transcribe conversations with speaker diarization using AssemblyAI
- Generate contextual coaching suggestions with Gemini
- Display suggestions on a translucent macOS overlay via Stream Chat
- Run the agent as an HTTP server that the companion app connects to
Next Steps
HTTP Server Guide
Learn how the agent server handles session management
AssemblyAI Integration
Explore AssemblyAI’s STT with speaker diarization

