Skip to main content

View Sales Assistant Example on GitHub

Check out the complete Sales Assistant example in our GitHub repository
In this example, we build a real-time AI meeting copilot that silently listens to your microphone and system audio during sales calls, interviews, or meetings. It transcribes the conversation with speaker diarization using AssemblyAI, analyzes the dialogue with Gemini, and surfaces coaching suggestions on a translucent macOS overlay — invisible to other participants. The agent can be extended with RAG and custom knowledge bases to tailor suggestions to your product, company playbook, or deal context. The project has two components: a Python agent backend built with Vision Agents, and a companion macOS overlay app built with Flutter that captures audio and displays suggestions.
Vision Agents requires a Stream account for real-time transport. Most providers offer free tiers to get started.

What You Will Build

  • Silently listen to meeting audio (microphone + system audio) in real time
  • Transcribe conversations with speaker diarization using AssemblyAI
  • Generate contextual coaching suggestions with Gemini
  • Display suggestions on a translucent macOS overlay via Stream Chat
  • Run the agent as an HTTP server that the companion app connects to

Next Steps

HTTP Server Guide

Learn how the agent server handles session management

AssemblyAI Integration

Explore AssemblyAI’s STT with speaker diarization