Smart Turn uses neural models (Silero VAD + Whisper features) to detect when a speaker has completed their turn. Provides natural conversation flow without relying solely on silence detection.
Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
from vision_agents.core.turn_detection import TurnStarted, TurnEndedfrom vision_agents.plugins import smart_turnturn_detection = smart_turn.TurnDetection()async for signal in turn_detection.output: if isinstance(signal, TurnStarted): print(f"User started speaking: {signal.participant.user_id}") elif isinstance(signal, TurnEnded): print(f"User finished speaking: confidence={signal.confidence}")