Vogent uses neural models to predict when a speaker has completed their conversational turn. Provides intelligent turn-taking for natural conversation flow.
Vision Agents requires a Stream account
for real-time transport. Most providers offer free tiers to get started.
from vision_agents.core.turn_detection import TurnStarted, TurnEndedfrom vision_agents.plugins import vogentturn_detection = vogent.TurnDetection()async for signal in turn_detection.output: if isinstance(signal, TurnStarted): print(f"User started speaking: {signal.participant.user_id}") elif isinstance(signal, TurnEnded): print(f"User finished speaking: confidence={signal.confidence}")