Skip to main content

View Security Camera Example on GitHub

Check out the complete Security Camera example in our GitHub repository
In this example, we build a real-time security monitoring system using face_recognition for identifying visitors, YOLOv11 for detecting packages, and Gemini for conversational AI — all running on Stream’s edge network. The agent tracks visitors, monitors packages, generates “wanted posters” when packages disappear, and answers natural language questions like “How many people visited?” or “What happened while I was away?”
Vision Agents requires a Stream account for real-time transport. Most providers offer free tiers to get started.

What You Will Build

  • Detect and recognize faces in real time with face_recognition, including named face registration
  • Monitor packages using a custom YOLOv11 detection model
  • Detect package theft and automatically generate wanted posters
  • Post alerts to X (Twitter) with suspect photos
  • Ask the AI questions about security activity via voice (“Who visited?”, “Any packages delivered?”)
  • Run on Stream’s low-latency edge network

Next Steps

Ultralytics Integration

Explore YOLO model options for object detection

Docker Deployment

Docker setup and environment configuration