VigiLens / Camonyou: An Intelligent Surveillance System

VigiLens is a comprehensive, AI-powered surveillance system designed to detect suspicious shoplifting behaviors for both real-time monitoring and post-event security investigations. The system leverages state-of-the-art pose estimation to analyze video feeds, flag anomalies, and present findings in a centralized, user-friendly web interface.

This project was developed as a Capstone for the Bachelor of Science in Computer Science program at Adamson University.

🎯 Project Overview

VigiLens combines cutting-edge AI technology with a modern web interface to provide intelligent surveillance capabilities. The system can simultaneously monitor multiple live camera feeds and analyze uploaded video footage, making it suitable for both real-time security monitoring and forensic analysis.

✨ Features

Real-Time Monitoring

Live Camera Feeds: Connect to and process multiple live camera feeds (webcams, IP cameras, RTSP streams) simultaneously
Real-Time Pose Detection: Continuous human pose estimation using YOLOv11 for immediate anomaly detection
Live Streaming Dashboard: View annotated live feeds directly in the web browser via MJPEG streams
Multi-Camera Support: Monitor multiple cameras with dedicated AI workers for each feed

Post-Event Analysis

Video Upload & Processing: Upload recorded surveillance footage for automated background analysis
Asynchronous Processing: Non-blocking video analysis with real-time progress tracking
Incident Logging: Automatic detection and logging of suspicious activities with timestamps

AI-Powered Detection

Advanced Pose Estimation: Utilizes YOLOv11 model for accurate human pose detection and tracking
Anomaly Detection Model: Custom transformer-based model for identifying suspicious behaviors
Continuous Learning: System designed to improve detection accuracy over time

Interactive Web Interface

Centralized Dashboard: Clean, modern React-based interface for all system operations
Real-Time Statistics: Live overview of total incidents, active cameras, and system status
Incident Management: Detailed incident logs with video evidence and thumbnails
Camera Management: Dynamic addition/removal of camera feeds without system restart
Video Evidence: View original and annotated video clips for each detected incident

System Architecture

Scalable Multi-Process Design: Independent services ensure responsive UI during intensive processing
Production-Ready: Uses Waitress WSGI server for robust performance
Database Integration: SQLite database for reliable incident storage and retrieval
Modular Structure: Clean separation of concerns with dedicated modules for different functionalities

🏗️ System Architecture

VigiLens uses a sophisticated multi-process architecture designed for scalability and reliability:

Core Components

Main Web Server (app.py)
- Technology: Flask with Waitress WSGI server
- Purpose: Serves the React frontend and provides REST API endpoints
- Features:
  - Incident management and retrieval
  - Video file serving for playback
  - Database operations
  - Static file serving for the compiled React app
- Port: 5000
Live Stream Server (stream_server.py)
- Technology: Lightweight Flask application
- Purpose: Handles real-time video streaming
- Features:
  - Receives annotated frames from AI workers
  - Serves live MJPEG streams to web browsers
  - Manages frame buffers for multiple cameras
- Port: 8080
AI Worker Processes (worker.py)
- Technology: OpenCV + Ultralytics YOLOv11
- Purpose: Dedicated per-camera AI processing
- Features:
  - Continuous video feed processing
  - Real-time pose estimation
  - Anomaly detection
  - Frame annotation and streaming
  - Incident clip generation
Clip Saver Process (save_clip.py)
- Technology: FFmpeg integration
- Purpose: Background video processing
- Features:
  - Non-blocking incident recording
  - Video compression and optimization
  - Thumbnail generation
  - Database logging

Data Flow

Camera Feed → AI Worker → Pose Estimation → Anomaly Detection
     ↓              ↓              ↓              ↓
Stream Server ← Annotated Frame  Database ← Incident Log
     ↓
Web Browser ← Live Stream

Process Management

The run_all.py master script coordinates all services using Python multiprocessing:

Automatic service startup and coordination
Graceful shutdown handling
Process isolation for stability
Centralized logging and monitoring

🛠️ Technical Stack

Backend Technologies

Python 3.10+ - Core backend language
Flask - Web framework for API and service endpoints
Waitress - Production WSGI server for stable video streaming
SQLAlchemy - Database ORM for incident management
OpenCV - Computer vision and video processing
Ultralytics YOLOv11 - State-of-the-art pose estimation model
FFmpeg - Video encoding, compression, and clip generation
NumPy - Numerical computing for AI operations
Requests - HTTP client for inter-service communication

Frontend Technologies

React 19 - Modern UI framework
Vite - Fast development server and build tool
Axios - HTTP client for API communication
React Router - Single-page application routing
Material-UI - Component library for consistent design
React Player - Video playback component
React Icons - Icon library

Database & Storage

SQLite - Embedded database for incident logs
Local File System - Video clips and thumbnails storage

AI & Machine Learning

YOLOv11 - Real-time pose estimation
Custom Transformer Model - Anomaly detection algorithm
PyTorch - Deep learning framework

📋 Prerequisites

Before setting up VigiLens, ensure your system meets these requirements:

System Requirements

Operating System: Windows 10/11, macOS 10.15+, or Linux (Ubuntu 18.04+)
RAM: Minimum 8GB (16GB recommended for multiple cameras)
Storage: At least 5GB free space (more for video storage)
CPU: Multi-core processor (quad-core recommended)
GPU: Optional but recommended for faster AI processing

Required Software

1. Python (3.10 or higher)

Download: python.org

Verify installation:

python --version
# Should display Python 3.10.x or higher

2. Node.js and npm

Download: nodejs.org (LTS version recommended)
Verify installation:
```
node --version
npm --version
```

3. FFmpeg (Critical for video processing)

Windows: Download from ffmpeg.org and add to PATH
macOS: Install via Homebrew: brew install ffmpeg
Linux: Install via package manager: sudo apt install ffmpeg

Verify installation:

ffmpeg -version
# Should display version information without errors

Hardware Considerations

Webcam: For live monitoring (built-in or USB)
Network Access: For RTSP camera connections
Camera Specifications: IP cameras should support standard RTSP protocols

🚀 Installation Guide

Step 1: Clone the Repository

git clone https://github.com/bear-hunter/Camonyou.git
cd Camonyou

Step 2: Backend Setup

Navigate to the backend directory and set up the Python environment:

cd backend

Create Virtual Environment

# Create virtual environment
python -m venv venv

# Activate virtual environment
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate

# Verify activation (should show (venv) in terminal prompt)

Install Dependencies

# Install all required packages
pip install -r requirements.txt

# Verify key packages are installed
pip list | grep -E "(flask|opencv|ultralytics|torch)"

Set Up AI Models

The system requires AI models for pose estimation and anomaly detection:

# Create models directory if it doesn't exist
mkdir -p models

# Place your trained models in the models directory:
# - yolov11s-pose.pt (YOLOv11 pose estimation model)
# - transformer_anomaly_detector.pt (Custom anomaly detection model)
# - shopformer_v2.pth (Transformer model weights)
# - gcae_tokenizer_v2.pth (Tokenizer for the model)

Note: The YOLOv11 model will be automatically downloaded on first run if not present. Custom models should be trained separately or obtained from the project maintainers.

Initialize Database

# The database will be automatically created on first run
# Optional: Seed with sample data
python app.py seed

Step 3: Frontend Setup

Open a new terminal and navigate to the frontend directory:

cd frontend

Install Node.js Dependencies

# Install all frontend dependencies
npm install

# Install additional required packages if not already included
npm install axios react-router-dom @mui/material @emotion/react @emotion/styled react-player

# Verify installation
npm list --depth=0

Build Frontend (Optional)

# For development, this step is optional as Vite serves files directly
# For production deployment:
npm run build

Step 4: Camera Configuration

Configure the cameras you want to monitor by editing the configuration file:

# Navigate to backend directory
cd backend

# Edit the camera configuration
# Use your preferred text editor to modify cameras.json

Camera Configuration Options

Edit backend/cameras.json to define your camera sources:

[
  {
    "id": "LAPTOP-WEBCAM",
    "rtsp_url": "0"
  },
  {
    "id": "OFFICE-CAMERA-1",
    "rtsp_url": "rtsp://username:[email protected]:554/stream1"
  },
  {
    "id": "PARKING-CAMERA",
    "rtsp_url": "rtsp://admin:[email protected]/live/main"
  }
]

Camera Source Types

Source Type	Configuration	Example
Built-in Webcam	`"rtsp_url": "0"`	Laptop camera
USB Camera	`"rtsp_url": "1"`	External USB camera
IP Camera (RTSP)	`"rtsp_url": "rtsp://user:pass@ip:port/path"`	Network security camera
HTTP Stream	`"rtsp_url": "http://ip:port/stream"`	Web-based camera

Camera ID Guidelines

Use descriptive, unique identifiers
Avoid spaces and special characters
Examples: ENTRANCE-CAM, CASHIER-1, WAREHOUSE-NORTH

Step 5: Directory Structure Verification

Ensure your project structure matches this layout:

Camonyou/
├── backend/
│   ├── models/                 # AI model files
│   ├── uploads/               # Uploaded video storage
│   ├── processed_data/        # Processed clips and thumbnails
│   ├── instance/              # Database files
│   ├── vigilens_core/         # Core application modules
│   ├── cameras.json           # Camera configuration
│   ├── requirements.txt       # Python dependencies
│   └── run_all.py            # Main startup script
├── frontend/
│   ├── src/                   # React source code
│   ├── public/               # Static assets
│   ├── package.json          # Node.js dependencies
│   └── dist/                 # Built frontend (created after build)
└── README.md

🎬 Running VigiLens

Quick Start (Recommended)

The easiest way to start VigiLens is using two terminals:

Terminal 1: Backend Services

# Navigate to backend directory
cd backend

# Ensure virtual environment is active
source venv/bin/activate  # On macOS/Linux
# OR
venv\Scripts\activate     # On Windows

# Start all backend services
python run_all.py

This single command starts:

✅ Main Web Server (Port 5000)
✅ Live Stream Server (Port 8080)
✅ AI Workers (one per camera)

Terminal 2: Frontend Development Server

# Navigate to frontend directory (in a new terminal)
cd frontend

# Start the development server
npm run dev

Accessing the Application

Once both terminals show successful startup messages:

Open your web browser
Navigate to: http://localhost:5173
You should see the VigiLens dashboard

Service Status Verification

Check Backend Services

Main API: http://localhost:5000/api/dashboard/stats
Live Stream: http://localhost:8080/stream/CAMERA-ID (replace with actual camera ID)

Check Frontend

Development Server: http://localhost:5173
Should display: React-based VigiLens interface

Alternative: Production Mode

For production deployment:

# Build frontend
cd frontend
npm run build

# Start backend only (serves built frontend)
cd ../backend
source venv/bin/activate
python run_all.py

# Access at: http://localhost:5000

📱 Using VigiLens

Dashboard Overview

Total Incidents: View cumulative count of detected anomalies
Active Cameras: Monitor currently connected camera feeds
System Status: Real-time status of all services
Top Cameras: Cameras with most incident detections

Live Monitoring

Navigate to Camera View: Access live feeds from all configured cameras
Real-Time Annotations: See pose estimation overlays in real-time
Incident Alerts: Automatic notifications when anomalies are detected
Multi-Camera Grid: Monitor multiple feeds simultaneously

Video Analysis

Upload Videos: Drag and drop or select video files for analysis
Background Processing: Videos are processed asynchronously
Progress Tracking: Monitor analysis progress in real-time
Results Review: View detected incidents with timestamps

Incident Management

Incident List: Browse all detected anomalies chronologically
Video Playback: Watch original and annotated video clips
Thumbnail Preview: Quick visual reference for each incident
Export Capabilities: Download video evidence for reporting

Camera Management

Add Cameras: Configure new camera sources through the UI
Remove Cameras: Deactivate camera feeds as needed
Live Configuration: Changes take effect after service restart
Camera Testing: Verify camera connectivity before deployment

🔧 Troubleshooting

Common Issues

Backend Issues

Problem: ModuleNotFoundError when starting backend

# Solution: Ensure virtual environment is activated
source venv/bin/activate  # macOS/Linux
venv\Scripts\activate     # Windows

# Reinstall dependencies if needed
pip install -r requirements.txt

Problem: Camera connection fails

# Check camera configuration in cameras.json
# Verify RTSP URL format: rtsp://username:password@ip:port/path
# Test with VLC or similar player first

Problem: FFmpeg errors during video processing

# Verify FFmpeg installation
ffmpeg -version

# Check PATH environment variable includes FFmpeg
# Reinstall FFmpeg if necessary

Frontend Issues

Problem: npm run dev fails to start

# Clear npm cache and reinstall
npm cache clean --force
rm -rf node_modules package-lock.json
npm install

Problem: Cannot connect to backend API

Ensure backend is running on port 5000
Check for CORS errors in browser console
Verify firewall settings

Performance Issues

Problem: High CPU usage with multiple cameras

Solution: Reduce number of simultaneous cameras
Alternative: Upgrade hardware or use GPU acceleration

Problem: Memory leaks during long-term operation

Solution: Restart services periodically
Monitor: Use system monitoring tools to track resource usage

Logs and Debugging

Backend Logs

# Run with verbose output
cd backend
python run_all.py

# Check individual service logs
python worker.py CAMERA-ID RTSP-URL

Frontend Logs

# Development server logs
npm run dev

# Browser console for JavaScript errors
# Open browser DevTools (F12) → Console tab

System Requirements Issues

Insufficient Memory:

Close unnecessary applications
Consider reducing video resolution
Limit number of concurrent cameras

Storage Space:

Regularly clean processed_data folder
Implement automatic cleanup policies
Monitor disk usage

🔒 Security Considerations

Network Security

RTSP Credentials: Use strong passwords for camera access
Network Isolation: Consider separate VLAN for security cameras
Firewall Rules: Limit access to necessary ports only

Data Privacy

Local Storage: All data remains on local system by default
Access Control: Implement user authentication for production use
Data Retention: Establish policies for video data lifecycle

Production Deployment

HTTPS: Enable SSL/TLS for production environments
Authentication: Implement proper user management
Backup: Regular backup of incident database and video files

📊 Performance Optimization

Hardware Optimization

GPU Acceleration: Install CUDA for faster AI processing
Storage: Use SSD for better video I/O performance
Network: Ensure stable network for RTSP streams

Software Optimization

Model Selection: Use lighter models for lower-end hardware
Frame Rate: Adjust processing frame rate based on requirements
Resolution: Balance detection accuracy with performance

🤝 Contributing

We welcome contributions to improve VigiLens! Please follow these guidelines:

Development Setup

Fork the repository
Create a feature branch
Make your changes
Test thoroughly
Submit a pull request

Code Standards

Follow PEP 8 for Python code
Use ESLint for JavaScript code
Add comments for complex logic
Include unit tests where applicable

📄 License

This project is developed as an academic capstone project. Please refer to the repository for licensing information.

🆘 Support

For issues and questions:

Check the troubleshooting section above
Search existing GitHub issues
Create a new issue with detailed information
Include system specifications and error logs

🙏 Acknowledgments

Adamson University - Computer Science Program
Ultralytics - YOLOv11 implementation
OpenCV Community - Computer vision tools
React Team - Frontend framework

Developed by: Adamson University Computer Science Students
Project Type: Capstone Project
Academic Year: 2024-2025

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json

Folders and files

Latest commit

History

Repository files navigation