AI Human & Face Finder

A modular Python-based system for real-time human detection and face recognition using computer vision and deep learning. Supports multiple camera sources including webcams, USB cameras, IP cameras, and video files.

Features

Human Detection: Real-time full-body detection using YOLOv8 (with HOG fallback)
Face Detection: Multi-face detection with MTCNN or Haar Cascades
Face Recognition: Identity matching using FaceNet embeddings
Flexible Camera Input: Webcam, USB, IP cameras (RTSP/HTTP), video files
Modular Architecture: Easy to swap models and extend functionality
GPU Acceleration: Optional CUDA support for improved performance
Privacy-First: All processing happens locally, no cloud dependency
Configurable: YAML-based configuration for all parameters

Requirements

System Requirements

OS: Windows, Linux
Python: 3.10 or higher
RAM: 4GB minimum (8GB+ recommended)
GPU: Optional (NVIDIA with CUDA for acceleration)

Python Dependencies

See requirements.txt

Installation

1. Clone the Repository

cd f:\ai-camera\Ai-human-finder

2. Create Virtual Environment (Recommended)

python -m venv venv

# Windows
venv\Scripts\activate

# Linux/Mac
source venv/bin/activate

3. Install Dependencies

pip install -r requirements.txt

4. Download Models (Optional)

YOLOv8 models will download automatically on first run. For faster startup, you can pre-download:

python -c "from ultralytics import YOLO; YOLO('yolov8n.pt')"

Configuration

Edit config.yaml to customize:

Camera Settings

camera:
  source_type: 'webcam'  # 'webcam', 'usb', 'ip', 'video'
  source: 0              # 0 for default webcam, path for video file
  resolution: [640, 480] # or null for default
  fps_limit: 30

Detection Settings

human_detection:
  enabled: true
  model: 'yolov8n'  # n=fastest, s/m/l/x=slower but more accurate
  confidence_threshold: 0.5
  use_gpu: true

face_detection:
  enabled: true
  model: 'mtcnn'  # 'mtcnn' or 'haar'
  confidence_threshold: 0.9

face_recognition:
  enabled: true
  model: 'facenet'
  recognition_threshold: 0.6  # Lower = stricter matching

Usage

Basic Usage - Detection Only

Simply run the main application:

python main.py

Controls:

Q - Quit application
F - Toggle fullscreen

Face Recognition - Adding Known Faces

Method 1: From Directory (Recommended)

Create a folder structure in data/known_faces/:

data/known_faces/
├── John_Doe/
│   ├── photo1.jpg
│   ├── photo2.jpg
│   └── photo3.jpg
├── Jane_Smith/
│   ├── photo1.jpg
│   └── photo2.jpg
└── Bob_Johnson/
    ├── photo1.jpg
    └── photo2.jpg

Run enrollment:

python enroll_faces.py --mode directory

Method 2: Live Camera Capture

# Capture 5 samples for a person
python enroll_faces.py --mode camera --name "John_Doe" --samples 5

Follow on-screen instructions:

Position face in frame
Press SPACE to capture each sample
Vary pose and expression between captures
Press Q to finish early

Running with Recognition

After enrolling faces:

python main.py

The system will now display names above recognized faces!

Project Structure

Ai-human-finder/
├── main.py                 # Main application
├── enroll_faces.py         # Face enrollment tool
├── config.yaml             # Configuration file
├── requirements.txt        # Python dependencies
├── README.md              # This file
│
├── src/                   # Source modules
│   ├── camera.py          # Camera input handler
│   ├── human_detector.py  # Human detection
│   ├── face_detector.py   # Face detection
│   └── face_recognizer.py # Face recognition
│
├── data/                  # Data directory
│   ├── known_faces/       # Known face images
│   │   └── PersonName/    # One folder per person
│   └── embeddings.pkl     # Trained embeddings
│
└── logs/                  # Application logs

Advanced Usage

Using Different Camera Sources

USB Camera

camera:
  source_type: 'usb'
  source: 1  # Camera index

IP Camera (RTSP)

camera:
  source_type: 'ip'
  source: 'rtsp://username:[email protected]:554/stream'

Video File

camera:
  source_type: 'video'
  source: 'path/to/video.mp4'

Model Selection

Human Detection Models

yolov8n - Fastest, lowest accuracy (recommended for CPU)
yolov8s - Small, balanced
yolov8m - Medium, good accuracy
yolov8l - Large, high accuracy
yolov8x - Extra large, best accuracy (GPU recommended)

Face Detection Models

mtcnn - More accurate, slower (recommended)
haar - Faster, less accurate (CPU-friendly fallback)

Adjusting Recognition Sensitivity

In config.yaml:

face_recognition:
  recognition_threshold: 0.6  # 0.4 = strict, 0.8 = lenient

Lower values = stricter matching (fewer false positives) Higher values = looser matching (more false positives)

Performance Tips

For CPU-Only Systems

human_detection:
  model: 'yolov8n'
  use_gpu: false

face_detection:
  model: 'haar'

camera:
  resolution: [640, 480]
  fps_limit: 15

For GPU Systems

human_detection:
  model: 'yolov8m'  # or yolov8l
  use_gpu: true

face_detection:
  model: 'mtcnn'

camera:
  resolution: [1280, 720]
  fps_limit: 30

Troubleshooting

Camera Not Opening

Check camera index (try 0, 1, 2)
Ensure camera isn't used by another application
On Linux, check permissions: sudo usermod -a -G video $USER

Low FPS

Reduce camera resolution
Use smaller model (yolov8n)
Enable GPU acceleration
Increase skip_frames in config

Face Recognition Not Working

Ensure embeddings.pkl exists (run enroll_faces.py)
Add more training samples per person (5-10 recommended)
Adjust recognition_threshold
Ensure good lighting during enrollment and runtime

ImportError for Ultralytics/MTCNN

pip install ultralytics mtcnn facenet-pytorch

CUDA Out of Memory

Use smaller models
Reduce camera resolution
Process fewer frames (increase skip_frames)

Performance Benchmarks

Test System: Intel i7-10700K, RTX 3070, 640x480 resolution

Configuration	FPS
YOLOv8n + Haar (CPU)	12-15
YOLOv8n + MTCNN (CPU)	8-10
YOLOv8n + MTCNN (GPU)	28-32
YOLOv8m + MTCNN (GPU)	22-25

Future Enhancements

Technical Details

Models Used

Human Detection: YOLOv8 (You Only Look Once)

Trained on COCO dataset
Real-time object detection
Detects 'person' class

Face Detection: MTCNN (Multi-task Cascaded CNN)

Three-stage cascade: P-Net, R-Net, O-Net
Fast and accurate face localization

Face Recognition: FaceNet

512-dimensional embeddings
Cosine similarity for matching
Pre-trained on VGGFace2 dataset

Recognition Pipeline

Capture frame from camera
Detect humans (YOLOv8) and faces (MTCNN)
Extract face crops from detections
Compute embeddings using FaceNet
Compare with known embeddings using cosine similarity
Identify if distance < threshold
Render results with bounding boxes and labels

Contributing

Contributions are welcome! Areas for improvement:

Additional detection models
Performance optimizations
New features from roadmap
Documentation improvements
Bug fixes

License

This project is for educational and research purposes.

Model Licenses:

YOLOv8: AGPL-3.0
MTCNN: MIT
FaceNet: Apache 2.0

Ethical Considerations

This system uses face recognition technology. Please use responsibly:

Obtain consent before capturing faces
Comply with privacy laws (GDPR, CCPA, etc.)
Secure face data appropriately
Do not use for surveillance without authorization
Be aware of algorithmic bias in face recognition
Consider ethical implications of your use case

Support

For issues, questions, or suggestions:

Check Troubleshooting section
Review config.yaml documentation
Examine logs in logs/ directory

Acknowledgments

Built with:

OpenCV - Computer vision library
PyTorch - Deep learning framework
Ultralytics YOLOv8 - Object detection
MTCNN - Face detection
FaceNet-PyTorch - Face recognition

Made for learning and experimentation

Remember: With great power comes great responsibility. Use AI ethically!

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data/known_faces		data/known_faces
src		src
.gitignore		.gitignore
DOCUMENTATION.md		DOCUMENTATION.md
README.md		README.md
config.yaml		config.yaml
enroll_faces.py		enroll_faces.py
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AI Human & Face Finder

Features

Requirements

System Requirements

Python Dependencies

Installation

1. Clone the Repository

2. Create Virtual Environment (Recommended)

3. Install Dependencies

4. Download Models (Optional)

Configuration

Camera Settings

Detection Settings

Usage

Basic Usage - Detection Only

Face Recognition - Adding Known Faces

Method 1: From Directory (Recommended)

Method 2: Live Camera Capture

Running with Recognition

Project Structure

Advanced Usage

Using Different Camera Sources

USB Camera

IP Camera (RTSP)

Video File

Model Selection

Human Detection Models

Face Detection Models

Adjusting Recognition Sensitivity

Performance Tips

For CPU-Only Systems

For GPU Systems

Troubleshooting

Camera Not Opening

Low FPS

Face Recognition Not Working

ImportError for Ultralytics/MTCNN

CUDA Out of Memory

Performance Benchmarks

Future Enhancements

Technical Details

Models Used

Recognition Pipeline

Contributing

License

Ethical Considerations

Support

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages