🐦 Bird vs 🤖 Drone Detection & Classification System

Deep Learning | Transfer Learning | YOLOv8 | FastAPI

An end-to-end computer vision system that classifies aerial objects as Bird or Drone and detects them in real-world images, videos, and live camera feeds. The project combines transfer learning for classification and YOLOv8 object detection with a FastAPI backend and lightweight web interface for interactive inference.

The system supports:

Image classification
Image object detection
Video object detection
Laptop webcam live detection
Mobile camera real-time detection via browser

🚀 Project Features

1️⃣ Image Classification (Bird vs Drone)

Binary classification using deep learning and transfer learning.

Models explored:

Custom CNN baseline
EfficientNetB0
ResNet50V2
MobileNetV2 (best performer)

Performance

Model	Accuracy	Precision	Recall	F1-score
Custom CNN	0.69	0.70	0.71	0.72
EfficientNetB0	0.972	0.968	0.968	0.968
ResNet50V2	0.972	0.968	0.968	0.962
MobileNetV2	0.972	0.968	0.968	0.968

📌 MobileNetV2 selected for deployment due to its speed and lightweight architecture.

The classifier outputs:

Predicted class (Bird / Drone)
Confidence score

🎯 Object Detection (YOLOv8)

Object detection is implemented using YOLOv8n, trained on a labeled dataset of aerial bird and drone images.

Dataset

3,319 labeled images
Bounding box annotations
Bird and Drone classes

Detection Performance

Metric	Score
mAP50	~0.82
mAP50–95	~0.53
Precision	0.82–0.85
Recall	0.77–0.79

The detector identifies multiple objects in a frame and outputs:

Bounding boxes
Class labels
Confidence scores

Example output:

Bird detected at (x1, y1, x2, y2) with 0.91 confidence
Drone detected at (x1, y1, x2, y2) with 0.87 confidence

🖥 Web Interface (FastAPI + HTML/CSS)

The project includes a lightweight web interface powered by FastAPI and a simple HTML/CSS/JavaScript frontend.

The interface allows users to interact with the models without needing Python knowledge.

Supported Inference Modes

📷 Image Upload

Users can upload an image and choose:

Classification Mode
YOLO Detection Mode

The system returns predictions along with bounding boxes if detection is selected.

🎥 Video Upload

Users can upload a video file.

The backend processes the video frame-by-frame using YOLOv8 and returns a fully annotated video with detections.

💻 Laptop Webcam Detection

Live object detection can be performed using the system's webcam.

Each frame is processed by YOLOv8 and displayed in real time with bounding boxes.

📱 Mobile Camera Detection

The interface also supports mobile phone cameras via the browser.

Using the device camera:

Frames are captured in the browser
Sent to the FastAPI backend
YOLOv8 performs inference
Annotated results are streamed back

This allows real-time aerial object detection directly from a smartphone.

🧠 How the System Works

1. Data Preprocessing

For classification:

Resize images to 224 × 224
Normalize pixel values
Apply data augmentation

For object detection:

Resize images to 640 × 640
Convert annotations to YOLO format
Perform augmentation

2. Model Training

Classification Pipeline

Custom CNN baseline
Transfer learning models
Hyperparameter tuning
Evaluation with confusion matrix

Object Detection Pipeline

YOLOv8 training
Bounding box learning
mAP evaluation
Inference testing

3. Inference Pipeline

For each request:

Input received from UI
Preprocessing applied
Model inference executed
Output returned

Outputs include:

Predicted class
Confidence score
Bounding boxes
Annotated images or videos

🧰 Technologies Used

Machine Learning

TensorFlow / Keras
Transfer Learning
YOLOv8 (Ultralytics)

Computer Vision

OpenCV
NumPy
PIL

Backend

FastAPI
Python

Frontend

HTML
CSS
JavaScript

Tools

Google Colab
Jupyter Notebook

🎯 Real-World Applications

✈ Airport Safety

Detect drones and birds near airport runways to reduce bird-strike risks.

🛰 Surveillance Systems

Monitor unauthorized drones in restricted airspace.

🐦 Wildlife Monitoring

Track bird activity for ecological research.

🛡 Security & Defense

Identify aerial objects in surveillance footage.

📡 UAV Monitoring

Assist in drone traffic monitoring systems.

📊 Example Outputs

Classification Output

Prediction: Bird
Confidence: 97.3%

Detection Output

Bounding boxes with labels:

Bird — 0.91
Drone — 0.87

📜 License

MIT License

🤝 Contributing

Contributions are welcome.

If you'd like to improve the system:

Fork the repository
Create a feature branch
Submit a pull request

👨‍💻 Author

Omprakash

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
frontend		frontend
models		models
myenv		myenv
notebook		notebook
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
inference.py		inference.py
main.py		main.py
requirements.txt		requirements.txt
temp_efe36ce3-86d1-4f57-a9d0-d233ba89be1e.jpg		temp_efe36ce3-86d1-4f57-a9d0-d233ba89be1e.jpg
temp_img.jpg		temp_img.jpg

Folders and files

Latest commit

History

Repository files navigation

🐦 Bird vs 🤖 Drone Detection & Classification System

🚀 Project Features

1️⃣ Image Classification (Bird vs Drone)

Performance

🎯 Object Detection (YOLOv8)

Dataset

Detection Performance

🖥 Web Interface (FastAPI + HTML/CSS)

Supported Inference Modes

📷 Image Upload

🎥 Video Upload

💻 Laptop Webcam Detection

📱 Mobile Camera Detection

🧠 How the System Works

1. Data Preprocessing

2. Model Training

3. Inference Pipeline

🧰 Technologies Used

Machine Learning

Computer Vision

Backend

Frontend

Tools

🎯 Real-World Applications

✈ Airport Safety

🛰 Surveillance Systems

🐦 Wildlife Monitoring

🛡 Security & Defense

📡 UAV Monitoring

📊 Example Outputs

Classification Output

Detection Output

📜 License

🤝 Contributing

👨‍💻 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages