YOLO-One 🚀

Revolutionary Single-Class Object Detection
Ultra-fast YOLO architecture optimized exclusively for single-class detection

Quick Start • Benchmarks • Documentation • Examples

⚠️ WORK IN PROGRESS - YOLO-One is currently under active development. The architecture is functional with promising initial results. Star ⭐ this repository to stay updated on our progress!

🎯 Why YOLO-One?

While existing YOLO models excel at multi-class detection, most real-world applications only need to detect ONE type of object. YOLO-One is the first YOLO architecture designed from the ground up for single-class detection:

⚡ Faster inference with optimized single-class architecture
📦 3x smaller model size (1.9MB vs 6.2MB YOLOv8n)
🎯 Same accuracy for single-class tasks (target)
🔋 Lower power consumption (mobile/edge optimized)
💾 Reduced memory usage (streamlined architecture)
🚀 Multi-platform deployment ready

📊 Current Performance (YOLO-One Nano)

🏗️ Architecture Efficiency

Model Size:     2MB (vs 6.2MB YOLOv8n)     📦 3x smaller
Parameters:      750K (vs ~3M YOLOv8n)         ⚡ 4x fewer params
Inference Time:  2.56ms (vs 9ms YOLOv8n)      📈 5x faster
Channels:       5 per scale (vs 85 COCO)     🎯 Single-class optimized
Memory Format:  Float32 (FP16/INT8 planned)  💾 Further optimization ready

⚡ Speed Benchmarks (Current)

Platform	Resolution	Current FPS	Target (Optimized)	Improvement Path
Development GPU	640x640	140 FPS	400+ FPS	+TensorRT +FP16
Inference Time	640x640	2.56ms	~0.5ms	+Optimizations

🎯 Optimization Roadmap

# Performance Projection Pipeline
Current:    140 FPS   # PyTorch Float32
Step 1:     250+ FPS      # + torch.compile
Step 2:     350+ FPS   # + TensorRT
Step 3:     600+ FPS     # + FP16 precision
Mobile:     TBD                 # Core ML / TFLite

🌐 Platform Coverage (Planned)

📱 Mobile & IoT

iOS: Core ML export (in development)
Android: TensorFlow Lite export (in development)
Edge Devices: ONNX export ready
ARM Optimization: Native support planned

🖥️ Desktop & Workstation

Windows: ✅ PyTorch ready, TensorRT planned
Linux: ✅ PyTorch ready, CUDA optimization
macOS: ✅ PyTorch ready, Metal planned

☁️ Cloud & Production

Docker: Container-ready architecture
ONNX: Export capability implemented
TensorRT: High-priority optimization
Serverless: Lightweight deployment ready

🚀 Quick Start

🛠️ Installation

🚀 Quick Start (Recommended)

pip install git+https://github.com/IAtrax/YOLO-One.git
# Run architecture test script to validate installation
python tests/test_architecture.py

Basic Usage

import torch
from yolo_one.models import YoloOne

# Create model
model = YoloOne(model_size='nano')

# Test inference
input_tensor = torch.randn(1, 3, 640, 640)
predictions = model(input_tensor)

print(f"Model size: {model.count_parameters():,} parameters")
# Output: Model size: 485,312 parameters

🎯 Single-Class Optimizations

🔧 Architecture Benefits

# Traditional YOLO (multi-class)
output_channels = 4 + 1 + num_classes  # bbox + conf + classes
# Example: 4 + 1 + 80 = 85 channels for COCO

# YOLO-One (single-class)
output_channels = 4 + 1  # bbox + fused_confidence
# Always: 5 channels only! 🎯

⚡ Performance Optimizations

No class probability computation (always 1 class)
Simplified NMS (no per-class separation)
Fused confidence (objectness + class probability)
Streamlined post-processing pipeline
Optimized loss function for single-class

📈 Benchmarks (In Development)

Current vs Target Performance

Metric	Current (Nano)	Target (Optimized)	YOLOv8n Baseline
Model Size	✅ 800KB	500KB	6.2MB
Parameters	✅ 750K	255K	~3M
Inference (GPU)	140 FPS	500+ FPS	~110 FPS
Memory Usage	<100MB	<50MB	~2GB
Accuracy (mAP)	TBD	Same	Baseline

🛠️ Development Status

✅ Completed

Core Architecture: Backbone + Neck + Head
Multi-scale Detection: P3, P4, P5 outputs
Single-class Optimization: 5-channel output
Model Variants: nano, small, medium, large
Basic Testing: Architecture validation
Multi-resolution: 320-640px support

🚧 In Development

Training Pipeline: Loss function + trainer
Benchmark Suite: vs YOLOv8n comparison
Export Pipeline: ONNX, TensorRT, Core ML
Mobile Optimization: INT8 quantization
Documentation: Complete API docs

🎯 Next Priorities

Training Pipeline - Validate accuracy claims
TensorRT Export - Achieve speed targets
Benchmark Suite - Automated comparisons
Mobile Deployment - iOS/Android support

🔧 Technical Details

Architecture Components

# Component breakdown
Backbone:  ~700K params (82%)  # Feature extraction
Neck:      ~40K params  (14%)  # Feature fusion  
Head:      ~1K params  (3%)   # Detection output
Total:     ~750K params         # Ultra-lightweight

Memory Efficiency

# Multi-resolution memory usage
640x640: ~5MB GPU memory  
# Scales efficiently with resolution

Training Pipeline

to launch training pipeline, run the following command:

Train with custom parameters

python train.py \
    --data path/to/dataset/directory \
    --config path/to/config/file \
    --model-size nano \
    --epochs 500 \
    --batch-size 16 \
    --lr 0.001 \
    --output-dir path/to/output \
    --device cuda

Inference Pipeline

to launch the inference pipepline, use de following command:

python detect.py \
    --source path/to/image.jpg \
    --weights path/to/model.pt \
    --device cuda
    --output-dir runs/detect
    --model-size nano
    --input-size 640
    --conf 0.5
    --iou 0.5

🤝 Contributing

We welcome contributions! Key areas:

Training Pipeline Development
Mobile Optimization
Benchmark Implementation
Documentation & Examples
Export Format Support

See CONTRIBUTING.md for guidelines.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🚀 Ready to revolutionize single-class object detection?

⭐ Star this repository • 🔄 Fork and contribute • 📖 Read the docs

Built with ❤️ by the IAtrax team

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
benchmarks		benchmarks
bindings		bindings
cpp		cpp
data		data
docker		docker
docs		docs
examples		examples
imgs		imgs
models		models
paper		paper
tests		tests
yolo_one		yolo_one
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

YOLO-One 🚀

🎯 Why YOLO-One?

📊 Current Performance (YOLO-One Nano)

🏗️ Architecture Efficiency

⚡ Speed Benchmarks (Current)

🎯 Optimization Roadmap

🌐 Platform Coverage (Planned)

📱 Mobile & IoT

🖥️ Desktop & Workstation

☁️ Cloud & Production

🚀 Quick Start

🛠️ Installation

🚀 Quick Start (Recommended)

Basic Usage

🎯 Single-Class Optimizations

🔧 Architecture Benefits

⚡ Performance Optimizations

📈 Benchmarks (In Development)

Current vs Target Performance

🛠️ Development Status

✅ Completed

🚧 In Development

🎯 Next Priorities

🔧 Technical Details

Architecture Components

Memory Efficiency

Training Pipeline

Inference Pipeline

🤝 Contributing

📄 License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages