Human Activity Understanding - Deep Assembly Lines

A multi-camera 3D scene visualization platform for monitoring battery and screw assembly processes. Integrates YOLOv11 for segmentation, DOPE for 6D object pose estimation, and VGGT for real-time 3D scene reconstruction using synchronized video recordings.

Demo

Installation

1. Create Conda Environment

conda create -n HAUP python=3.10 -y
conda activate HAUP

2. Install PyTorch

For macOS (Apple Silicon - M1/M2/M3):

conda install pytorch::pytorch torchvision torchaudio -c pytorch -y

For NVIDIA GPU (CUDA 12.1):

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

3. Install Dependencies

pip install -r requirements.txt

Run

python 3d_scene/3dscene.py

Open your browser at http://localhost:8085

📁 Project Structure

├── 3d_scene/                    # Main application
│   ├── 3dscene.py              # Backend server (aiohttp)
│   ├── web_interface.html      # 3D visualization frontend (Three.js)
│   ├── screw_sequence_tracker.py   # Screw sequence state machine
│   ├── sequence_from_distance_tool.py  # CLI monitoring tool
│   ├── distance_tool_screw.py  # Distance API client
│   ├── dope_inference.py       # DOPE 6D pose estimation
│   ├── yolo_inference.py       # YOLOv11 segmentation
│   ├── vggt_inference.py       # 3D point cloud reconstruction
│   ├── battery_fsm_module.py   # Battery tracking state machine (YOLO-based)
│   └── config/                 # Camera calibrations & DOPE config
│
├── data/
│   ├── recording_1-12/         # Multi-camera recordings (8 cameras each)
│   ├── scanned_objects/        # 3D models (case, e-screwdriver)
│   └── cams_calibrations.yml   # Camera calibration data
│
├── weights/                    # Model weights
│   ├── dope_tool.pth          # DOPE weights for screwdriver
│   ├── dope_case.pth          # DOPE weights for case
│   └── model.pt               # YOLOv11 finetuned weights
│
├── frameworks/                 # External frameworks
│   ├── dope/                  # DOPE implementation
│   └── vggt/                  # VGGT point cloud
│
└── yolov11_finetuned/         # YOLOv11 training & testing

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

This project was developed as part of the Practical Laboratory: Human Activity Understanding at the Technical University of Munich (TUM), Chair of Media Technology, supervised by Prof. Dr.-Ing. Eckehard Steinbach.

Huge Thanks to My Teammates

Lucía Balsa Picado (luciabalsa)
Ioannis Papadongonas (ipapadongonas)

Research Works Used

DOPE (Deep Object Pose Estimation) - 6D pose estimation for object detection https://github.com/NVlabs/Deep_Object_Pose
VGGT (Visual Geometry Grounded Transformer) - 3D scene reconstruction https://vgg-t.github.io/
YOLO (You Only Look Once, by Ultralytics) - state-of-the-art real-time object detection https://github.com/ultralytics/ultralytics

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
3d_scene		3d_scene
battery_order		battery_order
data		data
frameworks		frameworks
weights		weights
yolov11_finetuned		yolov11_finetuned
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
PPHAU GroupD - Final.pdf		PPHAU GroupD - Final.pdf
README.md		README.md
project_image.png		project_image.png
requirements.txt		requirements.txt
x5demo.gif		x5demo.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Activity Understanding - Deep Assembly Lines

Demo

Installation

1. Create Conda Environment

2. Install PyTorch

3. Install Dependencies

Run

📁 Project Structure

License

Acknowledgments

Huge Thanks to My Teammates

Research Works Used

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Human Activity Understanding - Deep Assembly Lines

Demo

Installation

1. Create Conda Environment

2. Install PyTorch

3. Install Dependencies

Run

📁 Project Structure

License

Acknowledgments

Huge Thanks to My Teammates

Research Works Used

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages