[CVPR 2026] EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents

Wenjia Wang^1,* Liang Pan^1,* Huaijin Pi¹ Yuke Lou¹ Xuqian Ren² Yifan Wu¹
Zhouyingcheng Liao¹ Lei Yang³ Rishabh Dabral⁴ Christian Theobalt⁴ Taku Komura¹

(*: Core Contributor)

¹The University of Hong Kong ²Tampere University
³The Chinese University of Hong Kong ⁴Max-Planck Institute for Informatics

🗓️ News:

🎆 2026.Mar.10, we have released the code and data now, please have a try!

🎆 2026.Feb.22, EmbodMocap has been accepted to CVPR2026, codes and data will be released soon.

🚀 Quick Start

For new users, follow this order:

Main Pipeline - Quick downloads, preview / visualization, running the pipeline, and step-by-step workflow notes
- docs/embod_mocap.md
Installation - Set up the environment, core dependencies, and manual download references
- docs/install.md
Visualization - Generate rendered videos or inspect scenes and motions interactively with Viser
- docs/visualization.md

Notes:

Compared to the paper version, the open-source release replaces PromptDA with LingbotDepth.
fast is mainly for users who only care about mesh + motion for embodied tasks.
standard is for users who also need RGBD/mask assets for training reconstruction models.
We provide an interactive visualization tool based on Viser - give it a try!

Interactive Visualization with Viser

Our Viser-based visualization tool allows you to interactively browse scenes, sequences, and SMPL motions in 3D:

Features:

Switch between multiple scenes and sequences
Interactive 3D viewing of scene mesh and SMPL motion
Real-time camera trajectory visualization
Frame-by-frame playback control

See docs/visualization.md for detailed usage.

🎓 Citation

If you find this project useful in your research, please consider citing us:

@inproceedings{wang2026embodmocap,
title = {EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents.},
booktitle = {CVPR},
author = {Wang, Wenjia and Pan, Liang and Pi, Huaijin and Lou, Yuke and Ren, Xuqian and Wu, Yifan and Liao, Zhouyingcheng and Yang, Lei, Dabral, Rishabh and Theobalt, Christian and Komura, Taku},
year = {2026}
}

😁 Related Repos

We acknowledge VGGT, TRAM, ViTPose, Lang-Segment-Anything, PromptDA, Lingbot-Depth, SAM, COLMAP for their awesome codes.

📧 Contact

Feel free to contact me for other questions or cooperation: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets		assets
docs		docs
embod_mocap		embod_mocap
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR 2026] EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents

🗓️ News:

🚀 Quick Start

Interactive Visualization with Viser

🎓 Citation

😁 Related Repos

📧 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2026] EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents

🗓️ News:

🚀 Quick Start

Interactive Visualization with Viser

🎓 Citation

😁 Related Repos

📧 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages