Multi-level and Multi-modal Action Anticipation (m&m-Ant)

Paper (arXiv): https://arxiv.org/abs/2506.02382
Project Page & Demo: https://alregib.ece.gatech.edu/multi-level-and-multi-modal-action-anticipation/

TL;DR

Action anticipation predicts future actions from partially observed videos, which is difficult due to incomplete context and inherent uncertainty. This work introduces m&m-Ant, a multi-modal (video + text) and multi-level (hierarchical) framework that improves long-term action anticipation by:

combining visual cues with textual information,
explicitly modeling hierarchical semantics,
and introducing a fine-grained text generator trained with a temporal consistency loss to mitigate errors from coarse/inaccurate action labels.

Experiments on Breakfast, 50Salads, and DARai show consistent improvements over prior methods, reporting an average anticipation accuracy gain of +3.08%. :contentReference[oaicite:0]{index=0}

Run

To run the code:

python3 main.py

Citation

@inproceedings{kim2025multi,
  title={Multi-level and Multi-modal Action Anticipation},
  author={Kim, Seulgi and Kaviani, Ghazal and Prabhushankar, Mohit and AlRegib, Ghassan},
  booktitle={2025 IEEE International Conference on Image Processing (ICIP)},
  pages={265--270},
  year={2025},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
data		data
model		model
README.md		README.md
main.py		main.py
opts.py		opts.py
predict.py		predict.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-level and Multi-modal Action Anticipation (m&m-Ant)

TL;DR

Run

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-level and Multi-modal Action Anticipation (m&m-Ant)

TL;DR

Run

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages