Mapnav: A novel memory representation via annotated semantic maps for vlm-based vision-and-language navigation (ACL 2025)

Repository for Mapnav: A novel memory representation via annotated semantic maps for vlm-based vision-and-language navigation (ACL 2025)

Installation

The code has been tested only with Python 3.8 on Ubuntu 20.04.

Environments Setup

Follow L3MVN to install Habitat-lab, Habitat-sim, rednet, torch and other independences.
Install the LLaVA.

Dataset

Download Matterport3d scene dataset to the data path.

Path

Change the dataset path and habitat path in the config_utils.py

Training

You can download the huggingface dataset to generate you QA pairs to train your own model using LLaVA-NeXT.

Evaluation

CUDA_VISIBLE_DEVICES=1 python r2rnav_benchmark.py --split val1 --eval 1 --auto_gpu_config 0 -n 1 --num_local_steps 10 --print_images 1 --model_dir model_path --exp_name nohis_rgb --eval_episodes 1839 --collect 0 --stop_th 300

ASM Generation

You can look for the generation and annotation pipeline in the r2rnav_benchmark.py and huatu3.py.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
RedNet		RedNet
configs		configs
docs		docs
utils		utils
.DS_Store		.DS_Store
README.md		README.md
arguments.py		arguments.py
caculate_metrics.py		caculate_metrics.py
config_utils.py		config_utils.py
constants.py		constants.py
huatu3.py		huatu3.py
llava_infer.py		llava_infer.py
mapper.py		mapper.py
model.py		model.py
r2r_benchmark.py		r2r_benchmark.py
r2rnav_agent_nohis.py		r2rnav_agent_nohis.py
r2rnav_agent_w_his.py		r2rnav_agent_w_his.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mapnav: A novel memory representation via annotated semantic maps for vlm-based vision-and-language navigation (ACL 2025)

Installation

Training

Evaluation

ASM Generation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mapnav: A novel memory representation via annotated semantic maps for vlm-based vision-and-language navigation (ACL 2025)

Installation

Training

Evaluation

ASM Generation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages