Format Matters: Tables vs. Charts

This is the repository for the paper: Format Matters: The Robustness of Multimodal LLMs in Reviewing Evidence from Tables and Charts (AAAI 2026)

Reproduction of Results

download data.zip
download outputs.zip

Table 2:

Edit base_path in Line 49 to obtain all results in Table 2.

python3 run_eval.py

Running process

Run the Claim Label Prediction Task for Tables

python3 run_claim_table.py

Run the Claim Label Prediction Task for Charts

python3 run_claim_img.py

Combination

python3 run_claim_combine.py

Evaluation

python3 run_eval.py

Citation

Please cite our paper as follows:

@article{Ho_Wu_Kumar_Boudin_Takasu_Aizawa_2026,
  title={Format Matters: The Robustness of Multimodal LLMs in Reviewing Evidence from Tables and Charts},
  volume={40},
  url={https://ojs.aaai.org/index.php/AAAI/article/view/40361},
  DOI={10.1609/aaai.v40i37.40361},
  number={37}, journal={Proceedings of the AAAI Conference on Artificial Intelligence},
  author={Ho, Xanh and Wu, Yun-Ang and Kumar, Sunisth and Boudin, Florian and Takasu, Atsuhiro and Aizawa, Akiko},
  year={2026},
  month={Mar.},
  pages={31014-31022}
}

The structure of the code in this repository is based on: https://github.com/Alab-NII/SciTabAlign

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
evaluation		evaluation
models		models
prompts		prompts
tasks/claim_2		tasks/claim_2
.gitignore		.gitignore
LICENSE		LICENSE
ReadMe.md		ReadMe.md
run_claim_combine.py		run_claim_combine.py
run_claim_img.py		run_claim_img.py
run_claim_table.py		run_claim_table.py
run_eval.py		run_eval.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Format Matters: Tables vs. Charts

Reproduction of Results

Table 2:

Running process

Run the Claim Label Prediction Task for Tables

Run the Claim Label Prediction Task for Charts

Combination

Evaluation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Format Matters: Tables vs. Charts

Reproduction of Results

Table 2:

Running process

Run the Claim Label Prediction Task for Tables

Run the Claim Label Prediction Task for Charts

Combination

Evaluation

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages