Skip to content

JidongLi-hub/JudgeBeforeAnswer

Repository files navigation

Judge Before Answer:
Can MLLM Discern the False Premise in Question?


Method Overview

Static Badge Static Badge

📋 Overview

The original code of constructing JBA dataset is given in this repository, which is an evaluation set of false premise problems for MLLM.

🤗Find our dataset on Huggingface

🚀 Quick Start

Find our JBA dataset in dataset/Judge_Before_Answer.json or on Huggingface, image ids are from Visual Genome. Also, you can run main.py to constrcut your own JBA dataset.

Run test.py to generate test results for MLLM.

Run evaluate.py to evaluate the results and get metrics.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors