metrics

Metrics (evaluators)

Each file defines an evaluator for a task with a specific setting. Each evaluator takes predictions (a list of predictions) and golds (a list of the gold data item, and each gold data item is a dict) and returns a dict as the evaluation result.

You can add new evaluators here for the evaluation of a new task or an existing task with a new setting. If you use the ../third_party/ directory for the evaluator, please add them into the ../third_party directory, and specify their link in .gitsubmodule, which enables recursive cloning.

Name		Name	Last commit message	Last commit date
parent directory ..
bird		bird
compwebq		compwebq
cosql		cosql
dart		dart
e2e		e2e
fetaqa		fetaqa
feverous		feverous
finqa		finqa
grailqa		grailqa
hybridqa		hybridqa
infotabs		infotabs
kvret		kvret
kvret_glmp		kvret_glmp
logic2text		logic2text
logicnlg		logicnlg
meta_tuning		meta_tuning
mmqa		mmqa
mtop		mtop
multi_woz_22		multi_woz_22
multiwoz		multiwoz
ottqa		ottqa
russ		russ
sparc		sparc
spider		spider
sqa		sqa
sql2text		sql2text
tab_fact		tab_fact
tabmwp		tabmwp
totto		totto
unified		unified
webnlg_challenge_2017		webnlg_challenge_2017
webqsp		webqsp
wikisql_fully_supervised		wikisql_fully_supervised
wikisql_weakly_supervised		wikisql_weakly_supervised
wikitabletext		wikitabletext
wikitq_weakly_supervised		wikitq_weakly_supervised
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Metrics (evaluators)

FilesExpand file tree

metrics

Directory actions

More options

Directory actions

More options

Latest commit

History

metrics

Folders and files

parent directory

README.md

Metrics (evaluators)