The documenation of the evaluation mixin misses some aspects, e.g.: - At the beginning of the page, it is not clear what the groundtruth is - the `metric_names` property is not documented - It is not documented how to implement a custom metric function