Applying Statistics to LLM Evaluations

An overview of useful statistics for building and interpreting LLM evaluations...
READ THE LATEST

Deep (Learning) Focus