Position: Don’t use the CLT in LLM evals with fewer than a few hundred datapoints
Published in arXiv, 2024
Recommended citation: Bowyer S, Aitchison L, and Ivanova DR. (2024). "Position: Don't use the CLT in LLM evals with fewer than a few hundred datapoints." arXiv:2503.01747.
Download Paper