Position: Don’t use the CLT in LLM evals with fewer than a few hundred datapoints

Published in arXiv, 2024

Recommended citation: Bowyer S, Aitchison L, and Ivanova DR. (2024). "Position: Don't use the CLT in LLM evals with fewer than a few hundred datapoints." arXiv:2503.01747.
Download Paper

Categories: