Go Back

Lm harness

About

`lm-evaluation-harness` (by EleutherAI) is a widely acclaimed open-source framework toolkit hosted on GitHub. It provides standardized, reproducible generative benchmark matrices to test, score, and evaluate large language models across thousands of public datasets.