Go Back

Lm harness

DevOps

About
`lm-evaluation-harness` (by EleutherAI) is a widely acclaimed open-source framework toolkit hosted on GitHub. It provides standardized, reproducible generative benchmark matrices to test, score, and evaluate large language models across thousands of public datasets.