Actions: EleutherAI/lm-evaluation-harness
Actions
2,821 workflow runs
2,821 workflow runs
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Unit Tests
#3990:
Pull request #2520
synchronize
by
mirianfsilva
--examples
Argument for Fine-Grained Task Evaluation in lm-evaluation-harness
. This feature is the first step towards efficient multi-prompt evaluation with PromptEval [1,2]
Unit Tests
#3978:
Pull request #2520
synchronize
by
StellaAthena