Skip to content

Actions: EleutherAI/lm-evaluation-harness

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,812 workflow runs
2,812 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Wandb step handling bugfix and feature (#2580)
Unit Tests #3903: Commit b86aa21 pushed by baberabb
December 20, 2024 21:40 5m 55s main
December 20, 2024 21:40 5m 55s
Fix the format of mgsm zh and ja.
Unit Tests #3900: Pull request #2587 opened by timturing
December 20, 2024 04:10 5m 45s timturing:main
December 20, 2024 04:10 5m 45s
Wandb step handling bugfix and feature
Unit Tests #3899: Pull request #2580 synchronize by sjmielke
December 19, 2024 23:06 5m 49s sjmielke:main
December 19, 2024 23:06 5m 49s
add warning for truncation (#2585)
Unit Tests #3898: Commit 6ccd520 pushed by baberabb
December 19, 2024 21:58 5m 40s main
December 19, 2024 21:58 5m 40s
add warning for truncation
Unit Tests #3897: Pull request #2585 synchronize by baberabb
December 19, 2024 21:51 5m 54s maxlenwarning
December 19, 2024 21:51 5m 54s
add warning for truncation
Unit Tests #3896: Pull request #2585 opened by baberabb
December 19, 2024 21:51 6m 25s maxlenwarning
December 19, 2024 21:51 6m 25s
Add Global MMLU Lite (#2567)
Unit Tests #3895: Commit 2b75b11 pushed by baberabb
December 19, 2024 14:54 6m 25s main
December 19, 2024 14:54 6m 25s
Wandb step handling bugfix and feature
Unit Tests #3893: Pull request #2580 opened by sjmielke
December 18, 2024 16:38 6m 26s sjmielke:main
December 18, 2024 16:38 6m 26s
Add Global MMLU Lite
Unit Tests #3892: Pull request #2567 synchronize by shivalika-singh
December 17, 2024 17:18 5m 58s shivalika-singh:add_gmmlu_lite
December 17, 2024 17:18 5m 58s
fix multiple input chat tempalte
Unit Tests #3890: Pull request #2576 synchronize by baberabb
December 17, 2024 15:32 3m 15s multiple_input
December 17, 2024 15:32 3m 15s
fix multiple input chat tempalte
Unit Tests #3889: Pull request #2576 opened by baberabb
December 17, 2024 15:28 2m 59s multiple_input
December 17, 2024 15:28 2m 59s
Add Global MMLU Lite
Unit Tests #3888: Pull request #2567 synchronize by shivalika-singh
December 17, 2024 11:57 6m 33s shivalika-singh:add_gmmlu_lite
December 17, 2024 11:57 6m 33s
drop python 3.8 support (#2575)
Unit Tests #3884: Commit 8558b8d pushed by baberabb
December 17, 2024 11:07 6m 46s main
December 17, 2024 11:07 6m 46s
drop python 3.8 support
Unit Tests #3883: Pull request #2575 synchronize by baberabb
December 17, 2024 10:52 3m 3s drop
December 17, 2024 10:52 3m 3s
drop python 3.8 support
Unit Tests #3882: Pull request #2575 opened by baberabb
December 17, 2024 10:48 6m 51s drop
December 17, 2024 10:48 6m 51s
increment version (#2574)
Unit Tests #3881: Commit 4c26a9c pushed by baberabb
December 17, 2024 10:24 6m 15s main
December 17, 2024 10:24 6m 15s
increment version to 4.6.7
Unit Tests #3880: Pull request #2574 opened by baberabb
December 17, 2024 09:13 6m 8s version
December 17, 2024 09:13 6m 8s
fix DeprecationWarning: invalid escape sequence '\s' for whitespace…
Unit Tests #3879: Commit 8d2f64c pushed by baberabb
December 16, 2024 14:43 6m 38s main
December 16, 2024 14:43 6m 38s
batch loglikelihood_rolling across requests (#2559)
Unit Tests #3878: Commit 0bfb022 pushed by baberabb
December 16, 2024 14:34 6m 9s main
December 16, 2024 14:34 6m 9s
Adding new subtask to SCORE tasks: non greedy robustness (#2558)
Unit Tests #3877: Commit 976d8a0 pushed by baberabb
December 16, 2024 14:28 5m 59s main
December 16, 2024 14:28 5m 59s
Added caseHOLD task
Unit Tests #3876: Pull request #2570 opened by zolastro
December 16, 2024 10:26 Action required zolastro:main
December 16, 2024 10:26 Action required
mlx Model (loglikelihood & generate_until)
Unit Tests #3875: Pull request #1902 synchronize by chimezie
December 15, 2024 13:26 Action required chimezie:mlx
December 15, 2024 13:26 Action required