Skip to content

Actions: EleutherAI/lm-evaluation-harness

Tasks Modified

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,812 workflow runs
2,812 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Wandb step handling bugfix and feature (#2580)
Tasks Modified #3931: Commit b86aa21 pushed by baberabb
December 20, 2024 21:40 11s main
December 20, 2024 21:40 11s
Fix the format of mgsm zh and ja.
Tasks Modified #3928: Pull request #2587 opened by timturing
December 20, 2024 04:10 1m 45s timturing:main
December 20, 2024 04:10 1m 45s
Wandb step handling bugfix and feature
Tasks Modified #3927: Pull request #2580 synchronize by sjmielke
December 19, 2024 23:06 18s sjmielke:main
December 19, 2024 23:06 18s
add warning for truncation (#2585)
Tasks Modified #3926: Commit 6ccd520 pushed by baberabb
December 19, 2024 21:58 11s main
December 19, 2024 21:58 11s
add warning for truncation
Tasks Modified #3925: Pull request #2585 synchronize by baberabb
December 19, 2024 21:51 12s maxlenwarning
December 19, 2024 21:51 12s
add warning for truncation
Tasks Modified #3924: Pull request #2585 opened by baberabb
December 19, 2024 21:51 12s maxlenwarning
December 19, 2024 21:51 12s
Add Global MMLU Lite (#2567)
Tasks Modified #3923: Commit 2b75b11 pushed by baberabb
December 19, 2024 14:54 1m 56s main
December 19, 2024 14:54 1m 56s
Wandb step handling bugfix and feature
Tasks Modified #3921: Pull request #2580 opened by sjmielke
December 18, 2024 16:38 12s sjmielke:main
December 18, 2024 16:38 12s
Add Global MMLU Lite
Tasks Modified #3920: Pull request #2567 synchronize by shivalika-singh
December 17, 2024 17:18 2m 54s shivalika-singh:add_gmmlu_lite
December 17, 2024 17:18 2m 54s
fix multiple input chat tempalte
Tasks Modified #3918: Pull request #2576 synchronize by baberabb
December 17, 2024 15:32 1m 40s multiple_input
December 17, 2024 15:32 1m 40s
fix multiple input chat tempalte
Tasks Modified #3917: Pull request #2576 opened by baberabb
December 17, 2024 15:28 2m 7s multiple_input
December 17, 2024 15:28 2m 7s
Add Global MMLU Lite
Tasks Modified #3916: Pull request #2567 synchronize by shivalika-singh
December 17, 2024 11:57 2m 8s shivalika-singh:add_gmmlu_lite
December 17, 2024 11:57 2m 8s
drop python 3.8 support (#2575)
Tasks Modified #3912: Commit 8558b8d pushed by baberabb
December 17, 2024 11:07 11s main
December 17, 2024 11:07 11s
drop python 3.8 support
Tasks Modified #3911: Pull request #2575 synchronize by baberabb
December 17, 2024 10:52 17s drop
December 17, 2024 10:52 17s
drop python 3.8 support
Tasks Modified #3910: Pull request #2575 opened by baberabb
December 17, 2024 10:48 15s drop
December 17, 2024 10:48 15s
increment version (#2574)
Tasks Modified #3909: Commit 4c26a9c pushed by baberabb
December 17, 2024 10:24 17s main
December 17, 2024 10:24 17s
increment version to 4.6.7
Tasks Modified #3908: Pull request #2574 opened by baberabb
December 17, 2024 09:13 18s version
December 17, 2024 09:13 18s
fix DeprecationWarning: invalid escape sequence '\s' for whitespace…
Tasks Modified #3907: Commit 8d2f64c pushed by baberabb
December 16, 2024 14:43 12s main
December 16, 2024 14:43 12s
batch loglikelihood_rolling across requests (#2559)
Tasks Modified #3906: Commit 0bfb022 pushed by baberabb
December 16, 2024 14:34 16s main
December 16, 2024 14:34 16s
Adding new subtask to SCORE tasks: non greedy robustness (#2558)
Tasks Modified #3905: Commit 976d8a0 pushed by baberabb
December 16, 2024 14:28 2m 56s main
December 16, 2024 14:28 2m 56s
Added caseHOLD task
Tasks Modified #3904: Pull request #2570 opened by zolastro
December 16, 2024 10:26 Action required zolastro:main
December 16, 2024 10:26 Action required
mlx Model (loglikelihood & generate_until)
Tasks Modified #3903: Pull request #1902 synchronize by chimezie
December 15, 2024 13:26 Action required chimezie:mlx
December 15, 2024 13:26 Action required