Skip to content

Actions: EleutherAI/lm-evaluation-harness

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,812 workflow runs
2,812 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

batch loglikelihood_rolling across requests
Unit Tests #3846: Pull request #2559 opened by baberabb
December 11, 2024 14:22 6m 47s rolling
December 11, 2024 14:22 6m 47s
mlx Model (loglikelihood & generate_until)
Unit Tests #3844: Pull request #1902 synchronize by chimezie
December 10, 2024 18:14 Action required chimezie:mlx
December 10, 2024 18:14 Action required
add llama3 tasks
Unit Tests #3843: Pull request #2556 synchronize by baberabb
December 10, 2024 18:01 6m 27s llama
December 10, 2024 18:01 6m 27s
add llama3 tasks
Unit Tests #3842: Pull request #2556 synchronize by baberabb
December 10, 2024 17:08 7m 6s llama
December 10, 2024 17:08 7m 6s
add llama3 tasks
Unit Tests #3841: Pull request #2556 synchronize by baberabb
December 10, 2024 16:25 5m 55s llama
December 10, 2024 16:25 5m 55s
add llama3 tasks
Unit Tests #3840: Pull request #2556 synchronize by baberabb
December 10, 2024 16:23 5m 32s llama
December 10, 2024 16:23 5m 32s
add llama3 tasks
Unit Tests #3839: Pull request #2556 opened by baberabb
December 10, 2024 16:12 5m 53s llama
December 10, 2024 16:12 5m 53s
Add GigaChat API
Unit Tests #3838: Pull request #2495 synchronize by seldereyy
December 10, 2024 15:13 Action required seldereyy:models/gigachat_llms
December 10, 2024 15:13 Action required
Update Lightning import (#2549)
Unit Tests #3837: Commit 0b99443 pushed by baberabb
December 9, 2024 21:38 6m 6s main
December 9, 2024 21:38 6m 6s
Update Lightning import
Unit Tests #3836: Pull request #2549 synchronize by maanug-nv
December 9, 2024 20:42 6m 6s maanug-nv:maanug/pl-to-l
December 9, 2024 20:42 6m 6s
[API] left truncate for generate_until (#2554)
Unit Tests #3834: Commit 2d11f2e pushed by baberabb
December 9, 2024 19:59 6m 22s main
December 9, 2024 19:59 6m 22s
[API] left truncate for generate_until
Unit Tests #3833: Pull request #2554 synchronize by baberabb
December 9, 2024 16:35 6m 29s max_
December 9, 2024 16:35 6m 29s
[API] left truncate for generate_until
Unit Tests #3832: Pull request #2554 opened by baberabb
December 9, 2024 16:32 6m 44s max_
December 9, 2024 16:32 6m 44s
Update Lightning import
Unit Tests #3831: Pull request #2549 opened by maanug-nv
December 6, 2024 22:27 6m 21s maanug-nv:maanug/pl-to-l
December 6, 2024 22:27 6m 21s
Update README.md (#2546)
Unit Tests #3825: Commit bcb4cbf pushed by baberabb
December 5, 2024 14:52 7m 22s main
December 5, 2024 14:52 7m 22s
add option to add an assistant_prefix
Unit Tests #3823: Pull request #2545 opened by baberabb
December 5, 2024 12:08 5m 7s prefix
December 5, 2024 12:08 5m 7s
[MM] Chartqa
Unit Tests #3822: Pull request #2544 opened by baberabb
December 5, 2024 12:06 5m 26s chartqa
December 5, 2024 12:06 5m 26s
[MM] Ai2d
Unit Tests #3821: Pull request #2542 opened by baberabb
December 5, 2024 12:02 4m 39s ai2d
December 5, 2024 12:02 4m 39s
Update KorMedMCQA: ver 2.0
Unit Tests #3819: Pull request #2540 opened by GyoukChu
December 5, 2024 03:54 6m 43s GyoukChu:main
December 5, 2024 03:54 6m 43s
fixed mmlu generative response extraction
Unit Tests #3814: Pull request #2503 synchronize by RawthiL
December 4, 2024 18:40 8m 15s RawthiL:mmlu_generative_fix
December 4, 2024 18:40 8m 15s
fixed mmlu generative response extraction
Unit Tests #3813: Pull request #2503 synchronize by RawthiL
December 4, 2024 18:08 6m 6s RawthiL:mmlu_generative_fix
December 4, 2024 18:08 6m 6s
add Russian mmlu
Unit Tests #3812: Pull request #2378 synchronize by tatiana-iazykova
December 4, 2024 16:44 6m 33s tatiana-iazykova:main
December 4, 2024 16:44 6m 33s
Support pipeline parallel with OpenVINO models (#2349)
Unit Tests #3810: Commit 1f9bc88 pushed by baberabb
December 4, 2024 16:18 6m 11s main
December 4, 2024 16:18 6m 11s
add better testing when both doc_to_text ends in and target_delimiter…
Unit Tests #3808: Commit 6824d39 pushed by baberabb
December 4, 2024 14:55 7m 3s main
December 4, 2024 14:55 7m 3s