Skip to content

Actions: EleutherAI/lm-evaluation-harness

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,821 workflow runs
2,821 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add HumanEval
Unit Tests #3991: Pull request #1992 synchronize by baberabb
January 15, 2025 18:23 6m 9s hjlee1371:humaneval
January 15, 2025 18:23 6m 9s
Add HumanEval
Unit Tests #3989: Pull request #1992 synchronize by baberabb
January 15, 2025 18:13 6m 29s hjlee1371:humaneval
January 15, 2025 18:13 6m 29s
Add HumanEval
Unit Tests #3988: Pull request #1992 synchronize by baberabb
January 15, 2025 18:09 5m 6s hjlee1371:humaneval
January 15, 2025 18:09 5m 6s
Add MLQA
Unit Tests #3987: Pull request #2622 synchronize by KahnSvaer
January 15, 2025 17:13 6m 16s KahnSvaer:mlqa
January 15, 2025 17:13 6m 16s
Add MLQA
Unit Tests #3984: Pull request #2622 synchronize by KahnSvaer
January 15, 2025 16:16 6m 17s KahnSvaer:mlqa
January 15, 2025 16:16 6m 17s
Add MLQA
Unit Tests #3983: Pull request #2622 synchronize by KahnSvaer
January 15, 2025 06:24 6m 2s KahnSvaer:mlqa
January 15, 2025 06:24 6m 2s
add hrm8k benchmark for both Korean and English
Unit Tests #3982: Pull request #2627 synchronize by bzantium
January 15, 2025 04:27 6m 7s feature/#2623
January 15, 2025 04:27 6m 7s
add hrm8k benchmark for both Korean and English
Unit Tests #3981: Pull request #2627 opened by bzantium
January 15, 2025 04:24 6m 28s feature/#2623
January 15, 2025 04:24 6m 28s
assistant prefill
Unit Tests #3980: Pull request #2615 synchronize by baberabb
January 14, 2025 23:05 6m 30s prefix
January 14, 2025 23:05 6m 30s
Add MLQA
Unit Tests #3979: Pull request #2622 synchronize by KahnSvaer
January 14, 2025 17:02 5m 51s KahnSvaer:mlqa
January 14, 2025 17:02 5m 51s
add hrm8k benchmark
Unit Tests #3977: Pull request #2624 opened by bzantium
January 14, 2025 12:35 6m 28s feature/#2623
January 14, 2025 12:35 6m 28s
Add MLQA
Unit Tests #3976: Pull request #2622 opened by KahnSvaer
January 13, 2025 22:37 6m 30s KahnSvaer:mlqa
January 13, 2025 22:37 6m 30s
Added EU20 task suite
Unit Tests #3974: Pull request #2620 opened by KlaudiaTH
January 10, 2025 13:04 6m 45s OpenGPTX:eu20_tasks
January 10, 2025 13:04 6m 45s
assistant prefill
Unit Tests #3973: Pull request #2615 synchronize by baberabb
January 9, 2025 18:45 5m 44s prefix
January 9, 2025 18:45 5m 44s
assistant prefill
Unit Tests #3972: Pull request #2615 synchronize by baberabb
January 8, 2025 18:44 6m 31s prefix
January 8, 2025 18:44 6m 31s
assistant prefill
Unit Tests #3971: Pull request #2615 synchronize by baberabb
January 8, 2025 17:22 6m 23s prefix
January 8, 2025 17:22 6m 23s
assistant prefill
Unit Tests #3970: Pull request #2615 synchronize by baberabb
January 8, 2025 17:21 5m 59s prefix
January 8, 2025 17:21 5m 59s
assistant prefill
Unit Tests #3969: Pull request #2615 synchronize by baberabb
January 8, 2025 16:00 5m 16s prefix
January 8, 2025 16:00 5m 16s
assistant prefill
Unit Tests #3968: Pull request #2615 synchronize by baberabb
January 8, 2025 15:59 5m 45s prefix
January 8, 2025 15:59 5m 45s
assistant prefill
Unit Tests #3967: Pull request #2615 synchronize by baberabb
January 8, 2025 15:09 5m 58s prefix
January 8, 2025 15:09 5m 58s
assistant prefill
Unit Tests #3966: Pull request #2615 synchronize by baberabb
January 8, 2025 15:04 6m 24s prefix
January 8, 2025 15:04 6m 24s
Unit Tests
Unit Tests #3965: by baberabb
January 7, 2025 15:42 6m 4s main
January 7, 2025 15:42 6m 4s