fail to evaluate piqa #2597

vejaxu · 2024-12-25T14:50:11Z

hello
I am trying to evaluate llama-2-13b-hf with v0.4.7 on dataset piqa but get error

File "/home/xwj/llm/lm-evaluation-harness/lm_eval/api/task.py", line 819, in init
self.download(self.config.dataset_kwargs)
File "/home/xwj/llm/lm-evaluation-harness/lm_eval/api/task.py", line 926, in download
self.dataset = datasets.load_dataset(
File "/usr/local/anaconda3/envs/xwj_transformers/lib/python3.12/site-packages/datasets/load.py", line 2556, in load_dataset
builder_instance = load_dataset_builder(
File "/usr/local/anaconda3/envs/xwj_transformers/lib/python3.12/site-packages/datasets/load.py", line 2265, in load_dataset_builder
builder_instance: DatasetBuilder = builder_cls(
TypeError: 'NoneType' object is not callable

and the command is
lm_eval --model hf --model_args pretrained="/home/llama-2-13b-hf" --tasks piqa --device cuda:0 --batch_size 8

can anyone help please?

vejaxu · 2024-12-25T15:04:47Z

I seem to solve this problem by replacing
dataset_path: piqa to dataset_path: nthngdy/piqa
in tasks/piqa.yaml

vejaxu · 2024-12-25T15:05:32Z

but I don't know whether it is correct

SHUMKASHUN · 2024-12-25T23:51:38Z

Same problem here... This problem affects on hellaswag, piqa, social_iqa...

LanDisen · 2024-12-26T02:43:32Z

Same problem☹️

baberabb · 2024-12-26T07:14:02Z

Hi! I can't reproduce this. If it's a network issue then some people have had success with #1634 (comment),
might be other mirrors as well.

vejaxu · 2024-12-26T09:46:21Z

thanks for your comment!
and maybe one solution is replacing the dataset_path which has data rather than .py file on hug.

baberabb added the asking questions For asking for clarification / support on library usage. label Jan 2, 2025

vejaxu closed this as completed Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fail to evaluate piqa #2597

fail to evaluate piqa #2597

vejaxu commented Dec 25, 2024

vejaxu commented Dec 25, 2024 •

edited

Loading

vejaxu commented Dec 25, 2024

SHUMKASHUN commented Dec 25, 2024

LanDisen commented Dec 26, 2024

baberabb commented Dec 26, 2024

vejaxu commented Dec 26, 2024

fail to evaluate piqa #2597

fail to evaluate piqa #2597

Comments

vejaxu commented Dec 25, 2024

vejaxu commented Dec 25, 2024 • edited Loading

vejaxu commented Dec 25, 2024

SHUMKASHUN commented Dec 25, 2024

LanDisen commented Dec 26, 2024

baberabb commented Dec 26, 2024

vejaxu commented Dec 26, 2024

vejaxu commented Dec 25, 2024 •

edited

Loading