Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fail to evaluate piqa #2597

Closed
vejaxu opened this issue Dec 25, 2024 · 6 comments
Closed

fail to evaluate piqa #2597

vejaxu opened this issue Dec 25, 2024 · 6 comments
Labels
asking questions For asking for clarification / support on library usage.

Comments

@vejaxu
Copy link

vejaxu commented Dec 25, 2024

hello
I am trying to evaluate llama-2-13b-hf with v0.4.7 on dataset piqa but get error

File "/home/xwj/llm/lm-evaluation-harness/lm_eval/api/task.py", line 819, in init
self.download(self.config.dataset_kwargs)
File "/home/xwj/llm/lm-evaluation-harness/lm_eval/api/task.py", line 926, in download
self.dataset = datasets.load_dataset(
File "/usr/local/anaconda3/envs/xwj_transformers/lib/python3.12/site-packages/datasets/load.py", line 2556, in load_dataset
builder_instance = load_dataset_builder(
File "/usr/local/anaconda3/envs/xwj_transformers/lib/python3.12/site-packages/datasets/load.py", line 2265, in load_dataset_builder
builder_instance: DatasetBuilder = builder_cls(
TypeError: 'NoneType' object is not callable

and the command is
lm_eval --model hf --model_args pretrained="/home/llama-2-13b-hf" --tasks piqa --device cuda:0 --batch_size 8

can anyone help please?

@vejaxu
Copy link
Author

vejaxu commented Dec 25, 2024

I seem to solve this problem by replacing
dataset_path: piqa to dataset_path: nthngdy/piqa
in tasks/piqa.yaml

@vejaxu
Copy link
Author

vejaxu commented Dec 25, 2024

but I don't know whether it is correct

@SHUMKASHUN
Copy link

Same problem here... This problem affects on hellaswag, piqa, social_iqa...

@LanDisen
Copy link

Same problem☹️

@baberabb
Copy link
Contributor

Hi! I can't reproduce this. If it's a network issue then some people have had success with #1634 (comment),
might be other mirrors as well.

@vejaxu
Copy link
Author

vejaxu commented Dec 26, 2024

thanks for your comment!
and maybe one solution is replacing the dataset_path which has data rather than .py file on hug.

@baberabb baberabb added the asking questions For asking for clarification / support on library usage. label Jan 2, 2025
@vejaxu vejaxu closed this as completed Jan 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
asking questions For asking for clarification / support on library usage.
Projects
None yet
Development

No branches or pull requests

4 participants