I was looking at:
https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
It's a valuable data table and I wanted to download it in a machine readable format. There is no static file that is rendered out of the repo, but app.py is executed (as per README metadata) - which creates a leaderboard DataFrame out of a separate evaluation repo which seems to be private.
How can one access that DataFrame from the outside?
Edit: I tried the the linked docker command fails (somewhat expectedly):
$ docker run -it -p 7860:7860 \
--platform=linux/amd64 \
-e H4_TOKEN=hf_nottherealtokenweorpqweruuoid \
-e IS_PUBLIC=true \
registry.hf.space/huggingfaceh4-open-llm-leaderboard:latest \
python app.py
...
Cloning into '.'...
remote: Repository not found
fatal: repository 'https://huggingface.co/datasets/HuggingFaceH4/lmeh_evaluations/' not found
Error(s) during clone:
git clone failed: exit status 128
Cf. https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard/discussions/50
The Open LLM leaderboard dataset has been made public as a dataset here, and can be manipulated using the datasets
library or downloaded using git
.
Have fun with the data!
(Disclaimer: I'm one of the leaderboard's programmers/researchers.)