dataframehuggingfacegradiohuggingface-hub

Download huggingface table / DataFrame?


I was looking at:

https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard

enter image description here

It's a valuable data table and I wanted to download it in a machine readable format. There is no static file that is rendered out of the repo, but app.py is executed (as per README metadata) - which creates a leaderboard DataFrame out of a separate evaluation repo which seems to be private.

How can one access that DataFrame from the outside?


Edit: I tried the the linked docker command fails (somewhat expectedly):

$ docker run -it -p 7860:7860 \
    --platform=linux/amd64 \
    -e H4_TOKEN=hf_nottherealtokenweorpqweruuoid \
    -e IS_PUBLIC=true \
    registry.hf.space/huggingfaceh4-open-llm-leaderboard:latest \
    python app.py

...

Cloning into '.'...
remote: Repository not found
fatal: repository 'https://huggingface.co/datasets/HuggingFaceH4/lmeh_evaluations/' not found
Error(s) during clone:
git clone failed: exit status 128

Cf. https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard/discussions/50


Solution

  • The Open LLM leaderboard dataset has been made public as a dataset here, and can be manipulated using the datasets library or downloaded using git.

    Have fun with the data!

    (Disclaimer: I'm one of the leaderboard's programmers/researchers.)