Skip to content

fix: Use namespaced HF dataset paths for huggingface_hub >= 1.16#1971

Open
yoavkatz wants to merge 3 commits into
mainfrom
fix/hf-namespaced-dataset-paths
Open

fix: Use namespaced HF dataset paths for huggingface_hub >= 1.16#1971
yoavkatz wants to merge 3 commits into
mainfrom
fix/hf-namespaced-dataset-paths

Conversation

@yoavkatz
Copy link
Copy Markdown
Member

Summary

  • Update all LoadHF(path=...) calls in prepare files, tests, and catalog to use the full namespace/name format (e.g., hellaswagRowan/hellaswag)
  • Required because huggingface_hub >= 1.16 enforces that dataset repository IDs must follow the namespace/name format, rejecting bare names with HfUriError
  • Fixes the CI failure in the performance job where loading hellaswag triggered: Repository id must be 'namespace/name', got 'hellaswag'

Test plan

  • CI performance test passes (hellaswag loads correctly)
  • Run prepare files to regenerate catalog JSONs with updated paths
  • Verify other dataset cards still load correctly with the namespaced paths

🤖 Generated with Claude Code

yoavkatz and others added 3 commits May 26, 2026 16:41
…atibility

huggingface_hub 1.16+ enforces that dataset repository IDs must use the
'namespace/name' format. Bare dataset names (e.g., 'hellaswag') are no
longer accepted, causing HfUriError in CI.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Signed-off-by: Yoav Katz <katz@il.ibm.com>
Run all prepare/cards scripts to update the catalog JSON files with the
full namespace/name format for HuggingFace dataset paths.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Signed-off-by: Yoav Katz <katz@il.ibm.com>
The evaluate library's cached f1.py uses `float(score)` on numpy arrays,
which raises TypeError with numpy >= 2.0. Bypass the evaluate wrapper and
call sklearn's f1_score/precision_score/recall_score directly.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Signed-off-by: Yoav Katz <katz@il.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant