Add Hugging Face filesystem support to fsspec#1997
Conversation
|
Hey @lhoestq Thanks for raising this PR. I think this is super interesting! I think the PR needs a couple more things:
|
|
I updated pyproject.toml and added some docs :) PS: I also added the "hf" extra in pyproject.toml, lmk if this is fine |
kevinjqliu
left a comment
There was a problem hiding this comment.
LGTM! Thanks for adding this.
| - **hdfs**: `PyArrowFileIO` | ||
| - **abfs**, **abfss**: `FsspecFileIO` | ||
| - **oss**: `PyArrowFileIO` | ||
| - **hf**: `FsspecFileIO` |
There was a problem hiding this comment.
is there a way to allow PyArrowFileIO as well?
There was a problem hiding this comment.
There is no HF filesystem implementation in arrow C++ yet unfortunately ! But hopefully soon
just retriggered CI |
|
Also retriggered the CI 😄 |
|
all green ! thanks |
# Rationale for this change Add support for the Hugging Face filesystem in `fsspec`, which uses `hf://` paths. This allows to import [HF datasets](https://huggingface.co/datasets). Authentication is done using the `"hf.token"` property. # Are these changes tested? I tried locally but haven't added tests in test_fsspec.py (lmk if it's a requirement) # Are there any user-facing changes? No changes, it simply adds support for `hf://` URLs
# Rationale for this change Add support for the Hugging Face filesystem in `fsspec`, which uses `hf://` paths. This allows to import [HF datasets](https://huggingface.co/datasets). Authentication is done using the `"hf.token"` property. # Are these changes tested? I tried locally but haven't added tests in test_fsspec.py (lmk if it's a requirement) # Are there any user-facing changes? No changes, it simply adds support for `hf://` URLs
Rationale for this change
Add support for the Hugging Face filesystem in
fsspec, which useshf://paths.This allows to import HF datasets.
Authentication is done using the
"hf.token"property.Are these changes tested?
I tried locally but haven't added tests in test_fsspec.py (lmk if it's a requirement)
Are there any user-facing changes?
No changes, it simply adds support for
hf://URLs