-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Support hf://
in read_(csv|ipc|ndjson)
functions
#17785
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #17785 +/- ##
==========================================
+ Coverage 80.47% 80.48% +0.01%
==========================================
Files 1503 1503
Lines 197115 196981 -134
Branches 2794 2795 +1
==========================================
- Hits 158628 158541 -87
+ Misses 37973 37920 -53
- Partials 514 520 +6 ☔ View full report in Codecov by Sentry. |
294adc9
to
8f7dd3a
Compare
hf://
in read_csv
hf://
in read_(csv|ipc|ndjson)
functions
# Also dispatch on FORCE_ASYNC, so that this codepath gets run | ||
# through by our test suite during CI. | ||
or ( | ||
os.getenv("POLARS_FORCE_ASYNC") == "1" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Magical test coverage for the hf://
dispatch 😉
LGTM, Could you create an issue about the breaking change for the storage options. Would be great if we could simplify the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Alright. Then for 2.0 we should see if we can move more from ffspec
into our engine.
Thanks!
Issue created at #17815 |
For CSV and IPC, only
hf://
paths are dispatched to theirscan_*
equivalents. They can't dispatch for all paths as it would cause a breaking change tostorage_options
as they currently usefsspec
, which uses different configuration keys. For CSV as well there is also the issue of compressed files.read_ndjson
doesn't suffer from the above issues, so I've added the full set of parameters fromscan_ndjson
and set it to always dispatch toscan_ndjson
.