-
Notifications
You must be signed in to change notification settings - Fork 284
Pull requests: uber/petastorm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add a ThreadPool which respects the order of Parquet dataset pieces.
#796
opened May 12, 2023 by
wbeardall
Loading…
Update CI to use latest versions of pyarrow and numpy. Drop pyarrow 4 test config.
#778
opened Sep 15, 2022 by
selitvin
Loading…
Fix type of the a batch returned by make_batch_reader when TransformSpec's function returns column with all values being None
#750
opened Apr 8, 2022 by
selitvin
Loading…
Use spark_test_ctx fixture instead of constructing spark manually
#711
opened Jul 30, 2021 by
selitvin
Loading…
Remove very old pickle compatibility code modifying old atg package names
#702
opened Jul 26, 2021 by
selitvin
Loading…
[WIP] Auto infer schema (including fields shape) from the first row
#512
opened Mar 23, 2020 by
WeichenXu123
Loading…
Add make_reader support for parquet partitioned on more than one key
#488
opened Feb 14, 2020 by
jamesprinc3
Loading…
ProTip!
Follow long discussions with comments:>50.