Replies: 1 comment 2 replies
-
Hi @drdebian I'm actually not support this out of the box. We use a library called If you look at the implementation for CSVDataSet we do pass I would try passing different arguments to fsspec or perhaps implement your own custom CSVDataSet by inheriting and overriding our own. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm trying to pull in data from a remote SFTP directory into an IncrementalDataSet. There are groups of files like car1*.csv, car2*.csv, ..., somethingelse*.csv in the same directory.
I found no obvious way to use wildcards to filter those groups of files. What I would like to have is something like this:
My expectation would be, that I get all files from remote directory /path/to/raw/data/ matching the pattern of "car8*.csv" read into my_incremental_dataset. Writing it like above, however, gets me no files at all. If I leave away the *, I get all files mangled into one dataset, which does not work due to different schemas of those file groups.
Any help would be greatly appreciated!
Beta Was this translation helpful? Give feedback.
All reactions