pl.scan_csv().select('col1').collect()
silently does not return all rows if one column contains the quote_char
#21519
Labels
A-io-csv
Area: reading/writing CSV files
bug
Something isn't working
needs triage
Awaiting prioritization by a maintainer
python
Related to Python Polars
Checks
Reproducible example
Log output
Issue description
Applying a selection on a different column, than the column (i.e.
col2
) with aquote_char
(default:"
) not as the first character silently, returns only rows until and including the line with thequote_char
. All subsequent rows are silently not returned. This also works with any other char used asquote_char
(e.g.q
).This also happens with
POLARS_MAX_THREADS=1
If we select the column with the
quote_char
, the csv is correctly parsed:Expected behavior
All rows from the csv should be returned by
scan_csv().select().collect()
.Another solution would be to throw an exception.
Installed versions
I tested this with polars 1.0.0, 1.22.0, 1.23.0. All versions are affected by this bug.
The text was updated successfully, but these errors were encountered: