-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: Fix corrupted reads for hive parts from cloud and projection pushdown failure on hive parts #17152
Conversation
4642ef1
to
20602f0
Compare
)); | ||
}, | ||
}; | ||
let is_cloud = is_cloud_url(self.paths.first().unwrap()); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed the optimizer to replace a scan with DataFrameScan
if the predicate filtered out all files (see below)
@@ -378,7 +378,7 @@ impl<'a> PredicatePushDown<'a> { | |||
} | |||
scan_type.remove_metadata(); | |||
} | |||
if paths.is_empty() { | |||
if new_paths.is_empty() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I discovered there was other code in place to ensure the invariant that a reader always has at least one path here, it would replace the scan node with a DataFrameScan
but it wasn't being hit because it was incorrectly checking the old paths
instead of new_paths
. This changes the fix at #12575
Thanks for the quick fix! |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #17152 +/- ##
==========================================
+ Coverage 80.89% 80.93% +0.03%
==========================================
Files 1456 1456
Lines 191355 191372 +17
Branches 2742 2742
==========================================
+ Hits 154801 154890 +89
+ Misses 36045 35973 -72
Partials 509 509 ☔ View full report in Codecov by Sentry. |
Fixes #17104, #15823 (projection pd)
Fixes #17155 (corrupted reads)