forked from apache/spark
-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-50080][SQL][TESTS] Add benchmark cases for parquet adaptive bl…
…oom filter in BloomFilterBenchmark ### What changes were proposed in this pull request? Parquet's AdaptiveBlockSplitBloomFilter is a technique for generating a bloom filter with the optimal bit size according to the number of distinct real data values. It may not come at no cost because it uses multiple BloomFilter candidates at runtime, which could increase CPU usage or time. This pull request adds benchmark cases to compare with those that use the default BloomFilter size. ### Why are the changes needed? Improvement benchmark coverage for common user-orient features from parquet datasource ### Does this PR introduce _any_ user-facing change? no ### How was this patch tested? benchmarking golden files attached ### Was this patch authored or co-authored using generative AI tooling? no Closes apache#48609 from yaooqinn/SPARK-50080. Authored-by: Kent Yao <[email protected]> Signed-off-by: Hyukjin Kwon <[email protected]>
- Loading branch information
1 parent
51e915a
commit 2cb7a16
Showing
3 changed files
with
118 additions
and
100 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.