Predicate Pushdown not Working #2215

Lukas012 · 2022-01-28T11:34:38Z

Hi all,

Environment: Spark 3.0.2, Koalas: 1.8.2, Delta Lake 0.7

I've a Delta-Table partioned by column "PARTITION". Koalas doesn't seem to execute predicate pushdown.

Using Spark:

my_kdf = ks.read_delta(f"...")
my_df = my_kdf.to_spark()
result_df = my_df.filter((col("PARTITION") == 15) & (col("ID") == 1))
result_df.to_koalas().toPandas()

Takes: 20 seconds

Same with koalas:

result_kdf = ks.read_delta(f"...")
result_kdf = result_kdf [(result_kdf ["PARTITION"] == 15) & (result_kdf ["ID"] == 1)]
result_kdf.toPandas()

Takes 130 seconds (seems that it doesnt execute predicate pushdown)

Other try with koalas:

my_kdf = ks.read_delta(f"...")
result_kdf = my_kdf [(my_kdf ["PARTITION"] == 15)]
result_kdf = result_kdf [(result_kdf ["ID"] == 1)]
result_kdf.toPandas()

Takes: 20 seconds.

Why takes 2. so long?

Thanks!
Best

The text was updated successfully, but these errors were encountered:

HyukjinKwon · 2022-02-03T00:34:20Z

@Lukas012 do you mind reporting a issue in https://issues.apache.org/jira/projects/SPARK?

Lukas012 · 2022-02-07T14:42:49Z

Why? This problem only occurs in koalas.

itholic · 2022-02-07T14:58:09Z

@Lukas012 Koalas is ported into PySpark under the name "pandas API on Spark", and this repository is only in maintenance mode. You can get faster feedback in Apache Spark community.

FYI: and also you can use Koalas code as is in the Apache Spark as below:

# import databricks.koalas as ks
import pyspark.pandas as ks

... (existing Koalas codes)

Lukas012 changed the title ~~Koalas Partition Pruning not Working~~ Predicate Pushdown not Working Jan 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Predicate Pushdown not Working #2215

Predicate Pushdown not Working #2215

Lukas012 commented Jan 28, 2022 •

edited

Loading

HyukjinKwon commented Feb 3, 2022

Lukas012 commented Feb 7, 2022

itholic commented Feb 7, 2022 •

edited

Loading

Predicate Pushdown not Working #2215

Predicate Pushdown not Working #2215

Comments

Lukas012 commented Jan 28, 2022 • edited Loading

HyukjinKwon commented Feb 3, 2022

Lukas012 commented Feb 7, 2022

itholic commented Feb 7, 2022 • edited Loading

Lukas012 commented Jan 28, 2022 •

edited

Loading

itholic commented Feb 7, 2022 •

edited

Loading