Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-40154][PYTHON][DOCS] Correct storage level in Dataframe.cache …
…docstring ### What changes were proposed in this pull request? Corrects the docstring `DataFrame.cache` to give the correct storage level after it changed with Spark 3.0. It seems that the docstring of `DataFrame.persist` was updated, but `cache` was forgotten. ### Why are the changes needed? The doctoring claims that `cache` uses serialised storage, but it actually uses deserialised storage. I confirmed that this is still the case with Spark 3.5.0 using the example code from the Jira ticket. ### Does this PR introduce _any_ user-facing change? Yes, the docstring changes. ### How was this patch tested? The Github actions workflow succeeded. ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#43229 from paulstaab/SPARK-40154. Authored-by: Paul Staab <[email protected]> Signed-off-by: Sean Owen <[email protected]> (cherry picked from commit 94607dd) Signed-off-by: Sean Owen <[email protected]>
- Loading branch information