-
Notifications
You must be signed in to change notification settings - Fork 437
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[VL] Track Jvm based libhdfs performance gap issue #7746
Comments
FYI, static means link at compile time. Dynamic here means loading so library at runtime. Ratio above means the performance boost compared to Vanilla Spark. |
Didn't think that performance diff so fast. |
Looks the indirect call to jvm is not efficient. |
I thought IO is the big head, indirect call is roughly the virtual table cost. |
All above perf test is on single node with enough SSDs. Not sure if the short circuit is enabled or not in JVM test, but even not the perf shouldn't drop so much. |
More clear. @JkSelf Can you also test the missing config?
|
@JkSelf Can you update above table with current data? |
So you guys turn to static linking? |
No, latest perf test shows libhdfs3 has the same perf. If libhdfs jvm also has the same perf, we can just use the runtime loading way.
|
@FelixYBW @zhanglistar
After addressing the above three issues, we re-tested the performance of dynamic and static linking of libhdfs and libhdfs3. The conclusions are as follows:
|
Thank you Ke. Can you add a readme and describe how to config libhdfs3 and hadoop libhdfs? Is changing executorEnv.LD_LIBRARY_PATH enough? Going fwd, we will only support run time load of libhdfs library. |
Description
Currently, we supported JVM-based libhdfs in both the Velox and Gluten communities. However, the performance in Jenkins tests has not met our expectations.
Static linking with libhdfs3.so: 340%
Dynamic linking with JVM: 214.75% (requires further investigation)
Dynamic linking with libhdfs3: 323.85% (only replacing libhdfs.so with libhdfs3.so)
Expected performance for static linking in PR#7697: 340%
The text was updated successfully, but these errors were encountered: