[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

bjacob · 2025-02-12T03:16:16Z

On ROCm, we want to use the device library for all math functions.

This expands on #19969, which only concerned math.erf.

We only leave one category of rewrites enabled: the operand casts to f32. The ROCm device library internally performs the same for many math functions, but we leave that unchanged here for incrementality. We might get to that in a follow-up PR.

MaheshRavishankar · 2025-02-12T03:23:50Z

If CI passes that's a good indication for e2e correctness for now I think

MaheshRavishankar · 2025-02-12T04:37:40Z

Interesting. It has compilation failures

lialan · 2025-02-13T23:46:58Z

compiler/src/iree/compiler/Codegen/Common/test/math_transform.mlir

+  // CHECK:         math.exp2
+  // CHECK:         math.expm1
+  // CHECK:         math.cbrt
+  // CHECK:         math.erf


have we figured out the numerical issue with math.erf library function?

We have (99%) figured that there was no issue with it, and the issues we ran into were caused by PolynomialApproximationPass being too coarse-grained and too convoluted, so that when we thought earlier that we were enabling/disabling math.erf approximation, we were also enabling/disabling a number of other things, unintentionally. This is what #19922 solved. Now in the present PR we are finally at a place where we have some fine-grained, well-defined levers to play with.

The remaining issue has been diagnosed in #20074 (comment) as an issue with overly strict tests requiring agreement with the less accurate polynomial approximations in f16, as opposed to the ROCm device lib performing the approximation after upcasting to f32.

bjacob · 2025-02-21T21:44:28Z

For a moment I felt that I was very close to resolving this with llvm/llvm-project#128203. That PR does solve the CI issues observed here, except that I have to add all the remaining math ops to scalarization, which is not desirable.

If and when we revive this, we have good reasons at this point to handle the necessary vector-flattening downstream.

Signed-off-by: Benoit Jacob <[email protected]>

bjacob · 2025-02-27T21:02:20Z

Good news, this should be unblocked by llvm/llvm-project#128915. Retrying now.

MaheshRavishankar

This is fine, but we maybe need some end-to-end tests to make sure it compiles as a whole. Do we have unit tests for these already in tests/e2e

bjacob requested review from lialan and MaheshRavishankar February 12, 2025 03:17

bjacob force-pushed the no-approx-at-all-on-rocm branch from 4ae6121 to 35e4441 Compare February 12, 2025 03:18

MaheshRavishankar approved these changes Feb 12, 2025

View reviewed changes

lialan reviewed Feb 13, 2025

View reviewed changes

bjacob force-pushed the no-approx-at-all-on-rocm branch from 35e4441 to 86c508b Compare February 20, 2025 15:25

no-approx-on-rocm

2d14e5e

Signed-off-by: Benoit Jacob <[email protected]>

bjacob force-pushed the no-approx-at-all-on-rocm branch from 86c508b to 2d14e5e Compare February 27, 2025 21:00

bjacob marked this pull request as ready for review February 27, 2025 21:05

bjacob requested a review from hanhanW as a code owner February 27, 2025 21:05

MaheshRavishankar approved these changes Feb 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

bjacob commented Feb 12, 2025 •

edited

Loading

MaheshRavishankar commented Feb 12, 2025

MaheshRavishankar commented Feb 12, 2025

lialan Feb 13, 2025

bjacob Feb 14, 2025

bjacob Feb 27, 2025

bjacob commented Feb 21, 2025

bjacob commented Feb 27, 2025

MaheshRavishankar left a comment

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

Are you sure you want to change the base?

[AMDGPU] Do not rewrite or approximate math functions on ROCm #19970

Conversation

bjacob commented Feb 12, 2025 • edited Loading

MaheshRavishankar commented Feb 12, 2025

MaheshRavishankar commented Feb 12, 2025

lialan Feb 13, 2025

Choose a reason for hiding this comment

bjacob Feb 14, 2025

Choose a reason for hiding this comment

bjacob Feb 27, 2025

Choose a reason for hiding this comment

bjacob commented Feb 21, 2025

bjacob commented Feb 27, 2025

MaheshRavishankar left a comment

Choose a reason for hiding this comment

bjacob commented Feb 12, 2025 •

edited

Loading