[AMDGPU] Add Wave Reduce Intrinsics for i32 type #111342

easyonaadit · 2024-10-07T06:14:32Z

Currently, wave wide reduction is supported for umin and umax operations only.
This patch extends the support for:
uadd, add, usub, sub, min, max, and, or, xor ops for i32 type.

github-actions · 2024-10-07T06:14:50Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

…, AND, OR, XOR

pravinjagtap · 2024-10-28T14:07:05Z

llvm/include/llvm/IR/IntrinsicsAMDGPU.td

@@ -2119,8 +2119,14 @@ class AMDGPUWaveReduce<LLVMType data_ty = llvm_anyint_ty> : Intrinsic<
    ],
    [IntrNoMem, IntrConvergent, IntrWillReturn, IntrNoCallback, IntrNoFree, ImmArg<ArgIndex<1>>]>;

-def int_amdgcn_wave_reduce_umin : AMDGPUWaveReduce;
-def int_amdgcn_wave_reduce_umax : AMDGPUWaveReduce;
+multiclass AMDGPUWaveReduceGenerator<list<string> Operations> {


Rename Operations with WaveReduceOps ?

pravinjagtap · 2024-10-28T14:14:02Z

llvm/lib/Target/AMDGPU/SIISelLowering.cpp

+      case AMDGPU::S_SUB_I32:
+      case AMDGPU::S_OR_B32:
+      case AMDGPU::S_XOR_B32:
+        return 0x00000000;


Literal values are not consistent form here (0, 0x00000000, 0xFFFFFFFF). Can we not query this using std::numeric_limits here also?

pravinjagtap · 2024-10-28T14:20:36Z

llvm/lib/Target/AMDGPU/SIISelLowering.cpp

+        const TargetRegisterClass *WaveMaskRegClass = TRI->getWaveMaskRegClass();
+        const TargetRegisterClass *DstRegClass = MRI.getRegClass(DstReg);
+        Register ExecMask = MRI.createVirtualRegister(WaveMaskRegClass);
+        Register CountOfActiveLanesReg = MRI.createVirtualRegister(DstRegClass);


Just ActiveLanes ? In general, you are mangling type in your variable names everywhere. we can avoid that.

easyonaadit changed the title ~~Wave reduce add intrinsic~~ [AMDGPU] Extend Wave Reduce Intrinsics for add, sub, and, or, xor, min, max (Integer type) Oct 8, 2024

easyonaadit force-pushed the wave-reduce-add-intrinsic branch 3 times, most recently from 16e26e7 to 724879b Compare October 9, 2024 11:48

easyonaadit changed the title ~~[AMDGPU] Extend Wave Reduce Intrinsics for add, sub, and, or, xor, min, max (Integer type)~~ [AMDGPU] Extend Wave Reduce Intrinsics for i32 Oct 9, 2024

easyonaadit changed the title ~~[AMDGPU] Extend Wave Reduce Intrinsics for i32~~ [AMDGPU] Add Wave Reduce Intrinsics for i32 Oct 9, 2024

easyonaadit force-pushed the wave-reduce-add-intrinsic branch from 0a2d3aa to e58b310 Compare October 9, 2024 12:12

easyonaadit changed the title ~~[AMDGPU] Add Wave Reduce Intrinsics for i32~~ [AMDGPU] Add Wave Reduce Intrinsics for i32 type Oct 9, 2024

easyonaadit force-pushed the wave-reduce-add-intrinsic branch 4 times, most recently from 490e21e to aa481b8 Compare October 21, 2024 05:52

Wave Reduce Intrinsics for i32 type -> Operations: Add, Sub, Min, Max…

45fc9f9

…, AND, OR, XOR

easyonaadit force-pushed the wave-reduce-add-intrinsic branch from aa481b8 to 45fc9f9 Compare October 21, 2024 08:44

pravinjagtap reviewed Oct 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Add Wave Reduce Intrinsics for i32 type #111342

[AMDGPU] Add Wave Reduce Intrinsics for i32 type #111342

easyonaadit commented Oct 7, 2024 •

edited

Loading

github-actions bot commented Oct 7, 2024

pravinjagtap Oct 28, 2024

pravinjagtap Oct 28, 2024

pravinjagtap Oct 28, 2024

[AMDGPU] Add Wave Reduce Intrinsics for i32 type #111342

Are you sure you want to change the base?

[AMDGPU] Add Wave Reduce Intrinsics for i32 type #111342

Conversation

easyonaadit commented Oct 7, 2024 • edited Loading

github-actions bot commented Oct 7, 2024

pravinjagtap Oct 28, 2024

Choose a reason for hiding this comment

pravinjagtap Oct 28, 2024

Choose a reason for hiding this comment

pravinjagtap Oct 28, 2024

Choose a reason for hiding this comment

easyonaadit commented Oct 7, 2024 •

edited

Loading