You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm a newbie to CUTLASS. I'm trying to use CUTLASS to develop multiple GEMM kernel templates and then tune them on different shapes to select the best one.
However, I'm confused with the APIs CUTLASS provided:
Which version of API should I use? I noticed that there're 2.x and 3.x API, is there any standard to choose the version?
Is the API dependent on Hardware architectures? I only saw Hopper examples with 3.x API, is it only applicable on Hopper?
The APIs from different layers, e.g. device and kernel, shares many similarities in terms of warp shape, instruction shape and etc. What's the difference between them?
Look forward to any reply, Thank you very much!
The text was updated successfully, but these errors were encountered:
Hi CUTLASS Community,
I'm a newbie to CUTLASS. I'm trying to use CUTLASS to develop multiple GEMM kernel templates and then tune them on different shapes to select the best one.
However, I'm confused with the APIs CUTLASS provided:
Look forward to any reply, Thank you very much!
The text was updated successfully, but these errors were encountered: