Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QST] What is the general process of developing and tuning GEMM kernels with CUTLASS? #2050

Open
tiandi111 opened this issue Jan 21, 2025 · 0 comments

Comments

@tiandi111
Copy link

tiandi111 commented Jan 21, 2025

Hi CUTLASS Community,

I'm a newbie to CUTLASS. I'm trying to use CUTLASS to develop multiple GEMM kernel templates and then tune them on different shapes to select the best one.

However, I'm confused with the APIs CUTLASS provided:

  1. Which version of API should I use? I noticed that there're 2.x and 3.x API, is there any standard to choose the version?
  2. Is the API dependent on Hardware architectures? I only saw Hopper examples with 3.x API, is it only applicable on Hopper?
  3. The APIs from different layers, e.g. device and kernel, shares many similarities in terms of warp shape, instruction shape and etc. What's the difference between them?

Look forward to any reply, Thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant