-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
differentiate reduction function with CUDA atomicAdd #1288
Comments
What version/commit are you on? We should handle that so I'm a bit surprised but if you get that on the main branch we definitely will fix it! |
Hi @wsmoses , thanks for your reply. Indeed I was on a outdated version and now I was trying to build Enzyme on lastest master. However met the issue: |
Ah sorry I just meant the latest Enzyme not the latest LLVM. Probably the latest stable release so 16? We generally try to keep up with llvm head (and compiler as of a few days ago), but someone could push a temporary breaking change that takes us time to learn about and adapt to. |
Hi @wsmoses . Thanks for the suggestion. I have successfully built LLVM (16.0.6) and Enzyme (latest main). And met the issue below:
The compile command line was working fine on Enzyme commit |
For newer LLVM's the way to load the pass is -fpass-plugin=/path/to/ClangEnzyme-XX.so. This is technically documented here (https://enzyme.mit.edu/getting_started/Faq/#opt-cant-find--enzyme-option), but very hard to find. Any PR's to our docs (github.com/EnzymeAD/www) are very appreciated! |
Thanks for your reply. For the newer LLVM, I met some Unknown command line argument issues. These arguments include
|
Since @tgymnich recently in a different GH issue commented on how to use command line flags on newer llvm's, I'll copy the link here #1295 (comment) Separately, since I presume you copied those flags from the SC21 artifact repo, in most cases you shouldn't need to add those flags (and they were used for performing an ablation study on the impact of those optimizations -- the default values should be reasonably performant, and if not we should fix them). |
Thanks for the reply! |
Hi, I met the issue (shown below) when differentiating a reduction function with atomicAdd. It seems CUDA
atomicAdd
is not supported in enzyme AD. I am wondering if anyone can please help give any suggestions? Thanks!The device function to differentiate:
The text was updated successfully, but these errors were encountered: