Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove rv_all from gpu generated code #5

Open
wants to merge 1 commit into
base: artic
Choose a base branch
from

Conversation

PearCoding
Copy link

GPU kernels get polluted by the rv_all instruction. The instruction will be filtered out only if Thorin is compiled with RV. This might not happen, as we should not expect RV to be available when Rodent is used only for GPU.
The "fix" is quite simple and it would be great to get rid of the code duplication, but that is beyond the task of this simple PR.

@Hugobros3
Copy link

We should probably have some kind of generic portable SIMD intrinsics that work regardless of the platform (RV, CUDA, AMDHSA, OpenCL, Shady...)

@PearCoding
Copy link
Author

Yes I agree. We should also include the "fma" instruction to the math builtins (maybe with a fallback for non LLVM, e.g, OpenCL, etc). I think there are more general purpose intrinsic which might be handy on all systems - if a well-behaving fallback can be defined.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants