Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable gt half precision types for HIP #295

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

cmpfeil
Copy link
Contributor

@cmpfeil cmpfeil commented Feb 12, 2025

Uses 16 bit FP types from HIP headers (<hip/hip_fp16.h>, <hip/hip_bf16.h>) when CUDA headers not available.

BF16 tests pass on AMD MI300A when built with module rocm/6.3 loaded via

cmake -S . -B build-hip -DCMAKE_INSTALL_PREFIX=build-hip -DGTENSOR_DEVICE=hip -DBUILD_TESTING=ON -DGTENSOR_ENABLE_BF16=ON -DCMAKE_CXX_COMPILER=$(which hipcc)
cmake --build build-hip --target install

(FP16 tests pass analogously, when built with -DGTENSOR_ENABLE_FP16=ON)

@cmpfeil cmpfeil marked this pull request as ready for review February 12, 2025 15:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant