First-order CUDA follow-up fix: do not use NVTX #162

griwodz · 2024-08-12T07:20:20Z

Description

Remove use of NVTX from the develop branch. This is a profiling feature that is not needed for everybody out there. People who know about it can add the relevant lines themselves.

It is currently very troublesome because the transition from the NVTX library to header-only NVTX3 is complete on Windows, where NVTX has been removed, while NVTX3 does not even exist on Tegra, Jetson and maybe other Arm-based platforms. The removal is OK because NVTX is only relevant for very fine-grained debugging of the interplay between CPU and GPU. Its timing features can also be achieved by cudaEvents.

griwodz · 2024-08-12T15:03:49Z

Seems to be confirmed in #161 that the PR fixes the problem. #161 is also proposing a few updates in the vcpkg portfile.

This make trouble for continuous integration and is apparently not supported on all platforms. Since it is a debug function, it's just as well to remove it from the mainstream tree.

griwodz · 2024-08-15T08:38:46Z

CUDA is always able to provide the timing of CUDA kernels running on the GPU to tools that observe this, either debuggers or performance analyzers. It is usually not able to register the timing and occupancy of threads on the CPU. NVTX is a mechanism that allows CPU code to add timestamps of its running threads into the same system.

That makes it possible to discover whether the CPU is working on something while the GPU is working on something else. It is not foolproof because NVTX knows nothing about the CPU's "occupancy" (whether it actually does something or is sleeping). But it can help to understand the communication between CPU and GPU better.

NVidia promises that it costs "nearly nothing", and I don't know how it actually compares to something like prof. It does create output for very nice performance analysers.

But it is most certainly not necessary to have it in the develop branch for everybody.

So while the transition from NVTX (version 2) to NVTX3 is ongoing and leads to cross-platform problems, I'd prefer to remove it entirely from the develop branch. There's probably nobody else than I who uses it anyway.

griwodz added type:bug in progress cuda issues related to cuda versions bugfix labels Aug 12, 2024

griwodz self-assigned this Aug 12, 2024

This was linked to issues Aug 12, 2024

[bug] Cannot built in vcpkg - MSVS2022 + Cuda 12.6 #161

Open

runtime error: cudaMemcpyToSymbol failed for Gauss kernel initialization #160

Closed

griwodz removed a link to an issue Aug 12, 2024

runtime error: cudaMemcpyToSymbol failed for Gauss kernel initialization #160

Closed

griwodz changed the title ~~First-order CUDA follow-up fixes~~ First-order CUDA follow-up fix: troublesome transition to nvtx3 Aug 12, 2024

griwodz force-pushed the dev/cmake-lang-cuda branch 3 times, most recently from ac217ae to 5c37b81 Compare August 12, 2024 10:08

griwodz added ready and removed in progress labels Aug 12, 2024

griwodz requested review from simogasp and fabiencastan August 12, 2024 14:03

griwodz changed the title ~~First-order CUDA follow-up fix: troublesome transition to nvtx3~~ First-order CUDA follow-up fix: do not use NVXT Aug 14, 2024

griwodz changed the title ~~First-order CUDA follow-up fix: do not use NVXT~~ First-order CUDA follow-up fix: do not use NVTX Aug 14, 2024

Remove profiling nvtx from develop branch.

abef1d4

This make trouble for continuous integration and is apparently not supported on all platforms. Since it is a debug function, it's just as well to remove it from the mainstream tree.

griwodz force-pushed the dev/cmake-lang-cuda branch from 7435c0f to abef1d4 Compare August 15, 2024 06:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First-order CUDA follow-up fix: do not use NVTX #162

First-order CUDA follow-up fix: do not use NVTX #162

griwodz commented Aug 12, 2024 •

edited

Loading

griwodz commented Aug 12, 2024

griwodz commented Aug 15, 2024 •

edited

Loading

First-order CUDA follow-up fix: do not use NVTX #162

Are you sure you want to change the base?

First-order CUDA follow-up fix: do not use NVTX #162

Conversation

griwodz commented Aug 12, 2024 • edited Loading

Description

griwodz commented Aug 12, 2024

griwodz commented Aug 15, 2024 • edited Loading

griwodz commented Aug 12, 2024 •

edited

Loading

griwodz commented Aug 15, 2024 •

edited

Loading