Tweak type 3 setpts #609

mreineck · 2025-01-27T10:42:49Z

This removes the temporary phihatk arrays in the setpts method for type 3 transforms. It also reduces the amount of large array accesses happening in this function.

…atk arrays; reduce large array reads/writes

src/finufft_core.cpp

DiamonDinoia · 2025-01-27T16:24:44Z

This is a good idea. I thought of doing it at some point. Do you have an estimate of the improvement? I don't remember this taking too long in my profiling.

src/finufft_core.cpp

mreineck · 2025-01-27T18:41:18Z

This is a good idea. I thought of doing it at some point. Do you have an estimate of the improvement? I don't remember this taking too long in my profiling.

I didn't notice any considerable improvement, but memory reduction is always nice, and I expect that scaling behavior should be better with fewer memory accesses.

The overhead for this part of the code can be large, but of course it is only part of the planning phase and therefore not a major concern. It is possible to speed up the correction factor considerably by allowing the compiler to vectorize the many cos calls, but that requires -ffast-math or similar, which could be problematic.

DiamonDinoia · 2025-01-27T19:34:25Z

I like it as is and can be merged. For vectorization we could open an issue where I vectorize it with xsimd since it has a cos implementation. We can use pragmas/attributes around the function with fast-math to have the compiler doing so for us. Othewise, we can move everything that cannot rely on fast-math to another file and enable it on this file.

ahbarnett · 2025-01-28T18:17:19Z

Thanks for the effort, but I perhaps would have liked to approve this before merging, since it rewrites some of my code, and I am back from vacation. Next time :)

The "operator () (T k)" stuff I guess is just a way to get on-the-fly phihat[k] for each dimension without storing to a previous array. But, surely all of this abstraction into this mini-class will have negligible performance change? (computing the cosines dominates the DRAM movement, right? Or am I wrong?) A speed comparison would be good.

DiamonDinoia · 2025-01-28T18:27:12Z

Thanks for the effort, but I perhaps would have liked to approve this before merging, since it rewrites some of my code, and I am back from vacation. Next time :)

Apologies.

The "operator () (T k)" stuff I guess is just a way to get on-the-fly phihat[k] for each dimension without storing to a previous array. But, surely all of this abstraction into this mini-class will have negligible performance change? (computing the cosines dominates the DRAM movement, right? Or am I wrong?) A speed comparison would be good.

Class go away when compiling (in theory) so this is likely to have no impact on performance. The opposite will surprise me.

DRAM movement is the same as the previous version. The only difference is that now things are allocated on the heap instead of the stack. Heap can be slower than stack and we pay the allocation. But I do not think it makes a measurable difference. On the other hand, it reduces the memory consumption.

mreineck added 3 commits January 27, 2025 11:35

reduce memory overhead in Type 3 setpts by eliminating temporary phih…

f2339d4

…atk arrays; reduce large array reads/writes

dummy commit to trigger formatter

0fd0142

fix typo

a55354d

mreineck requested a review from ahbarnett January 27, 2025 11:02

DiamonDinoia reviewed Jan 27, 2025

View reviewed changes

src/finufft_core.cpp Outdated Show resolved Hide resolved

DiamonDinoia reviewed Jan 27, 2025

View reviewed changes

src/finufft_core.cpp Show resolved Hide resolved

use FINUFFT_ALWAYS_INLNE

8d36187

DiamonDinoia merged commit 8baf39d into master Jan 28, 2025
332 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tweak type 3 setpts #609

Tweak type 3 setpts #609

mreineck commented Jan 27, 2025

DiamonDinoia commented Jan 27, 2025

mreineck commented Jan 27, 2025

DiamonDinoia commented Jan 27, 2025

ahbarnett commented Jan 28, 2025

DiamonDinoia commented Jan 28, 2025

Tweak type 3 setpts #609

Tweak type 3 setpts #609

Conversation

mreineck commented Jan 27, 2025

DiamonDinoia commented Jan 27, 2025

mreineck commented Jan 27, 2025

DiamonDinoia commented Jan 27, 2025

ahbarnett commented Jan 28, 2025

DiamonDinoia commented Jan 28, 2025