Query register capacity for segred and segscan codegen. #2057

athas · 2023-12-06T22:07:05Z

This is very tedious code, and required adding the notion of "kernel constant expressions", as we have some expressions that must be constant at kernel compilation time (which is at program runtime). We actually had this notion in the ImpCode representation, but now ImpGen provides some manual control as well.

athas · 2023-12-06T22:07:20Z

As I told you, it's not very interesting.

This is very tedious code, and required adding the notion of "kernel constant expressions", as we have some expressions that _must_ be constant at kernel compilation time (which is at program runtime). We actually had this notion in the ImpCode representation, but now ImpGen provides some manual control as well.

athas · 2023-12-07T16:06:17Z

Looks ilke the accurate querying is actually detrimental to performance, compared to the old hardcoded values.

athas · 2023-12-07T16:13:49Z

The old numbers correspond to pretending we have twice as much local memory available as is actually the case. Should we just multiply CU_DEVICE_ATTRIBUTE_MAX_SHARED_MEMORY_PER_BLOCK by two? What is the logic here? @coancea, do you remember?

athas requested a review from sortraev December 6, 2023 22:07

athas force-pushed the query-registers branch 2 times, most recently from 2066150 to 5c6af29 Compare December 7, 2023 11:41

athas added the run-benchmarks Makes GA run the benchmark suite. label Dec 7, 2023

athas self-assigned this Dec 7, 2023

athas force-pushed the query-registers branch from 5c6af29 to ae3b94f Compare December 7, 2023 14:20

athas merged commit 8a1502c into master Dec 7, 2023
24 checks passed

athas deleted the query-registers branch December 7, 2023 15:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query register capacity for segred and segscan codegen. #2057

Query register capacity for segred and segscan codegen. #2057

athas commented Dec 6, 2023

athas commented Dec 6, 2023

athas commented Dec 7, 2023

athas commented Dec 7, 2023

Query register capacity for segred and segscan codegen. #2057

Query register capacity for segred and segscan codegen. #2057

Conversation

athas commented Dec 6, 2023

athas commented Dec 6, 2023

athas commented Dec 7, 2023

athas commented Dec 7, 2023