JIT: Re-enable acceleration of Vector512<long>.op_Multiply #111832

saucecontrol · 2025-01-25T21:35:18Z

This was a regression in 9.0, from #103555

https://godbolt.org/z/11hs3Kqdd

dotnet-policy-service · 2025-01-25T21:35:50Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

EgorBo

Thanks!

EgorBo · 2025-01-25T21:58:37Z

src/coreclr/jit/hwintrinsicxarch.cpp

@@ -3335,7 +3335,7 @@ GenTree* Compiler::impSpecialIntrinsic(NamedIntrinsic        intrinsic,
                {
                    // Emulate NI_AVX512DQ_VL_MultiplyLow with SSE41 for SIMD16
                }
-                else
+                else if (simdSize != 64)


actually, I think this needs Avx512DQ ISA check

If we're treating any of the Vector512 methods other than IsSupported as intrinsic, that implies we have the full baseline AVX-512 set (F,DQ,BW,CD,VL). It's a bit confusing because some of the import paths assert or check that, but most don't. I'm actually cleaning up some of those redundant asserts in a different branch now.

Ah ok, I thought that Vector512.IsHardwareAccelerated only relies on AVX512F, but looks like DOTNET_EnableAVX512DQ=0 turns it off so it's ok.

Right, IsHardwareAccelerated 😄

The rule is Vector512.IsHardwareAccelerated will return false unless all of the following are satisfied:

Baseline AVX-512 support (F,CD,DW,BW,VL)

Not throttling based on CPUID check for Skylake-X and others with severe downclocking for 512-bit vector instructions, or DOTNET_PreferredVectorBitWidth: >= 512

No DOTNET_PreferredVectorBitWidth: < 512

The rule for whether Vector512 methods actually import as intrinsic is only that we have the baseline AVX-512 set, meaning IsHardwareAccelerated may return false, but all methods may actually be accelerated anyway.

So the fact that we're importing the methods for Vector512 as intrinsic in the first place means the ISA requirements have already been met.

Vector128 and Vector256 are a bit different, because the baseline ISA requirement may not be enough to accelerate all methods.

Vector256.IsHardwareAccelerated returns true only if AVX2 is supported, but we attempt to import methods as intrinsic as long as AVX is supported. Since many of the methods require AVX2 for acceleration, they have an extra check for AVX2 and then fall back to managed if it's not available. Hence all the (simdSize != 32) || compOpportunisticallyDependsOn(InstructionSet_AVX2) checks.

Similar checks are not included for Vector128, because the base requirement is SSE2, so almost all methods can be accelerated, minus a few that require SSE4.1 and check for it explicitly.

Clear as mud, I know...

re-enable acceleration of Vector512<long>.op_Multiply

d20eb9f

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jan 25, 2025

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Jan 25, 2025

EgorBo approved these changes Jan 25, 2025

View reviewed changes

EgorBo reviewed Jan 25, 2025

View reviewed changes

build-analysis bot mentioned this pull request Jan 26, 2025

Test failure: baseservices/exceptions/stackoverflow/stackoverflowtester/stackoverflowtester.cmd #110173

Open

EgorBo merged commit d5c8265 into dotnet:main Jan 26, 2025
117 of 119 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Re-enable acceleration of Vector512<long>.op_Multiply #111832

JIT: Re-enable acceleration of Vector512<long>.op_Multiply #111832

saucecontrol commented Jan 25, 2025

dotnet-policy-service bot commented Jan 25, 2025

EgorBo left a comment

EgorBo Jan 25, 2025

saucecontrol Jan 25, 2025

EgorBo Jan 25, 2025 •

edited

Loading

saucecontrol Jan 26, 2025 •

edited

Loading

JIT: Re-enable acceleration of Vector512<long>.op_Multiply #111832

JIT: Re-enable acceleration of Vector512<long>.op_Multiply #111832

Conversation

saucecontrol commented Jan 25, 2025

dotnet-policy-service bot commented Jan 25, 2025

EgorBo left a comment

Choose a reason for hiding this comment

EgorBo Jan 25, 2025

Choose a reason for hiding this comment

saucecontrol Jan 25, 2025

Choose a reason for hiding this comment

EgorBo Jan 25, 2025 • edited Loading

Choose a reason for hiding this comment

saucecontrol Jan 26, 2025 • edited Loading

Choose a reason for hiding this comment

EgorBo Jan 25, 2025 •

edited

Loading

saucecontrol Jan 26, 2025 •

edited

Loading