A commit causes cl-denorms-are-zero is added to char_rtn_float test where gForceFTZ is false. #2066

haonanya · 2024-09-03T07:28:14Z

The change https://github.com/KhronosGroup/OpenCL-CTS/blob/main/test_conformance/conversions/basic_test_conversions.cpp#L1543 is from b6941b6.
Without the commit cl-denorms-are-zero is not added for char_rtn_float test. With the commit for char_rtn_float test gForceFTZ is false and gForceHalfFTZ is true and then cl-denorms-are-zero is added.

opencl test is like:
convert_char_rtn(-0x1.fffffcp-127)

adding the cl-denorms-are-zero option causes output is 0, while host expects -1. opencl spec doesn't explicitly mention flush to zero for input or output or both. There are no cl-denorms-are-zero for cc1 options since llvm/llvm-project@a4451d88ee456304c. Currently intel opencl-clang treats cl-denorms-are-zero as https://github.com/intel/opencl-clang/blob/main/options_compile.cpp#L106 which means flush to zero for input and output https://github.com/llvm/llvm-project/blob/main/llvm/docs/LangRef.rst#L2424.
opencl spec mention that This is intended to be a performance hint and the OpenCL compiler can choose not to flush denorms to zero if the device supports single precision (or double precision) denormalized numbers. But this is not very same as denormal-fp-math which say If the input mode is "preserve-sign", or "positive-zero", a floating-point operation must treat any input denormal value as zero.
@bashbaug , @svenvh , do you have any comments for this? Thanks very much.

The text was updated successfully, but these errors were encountered:

svenvh · 2024-09-03T09:50:29Z

Without the commit cl-denorms-are-zero is not added for char_rtn_float test. With the commit for char_rtn_float test gForceFTZ is false and gForceHalfFTZ is true and then cl-denorms-are-zero is added.

That seems like a bug in the test. Addition of the fp16 tests should not have caused the testing behaviour for fp32 to change. @haonanya I have opened #2067, could you please confirm if that fixes the issue for char_rtn_float on your side?

opencl spec mention that This is intended to be a performance hint and the OpenCL compiler can choose not to flush denorms to zero if the device supports single precision (or double precision) denormalized numbers. But this is not very same as denormal-fp-math which say If the input mode is "preserve-sign", or "positive-zero", a floating-point operation must treat any input denormal value as zero.

Setting -cl-denorms-are-zero predates the fp16 additions, but I agree something looks odd there too. The specification states that "This option is ignored for single precision numbers if the device does not support single precision denormalized numbers i.e. CL_FP_DENORM bit is not set in CL_DEVICE_SINGLE_FP_CONFIG". But the conversions test seems to do exactly that: add the option when CL_FP_DENORM is not set.

haonanya · 2024-09-04T03:18:53Z

@svenvh , thanks, the test passes with your fix.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A commit causes cl-denorms-are-zero is added to char_rtn_float test where gForceFTZ is false. #2066

A commit causes cl-denorms-are-zero is added to char_rtn_float test where gForceFTZ is false. #2066

haonanya commented Sep 3, 2024 •

edited

Loading

svenvh commented Sep 3, 2024

haonanya commented Sep 4, 2024

A commit causes cl-denorms-are-zero is added to char_rtn_float test where gForceFTZ is false. #2066

A commit causes cl-denorms-are-zero is added to char_rtn_float test where gForceFTZ is false. #2066

Comments

haonanya commented Sep 3, 2024 • edited Loading

svenvh commented Sep 3, 2024

haonanya commented Sep 4, 2024

haonanya commented Sep 3, 2024 •

edited

Loading