benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes #2354

wzt1997 · 2025-01-08T03:16:37Z

Description

The PR enhances input displacement and rewrite functionality in benchdnn graph for the following aspects:

Support mb rewrite on SRC1 of MatMul and scale and zp of DynamicDequantize to support SDPA patterns rewriting.
Support shape rewrite for linked attributes and shapes, such as group_shape and scale/zp input of DynamicDequantize. If user provides shapes for one of the attributes or input shapes, benchdnn graph will update the other accordingly after performing some checks.
Fix the data type setting for input displacement in case the primitive cannot be created, which solved two specific cases: f8_e4m3 cases as primitive creating might fail as f8_e4m3:f8_e4m3:f8_e5m2 is not supported, and bf16:int4:bf16 matmul cases where f32:int4:bf16 matmul is not supported.

For example:

# Rewrite input shape only
0:PASSED __REPRO: --graph --in-shapes=7:1x16x128x1+8:1x16x128x1 --case=/home/wangzhitao/oneDNN-src/tests/benchdnn/inputs/graph/complex_fusion/mha/sdpa-compressed-v-int8-gs32.json
# Rewrite group-shape only
1:PASSED __REPRO: --graph --op-attrs=34107656704:group_shape:1x1x128x1 --case=/home/wangzhitao/oneDNN-src/tests/benchdnn/inputs/graph/complex_fusion/mha/sdpa-compressed-k-int8-gs32.json
# Rewrite mb size
2:PASSED __REPRO: --graph --mb=10 --case=/home/wangzhitao/oneDNN-src/tests/benchdnn/inputs/graph/complex_fusion/mha/sdpa-compressed-v-int8-gs32.json

wzt1997 · 2025-01-10T06:55:04Z

make test
enable benchdnn_nightly
disable benchdnn_all
enable benchdnn_graph

dzarukin · 2025-01-10T19:47:38Z

tests/benchdnn/graph/deserialize.cpp


+    bool ret = true;


I wonder if it makes sense to set this to false in the first place to avoid multiple re-initialization of this value across the logic.

dzarukin · 2025-01-10T19:49:45Z

tests/benchdnn/inputs/graph/complex_fusion/harness_mha_all

@@ -32,7 +32,7 @@
 # Re-written graphs
 --reset --dt=f32,bf16,f16 --in-shapes=4:4x16x32x256+5:4x16x256x33+0:4x16x33x256+1:4x1x1x33+3:4x1x32x33 --case=complex_fusion/mha/MHA-GPT-inf-fp32-bs1.json
 --reset --expected-n-partitions=0 --dt=f32,bf16,f16 --in-shapes=3:4x32x32x128+4:4x32x128x33+0:4x32x33x128+1:4x1x32x33 --case=complex_fusion/mha/MHA-LLaMa-inf-fp32-bs1.json
--reset --dt=f32,bf16,f16 --in-shapes=3:20x16x384x64+4:20x16x64x384+0:20x16x384x64+1:20x1x1x384 --case=complex_fusion/mha/MHA-bert_large-inf-fp32-bs1.json
+--reset --dt=f32,bf16,f16 --mb=10,20 --case=complex_fusion/mha/MHA-bert_large-inf-fp32-bs1.json
 --reset --dt=f32,bf16,f16 --in-shapes=3:10x16x384x64+4:10x1x64x384+0:10x1x384x64+1:10x1x1x384 --case=complex_fusion/mha/MHA-bert_large-inf-fp32-bs1.json


Should this me removed in favor of mb=10 right above?

dzarukin · 2025-01-10T20:28:17Z

tests/benchdnn/graph/flex_rewrite.cpp

+            if (attr.find("qtype") == attr.end()
+                    || attr["qtype"].str_value_ != "per_group")
+                continue;
+            if (attr.find("group_shape") == attr.end()) {


Would be nice to dump group_size attribute in graph dump under -v7 so that it's visible to the user.

dzarukin · 2025-01-10T20:34:00Z

tests/benchdnn/graph/flex_rewrite.cpp

+                            zp_lt.shape_, dgraph.lt_2_mtag_[zp_lt.id_]);
+                }
+            } else if (input_shape_rewrite && !group_shape_rewrite) {
+                // if user only rewrite input shapes, upadte the group-shape


Suggested change

// if user only rewrite input shapes, upadte the group-shape

// if user only rewrites input shapes, update the group-shape

dzarukin · 2025-01-10T20:40:25Z

tests/benchdnn/graph/flex_rewrite.cpp

+                    });
+            bool group_shape_rewrite = op_attrs_.count(aop.id_)
+                    && parse_attrs(op_attrs_.at(aop.id_)).count("group_shape");
+


I see some checks are duplicated, I guess reorganizing code a little bit can help eliminate those duplicates (pseudo-code):

if (!input_shape_rewrite && !group_shape_rewrite) continue; if (input_shape_rewrite) { checks_for_src_and_scale } if (group_shape_rewrite) { checks_for_src_and_group }

And then bodies of action will just do what they need to do.

dzarukin · 2025-01-10T20:42:19Z

tests/benchdnn/graph/flex_rewrite.cpp

+                                "Error: the ndims of scale tensor should align "
+                                "with the ndims of zero-point tensor for op "
+                                "with "
+                                "id=\'%zu\'\n",


Clang-format (the one that we are using) is bad at formatting strings. The best way to update the string to make a single line and then let clang-format break it.

wzt1997 self-assigned this Jan 8, 2025

github-actions bot added component:graph-api Codeowner: @oneapi-src/onednn-graph component:tests Codeowner: @oneapi-src/onednn-arch labels Jan 8, 2025

wzt1997 force-pushed the zhitao/enhance-shape-rewrite branch from 4af44a1 to d65e846 Compare January 8, 2025 06:09

wzt1997 changed the title ~~[WIP]benchdnn: graph: enhance mb and shape rewrite~~ [WIP]benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes Jan 8, 2025

wzt1997 force-pushed the zhitao/enhance-shape-rewrite branch from 50714b2 to c82a899 Compare January 8, 2025 08:02

wzt1997 added 4 commits January 10, 2025 05:40

benchdnn: graph: extend mb rewrite for matmul and dq

c91ed2e

benchdnn: graph: support rewrite for linked attrs and shapes

dc0837f

benchdnn: graph: fix the dt setting for input displace

1c2c678

benchdnn: inputs: add test case for mb and group shape rewrite

b6a63b5

wzt1997 force-pushed the zhitao/enhance-shape-rewrite branch from c82a899 to b6a63b5 Compare January 10, 2025 05:56

wzt1997 changed the title ~~[WIP]benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes~~ benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes Jan 10, 2025

wzt1997 marked this pull request as ready for review January 10, 2025 06:50

wzt1997 requested review from a team as code owners January 10, 2025 06:50

dzarukin approved these changes Jan 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes #2354

benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes #2354

wzt1997 commented Jan 8, 2025 •

edited

Loading

wzt1997 commented Jan 10, 2025

dzarukin Jan 10, 2025

dzarukin Jan 10, 2025

dzarukin Jan 10, 2025

dzarukin Jan 10, 2025

dzarukin Jan 10, 2025

dzarukin Jan 10, 2025

	// if user only rewrite input shapes, upadte the group-shape
	// if user only rewrites input shapes, update the group-shape

benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes #2354

Are you sure you want to change the base?

benchdnn: graph: enhance input displace and shape rewrite for linked attribute and shapes #2354

Conversation

wzt1997 commented Jan 8, 2025 • edited Loading

Description

wzt1997 commented Jan 10, 2025

dzarukin Jan 10, 2025

Choose a reason for hiding this comment

dzarukin Jan 10, 2025

Choose a reason for hiding this comment

dzarukin Jan 10, 2025

Choose a reason for hiding this comment

dzarukin Jan 10, 2025

Choose a reason for hiding this comment

dzarukin Jan 10, 2025

Choose a reason for hiding this comment

dzarukin Jan 10, 2025

Choose a reason for hiding this comment

wzt1997 commented Jan 8, 2025 •

edited

Loading