Introduce matmul-based IP for training #2377

densamoilov · 2025-01-11T07:48:14Z

This PR introduces matmul-based IP implementation for training.

When IP is created for forward_training plain layouts are always used to be consistent with the corresponding formats used for backward_data and backward_weights
To support bias gradients for backward_weights case the matmul primitive has been extended with an internal feature that enables mamtul to reduce matrix A. The reduction is fused.
Verbose was not extended to cover the new internal feature intentionally because having it in verbose doesn't provide any information that would be helpful, information about bias for IP is already enough.
This matmul-based implementation for training is meant to be a full replacement for the existing brgemm-based one however, the latter one will stay available through the primitive descriptor iterator for some time but no further development is planned for it and it will eventually be removed.

TODO: post performance data.

densamoilov · 2025-01-11T07:48:41Z

make test
enable device_cpu
disable device_gpu

densamoilov added 3 commits January 10, 2025 23:30

cpu: x64: use plain layout for matmul-based IP for forward_training

7a552c9

cpu: x64: enable matmul-based IP for bwd_d

96beb93

cpu: x64: enable matmul-based IP for bwd_wb

07f27a7

densamoilov requested review from a team as code owners January 11, 2025 07:48

github-actions bot added platform:cpu-x64 Intel64/AMD64 processors. Codeowner: @oneapi-src/onednn-cpu-x64 component:api Codeowner: @oneapi-src/onednn-arch labels Jan 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce matmul-based IP for training #2377

Introduce matmul-based IP for training #2377

densamoilov commented Jan 11, 2025

densamoilov commented Jan 11, 2025

Introduce matmul-based IP for training #2377

Are you sure you want to change the base?

Introduce matmul-based IP for training #2377

Conversation

densamoilov commented Jan 11, 2025

densamoilov commented Jan 11, 2025