Fix the use_fused_attention filtering #81

wangye805 · 2024-10-17T21:06:18Z

Use fp8_dpa to filter fused attention, rather than fp8

Fixes #72

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refractor

fix the fused attention filtering

Use fp8_dpa to filter fused attention, rather than fp8

Fix the use_fused_attention filtering

eec5c20

Use fp8_dpa to filter fused attention, rather than fp8

wangye805 requested review from ipanfilo and wenchenvincent October 17, 2024 21:06

wenchenvincent approved these changes Oct 17, 2024

View reviewed changes

wangye805 merged commit 2e46618 into dev Oct 17, 2024

wangye805 deleted the fix_fp8_fused_attention_filtering branch October 17, 2024 21:33