Skip to content

support grouped query attention(MQA & GQA) for flash_attn#22

Merged
iclementine merged 2 commits intoFlagOpen:mainfrom iclementine:mqaMay 27, 2024