PR: Refine ggml-qnn backend(QNN, Qualcomm Neural Network,aka Qualcomm AI Engine Direct) for latest ggml,whisper.cpp,llama.cpp#12049
Closed
zhouwg wants to merge 37 commits intoggml-org:masterfrom kantv-ai:build_fix
+4,590-1
Commits
Commits on Feb 16, 2025
Commits on Feb 17, 2025
Commits on Feb 20, 2025
Commits on Feb 21, 2025
- committed
- committed
- committed
- committed
- committed
ggml-qnn: merge QNN RPC feature from https://github.com/zhouwg/kantv/blob/ggml-qnn-quantize/core/ggml/llamacpp/ggml-qnn.cpp
committed- committed
ggml-qnn: a concise approach to offload mulmat to QNN backend(sync from branch kantvai-ggmlqnn-npurpc, https://github.com/kantv-ai/llama.cpp/wiki/offloading-mulmat-to-QNN-backend)
committed- committed
- committed
- committed
- committed
- committed