You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
So far, I have tested kilt_nq dataset in RAG category. I saw accuracy numbers are close to the results published in paper when input length = 8k, but 64k results have big differences when using vllm endpoint. However, using transformer pipeline gets similar results to the paper at both 8k and 64k length. Debugging WIP.
So far, I have tested kilt_nq dataset in RAG category. I saw accuracy numbers are close to the results published in paper when input length = 8k, but 64k results have big differences when using vllm endpoint. However, using transformer pipeline gets similar results to the paper at both 8k and 64k length. Debugging WIP.
HELMET supports OPEA LLM endpoint.
The text was updated successfully, but these errors were encountered: