We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
First error is occured when max-context-len >= 10000 The second error is occured when max-prompt-len >=6000
Device : Nuc 14 pro CPU: Ultra5 125H Memory: 96G whole NPU driver: 32.0.100.3104 Others:
The text was updated successfully, but these errors were encountered:
Hi, @dockerg , max-context-len>=10000 is not supported yet, maybe you could try 2K.
max-context-len>=10000
Sorry, something went wrong.
how about "DDR" memory allocate issue?
Does this issue exist for smaller value of max-prompt-len (such as 1024)? Too long prompt is also not supported yet.
max-prompt-len
how about "DDR" memory allocate issue? Does this issue exist for smaller value of max-prompt-len (such as 1024)? Too long prompt is also not supported yet.
It occured when max-prompt-len > 4000; What limit the prompt? Hardware or npu-driver or Ipex-llm not support it?
The issue relates to hardware and npu driver.
No branches or pull requests
First error is occured when max-context-len >= 10000
The second error is occured when max-prompt-len >=6000
Device : Nuc 14 pro
CPU: Ultra5 125H
Memory: 96G whole
NPU driver: 32.0.100.3104
Others:
The text was updated successfully, but these errors were encountered: