-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Please release a cuda build for v0.3.5 #1925
Comments
Agree. I've manually built a CUDA version, but an official prebuilt release should be convenient for most users. |
how? is there reference for manual build ? |
The latest version is v0.3.7. You can follow the steps in the CI workflow. For Windows users, here is my two cents:
If you need to build it with CUDA<12.4, use MSVC 2019:
|
+1 for pre-built whl's |
@ZiyaCu @ParisNeo @la1ty , Check out this repo: textgen-webui release includes llama-cpp-python CUDA wheels. The only downside is that these wheels can't be imported using import llama_cpp. Instead, you should use import llama_cpp_cuda or import llama_cpp_cuda_tensorcore, depending on the wheel you installed. You can find the wheels in the requirements file: Or check the full release here: |
@Amrabdelhamed611 thanks alot. |
PS E:\llama-cpp-python> conda activate CUDA125-py312
-- Configuring incomplete, errors occurred! *** CMake configuration failed ERROR Backend subprocess exited when trying to invoke build_wheel |
@dw5189 There are two possible causes I guess:
|
Hi there. I see there is a metal build for v0.3.5. Would you please releasze a cuda version?
Best regards
The text was updated successfully, but these errors were encountered: