-
Notifications
You must be signed in to change notification settings - Fork 250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
State of GPU support #133
Comments
Hi @ViktorooReps, thanks for reaching out. We are planning some performance improvement in the next days/week. If you are willing to contribute, it will for sure be appreciated. Performance will be addressed in three ways
The initial thought about 3 are
|
We have ran multiple time a RAG pipeline with included examples here with Milvus ... even with scaling on NVIDIA GPU L4 machines and it worked very well. Was a bit challenging to compile the docker image for that, but it seems to perform better Not really did a performance metrics, but from the observations is at least 2x faster. Looking forward to v2 implementation, thank you guys for that great job. FYI is the start of the Dockerfile for getting the things run in the docker image. Ensure, that drivers are proper activated to docker with gpu capabilities, that GPU is really used:
|
@leviataniac thanks for sharing this! |
Hello Deep Search Team!
Thank you for this contribution to open source!
We are considering using your library to parse PDF files for LLM training, so we will potentially need to scale things up. Do you have any updates on GPU/multi-GPU support? Maybe some directions on where to start if we were to work on GPU support ourselves?
The text was updated successfully, but these errors were encountered: