Haystack Inference only (farm-haystack[inference]) very heavy and consumes a lot of RAM #6031

gooseillo · 2023-10-11T15:01:07Z

gooseillo
Oct 11, 2023

Hi everyone,

I'm trying to deploy a haystack RAG pipeline to EBS or EC2 on one of the lower config machines (2CPU - 2GB RAM and 2CPU 4GB RAM). While building the container, I used pip install 'farm-haystack[inference]' ## installs torch, sentence-transformers, sentencepiece, and huggingface-hub.

I was wondering, since I only intend to do inference on the CPU, why does it have to install torch and other heavier libraries. Is there a lighter install?

Otherwise, is there a cheaper deployment pipeline for a RAG solution?

vblagoje · 2023-10-31T13:47:58Z

vblagoje
Oct 31, 2023
Maintainer

Hey @gooseillo don't use inference - but only "preprocessing". Have a look at how we build our lightweight Docker image

0 replies

vblagoje · 2023-10-31T13:50:02Z

vblagoje
Oct 31, 2023
Maintainer

Perhaps it would be useful to see how we make Docker images at https://github.com/deepset-ai/haystack/tree/main/docker and pay attention to cpu-remote-inference in our bake file

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haystack Inference only (farm-haystack[inference]) very heavy and consumes a lot of RAM #6031

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Haystack Inference only (farm-haystack[inference]) very heavy and consumes a lot of RAM #6031

gooseillo Oct 11, 2023

Replies: 2 comments

vblagoje Oct 31, 2023 Maintainer

vblagoje Oct 31, 2023 Maintainer

gooseillo
Oct 11, 2023

vblagoje
Oct 31, 2023
Maintainer

vblagoje
Oct 31, 2023
Maintainer