How to Run Llama-7B 4-bit Model on CPU #12

janpashashaik123 · 2023-05-22T14:38:05Z

Hi,

I am trying to run (inference) llama-7b 4-bit Model in my local Ubuntu system in CPU without GPU. But facing an error with quant_cuda (NameError: name 'quant_cuda' is not defined). Can the llama-7b 4-bit model be run on the CPU?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to Run Llama-7B 4-bit Model on CPU #12

How to Run Llama-7B 4-bit Model on CPU #12

janpashashaik123 commented May 22, 2023

How to Run Llama-7B 4-bit Model on CPU #12

How to Run Llama-7B 4-bit Model on CPU #12

Comments

janpashashaik123 commented May 22, 2023