The vLLM Spyre plugin (vllm-spyre
) is a dedicated backend extension that enables seamless integration of IBM Spyre Accelerator with vLLM. It follows the architecture describes in vLLM's Plugin System, making it easy to integrate IBM's advanced AI acceleration into existing vLLM workflows.
First, download vllm-spyre
git clone https://github.com/IBM/vllm-spyre
cd vllm-spyre
Build image from source
docker build . -f Dockerfile.spyre -t vllm-spyre
docker run -it --rm vllm-spyre bash
# Install vllm
pip install vllm==0.7.3
# Install vllm-spyre
cd ..
git clone https://github.com/IBM/vllm-spyre.git
cd vllm-spyre
pip install -v -e .