ChatQnA - add files for deploy ChatQnA application on AMD ROCm with vLLM service #1181

chyundunovDatamonsters · 2024-11-21T17:27:24Z

Description

Add files for deploy ChatQnA application on AMD ROCm with vLLM service:

ChatQnA/docker_compose/amd/gpu/rocm-vllm/compose_vllm.yaml - Docker Compose file
ChatQnA/docker_compose/amd/gpu/rocm-vllm/set_env_vllm.sh - set envs script
ChatQnA/docker_compose/amd/gpu/rocm-vllm/Dockerfile-vllm - Dockerfile for build vLLM Docker image for service on ROCm
ChatQnA/docker_compose/amd/gpu/rocm-vllm/README.md - README file

Issues

It was required to be able to deploy the ChatQnA application on AMD hardware using the vLLM Service

Type of change

[* ] New feature (non-breaking change which adds new functionality)

Dependencies

Tests

Testing is performed by a script GenAIExamples/ChatQnA/tests/test_compose_on_rocm_vllm.sh

…LLM service: 1. ChatQnA/docker_compose/amd/gpu/rocm-vllm/compose_vllm.yaml 2. ChatQnA/docker_compose/amd/gpu/rocm-vllm/set_env_vllm.sh 3. ChatQnA/docker_compose/amd/gpu/rocm-vllm/Dockerfile-vllm 4. ChatQnA/docker_compose/amd/gpu/rocm-vllm/REAMDE.md. 5. ChatQnA/tests/test_compose_on_rocm_vllm.sh Fix build.yaml and playwright.config.ts Signed-off-by: Chingis Yundunov <[email protected]>

Signed-off-by: Chingis Yundunov <[email protected]>

for more information, see https://pre-commit.ci

chensuyue · 2024-11-27T05:37:26Z

ChatQnA/docker_compose/amd/gpu/rocm-vllm/Dockerfile-vllm

@@ -0,0 +1,18 @@
+FROM rocm/vllm:rocm6.2_mi300_ubuntu20.04_py3.9_vllm_0.6.4


let's put the dockerfile under GenAIExamples/ChatQnA/, it can be named as Dockerfile.vllm_rocm.

Good. I will place the file according to the proposed path and adapt the scripts

@chyundunovDatamonsters will you continue working on this PR?

chensuyue · 2024-11-27T05:40:25Z

Let use one folder GenAIExamples/ChatQnA/docker_compose/amd/gpu/rocm/ to contain all the implementation on rocm, like this one https://github.com/opea-project/GenAIExamples/tree/60871f2622001a42de050f5606de22072d905fa6/ChatQnA/docker_compose/intel/hpu/gaudi.

chensuyue · 2024-11-27T05:41:37Z

ChatQnA/tests/test_compose_on_rocm_vllm.sh

@@ -0,0 +1,265 @@
+#!/bin/bash


ChatQnA/tests/test_compose_on_rocm_vllm.sh --> ChatQnA/tests/test_compose_vllm_on_rocm.sh
The naming format will impact CI trigger.

For this file name, rocm_vllm will be recognized as a hardware label, but we just need rocm. So it need to be update like what I commented.

Accepted. Let's put everything in one place for the Arm architecture.

…oy app on AMD GPU Signed-off-by: Chingis Yundunov <[email protected]>

Signed-off-by: Chingis Yundunov <[email protected]>

Chingis Yundunov added 2 commits November 22, 2024 00:11

ChatQnA - fix README.md for deploy on AMD GPU with vLLM service

6b54854

Signed-off-by: Chingis Yundunov <[email protected]>

chyundunovDatamonsters requested a review from lvliang-intel as a code owner November 21, 2024 17:27

pre-commit-ci bot and others added 2 commits November 21, 2024 17:29

[pre-commit.ci] auto fixes from pre-commit.com hooks

2634171

for more information, see https://pre-commit.ci

Merge branch 'main' into feature/GenAIExample_ChatQnA_AMD_vLLM

60871f2

joshuayao mentioned this pull request Nov 22, 2024

[Feature] Examples on AMD ROCm for OPEA v1.2 #1178

Closed

5 tasks

joshuayao linked an issue Nov 22, 2024 that may be closed by this pull request

[Feature] Examples on AMD ROCm for OPEA v1.2 #1178

Closed

5 tasks

chensuyue reviewed Nov 27, 2024

View reviewed changes

Chingis Yundunov added 7 commits December 6, 2024 18:25

ProductivitySuite - fix Docker compose file, set envs script for depl…

58f3fa3

…oy app on AMD GPU Signed-off-by: Chingis Yundunov <[email protected]>

ProductivitySuite - fix Docker compose file, set envs script for depl…

c185a8d

…oy app on AMD GPU Signed-off-by: Chingis Yundunov <[email protected]>

ChatQnA - add files deploy for benchmark

5b4a3c6

Signed-off-by: Chingis Yundunov <[email protected]>

ChatQnA - add files deploy for benchmark

5c5eacd

Signed-off-by: Chingis Yundunov <[email protected]>

ChatQnA - add files deploy for benchmark

ac1765b

Signed-off-by: Chingis Yundunov <[email protected]>

ChatQnA - add files deploy for benchmark

021f2c2

Signed-off-by: Chingis Yundunov <[email protected]>

ChatQnA - add files deploy for benchmark

751d7e1

Signed-off-by: Chingis Yundunov <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatQnA - add files for deploy ChatQnA application on AMD ROCm with vLLM service #1181

ChatQnA - add files for deploy ChatQnA application on AMD ROCm with vLLM service #1181

chyundunovDatamonsters commented Nov 21, 2024

chensuyue Nov 27, 2024

decartoff Nov 29, 2024

chensuyue Dec 17, 2024

chensuyue commented Nov 27, 2024 •

edited

Loading

chensuyue Nov 27, 2024

chensuyue Nov 27, 2024

decartoff Nov 29, 2024

		@@ -0,0 +1,18 @@
		FROM rocm/vllm:rocm6.2_mi300_ubuntu20.04_py3.9_vllm_0.6.4

ChatQnA - add files for deploy ChatQnA application on AMD ROCm with vLLM service #1181

Are you sure you want to change the base?

ChatQnA - add files for deploy ChatQnA application on AMD ROCm with vLLM service #1181

Conversation

chyundunovDatamonsters commented Nov 21, 2024

Description

Issues

Type of change

Dependencies

Tests

chensuyue Nov 27, 2024

Choose a reason for hiding this comment

decartoff Nov 29, 2024

Choose a reason for hiding this comment

chensuyue Dec 17, 2024

Choose a reason for hiding this comment

chensuyue commented Nov 27, 2024 • edited Loading

chensuyue Nov 27, 2024

Choose a reason for hiding this comment

chensuyue Nov 27, 2024

Choose a reason for hiding this comment

decartoff Nov 29, 2024

Choose a reason for hiding this comment

chensuyue commented Nov 27, 2024 •

edited

Loading