Merge remote-tracking branch 'origin/main' into gcp-test-runners

neuralmagic · Jun 4, 2024 · 7731ec3 · 7731ec3
2 parents 95d0a28 + 0257d9d
commit 7731ec3
Show file tree

Hide file tree

Showing 2 changed files with 2 additions and 4 deletions.
diff --git a/.github/workflows/publish.yml b/.github/workflows/publish.yml
@@ -4,9 +4,7 @@
 name: Create Release
 
 on:
-  # push:
-  #   tags:
-  #     - v*
+  workflow_dispatch:
 
 # Needed to create release and upload assets
 permissions:

diff --git a/README.md b/README.md
@@ -8,7 +8,7 @@
 [vLLM](https://github.com/vllm-project/vllm) is a fast and easy-to-use library for LLM inference that Neural Magic regularly contributes upstream improvements to. This fork, `nm-vllm` is our opinionated focus on incorporating the latest LLM optimizations like quantization and sparsity for enhanced performance.
 
 ## Installation
-The [nm-vllm PyPi package](https://pypi.org/project/nm-vllm/) includes pre-compiled binaries for CUDA (version 12.1) kernels, streamlining the setup process. For other PyTorch or CUDA versions, please compile the package from source.
+The [nm-vllm PyPi package](https://pypi.neuralmagic.com/simple/nm-vllm/index.html) includes pre-compiled binaries for CUDA (version 12.1) kernels, streamlining the setup process. For other PyTorch or CUDA versions, please compile the package from source.
 
 Install it using pip:
 ```bash