-
Notifications
You must be signed in to change notification settings - Fork 10
Commits on May 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7b11874 - Browse repository at this point
Copy the full SHA 7b11874View commit details -
Configuration menu - View commit details
-
Copy full SHA for 28590fc - Browse repository at this point
Copy the full SHA 28590fcView commit details -
[ROCm][Hardware][AMD][Doc] Documentation update for ROCm (vllm-projec…
…t#4376) Co-authored-by: WoosukKwon <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 7873343 - Browse repository at this point
Copy the full SHA 7873343View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5f32d89 - Browse repository at this point
Copy the full SHA 5f32d89View commit details -
Configuration menu - View commit details
-
Copy full SHA for c20ff92 - Browse repository at this point
Copy the full SHA c20ff92View commit details -
[CI] Disable non-lazy string operation on logging (vllm-project#4326)
Co-authored-by: Danny Guinther <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for ec4050a - Browse repository at this point
Copy the full SHA ec4050aView commit details -
Configuration menu - View commit details
-
Copy full SHA for ee654c9 - Browse repository at this point
Copy the full SHA ee654c9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f5d020 - Browse repository at this point
Copy the full SHA 4f5d020View commit details -
[Misc] add RFC issue template (vllm-project#4401)
Co-authored-by: Simon Mo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for dc47676 - Browse repository at this point
Copy the full SHA dc47676View commit details -
Configuration menu - View commit details
-
Copy full SHA for 192c704 - Browse repository at this point
Copy the full SHA 192c704View commit details -
[Kernel] Optimize FP8 support for MoE kernel / Mixtral via static sca…
…les (vllm-project#4343) Co-authored-by: Woosuk Kwon <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1e88172 - Browse repository at this point
Copy the full SHA 1e88172View commit details -
Configuration menu - View commit details
-
Copy full SHA for b9e05fa - Browse repository at this point
Copy the full SHA b9e05faView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5395fa3 - Browse repository at this point
Copy the full SHA 5395fa3View commit details -
Configuration menu - View commit details
-
Copy full SHA for cc7a791 - Browse repository at this point
Copy the full SHA cc7a791View commit details -
[Kernel] Full Tensor Parallelism for LoRA Layers (vllm-project#3524)
Co-authored-by: Antoni Baum <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 77c1eb1 - Browse repository at this point
Copy the full SHA 77c1eb1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 287d987 - Browse repository at this point
Copy the full SHA 287d987View commit details -
Configuration menu - View commit details
-
Copy full SHA for b3759af - Browse repository at this point
Copy the full SHA b3759afView commit details -
[Bugfix] Abort requests when the connection to /v1/completions is int…
…errupted (vllm-project#4363)
Configuration menu - View commit details
-
Copy full SHA for 6a44e8e - Browse repository at this point
Copy the full SHA 6a44e8eView commit details -
[BugFix] Fix
min_tokens
wheneos_token_id
is None (vllm-project#4389) Co-authored-by: DefTruth <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 821a91a - Browse repository at this point
Copy the full SHA 821a91aView commit details -
[Core] Support offline use of local cache for models (vllm-project#4374)
Signed-off-by: Prashant Gupta <[email protected]> Co-authored-by: Travis Johnson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5a4c41b - Browse repository at this point
Copy the full SHA 5a4c41bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 593db14 - Browse repository at this point
Copy the full SHA 593db14View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1f87fe1 - Browse repository at this point
Copy the full SHA 1f87fe1View commit details -
Configuration menu - View commit details
-
Copy full SHA for b24aae6 - Browse repository at this point
Copy the full SHA b24aae6View commit details -
Add more Prometheus metrics (vllm-project#2764)
Co-authored-by: Robert Shaw <[email protected]> Co-authored-by: Robert Shaw <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 6a8a97b - Browse repository at this point
Copy the full SHA 6a8a97bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8ab0de8 - Browse repository at this point
Copy the full SHA 8ab0de8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f5a450 - Browse repository at this point
Copy the full SHA 7f5a450View commit details -
[Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (vllm-…
…project#3922) Co-authored-by: alexm <[email protected]> Co-authored-by: mgoin <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 1e75df8 - Browse repository at this point
Copy the full SHA 1e75df8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 19187df - Browse repository at this point
Copy the full SHA 19187dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 43add77 - Browse repository at this point
Copy the full SHA 43add77View commit details -
Configuration menu - View commit details
-
Copy full SHA for 768facf - Browse repository at this point
Copy the full SHA 768facfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 10b984a - Browse repository at this point
Copy the full SHA 10b984aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 42929fe - Browse repository at this point
Copy the full SHA 42929feView commit details -
[BugFix] fix num_lookahead_slots missing in async executor (vllm-proj…
…ect#4165) Co-authored-by: Lei Wen <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for da4215e - Browse repository at this point
Copy the full SHA da4215eView commit details -
[Doc] add visualization for multi-stage dockerfile (vllm-project#4456)
Signed-off-by: Prashant Gupta <[email protected]> Co-authored-by: Roger Wang <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 40b286f - Browse repository at this point
Copy the full SHA 40b286fView commit details -
[Kernel] Support Fp8 Checkpoints (Dynamic + Static) (vllm-project#4332)
Co-authored-by: Philipp Moritz <[email protected]> Co-authored-by: Woosuk Kwon <[email protected]> Co-authored-by: mgoin <[email protected]> Co-authored-by: Tyler Michael Smith <[email protected]> Co-authored-by: Cody Yu <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for faed3eb - Browse repository at this point
Copy the full SHA faed3ebView commit details -
[Frontend] Support complex message content for chat completions endpo…
…int (vllm-project#3467) Co-authored-by: Lily Liu <[email protected]> Co-authored-by: Cyrus Leung <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 8b9d685 - Browse repository at this point
Copy the full SHA 8b9d685View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9ad9b65 - Browse repository at this point
Copy the full SHA 9ad9b65View commit details -
Configuration menu - View commit details
-
Copy full SHA for 195439e - Browse repository at this point
Copy the full SHA 195439eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7cff2a5 - Browse repository at this point
Copy the full SHA 7cff2a5View commit details -
Unable to find Punica extension issue during source code installation (…
…vllm-project#4494) Co-authored-by: Simon Mo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 666ccdb - Browse repository at this point
Copy the full SHA 666ccdbView commit details -
Configuration menu - View commit details
-
Copy full SHA for e1fc3da - Browse repository at this point
Copy the full SHA e1fc3daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2ef0a89 - Browse repository at this point
Copy the full SHA 2ef0a89View commit details -
Configuration menu - View commit details
-
Copy full SHA for bd7f454 - Browse repository at this point
Copy the full SHA bd7f454View commit details -
Configuration menu - View commit details
-
Copy full SHA for 66d2c00 - Browse repository at this point
Copy the full SHA 66d2c00View commit details -
Configuration menu - View commit details
-
Copy full SHA for dc2970e - Browse repository at this point
Copy the full SHA dc2970eView commit details -
Configuration menu - View commit details
-
Copy full SHA for b496ac2 - Browse repository at this point
Copy the full SHA b496ac2View commit details -
[Bugfix] Fix the fp8 kv_cache check error that occurs when failing to…
… obtain the CUDA version. (vllm-project#4173) Signed-off-by: AnyISalIn <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for c1e7a79 - Browse repository at this point
Copy the full SHA c1e7a79View commit details -
Configuration menu - View commit details
-
Copy full SHA for d05b702 - Browse repository at this point
Copy the full SHA d05b702View commit details -
Configuration menu - View commit details
-
Copy full SHA for 75c6ebf - Browse repository at this point
Copy the full SHA 75c6ebfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 21bc3bf - Browse repository at this point
Copy the full SHA 21bc3bfView commit details -
[CI/Build][Bugfix] VLLM_USE_PRECOMPILED should skip compilation (vllm…
…-project#4534) Signed-off-by: Travis Johnson <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 752043f - Browse repository at this point
Copy the full SHA 752043fView commit details -
[Speculative decoding] Add ngram prompt lookup decoding (vllm-project…
…#4237) Co-authored-by: Lei Wen <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 862330a - Browse repository at this point
Copy the full SHA 862330aView commit details -
[Core] Enable prefix caching with block manager v2 enabled (vllm-proj…
…ect#4142) Co-authored-by: Lei Wen <[email protected]> Co-authored-by: Sage Moore <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3d32972 - Browse repository at this point
Copy the full SHA 3d32972View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56d2002 - Browse repository at this point
Copy the full SHA 56d2002View commit details -
[Kernel] Update fused_moe tuning script for FP8 (vllm-project#4457)
This PR updates the tuning script for the fused_moe kernel to support FP8 and also adds configurations for TP4. Note that for the configuration I removed num_warps and num_stages for small batch sizes since that improved performance and brought the benchmarks on par with the numbers before in that regime to make sure this is a strict improvement over the status quo. All the numbers below are for mistralai/Mixtral-8x7B-Instruct-v0.1, 1000 input and 50 output tokens. Before this PR (with static activation scaling): qps = 1: 9.8 ms ITL, 0.49s e2e latency qps = 2: 9.7 ms ITL, 0.49s e2e latency qps = 4: 10.1 ms ITL, 0.52s e2e latency qps = 6: 11.9 ms ITL, 0.59s e2e latency qps = 8: 14.0 ms ITL, 0.70s e2e latency qps = 10: 15.7 ms ITL, 0.79s e2e latency After this PR (with static activation scaling): qps = 1: 9.8 ms ITL, 0.49s e2e latency qps = 2: 9.7 ms ITL, 0.49s e2e latency qps = 4: 10.2 ms ITL, 0.53s e2e latency qps = 6: 11.9 ms ITL, 0.59s e2e latency qps = 8: 11.9 ms ITL, 0.59s e2e latency qps = 10: 12.1 ms ITL, 0.61s e2e latency
Configuration menu - View commit details
-
Copy full SHA for 7c04a00 - Browse repository at this point
Copy the full SHA 7c04a00View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0533a6b - Browse repository at this point
Copy the full SHA 0533a6bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 224ecd7 - Browse repository at this point
Copy the full SHA 224ecd7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5b174c4 - Browse repository at this point
Copy the full SHA 5b174c4View commit details -
[Misc] Remove Mixtral device="cuda" declarations (vllm-project#4543)
Remove the device="cuda" declarations in mixtral as promised in vllm-project#4343
Configuration menu - View commit details
-
Copy full SHA for 4be23dd - Browse repository at this point
Copy the full SHA 4be23ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for de3262f - Browse repository at this point
Copy the full SHA de3262fView commit details -
[MISC] Rework logger to enable pythonic custom logging configuration …
…to be provided (vllm-project#4273)
Configuration menu - View commit details
-
Copy full SHA for b85188d - Browse repository at this point
Copy the full SHA b85188dView commit details -
[Bug fix][Core] assert num_new_tokens == 1 fails when SamplingParams.…
…n is not 1 and max_tokens is large & Add tests for preemption (vllm-project#4451)
Configuration menu - View commit details
-
Copy full SHA for b259286 - Browse repository at this point
Copy the full SHA b259286View commit details -
Configuration menu - View commit details
-
Copy full SHA for 91f8b48 - Browse repository at this point
Copy the full SHA 91f8b48View commit details -
[mypy][6/N] Fix all the core subdirectory typing (vllm-project#4450)
Co-authored-by: Cade Daniel <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2017aaf - Browse repository at this point
Copy the full SHA 2017aafView commit details -
[Core][Distributed] enable multiple tp group (vllm-project#4512)
Co-authored-by: Zhuohan Li <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 27f0c2b - Browse repository at this point
Copy the full SHA 27f0c2bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2078207 - Browse repository at this point
Copy the full SHA 2078207View commit details -
Configuration menu - View commit details
-
Copy full SHA for ed6d376 - Browse repository at this point
Copy the full SHA ed6d376View commit details -
Configuration menu - View commit details
-
Copy full SHA for 87d793d - Browse repository at this point
Copy the full SHA 87d793dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4dc269d - Browse repository at this point
Copy the full SHA 4dc269dView commit details -
Configuration menu - View commit details
-
Copy full SHA for f7d8e46 - Browse repository at this point
Copy the full SHA f7d8e46View commit details -
[kernel] fix sliding window in prefix prefill Triton kernel (vllm-pro…
…ject#4405) Co-authored-by: SangBin Cho <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 673e4eb - Browse repository at this point
Copy the full SHA 673e4ebView commit details -
[CI/Build] AMD CI pipeline with extended set of tests. (vllm-project#…
…4267) Co-authored-by: simon-mo <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2ff2756 - Browse repository at this point
Copy the full SHA 2ff2756View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3d453d0 - Browse repository at this point
Copy the full SHA 3d453d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2a0fb55 - Browse repository at this point
Copy the full SHA 2a0fb55View commit details -
Configuration menu - View commit details
-
Copy full SHA for 82bbb3d - Browse repository at this point
Copy the full SHA 82bbb3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 44f6086 - Browse repository at this point
Copy the full SHA 44f6086View commit details -
Configuration menu - View commit details
-
Copy full SHA for f62ba17 - Browse repository at this point
Copy the full SHA f62ba17View commit details -
Configuration menu - View commit details
-
Copy full SHA for fc4f08f - Browse repository at this point
Copy the full SHA fc4f08fView commit details -
[Bugfix] Allow "None" or "" to be passed to CLI for string args that …
…default to None (vllm-project#4586)
Configuration menu - View commit details
-
Copy full SHA for f10844f - Browse repository at this point
Copy the full SHA f10844fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e132240 - Browse repository at this point
Copy the full SHA e132240View commit details -
[Kernel] Use flashinfer for decoding (vllm-project#4353)
Co-authored-by: LiuXiaoxuanPKU <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4b0f703 - Browse repository at this point
Copy the full SHA 4b0f703View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6dd96ce - Browse repository at this point
Copy the full SHA 6dd96ceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 19ae179 - Browse repository at this point
Copy the full SHA 19ae179View commit details -
Configuration menu - View commit details
-
Copy full SHA for 12c155b - Browse repository at this point
Copy the full SHA 12c155bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d65e2f - Browse repository at this point
Copy the full SHA 5d65e2fView commit details -
[Kernel] Support MoE Fp8 Checkpoints for Mixtral (Static Weights with…
… Dynamic/Static Activations) (vllm-project#4527) Follow on to vllm-project#4332 to enable FP8 checkpoint loading for Mixtral and supersedes vllm-project#4436. This PR enables the following checkpoint loading features for Mixtral: Supports loading fp8 checkpoints for Mixtral, such as this "nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8" test model Supports static or dynamic activation quantization with static weight quantization (all per tensor) Supports different scales for each expert weight Supports Fp8 in QKV layer Notes: The Expert Gate/Router always runs at half / full precision for now. If there are different weight scales between QKV layer (for separate QKV weights), they are re-quantized using layer.weight_scale.max() so we can have a single gemm for performance.
Configuration menu - View commit details
-
Copy full SHA for 55dd119 - Browse repository at this point
Copy the full SHA 55dd119View commit details -
Configuration menu - View commit details
-
Copy full SHA for c152bd7 - Browse repository at this point
Copy the full SHA c152bd7View commit details -
Configuration menu - View commit details
-
Copy full SHA for f8fb8c1 - Browse repository at this point
Copy the full SHA f8fb8c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2d96b61 - Browse repository at this point
Copy the full SHA 2d96b61View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 9f817f0 - Browse repository at this point
Copy the full SHA 9f817f0View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for f57a219 - Browse repository at this point
Copy the full SHA f57a219View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b2c4c1 - Browse repository at this point
Copy the full SHA 6b2c4c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 18a6e93 - Browse repository at this point
Copy the full SHA 18a6e93View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for bcf686d - Browse repository at this point
Copy the full SHA bcf686dView commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 8423620 - Browse repository at this point
Copy the full SHA 8423620View commit details
Commits on May 7, 2024
-
1
Configuration menu - View commit details
-
Copy full SHA for 50c1029 - Browse repository at this point
Copy the full SHA 50c1029View commit details
Commits on May 8, 2024
-
Robert Shaw committed
May 8, 2024 1Configuration menu - View commit details
-
Copy full SHA for a55fb2b - Browse repository at this point
Copy the full SHA a55fb2bView commit details -
1
Configuration menu - View commit details
-
Copy full SHA for b091999 - Browse repository at this point
Copy the full SHA b091999View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 4c04122 - Browse repository at this point
Copy the full SHA 4c04122View commit details
Commits on May 9, 2024
-
Robert Shaw committed
May 9, 2024 1Configuration menu - View commit details
-
Copy full SHA for 0300194 - Browse repository at this point
Copy the full SHA 0300194View commit details
Commits on May 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5dc0afe - Browse repository at this point
Copy the full SHA 5dc0afeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 94878e5 - Browse repository at this point
Copy the full SHA 94878e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 81f5e29 - Browse repository at this point
Copy the full SHA 81f5e29View commit details -
Configuration menu - View commit details
-
Copy full SHA for e04b743 - Browse repository at this point
Copy the full SHA e04b743View commit details -
Configuration menu - View commit details
-
Copy full SHA for 774df9d - Browse repository at this point
Copy the full SHA 774df9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 02b7775 - Browse repository at this point
Copy the full SHA 02b7775View commit details -
Configuration menu - View commit details
-
Copy full SHA for fe43f6b - Browse repository at this point
Copy the full SHA fe43f6bView commit details
Commits on May 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9ba99bd - Browse repository at this point
Copy the full SHA 9ba99bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8e49ada - Browse repository at this point
Copy the full SHA 8e49adaView commit details -
Configuration menu - View commit details
-
Copy full SHA for e61507e - Browse repository at this point
Copy the full SHA e61507eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b2f301 - Browse repository at this point
Copy the full SHA 2b2f301View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 304a5f9 - Browse repository at this point
Copy the full SHA 304a5f9View commit details
Commits on May 13, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2f6849f - Browse repository at this point
Copy the full SHA 2f6849fView commit details -
1
Configuration menu - View commit details
-
Copy full SHA for e257749 - Browse repository at this point
Copy the full SHA e257749View commit details -
5
Configuration menu - View commit details
-
Copy full SHA for 6a22a11 - Browse repository at this point
Copy the full SHA 6a22a11View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6207d84 - Browse repository at this point
Copy the full SHA 6207d84View commit details -
Configuration menu - View commit details
-
Copy full SHA for c8450a7 - Browse repository at this point
Copy the full SHA c8450a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2df6bda - Browse repository at this point
Copy the full SHA 2df6bdaView commit details -
Configuration menu - View commit details
-
Copy full SHA for cb216b6 - Browse repository at this point
Copy the full SHA cb216b6View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for a767eb8 - Browse repository at this point
Copy the full SHA a767eb8View commit details -
Configuration menu - View commit details
-
Copy full SHA for e7dd38e - Browse repository at this point
Copy the full SHA e7dd38eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 30d202d - Browse repository at this point
Copy the full SHA 30d202dView commit details -
Update test_logprobs.py (#236)
FILL IN THE PR DESCRIPTION HERE FIX #xxxx (*link existing issues this PR will resolve*) **BEFORE SUBMITTING, PLEASE READ THE CHECKLIST BELOW AND FILL IN THE DESCRIPTION ABOVE** --- <details> <!-- inside this <details> section, markdown rendering does not work, so we use raw html here. --> <summary><b> PR Checklist (Click to Expand) </b></summary> <p>Thank you for your contribution to vLLM! Before submitting the pull request, please ensure the PR meets the following criteria. This helps vLLM maintain the code quality and improve the efficiency of the review process.</p> <h3>PR Title and Classification</h3> <p>Only specific types of PRs will be reviewed. The PR title is prefixed appropriately to indicate the type of change. Please use one of the following:</p> <ul> <li><code>[Bugfix]</code> for bug fixes.</li> <li><code>[CI/Build]</code> for build or continuous integration improvements.</li> <li><code>[Doc]</code> for documentation fixes and improvements.</li> <li><code>[Model]</code> for adding a new model or improving an existing model. Model name should appear in the title.</li> <li><code>[Frontend]</code> For changes on the vLLM frontend (e.g., OpenAI API server, <code>LLM</code> class, etc.) </li> <li><code>[Kernel]</code> for changes affecting CUDA kernels or other compute kernels.</li> <li><code>[Core]</code> for changes in the core vLLM logic (e.g., <code>LLMEngine</code>, <code>AsyncLLMEngine</code>, <code>Scheduler</code>, etc.)</li> <li><code>[Hardware][Vendor]</code> for hardware-specific changes. Vendor name should appear in the prefix (e.g., <code>[Hardware][AMD]</code>).</li> <li><code>[Misc]</code> for PRs that do not fit the above categories. Please use this sparingly.</li> </ul> <p><strong>Note:</strong> If the PR spans more than one category, please include all relevant prefixes.</p> <h3>Code Quality</h3> <p>The PR need to meet the following code quality standards:</p> <ul> <li>We adhere to <a href="https://google.github.io/styleguide/pyguide.html">Google Python style guide</a> and <a href="https://google.github.io/styleguide/cppguide.html">Google C++ style guide</a>.</li> <li>Pass all linter checks. Please use <a href="https://github.com/vllm-project/vllm/blob/main/format.sh"><code>format.sh</code></a> to format your code.</li> <li>The code need to be well-documented to ensure future contributors can easily understand the code.</li> <li>Include sufficient tests to ensure the project to stay correct and robust. This includes both unit tests and integration tests.</li> <li>Please add documentation to <code>docs/source/</code> if the PR modifies the user-facing behaviors of vLLM. It helps vLLM user understand and utilize the new features or changes.</li> </ul> <h3>Notes for Large Changes</h3> <p>Please keep the changes as concise as possible. For major architectural changes (>500 LOC excluding kernel/data/config/test), we would expect a GitHub issue (RFC) discussing the technical design and justification. Otherwise, we will tag it with <code>rfc-required</code> and might not go through the PR.</p> <h3>What to Expect for the Reviews</h3> <p>The goal of the vLLM team is to be a <i>transparent reviewing machine</i>. We would like to make the review process transparent and efficient and make sure no contributor feel confused or frustrated. However, the vLLM team is small, so we need to prioritize some PRs over others. Here is what you can expect from the review process: </p> <ul> <li> After the PR is submitted, the PR will be assigned to a reviewer. Every reviewer will pick up the PRs based on their expertise and availability.</li> <li> After the PR is assigned, the reviewer will provide status update every 2-3 days. If the PR is not reviewed within 7 days, please feel free to ping the reviewer or the vLLM team.</li> <li> After the review, the reviewer will put an <code> action-required</code> label on the PR if there are changes required. The contributor should address the comments and ping the reviewer to re-review the PR.</li> <li> Please respond to all comments within a reasonable time frame. If a comment isn't clear or you disagree with a suggestion, feel free to ask for clarification or discuss the suggestion. </li> </ul> <h3>Thank You</h3> <p> Finally, thank you for taking the time to read these guidelines and for your interest in contributing to vLLM. Your contributions make vLLM a great tool for everyone! </p> </details>
1Configuration menu - View commit details
-
Copy full SHA for 5096714 - Browse repository at this point
Copy the full SHA 5096714View commit details -
Configuration menu - View commit details
-
Copy full SHA for 635fc10 - Browse repository at this point
Copy the full SHA 635fc10View commit details -
2
Configuration menu - View commit details
-
Copy full SHA for 4477333 - Browse repository at this point
Copy the full SHA 4477333View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9a9c899 - Browse repository at this point
Copy the full SHA 9a9c899View commit details -
1
Configuration menu - View commit details
-
Copy full SHA for 1c359ae - Browse repository at this point
Copy the full SHA 1c359aeView commit details