Skip to content

Pull requests: HabanaAI/vllm-fork

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Bump jinja2 from 3.1.4 to 3.1.5 dependencies Pull requests that update a dependency file
#679 opened Jan 12, 2025 by dependabot bot Loading…
Jan 10 rebase
#677 opened Jan 10, 2025 by kzawora-intel Loading…
add renormalize param for FusedMOE
#671 opened Jan 9, 2025 by tangleintel Loading…
Added lora manager tests
#670 opened Jan 8, 2025 by rsshaik1 Draft
Draft: Delayed prompts
#659 opened Dec 20, 2024 by kamil-kaczor Draft
Chunked Prefill
#656 opened Dec 20, 2024 by hlahkar Draft
Fix: selecting correct backend for MultiHeadAttention habana Issues or PRs submitted by Habana Labs
#645 opened Dec 18, 2024 by adobrzyniewicz-habana Loading…
Fix model OOM issue in llama-405 and mixtral - 2nd attempt habana Issues or PRs submitted by Habana Labs
#644 opened Dec 18, 2024 by afierka-intel Loading…
Selective merged prefill
#643 opened Dec 18, 2024 by xuechendi Loading…
Multimodality fix for llava habana Issues or PRs submitted by Habana Labs
#641 opened Dec 17, 2024 by adobrzyniewicz-habana Loading…
Add inc fp8 qunatization documentation
#635 opened Dec 16, 2024 by nirda7 Loading…
[WIP] Add HPU support to vLLM v1 - cont.
#609 opened Dec 10, 2024 by kzawora-intel Loading…
21 of 23 tasks
Add in Dockerfile.hpu.ubi
#602 opened Dec 9, 2024 by Xaenalt Loading…
Add real BS & seq_len to profiling
#601 opened Dec 9, 2024 by kamil-kaczor Loading…
Multi models support for upstream
#590 opened Dec 4, 2024 by xuechendi Loading…
Remove assert for alibi in case of FusedSDPA.
#587 opened Dec 4, 2024 by itaraban Loading…
Update documentation
#555 opened Nov 26, 2024 by michalkuligowski Draft
Bump aiohttp from 3.10.10 to 3.10.11 dependencies Pull requests that update a dependency file
#536 opened Nov 21, 2024 by dependabot bot Loading…
Clean-up LoRA flow
#518 opened Nov 18, 2024 by SanjuCSudhakaran Draft
1.19 documentation update
#507 opened Nov 15, 2024 by kzawora-intel Draft
ProTip! Follow long discussions with comments:>50.