Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[V1][Bugfix] Fix assertion when mm hashing is turned off ready ONLY add when PR is ready to merge/full CI is needed
#12439 opened Jan 26, 2025 by ywang96 Loading…
Revert "[Misc] Add FA2 support to ViT MHA layer (#12355)" ready ONLY add when PR is ready to merge/full CI is needed
#12433 opened Jan 26, 2025 by WoosukKwon Loading…
[Platform] add pre_register_and_update function
#12432 opened Jan 26, 2025 by wangxiyuan Loading…
add support for AMD MI25/50/60
#12431 opened Jan 26, 2025 by Said-Akbar Loading…
[Build/CI] Fix libcuda.so linkage ci/build ready ONLY add when PR is ready to merge/full CI is needed
#12424 opened Jan 25, 2025 by tlrmchlsmth Loading…
[ROCm][AMD][Model] llama 3.2 support upstreaming
#12421 opened Jan 24, 2025 by maleksan85 Loading…
Fix the pydantic logging validator frontend
#12420 opened Jan 24, 2025 by maxdebayser Loading…
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 ready ONLY add when PR is ready to merge/full CI is needed
#12417 opened Jan 24, 2025 by tlrmchlsmth Loading…
[V1][Metrics] Add initial Prometheus logger ready ONLY add when PR is ready to merge/full CI is needed
#12416 opened Jan 24, 2025 by markmc Draft
[V1] Revert uncache_blocks and support recaching full blocks ready ONLY add when PR is ready to merge/full CI is needed
#12415 opened Jan 24, 2025 by comaniac Loading…
[Usage] Add pipeline parallelism for usage stats
#12414 opened Jan 24, 2025 by simon-mo Loading…
[Frontend] Support override generation config in args ready ONLY add when PR is ready to merge/full CI is needed
#12409 opened Jan 24, 2025 by liuyanyi Loading…
[ci/build] detect and auto use cxx abi ci/build
#12403 opened Jan 24, 2025 by youkaichao Loading…
[MISC] add arg pad_for_invariant_seq_len
#12397 opened Jan 24, 2025 by MengqingCao Loading…
[Bugfix] Fix output_tokens is 0 if using tgi backend
#12394 opened Jan 24, 2025 by sywangyi Loading…
[Hardware][Intel GPU] add XPU bf16 support documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#12392 opened Jan 24, 2025 by jikunshang Loading…
[Misc] Add BNB quantization for Whisper
#12381 opened Jan 24, 2025 by jeejeelee Loading…
[Frontend] Rerank API (Jina- and Cohere-compatible API) documentation Improvements or additions to documentation frontend
#12376 opened Jan 24, 2025 by K-Mistele Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.