-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[V1][Bugfix] Fix assertion when mm hashing is turned off
ready
ONLY add when PR is ready to merge/full CI is needed
#12439
opened Jan 26, 2025 by
ywang96
Loading…
[Bugfix][Kernel] Fix perf regression caused by PR #12405
ci/build
#12434
opened Jan 26, 2025 by
LucasWilkinson
Loading…
Revert "[Misc] Add FA2 support to ViT MHA layer (#12355)"
ready
ONLY add when PR is ready to merge/full CI is needed
#12433
opened Jan 26, 2025 by
WoosukKwon
Loading…
[Frontend] Support scores endpoint in run_batch
frontend
#12430
opened Jan 25, 2025 by
pooyadavoodi
Loading…
[Bugfix] Fix tqdm progress bar when SamplingParams.n > 1
frontend
#12428
opened Jan 25, 2025 by
yanyc428
Loading…
[Build/CI] Fix libcuda.so linkage
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#12424
opened Jan 25, 2025 by
tlrmchlsmth
Loading…
[Misc] Add offline test for disaggregated prefill
#12418
opened Jan 24, 2025 by
Shaoting-Feng
Loading…
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24
ready
ONLY add when PR is ready to merge/full CI is needed
#12417
opened Jan 24, 2025 by
tlrmchlsmth
Loading…
[V1][Metrics] Add initial Prometheus logger
ready
ONLY add when PR is ready to merge/full CI is needed
[V1] Revert ONLY add when PR is ready to merge/full CI is needed
uncache_blocks
and support recaching full blocks
ready
#12415
opened Jan 24, 2025 by
comaniac
Loading…
[Frontend] Support override generation config in args
ready
ONLY add when PR is ready to merge/full CI is needed
#12409
opened Jan 24, 2025 by
liuyanyi
Loading…
[Bugfix] Fix benchmark script bug: inaccurate stats for vllm backend when max_model_len < input_len + output_len
#12407
opened Jan 24, 2025 by
WangErXiao
Loading…
[Bugfix] Fix output_tokens is 0 if using tgi backend
#12394
opened Jan 24, 2025 by
sywangyi
Loading…
[torch.compile] PyTorch 2.6 and nightly compatibility
#12393
opened Jan 24, 2025 by
youkaichao
Loading…
[Hardware][Intel GPU] add XPU bf16 support
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#12392
opened Jan 24, 2025 by
jikunshang
Loading…
[Frontend] Rerank API (Jina- and Cohere-compatible API)
documentation
Improvements or additions to documentation
frontend
#12376
opened Jan 24, 2025 by
K-Mistele
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.