-
-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Model] Add HunyuanOCR support
documentation
Improvements or additions to documentation
new-model
Requests to new models
v1
#29327
opened Nov 24, 2025 by
Isotr0py
Loading…
5 tasks
Scheduled removal of Improvements or additions to documentation
frontend
structured-output
v1
guided_* config fields
documentation
#29326
opened Nov 24, 2025 by
hmellor
Loading…
Scheduled removal of
ParallelConfig's direct child EPLB fields
#29324
opened Nov 24, 2025 by
hmellor
Loading…
Updated AMD-CI mirror (2025-11-24)
ci/build
rocm
Related to AMD ROCm
#29321
opened Nov 24, 2025 by
Alexei-V-Ivanov-AMD
Loading…
Validating Runai Model Streamer Integration with S3 Object Storage
ci/build
rocm
Related to AMD ROCm
#29320
opened Nov 24, 2025 by
noa-neria
Loading…
lora cuda multistream
ci/build
nvidia
v1
#29316
opened Nov 24, 2025 by
hai-meh-cs
Loading…
3 of 5 tasks
Add support to load int4 weights for CPU
gpt-oss
Related to GPT-OSS models
#29315
opened Nov 24, 2025 by
isharif168
10BC0
Loading…
5 tasks
Fix: bad_words filtering ineffective when n > 1
v1
#29313
opened Nov 24, 2025 by
GOavi101
Loading…
5 tasks
[Bugfix] Make deprecated ONLY add when PR is ready to merge/full CI is needed
--task embedding consistent with `--runner…
ready
#29312
opened Nov 24, 2025 by
maryamtahhan
Loading…
3 of 5 tasks
[Perf] use cpu all reduce to avoid sync when async_scheduling & dp > 1
ready
ONLY add when PR is ready to merge/full CI is needed
#29311
opened Nov 24, 2025 by
izhuhaoran
Loading…
[CI] Add batched audios Whisper test
ready
ONLY add when PR is ready to merge/full CI is needed
#29308
opened Nov 24, 2025 by
NickLucche
Loading…
[XPU] upgrade torch & ipex 2.9 on XPU platform
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#29307
opened Nov 24, 2025 by
jikunshang
Loading…
5 tasks
[ROCm][PD] add moriio kv connector.
documentation
Improvements or additions to documentation
frontend
kv-connector
rocm
Related to AMD ROCm
[Hybrid Allocator] Better layer padding strategy for gpt-oss eagle
gpt-oss
Related to GPT-OSS models
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#29303
opened Nov 24, 2025 by
heheda12345
Loading…
5 tasks
[CUDA] cutlass_moe_mm: proper sm version check
nvidia
#29302
opened Nov 24, 2025 by
Aidyn-A
Loading…
Add TP CLI argument to multimodal inference examples
documentation
Improvements or additions to documentation
#29301
opened Nov 24, 2025 by
faaany
Loading…
3 of 5 tasks
[DRAFT] Add RoaringBitmap-based sparse attention mask implementation
performance
Performance-related issues
#29296
opened Nov 24, 2025 by
anandheritage
Loading…
5 tasks
Bump actions/checkout from 4 to 6
ci/build
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#29293
opened Nov 24, 2025 by
dependabot
bot
Loading…
[Misc] Suppress log outputs when constructing the default vllm config.
ready
ONLY add when PR is ready to merge/full CI is needed
#29291
opened Nov 24, 2025 by
noooop
Loading…
5 tasks
[Perf] Enable environment cache in EngineCore to enable the feature for UniProcExecutor as well
v1
#29289
opened Nov 24, 2025 by
Jialin
Loading…
3 of 5 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.