Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Misc] fix this log format not space v1
#32112 opened Jan 11, 2026 by lengrongfu Loading…
5 tasks
[CI/Build] Separate out flaky responses API tests ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#32110 opened Jan 11, 2026 by DarkLight1337 Loading…
5 tasks
Add tensor IPC transfer mechanism for multimodal data frontend multi-modality Related to multi-modality (#4194) v1
#32104 opened Jan 11, 2026 by brandonpelfrey Loading…
1 task
[ROCm][Bugfix] Fix Mamba batched decode producing incorrect output rocm Related to AMD ROCm
#32099 opened Jan 10, 2026 by AndreasKaratzas Loading…
fix(examples): replace unsafe eval() with safe math evaluator in xLAM tool examples documentation Improvements or additions to documentation
#32098 opened Jan 10, 2026 by deosha Loading…
[Bugfix] Fix GLM-4.7 tool parser for tool call without arguments
#32097 opened Jan 10, 2026 by steinfurt Loading…
3 of 5 tasks
[cpu][bench] Add Fused MoE Micro Benchmark for CPU Backend cpu Related to CPU backends performance Performance-related issues
#32092 opened Jan 10, 2026 by andikarachman Loading…
3 of 5 tasks
fix offline inference chat response prompt documentation Improvements or additions to documentation speculative-decoding
#32088 opened Jan 10, 2026 by andyxning Loading…
5 tasks
refactor: refactor_repeated_interfaces deepseek Related to DeepSeek models
#32087 opened Jan 10, 2026 by tom-zju Loading…
5 tasks
[Model] Improve multimodal pooling examples documentation Improvements or additions to documentation
#32085 opened Jan 10, 2026 by noooop Draft
5 tasks
[ROCm][Bugfix] Fix AITER speculative decoding accuracy issue rocm Related to AMD ROCm v1
#32084 opened Jan 10, 2026 by c0de128 Loading…
3 tasks
[Models] Add SharedFusedMoE support to Qwen3MoE qwen Related to Qwen models
#32082 opened Jan 10, 2026 by Isotr0py Draft
1 of 5 tasks
[Bugfix] Fix ModelOpt Llama-4 slow loading via tensor contiguity llama Related to Llama models
#32081 opened Jan 10, 2026 by ishrith-gowda Loading…
[EPLB] Replace async handshake flags with TransferPhase state machine
#32078 opened Jan 10, 2026 by Anri-Lombard Loading…
2 tasks done
[Cleanup] Removed unused KVConnectorModelRunnerMixin methods kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1
#32077 opened Jan 10, 2026 by njhill Loading…
[RFC] Improve environment variable declaration and handling (#31249) documentation Improvements or additions to documentation
#32070 opened Jan 10, 2026 by nliu365 Draft
10 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.