Skip to content

Pull requests: ROCm/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Merge upstream→gfx11
#1041 opened Jun 30, 2026 by eble-amd Loading…
455 wip rebased
#1039 opened Jun 30, 2026 by danichan-mkm Loading…
gpt-oss gfx1250 ATOM-parity perf patches
#1035 opened Jun 29, 2026 by dllehr-amd Collaborator Loading…
[CI] Re-enable performance test job
#1033 opened Jun 29, 2026 by marcusr-amd Draft
5 tasks
tune Qwen3-VL-4B prefill unified-attention on gfx1150
#1024 opened Jun 26, 2026 by qingxuamd Loading…
Fix GLM 5.2 mxfp4 MTP loading issue
#1022 opened Jun 25, 2026 by amd-xiaoyu12 Loading…
optimize TTFT qwen3-vl
#1006 opened Jun 15, 2026 by qingxuamd Loading…
455 war room findings
#1001 opened Jun 12, 2026 by jpvillam-amd Loading…
Hybrid
#918 opened May 4, 2026 by liangliangchang Draft
5 tasks
[ROCm] support topk_softplus for all number of experts
#899 opened Apr 25, 2026 by tjtanaa Loading…
5 tasks
Tune hybrid_triton_w4a16 prefill kernel for gfx1151
#879 opened Apr 15, 2026 by mgehre-amd Draft
3 tasks done
ProTip! Filter pull requests by the default branch with base:main.