-
Notifications
You must be signed in to change notification settings - Fork 128
Pull requests: alibaba/rtp-llm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support python-xqa with CUDA 12.9 and compatible with CUDA 12.6
#444
opened Dec 11, 2025 by
qqbbiu
Loading…
feat: add process-isolated logging with rank_id and server_id
#436
opened Dec 5, 2025 by
sunmiaozju
Loading…
fix: modify pre_decoder_residual under multimodalEmbedding input
#433
opened Dec 5, 2025 by
junna2016
Loading…
fix: wrong residual when pre decoder layernorm + mm embedding + quant
#426
opened Dec 4, 2025 by
LLLLKKKK
Loading…
fix - backend server not shutdown graceful in mulit rank case
#424
opened Dec 3, 2025 by
jianglan89
Loading…
feat: support return raw output and output ids in debug info
#421
opened Dec 2, 2025 by
soaringk
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.