-
Notifications
You must be signed in to change notification settings - Fork 96
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
bugfix: add the virtual to the destructor of the shared memory base c…
#529
opened Dec 12, 2025 by
Clement-Wang26
Loading…
refactor: refactor runtime and distributed_runtime to break the circular dependency.
#522
opened Dec 11, 2025 by
yq33victor
Loading…
refactor: add common attention metadata for all cuda-like devices.
#520
opened Dec 11, 2025 by
RobbieLeung
Loading…
refactor: remove empty_kv_cache and global_empty_kv_cache.
#514
opened Dec 10, 2025 by
RobbieLeung
Loading…
bugfix: fix the issue of ineffective input embedding transmission.
#490
opened Dec 5, 2025 by
magicheng0816
Loading…
refactor: optimize unique token count preparation of batch input builder.
#449
opened Nov 27, 2025 by
RobbieLeung
Loading…
feat: support Qwen2-VL & GME-Qwen2-VL model on npu device.
#399
opened Nov 18, 2025 by
xanecdotex
Loading…
feat: enable torch_npu graph mode for Qwen-3 dense with TP support.
#325
opened Nov 6, 2025 by
yingxudeng
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.