-
Notifications
You must be signed in to change notification settings - Fork 147
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: optimize cal_seq_exchange_index using unordered_map.
#1041
opened Mar 11, 2026 by
Dragonliu2018
Loading…
bugfix: add synchronize when running multi-modal embedding model.
#1040
opened Mar 11, 2026 by
wxh571001500
Loading…
feat: add embedding manager to reduce total size of embeding cache.
#1039
opened Mar 11, 2026 by
RobbieLeung
Loading…
feat: add unified request stats logs for prefill and service.
#1038
opened Mar 11, 2026 by
DongheJin
Loading…
feat: lm head uses row parallel to support more sizes of vocab.
#1037
opened Mar 11, 2026 by
RobbieLeung
Loading…
bugfix: adjust acl graph sequence capacity for speculative modes.
#1030
opened Mar 10, 2026 by
DongheJin
Loading…
feat: support QwenImageEditPlus pipeline with embedding infer.
#1028
opened Mar 10, 2026 by
shan-chen-feng
Loading…
feat: add mlu sequence parallel prerequisites part 2.
#1026
opened Mar 10, 2026 by
phantomlei3
Loading…
perf: optimize FP8 GEMM performance via tile strategy tuning[3/N].
#1025
opened Mar 9, 2026 by
yingxudeng
Loading…
refactor: extract multi-modal input processors to processors dir.
#1022
opened Mar 9, 2026 by
wly-115
Loading…
github: only build on PR approval, skip on raw push/open.
#1012
opened Mar 6, 2026 by
Clement-Wang26
Loading…
bugfix: improve qwen25 tool-call parsing robustness.
#1010
opened Mar 6, 2026 by
yingxudeng
Loading…
refactor: unify rec multi round decode mode with one-stage flag.
#1000
opened Mar 5, 2026 by
LMX-xin
Loading…
bugfix: add group_id when creating the HccLProcessGroup to support multiple communication process_group.
#999
opened Mar 5, 2026 by
shan-chen-feng
Loading…
feat: add qwen35 and qwen35-thinking reasoning parser detectors.
#985
opened Mar 3, 2026 by
yingxudeng
Loading…
feat: support qwen3.5 tool-call parser with qwen3_coder detector.
#982
opened Mar 3, 2026 by
yingxudeng
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.