-
Notifications
You must be signed in to change notification settings - Fork 105
Pull requests: lightseekorg/tokenspeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Perf[Qwen3.5]: some kernel fuse optimizations.
#228
opened May 23, 2026 by
tuanzhangCS
Contributor
Loading…
deps: bump tokenspeed-trtllm-kernel to 1.3.0rc15.post20260522+full
#227
opened May 23, 2026 by
aaronliuls
Contributor
Loading…
3 tasks
[WIP] perf(eagle3): skip dead-position compute in draft catch-up step
#217
opened May 22, 2026 by
rjzhb
Loading…
1 task done
feat(trtllm-MHA): support mixed prefill/decode batches
#176
opened May 18, 2026 by
rjzhb
Loading…
4 tasks done
feat: support post-norm EAGLE + add speculative decoding docs
high priority
#174
opened May 17, 2026 by
Dogacel
Loading…
perf(moe): triton biased grouped topk for deepseek-v3 routing
#171
opened May 17, 2026 by
roycho96
Contributor
Loading…
feat(kvstore): support mamba l2 cache transfers
high priority
#162
opened May 15, 2026 by
XucSh
Contributor
Loading…
perf: chunked-prefill prefix cache update for non-hybrid models
#22
opened May 7, 2026 by
LorrinWWW
Contributor
Loading…
fix: wait per-layer on drafter KV pool during cpu cache loadback
#6
opened May 6, 2026 by
LorrinWWW
Contributor
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.