Skip to content

Pull requests: lightseekorg/tokenspeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix(deepseek-v4): corrected profiling to estimate cache capacity.
#173 opened May 17, 2026 by SimonCqk Contributor Loading…
perf(moe): triton biased grouped topk for deepseek-v3 routing
#171 opened May 17, 2026 by roycho96 Contributor Loading…
wip: EAGLE post-norm
#170 opened May 17, 2026 by Dogacel Draft
[WIP] perf: add gluon fp16 prefill kernel
#165 opened May 16, 2026 by borontion Contributor Loading…
feat(kvstore): support mamba l2 cache transfers
#162 opened May 15, 2026 by XucSh Contributor Loading…
[WIP] feat(deepseek-v4): support prefix cache snapshots
#146 opened May 14, 2026 by SimonCqk Contributor Loading…
[Draft]feat(deepseek-v4): support MTP speculative decoding
#123 opened May 13, 2026 by dongjiyingdjy Contributor Loading…
[WIP] feat(lora): LoRA adapter serving
#83 opened May 11, 2026 by qywu Collaborator Draft
1 of 7 tasks
fix: retraction load back race condition.
#74 opened May 11, 2026 by LorrinWWW Contributor Loading…
fix: wait per-layer on drafter KV pool during cpu cache loadback
#6 opened May 6, 2026 by LorrinWWW Contributor Loading…
ProTip! Follow long discussions with comments:>50.