-
Notifications
You must be signed in to change notification settings - Fork 200
Pull requests: Luce-Org/lucebox-hub
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(gemma4): feature-complete backend with DFlash + MTP + sparse-FA decode (supersedes PR #175 skeleton)
#193
opened May 14, 2026 by
dusterbloom
Contributor
•
Draft
feat(gemma4): target-graph MTP integration (h_prev capture + asymmetric KV)
#183
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
feat(gemma4): add mtp loader and step graph
#182
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
feat(gemma4): add draft loader and quantization support
#180
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
fix(gemma4): add long-context KV correctness
#177
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
docs(dflash): document small-vram cuda vmm guidance
#174
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
feat(dflash): add csv token output utility
#173
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
feat(dflash): linear native MTP integrated decode CLI (stacked on #153)
#154
opened May 11, 2026 by
javierpazo
Contributor
Loading…
feat(dflash): native Qwen3.6 MTP (NextN) runtime + contract test
#153
opened May 11, 2026 by
javierpazo
Contributor
Loading…
feat(dflash): accept FP16 safetensors drafter alongside BF16
#142
opened May 9, 2026 by
javierpazo
Contributor
Loading…
chore(dflash): enforce sm_89 user override and keep BSA enabled
#137
opened May 9, 2026 by
javierpazo
Contributor
Loading…
feat(dflash): native multi-request scheduler with batched target step
#135
opened May 9, 2026 by
javierpazo
Contributor
Loading…
Gemma4 support: pFlash + DFlash + chunked prefill, daemon mode, server routing
#131
opened May 8, 2026 by
dusterbloom
Contributor
Loading…
5 of 6 tasks
feat(dflash): support Qwen3.6-27B-DFlash draft (SWA layers) — 106 t/s on RTX 4090
#94
opened May 4, 2026 by
Quitetall
Contributor
Loading…
perf(pflash): add SM75 target-resident TTFT path
#72
opened May 1, 2026 by
weicj
Contributor
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.