-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[NVIDIA] Add cutedsl e2e test to GB200 CI
run-ci
#12672
opened Nov 5, 2025 by
kaixih
Loading…
4 tasks
[router][grpc] Implement tool_choice support for Responses API
enhancement
New feature or request
high priority
router
run-ci
#12668
opened Nov 5, 2025 by
CatherineSue
Loading…
1 of 4 tasks
[sgl-kernel][5/N]Support Expert Specialization Grouped GEMM
run-ci
#12666
opened Nov 5, 2025 by
HydraQYH
Loading…
2 of 4 tasks
[VLM] Support Encode/Language Model Dissaggregation for Qwen
#12665
opened Nov 5, 2025 by
ZhengWG
Loading…
4 tasks
overlap shared + routed expert computation in kimi linear
run-ci
#12660
opened Nov 5, 2025 by
b8zhong
Loading…
[Dockerfile] Speedup image building by setting local artifact repo
run-ci
#12655
opened Nov 5, 2025 by
Kangyan-Zhou
Loading…
4 tasks
Optimize EAGLE select_top_k_tokens: use logprobs.
#12637
opened Nov 4, 2025 by
w32zhong
Loading…
2 of 4 tasks
[router] fix: validate HTTP status codes in health check
#12631
opened Nov 4, 2025 by
wyx-0203
Loading…
4 tasks
feat(SpecEagleV2): add standalone_worker_v2(WIP)
#12625
opened Nov 4, 2025 by
attack204
Loading…
1 of 7 tasks
Enhance dumper comparator with tensor unifier and location finder
run-ci
#12623
opened Nov 4, 2025 by
fzyzcjy
Loading…
4 tasks
Tiny enhance dumper with ctx and enable flags
run-ci
#12622
opened Nov 4, 2025 by
fzyzcjy
Loading…
4 tasks
[Ascend] ascend supports ds-ocr model
run-ci
#12619
opened Nov 4, 2025 by
ping1jing2
•
Draft
4 tasks
[Ascend]adapt enable-profile-cuda-graph for NPU
run-ci
#12617
opened Nov 4, 2025 by
ping1jing2
•
Draft
4 tasks
feat: Add FP4 (E2M1) KV Cache Support for MHA
#12612
opened Nov 4, 2025 by
JackChuang
Loading…
3 of 4 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.