-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[model-gateway]: Implement hierarchical multi-tenant and model-based rate limiting
model-gateway
#15517
opened Dec 20, 2025 by
Ratish1
Loading…
4 of 6 tasks
[model-gateway] Support MCP Namespaces
model-gateway
run-ci
#15516
opened Dec 20, 2025 by
xuwenyihust
Loading…
2 of 6 tasks
[model-gateway] Optimize WASM Runtime with Instance Pooling and Component Caching
model-gateway
#15515
opened Dec 20, 2025 by
ppraneth
Loading…
6 tasks
Add Flashinfer DeepGEMM SM90 for SwapAB Optimization
#15514
opened Dec 20, 2025 by
b8zhong
Loading…
6 tasks done
feature: support unicorn access log filter(disable logging /metrics)
#15513
opened Dec 20, 2025 by
alphabetc1
Loading…
6 tasks
[Feature] overlap LoRA weight loading with compute
#15512
opened Dec 20, 2025 by
glenliu21
Loading…
4 of 6 tasks
[Diffusion] Wan video model support zero-cost weight offload and overlap with compute
diffusion
SGLang Diffusion
#15511
opened Dec 20, 2025 by
BBuf
Loading…
6 tasks
fix(qwen_vl): add semaphore to serialize video decoding for thread safety
#15506
opened Dec 20, 2025 by
ShuhangGe
Loading…
chore: bump sgl-kernel version to 0.3.20
amd
dependencies
Pull requests that update a dependency file
run-ci
sgl-kernel
#15501
opened Dec 19, 2025 by
sglang-bot
Loading…
[AMD] Add nightly performance benchmark tests
amd
deepseek
#15500
opened Dec 19, 2025 by
michael-amd
•
Draft
6 tasks
feat: only add input vision tokens in
bench_serving result if vision dataset is used
#15492
opened Dec 19, 2025 by
raayandhar
Loading…
6 tasks done
[MiMoV2Flash] fix: respect --swa-full-tokens-ratio arg
run-ci
#15488
opened Dec 19, 2025 by
acelyc111
Loading…
6 tasks
[diffusion] kernel: support qk rotary_embedding in one triton kernel
diffusion
SGLang Diffusion
#15480
opened Dec 19, 2025 by
triple-Mu
Loading…
6 tasks
[diffusion] refactor: support scheduling logic for reqs inside scheduler
diffusion
SGLang Diffusion
#15479
opened Dec 19, 2025 by
mickqian
Loading…
6 tasks
tiny fix: fix glm46v launch and transformers issue
#15476
opened Dec 19, 2025 by
yhyang201
Loading…
6 tasks
[WIP][EPD][VLM] support video input(qwen-series)
#15475
opened Dec 19, 2025 by
ZhengWG
Loading…
6 tasks
[sgl-kernel][6/7]Support Expert Specialization Grouped GEMM
sgl-kernel
#15471
opened Dec 19, 2025 by
HydraQYH
Loading…
1 of 6 tasks
Fix 'Scheduler' object has no attribute 'port_args' error
#15470
opened Dec 19, 2025 by
lambert0312
Loading…
1 of 6 tasks
Fix Illegal Memory Access when fa3 + spec + topk + page_size > 1
#15469
opened Dec 19, 2025 by
yubofredwang
Loading…
6 tasks
[Quantization]feat: Add Nvidia ModelOpt HF FP8 support for fp8_pc_pt and fp8_pb_wo
documentation
Improvements or additions to documentation
quant
LLM Quantization
#15468
opened Dec 19, 2025 by
CedricHwong
Loading…
4 of 6 tasks
Add similar to vllm wait --ready-check-timeout-sec parameter for benchmark script
#15466
opened Dec 19, 2025 by
almaslof
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.