sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 3.8k
Star 21.8k

Code
Issues 666
Pull requests 1.1k
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: sgl-project/sglang

Labels 64 Milestones 1

New pull request New

1,102 Open 9,862 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[model-gateway]: Implement hierarchical multi-tenant and model-based rate limiting model-gateway

#15517 opened Dec 20, 2025 by Ratish1

Loading…

4 of 6 tasks

[model-gateway] Support MCP Namespaces model-gateway run-ci

#15516 opened Dec 20, 2025 by xuwenyihust

Loading…

2 of 6 tasks

[model-gateway] Optimize WASM Runtime with Instance Pooling and Component Caching model-gateway

#15515 opened Dec 20, 2025 by ppraneth

Loading…

6 tasks

Add Flashinfer DeepGEMM SM90 for SwapAB Optimization

#15514 opened Dec 20, 2025 by b8zhong

Loading…

6 tasks done

feature: support unicorn access log filter(disable logging /metrics)

#15513 opened Dec 20, 2025 by alphabetc1

Loading…

6 tasks

[Feature] overlap LoRA weight loading with compute

#15512 opened Dec 20, 2025 by glenliu21

Loading…

4 of 6 tasks

[Diffusion] Wan video model support zero-cost weight offload and overlap with compute diffusion

SGLang Diffusion

#15511 opened Dec 20, 2025 by BBuf

Loading…

6 tasks

Clean up logprob utils run-ci

#15509 opened Dec 20, 2025 by hnyls2002

Loading…

fix(qwen_vl): add semaphore to serialize video decoding for thread safety

#15506 opened Dec 20, 2025 by ShuhangGe

Loading…

chore: bump sgl-kernel version to 0.3.20 amd dependencies

Pull requests that update a dependency file

run-ci sgl-kernel

#15501 opened Dec 19, 2025 by sglang-bot

Loading…

[AMD] Add nightly performance benchmark tests amd deepseek

#15500 opened Dec 19, 2025 by michael-amd • Draft

6 tasks

Draft: MLA Eagle3 deepseek

#15499 opened Dec 19, 2025 by IzzyPutterman

Loading…

6 tasks

feat: only add input vision tokens in bench_serving result if vision dataset is used

#15492 opened Dec 19, 2025 by raayandhar

Loading…

6 tasks done

[MiMoV2Flash] fix: respect --swa-full-tokens-ratio arg run-ci

#15488 opened Dec 19, 2025 by acelyc111

Loading…

6 tasks

[WIP] MLP weight prefetching unification

#15482 opened Dec 19, 2025 by terfendail • Draft

6 tasks

[diffusion] kernel: support qk rotary_embedding in one triton kernel diffusion

SGLang Diffusion

#15480 opened Dec 19, 2025 by triple-Mu

Loading…

6 tasks

[diffusion] refactor: support scheduling logic for reqs inside scheduler diffusion

SGLang Diffusion

#15479 opened Dec 19, 2025 by mickqian

Loading…

6 tasks

tiny fix: fix glm46v launch and transformers issue

#15476 opened Dec 19, 2025 by yhyang201

Loading…

6 tasks

[WIP][EPD][VLM] support video input(qwen-series)

#15475 opened Dec 19, 2025 by ZhengWG

Loading…

6 tasks

Fix reasoning parser in non-stream

#15472 opened Dec 19, 2025 by mingMelody

Loading…

6 tasks

[sgl-kernel][6/7]Support Expert Specialization Grouped GEMM sgl-kernel

#15471 opened Dec 19, 2025 by HydraQYH

Loading…

1 of 6 tasks

Fix 'Scheduler' object has no attribute 'port_args' error

#15470 opened Dec 19, 2025 by lambert0312

Loading…

1 of 6 tasks

Fix Illegal Memory Access when fa3 + spec + topk + page_size > 1

#15469 opened Dec 19, 2025 by yubofredwang

Loading…

6 tasks

[Quantization]feat: Add Nvidia ModelOpt HF FP8 support for fp8_pc_pt and fp8_pb_wo documentation

Improvements or additions to documentation

quant

LLM Quantization

#15468 opened Dec 19, 2025 by CedricHwong

Loading…

4 of 6 tasks

#15466 opened Dec 19, 2025 by almaslof

Loading…

Previous 1 2 3 4 5 … 44 45 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!