[DRAFT]: Update GB200 and GB300 SGLANG FP8 DSR1 Disaggregated inference configs (STP only)#635
[DRAFT]: Update GB200 and GB300 SGLANG FP8 DSR1 Disaggregated inference configs (STP only)#635
Conversation
|
@claude Review. |
|
Claude finished @jgangani's task in 1m 17s —— View job PR Review Complete
SummaryThis PR updates GB200 and GB300 SGLANG FP8 DSR1 Disaggregated inference configs for STP mode, including new image version, updated parallelism settings, and CONFIG_FILE-based configuration. 🔴 Critical Issues1. Missing
|
28e4ba5 to
f3ab616
Compare
|
hi @jgangani thanks for the PR. i will removing sweep-enable and cancelling this sweep since it was already failing feel free to re-add when the patch been fixed https://github.com/InferenceMAX/InferenceMAX/actions/runs/21701039822 |
No description provided.