Litellm hotfix azure ai gpt 5.4 main by shivamrawat1 · Pull Request #28104 · BerriAI/litellm

shivamrawat1 · 2026-05-17T01:49:40Z

gpt-5.4 azure_ai for foundry

Register azure_ai GPT-5.4 variants with pricing, context limits from Foundry catalog, and capability flags for cost routing and tooling. Co-authored-by: Cursor <cursoragent@cursor.com>

…data Add supports_web_search for base GPT-5.4 aliases, priority-tier Pro rates, and mini/nano above-272k plus priority pricing for correct spend math. Co-authored-by: Cursor <cursoragent@cursor.com>

…ckup row Mirror supports_web_search for azure_ai/gpt-5.4-2026-03-05 in the backup catalog so it matches model_prices_and_context_window.json. Co-authored-by: Cursor <cursoragent@cursor.com>

codspeed-hq · 2026-05-17T01:51:20Z

Merging this PR will not alter performance

✅ 16 untouched benchmarks

_{Comparing litellm_hotfix_azure_ai_gpt-5.4_main (907d533) with main (e58a561)}

greptile-apps · 2026-05-17T01:52:06Z

Greptile Summary

This PR adds eight new azure_ai model entries to both the primary and backup model pricing JSON files: gpt-5.4, gpt-5.4-pro, gpt-5.4-mini, and gpt-5.4-nano, each accompanied by a dated snapshot alias (e.g. gpt-5.4-2026-03-05).

Pricing arithmetic is internally consistent across all tiers: the cache-read discount is uniformly 10% of the corresponding input cost, the above-272k tier doubles the base rate, and the priority tier also doubles the base rate.
azure_ai/gpt-5.4-pro (and its alias) sets \"mode\": \"responses\" and omits /v1/chat/completions from supported_endpoints, making it a responses-API-only model — a notable divergence from the other three family members that all support chat completions.
Both JSON files are kept in sync with identical additions.

Confidence Score: 4/5

Safe to merge with a quick confirmation that gpt-5.4-pro is intentionally responses-API-only and that the pricing figures match the Azure Foundry catalog.

The change is purely additive JSON config — no runtime logic is touched. The pricing math checks out internally, and both files are kept in sync. The two open questions are whether the gpt-5.4-pro chat-completions omission is intentional, and whether the pricing figures have been cross-checked against the official Azure catalog, neither of which can be verified from the PR description alone.

The azure_ai/gpt-5.4-pro entry in both JSON files deserves a second look to confirm the responses-only mode and pricing are accurate per the Azure Foundry listing.

Important Files Changed

Filename	Overview
model_prices_and_context_window.json	Adds 8 new azure_ai/gpt-5.4 model variants (base, pro, mini, nano — each with a dated snapshot alias); pricing math is internally consistent; gpt-5.4-pro is responses-mode only (no /v1/chat/completions), which diverges from the other family members and may be intentional but is unverified in the PR description.
litellm/model_prices_and_context_window_backup.json	Backup file updated in lock-step with the primary; identical entries and the same internal pricing consistency; same gpt-5.4-pro responses-only concern applies.

Comments Outside Diff (2)

model_prices_and_context_window.json, line 488 (link)

gpt-5.4-pro missing /v1/chat/completions endpoint

azure_ai/gpt-5.4-pro (and its dated variant) set "mode": "responses" and list only /v1/batch and /v1/responses as supported_endpoints, omitting /v1/chat/completions. The other gpt-5.4 family members all include chat completions. If this model actually supports chat completions on Azure Foundry, this entry will silently mis-represent its capabilities and may confuse users who try the standard chat path. If the omission is intentional (responses-API-only model), a brief comment or source link clarifying that would help reviewers verify the pricing data.
model_prices_and_context_window.json, line 394 (link)

PR description lacks evidence of what was fixed

The PR title says "hotfix" but the description contains only "gpt-5.4 azure_ai for foundry" with no reference to a GitHub issue, no prior bug report, and no test output confirming the entries are correct. Per the team's review standard, PRs claiming to fix something should include verifiable evidence (e.g., a link to the Azure pricing page, a test run, or an issue number). Without a source cross-reference it's hard to confirm that the pricing values and capability flags are accurate.

Rule Used: What: Ensure that any PR claiming to fix an issue ... (source)

_{Reviews (1): Last reviewed commit: "fix(model_catalog): sync web_search flag..." | Re-trigger Greptile}

shivamrawat1 and others added 3 commits May 16, 2026 18:47

feat(model_catalog): add Azure AI Foundry GPT-5.4 model metadata

c48d691

Register azure_ai GPT-5.4 variants with pricing, context limits from Foundry catalog, and capability flags for cost routing and tooling. Co-authored-by: Cursor <cursoragent@cursor.com>

fix(model_catalog): tighten Azure AI GPT-5.4 cost and capability meta…

243c1cd

…data Add supports_web_search for base GPT-5.4 aliases, priority-tier Pro rates, and mini/nano above-272k plus priority pricing for correct spend math. Co-authored-by: Cursor <cursoragent@cursor.com>

fix(model_catalog): sync web_search flag on Azure AI GPT-5.4 dated ba…

907d533

…ckup row Mirror supports_web_search for azure_ai/gpt-5.4-2026-03-05 in the backup catalog so it matches model_prices_and_context_window.json. Co-authored-by: Cursor <cursoragent@cursor.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Litellm hotfix azure ai gpt 5.4 main#28104

Litellm hotfix azure ai gpt 5.4 main#28104
shivamrawat1 wants to merge 3 commits into
mainfrom
litellm_hotfix_azure_ai_gpt-5.4_main

shivamrawat1 commented May 17, 2026

Uh oh!

codspeed-hq Bot commented May 17, 2026

Uh oh!

greptile-apps Bot commented May 17, 2026 •

edited

Loading

Important Files Changed

Comments Outside Diff (2)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

shivamrawat1 commented May 17, 2026

Uh oh!

codspeed-hq Bot commented May 17, 2026

Merging this PR will not alter performance

Uh oh!

greptile-apps Bot commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Comments Outside Diff (2)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

greptile-apps Bot commented May 17, 2026 •

edited

Loading