Add provider_cache_read/write_input_tokens to Usage type by AntoineToussaint · Pull Request #7032 · tensorzero/tensorzero

AntoineToussaint · 2026-03-23T14:58:35Z

Summary

Adds provider_cache_read_input_tokens: Option<u32> and provider_cache_write_input_tokens: Option<u32> to the core Usage struct (crates/tensorzero-core/src/inference/types/usage.rs).

Production code changes

usage.rs: Two new fields on Usage, updated zero(), total_tokens(), and aggregation logic (lenient summation that preserves Some values)
mod.rs: aggregate_usage_across_model_inferences updated to propagate cache tokens
streams.rs: Streaming usage accumulation updated for cache tokens
Python client: Usage dataclass updated with new optional fields
TypeScript bindings: Usage.ts regenerated

Test / mechanical changes

All other files (providers, variants, endpoints, evaluations, etc.) are mechanical: adding provider_cache_read_input_tokens: None, provider_cache_write_input_tokens: None to existing Usage {} constructors so the code compiles.

What's NOT in this PR

OpenAI-compatible response format (PR Thread cache tokens through endpoints and OpenAI-compatible response #7035)
DB migrations (PR Add ClickHouse and Postgres migrations for cache token columns #7033)
Provider-specific cache parsing (PR Add cache token parsing for all providers #7034)
E2e tests and docs (PR Add e2e tests, docs, and test fixture updates for cache tokens #7036)

PR Stack

This PR — Usage type changes
Add ClickHouse and Postgres migrations for cache token columns #7033 — ClickHouse + Postgres migrations
Add cache token parsing for all providers #7034 — Provider type definitions + cache parsing
Thread cache tokens through endpoints and OpenAI-compatible response #7035 — Endpoint threading + OpenAI-compatible response format
Add e2e tests, docs, and test fixture updates for cache tokens #7036 — E2e tests, docs, test fixtures

Test plan

cargo check --all-targets --all-features
cargo clippy --all-targets --all-features -- -D warnings
cargo test-unit-fast (12 pre-existing failures unrelated)

🤖 Generated with Claude Code

Add two new Optional<u32> fields to the Usage struct for tracking provider-reported cache token counts. All existing constructors initialized with None. Aggregation helpers in mod.rs and streams.rs updated to propagate cache tokens through lenient summation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The #[serde(default, skip_serializing_if)] on the new cache fields is for backward compatibility with existing serialized data that predates these fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Usage is never deserialized from JSON — fields are always constructed in Rust code. Keep cache fields plain like input_tokens/output_tokens. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Per AGENTS.md convention: omit optional fields from API responses when None. Most responses won't have cache data, so avoid cluttering every response with null fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

AntoineToussaint requested a review from GabrielBianconi as a code owner March 23, 2026 14:58

AntoineToussaint assigned virajmehta Mar 23, 2026

AntoineToussaint and others added 2 commits March 23, 2026 11:19

Regenerate TypeScript bindings for Usage cache token fields

53e4409

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Add cache token fields to Python client Usage dataclass

f7b2e56

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

AntoineToussaint mentioned this pull request Mar 23, 2026

Add ClickHouse and Postgres migrations for cache token columns #7033

Merged

3 tasks

virajmehta approved these changes Mar 23, 2026

View reviewed changes

virajmehta previously approved these changes Mar 23, 2026

View reviewed changes

Add comment explaining serde annotations on cache token fields

a594523

The #[serde(default, skip_serializing_if)] on the new cache fields is for backward compatibility with existing serialized data that predates these fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

AntoineToussaint dismissed virajmehta’s stale review via a594523 March 23, 2026 15:37

Remove unnecessary serde annotations from cache token fields

35c38cf

Usage is never deserialized from JSON — fields are always constructed in Rust code. Keep cache fields plain like input_tokens/output_tokens. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

virajmehta previously approved these changes Mar 23, 2026

View reviewed changes

virajmehta assigned AntoineToussaint and unassigned virajmehta Mar 23, 2026

Add skip_serializing_if on cache token fields

80d53ab

Per AGENTS.md convention: omit optional fields from API responses when None. Most responses won't have cache data, so avoid cluttering every response with null fields. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

AntoineToussaint dismissed virajmehta’s stale review via 80d53ab March 23, 2026 16:38

Merge branch 'main' into feat/cache-tokens-1-usage-type

dce82c7

AntoineToussaint enabled auto-merge March 23, 2026 18:11

virajmehta approved these changes Mar 23, 2026

View reviewed changes

AntoineToussaint assigned GabrielBianconi Mar 23, 2026

GabrielBianconi approved these changes Mar 23, 2026

View reviewed changes

AntoineToussaint added this pull request to the merge queue Mar 23, 2026

Merged via the queue into main with commit da77058 Mar 23, 2026
197 of 203 checks passed

AntoineToussaint deleted the feat/cache-tokens-1-usage-type branch March 23, 2026 22:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add provider_cache_read/write_input_tokens to Usage type#7032

Add provider_cache_read/write_input_tokens to Usage type#7032
AntoineToussaint merged 7 commits intomainfrom
feat/cache-tokens-1-usage-type

AntoineToussaint commented Mar 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AntoineToussaint commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Production code changes

Test / mechanical changes

What's NOT in this PR

PR Stack

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AntoineToussaint commented Mar 23, 2026 •

edited

Loading