fix: double counting anthropic langchain #399

carlos-marchal-ph · 2026-01-02T17:16:24Z

We were counting cache write tokens incorrectly when using Anthropic with Langchain.

greptile-apps · 2026-01-02T17:17:37Z

What Changed

The core issue was in posthog/ai/langchain/callbacks.py where the Anthropic-specific token accounting logic was only subtracting cache read tokens from input tokens, but not cache write tokens. According to Anthropic's API semantics, their input_tokens field includes all tokens (including cache tokens), but PostHog's cost calculation expects them to be separate.

Key changes:

Token calculation fix: Modified the Anthropic token normalization to subtract both cache_read_tokens AND cache_write_tokens from input_tokens
Provider detection: Uses consistent logic matching the plugin-server - exact match on provider name OR substring match on model name
Comprehensive tests: Added three new test functions to verify the cache token subtraction behavior works correctly for Anthropic while leaving other providers unchanged

The fix ensures that for Anthropic providers:

normalized_input_tokens = reported_input_tokens - cache_read_tokens - cache_write_tokens

While for other providers like OpenAI, input_tokens already excludes cache tokens as expected.

Testing Coverage

The PR includes thorough test coverage with scenarios testing:

Cache write token subtraction for Anthropic provider
Combined cache read + write token subtraction for Anthropic
Verification that non-Anthropic providers don't have cache tokens subtracted
Edge cases with missing or zero cache token values

Why This Matters

This fix ensures accurate billing calculations and usage analytics for teams using Anthropic models through LangChain. Previously, cache creation costs were being double-counted, leading to inflated metrics and incorrect cost attribution.

The change is backwards compatible and only affects the internal token accounting logic for Anthropic providers - the external API remains unchanged.

fix: double counting anthropic langchain

fd437e9

carlos-marchal-ph requested a review from a team January 2, 2026 17:16

carlos-marchal-ph self-assigned this Jan 2, 2026

carlos-marchal-ph added bug Something isn't working team/llm-analytics labels Jan 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: double counting anthropic langchain #399

fix: double counting anthropic langchain #399

carlos-marchal-ph commented Jan 2, 2026

Uh oh!

greptile-apps bot commented Jan 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: double counting anthropic langchain #399

Are you sure you want to change the base?

fix: double counting anthropic langchain #399

Conversation

carlos-marchal-ph commented Jan 2, 2026

Uh oh!

greptile-apps bot commented Jan 2, 2026

What Changed

Testing Coverage

Why This Matters

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants