AI | Steve-AI

What Is the Scope of Prompt Caching?

Prompt caching is driven by prefix matching, not by session identity. If the same prefix shows up again, reuse can happen across conversations and sometimes across users.

Steve

2026/03/30

What Is Prompt Caching TTL?

TTL is the lifetime of a prompt cache entry. Each hit refreshes it. Leave it unused for long enough, and it expires.

Steve

2026/03/30

Prompt Caching vs Tool Result Replacement

Prompt caching cuts the price of repeated prefixes. Tool-result replacement shrinks the prompt itself. Both can save money, but they push on different parts of the bill.

Steve

2026/03/30