A practical guide to AI agent caching: what to cache, what not to cache, how to set freshness rules, and how to reduce cost and latency without making your agent confidently wrong.