Prompt Caching for Cost Savings

LunoGen caches your long system prompt, memory context, and knowledge base on Anthropic's side. Repeated calls hit the cache and cost a fraction of normal tokens — often cutting bills by 70% or more.

WhatsApp DMcachingcostoptimization

Conversation example

Show me how much prompt caching is saving us this week

http_request → Anthropic Usage API

Pulling cache hit stats from Anthropic usage logs...

Prompt Caching — week Apr 5 to Apr 11 Total requests: 18,204 Cache hits: 15,911 (87%) Cache misses: 2,293 (first calls, cache fills) Tokens billed at cache rate: 82.4M (cheap) Tokens billed at full rate: 11.8M Cost this week: $148 Cost without caching: $892 (estimated) Savings: $744 (83%) By model - claude-opus: 94% cache hit rate - claude-sonnet: 81% - claude-haiku: 78% Very healthy. Caching is paying for itself many times over.

More Anthropic use cases

Bring Your Own Anthropic Key

Connect your Anthropic API key and LunoGen uses your account for all Claude calls. You control billing, model access, and rate limits — and gain access to the latest Claude releases as soon as they ship.

Claude Opus for Complex Reasoning

Route your hardest tasks — deep research, multi-step tool use, long planning — to Claude Opus. The agent detects complexity and automatically uses Opus when the prompt needs it.

Claude Haiku for Fast, Cheap Replies

Quick user messages — greetings, simple questions, intent checks — get routed to Claude Haiku for near-instant, low-cost replies. Opus handles the thinking, Haiku handles the talking.

Messages API with Tool Use

LunoGen uses Anthropic's Messages API with tool use for structured agent actions. Claude decides which tool to call, and the agent executes HTTP requests, sends WhatsApp messages, or writes files in one loop.

Vision Tasks on WhatsApp Images

Send an image to the agent on WhatsApp and it runs Claude vision — reading receipts, describing screenshots, extracting handwritten notes, or checking a product photo for defects.

Deploy this in minutes

Create a LunoGen agent, connect Anthropic, and start running this workflow from WhatsApp today.

Back to Anthropic