Right model, every request.
Each request goes to the cheapest model that can handle it, with the hard calls escalated to the strong model — about half the cost at today's prices. The shadow audit measures your exact number.
~50% lower · escalates when it matters