Calculate AI API usage cost for GPT-5, Claude 4, Gemini 3, and Llama 4 by tokens per request, daily volume, and yearly budget.
Added May 3, 2026 · Updated May 26, 2026
Input
Result
Enter a value for model to see your result.
Calculate daily, monthly, and yearly AI API costs for any model and request volume. Compare GPT-5, Claude 4, Gemini 3, and Llama 4 options for your workload.
A typical chatbot with short prompts and responses costs about $52 per month on GPT-5.4 mini at 1,000 req/day.
Inputs
Result
You pay separately for input tokens (your prompt) and output tokens (the model's response). Cost = (input_tokens / 1,000,000) × input_price + (output_tokens / 1,000,000) × output_price. Multiply by daily requests for ongoing costs.
Among current-generation listed models, GPT-5.4 mini and Claude Haiku 4.5 are lower-cost hosted options, while Llama 4 Maverick varies by provider or self-hosting setup. At high volume, model choice can change monthly spend by 5–20×.
Input tokens include your system prompt, conversation history, and the user's message. Output tokens are everything the model generates in its response. Long system prompts and history are often the biggest cost drivers.