Question 1

What is a token?

Accepted Answer

A token is the unit AI models read and bill in — roughly three-quarters of an English word, or about 4 characters. A 1,000-word document is around 1,300 tokens. Providers price per million tokens and charge separately for the tokens you send (input) and the tokens the model generates (output).

Question 2

Why are input and output tokens priced differently?

Accepted Answer

Generating text is more computationally expensive than reading it, so output tokens almost always cost more than input tokens — often three to five times as much. A model’s real cost therefore depends heavily on how much text it produces, not just how much you send it.

Question 3

What is the cheapest LLM API?

Accepted Answer

The lowest-cost options are lightweight models such as GPT-4o mini, Google Gemini Flash and Claude Haiku, which can be more than 100 times cheaper than top-tier reasoning models. The cheapest capable model for a given task — not the cheapest overall — is what actually minimises spend.

Question 4

How often do AI token prices change?

Accepted Answer

Provider list prices change infrequently — typically every few months — but cuts can be sudden and large. This page refreshes daily from public pricing, so it reflects the current rate rather than a stale snapshot.

Question 5

Are these prices in USD or GBP?

Accepted Answer

Providers publish prices in US dollars, shown in USD by default. Use the USD/GBP toggle to convert at the latest Bank of England daily reference rate.

Question 6

Do these prices include discounts or batch pricing?

Accepted Answer

No — these are standard published list prices. Many providers offer cheaper batch processing, cached-input pricing or volume discounts, so your effective cost can be lower depending on how you use the API.


Anthropic	claude-haiku-4-5	US$1.00	US$5.00
Anthropic	claude-opus-4-5	US$5.00	US$25.00
Anthropic	claude-sonnet-4-5	US$3.00	US$15.00
Cohere	command-a	US$2.50	US$10.00
Cohere	command-r	US$0.15	US$0.60
Cohere	command-r-plus	US$2.50	US$10.00
DeepSeek	deepseek-chat	US$0.28	US$0.42
DeepSeek	deepseek-reasoner	US$0.28	US$0.42
Fireworks	llama-v3p1-405b-instruct	US$3.00	US$3.00
Fireworks	llama-v3p3-70b-instruct	US$0.90	US$0.90
Fireworks	mixtral-8x22b-instruct	US$1.20	US$1.20
Google	gemini-2.0-flash	US$0.10	US$0.40
Google	gemini-2.5-flash	US$0.30	US$2.50
Google	gemini-2.5-pro	US$1.25	US$10.00
Groq	llama-3.1-8b-instant	US$0.05	US$0.08
Groq	llama-3.3-70b-versatile	US$0.59	US$0.79
Mistral	codestral	US$1.00	US$3.00
Mistral	mistral-large	US$0.50	US$1.50
Mistral	mistral-medium	US$2.70	US$8.10
Mistral	mistral-small	US$0.10	US$0.30
OpenAI	gpt-4-turbo	US$10.00	US$30.00
OpenAI	gpt-4o	US$2.50	US$10.00
OpenAI	gpt-4o-mini	US$0.15	US$0.60
OpenAI	o1	US$15.00	US$60.00
OpenAI	o3-mini	US$1.10	US$4.40
Together AI	Llama-3.3-70B-Instruct-Turbo	US$0.88	US$0.88
Together AI	Meta-Llama-3.1-405B-Instruct-Turbo	US$3.50	US$3.50
Together AI	Meta-Llama-3.1-70B-Instruct-Turbo	US$0.88	US$0.88
xAI	grok-2	US$2.00	US$10.00
xAI	grok-beta	US$5.00	US$15.00

AI Token Prices

How this data is sourced

Frequently asked questions

Further reading

Prices change under you. Know what you’re actually spending.