Question 1

How do I compare OpenAI, Anthropic, and Google AI API pricing in one place?

Accepted Answer

Use the sortable table: filter by provider or search model ids (including OpenAI-style, Anthropic Claude, and Google Gemini SKUs), align columns on input versus output USD per 1M tokens, then open vendor billing pages for invoice-grade quotes.

Question 2

Is this LLM pricing comparison tool free?

Accepted Answer

Yes. Browsing, filtering, comparing up to three models, and CSV export require no CloudyBot account. Figures are list-price estimates from a published catalog snapshot—not live billing.

Question 3

How do I find the cheapest LLM API for chatbots?

Accepted Answer

For assistant workloads that mostly generate replies, sort by output USD per 1M tokens on text-capable models, then pilot latency and quality on your prompts. Chat-heavy apps are usually output-token dominated.

Question 4

Does this include OpenRouter-style model IDs?

Accepted Answer

Rows use normalized model identifiers from an aggregated public catalog (often aligned with OpenRouter-style ids). Always confirm the exact SKU and rate on your provider or router billing console before budgeting.

Question 5

What does USD per 1M tokens mean?

Accepted Answer

It is an estimated list-price shorthand: how many US dollars you would pay for one million tokens at published rates. Providers bill at the token level; showing per 1M makes comparisons readable across models.

Question 6

What is the difference between input and output pricing?

Accepted Answer

Input tokens cover your prompt and instructions; output tokens cover text the model generates. Most APIs price output higher than input because generation tends to be longer-running and more compute-heavy.

Question 7

What is cached input pricing?

Accepted Answer

Some vendors discount repeated prompt prefixes or cached context reads. When those channels exist in the catalog we surface them as cached-input USD per 1M tokens alongside standard input.

Question 8

Where does this pricing data come from?

Accepted Answer

Figures are aggregated from a public model catalog, normalized server-side, and checked before publishing. Treat numbers as directional list estimates rather than a contractual quote.

Question 9

How often is the snapshot refreshed?

Accepted Answer

The catalog is refreshed on a scheduled cadence and whenever operators publish a validated snapshot. If you need invoice-grade certainty, verify live pricing on the vendor billing console.

Question 10

How should I pick the cheapest LLM API for my workload?

Accepted Answer

Sort by input or output price for your dominant token direction, filter modality to match text versus vision workloads, then sanity-check quality and latency on short evaluations before locking in.

Question 11

Are free models really zero?

Accepted Answer

Rows marked free reflect catalog list prices where both input and output token rates are zero. Vendors may still enforce quotas, rate limits, or separate enterprise packaging.

Question 12

What is the context window?

Accepted Answer

Context window is how many tokens of prompt plus completion fit in a single request window for that SKU. Larger windows cost more memory but allow bigger documents or transcripts inline.

Question 13

What is a token in AI API billing?

Accepted Answer

A token is a small chunk of text the API counts when billing—roughly four characters of English on average, but it varies by language and model. Vendors publish dollars per million tokens so you can compare plans without tiny per-token decimals.

Question 14

What is the cheapest ChatGPT or OpenAI-compatible API at list prices?

Accepted Answer

List prices change often and depend on which exact model SKU you use. Use this comparison table sorted by output or input price for text models, filter free-only rows if applicable, then confirm the live rate on OpenAI or your router billing console before budgeting.

Question 15

Are there free AI APIs?

Accepted Answer

Some catalog rows show zero list price for both input and output; vendors may still impose quotas, keys, or enterprise packaging. Always read each provider's terms—free in the table means free list tier in our snapshot, not unlimited production use.

Question 16

How do I estimate monthly spend?

Accepted Answer

Use the CloudyBot AI cost calculator to blend prompts per day, blended models, and caps—after you pick indicative prices from this comparison table.

Compare	Provider	Model id	Display name	Input cost / 1M tokens	Cached input / 1M	Output cost / 1M tokens	Context window	Max response	Modality	Catalog

Free LLM & AI API pricing comparison

Choose what you are optimizing for

Popular picks for chat-style workloads

Explore every model

Why use this free LLM API pricing tool?

What is AI API pricing per 1M tokens?

Input vs output pricing (quick example)

Cheapest models by workload shape

Provider pricing guides

Official vendor billing pages

Recent catalog updates

FAQ

Automate after you pick a model

Compare models