Honest pricing. Just under official.
Roughly 15% below each provider’s official rate, per model — and top-up bonuses bring the effective discount to about 21%. Pay-as-you-go, per-token for chat and per-call for image and video. No subscription, no minimum — and you’re billed on true token counts, so what you’re charged matches what you actually used.
Pay only for what you actually use.
Pre-paid wallet, no subscription and no minimum. Add $10 to get going; your balance never expires, and every charge maps to real, audited usage.
Starter
Trying it out
- Access to every genuine model
- True per-call usage logs
- Community & email support
- No minimum, no credit card
Builder
Shipping a product
- Honest token billing on every call
- 10 isolated API keys
- Auto-recharge · IP allowlist
- Priority email support
Scale
Running production traffic
- Monitored, fail-fast routing
- Unlimited API keys
- Webhooks · monthly invoices
- Dedicated Slack/Discord support
Enterprise
High-volume scale
- Everything in Scale
- Dedicated routing capacity
- Custom rate limits & SLA
- Dedicated account manager
Every model. Every rate. In the open.
Text · Text
7 models| Model | Provider | Input price | Output price | Billing unit | vs Official |
|---|---|---|---|---|---|
Claude Opus 4.7 | Anthropic | $4.25 | $21.25 | per 1M tokens | −15% |
Claude Opus 4.6 | Anthropic | $4.25 | $21.25 | per 1M tokens | −15% |
Claude Sonnet 4.6 | Anthropic | $2.55 | $12.75 | per 1M tokens | −15% |
Claude Sonnet 4.5 | Anthropic | $2.55 | $12.75 | per 1M tokens | −15% |
Claude Haiku 4.5 | Anthropic | $0.85 | $4.25 | per 1M tokens | −15% |
Gemini 2.5 Pro | $1.0625 | $8.50 | per 1M tokens | −15% | |
Gemini 2.5 Flash | $0.255 | $2.125 | per 1M tokens | −15% |
Image · Image
6 models| Model | Provider | Flat price | Billing unit | vs Official |
|---|---|---|---|---|
Nano Banana | $0.024 | per image | −38% | |
Nano Banana 2 | $0.042+ | per image | −37% | |
Resolution1K $0.042−37%2K $0.063−38%4K $0.094−38% | ||||
Nano Banana Pro | $0.084+ | per image | −37% | |
Resolution1K / 2K $0.084−37%4K $0.15−38% | ||||
GPT-Image-2 | OpenAI | $0.079 | per image | −38% |
Resolution1K $0.079−38%2K $0.079−38%4K $0.079−38% | ||||
Nano Banana Edit | $0.0273 | per image | −30% | |
GPT-Image-2 Edit | OpenAI | $0.0886 | per image | −30% |
Resolution1K $0.0886−30%2K $0.0886−30%4K $0.0886−30% | ||||
Video · Video
3 models| Model | Provider | Flat price | Billing unit | vs Official |
|---|---|---|---|---|
Veo 3 Fast | $0.25+ | per video | −38% | |
Resolution720p $0.25−38%1080p $0.31−38%4K $0.75−38% | ||||
Veo 3 Quality | $1.20+ | per video | −38% | |
Resolution720p $1.2−38%1080p $1.2−38%4K $1.8−38% | ||||
Veo 3 Lite | $0.15+ | per video | −38% | |
Resolution720p $0.15−38%1080p $0.24−38% | ||||
Brievio vs cheap resellers vs going direct
If a gateway is 80% under list, ask where the capacity comes from. Here’s how the discounted-official tier compares.
| What matters | Brievio | Cheap / gray-market resellers | Going direct |
|---|---|---|---|
| Genuine first-party models | |||
| max_tokens, tools & caching honored | — | ||
| True token counts, never inflated | |||
| Monitored, fail-fast routing | — | ||
| Failed requests never billed | — | ||
| Stable, enterprise-grade supply | $5 | ||
| Text, image & video on one key | — | ||
| Priced just under official |
Cache reads at a fraction of the input rate.
Where the provider supports it, cache reads bill at 10% of input on Anthropic models and 20% on OpenAI and Gemini. Cache writes bill at the plain input rate — Brievio doesn’t pass through Anthropic’s upstream write surcharge. Add cache_control to your system prompt and these rates apply automatically.
| Model | Input | Cache read | Ratio |
|---|---|---|---|
Claude Opus 4.7 Anthropic | $4.25 | $0.85 | 0.20× |
Claude Opus 4.6 Anthropic | $4.25 | $0.85 | 0.20× |
Claude Sonnet 4.6 Anthropic | $2.55 | $0.51 | 0.20× |
Claude Sonnet 4.5 Anthropic | $2.55 | $0.51 | 0.20× |
Claude Haiku 4.5 Anthropic | $0.85 | $0.17 | 0.20× |
Gemini 2.5 Pro Google | $1.06 | $0.2125 | 0.20× |
Gemini 2.5 Flash Google | $0.255 | $0.051 | 0.20× |
Full guide: how to enable it, dashboard metrics, affinity routing.Read the prompt-caching guide →
Need dedicated capacity, custom SLAs or invoiced billing?
Higher-volume teams can reserve dedicated routing capacity, agree custom SLAs and rate limits, and move to monthly invoiced billing. Tell us your traffic shape and we’ll size it with you.