$ Pricing

Honest pricing. Just under official.

Roughly 15% below each provider’s official rate, per model — and top-up bonuses bring the effective discount to about 21%. Pay-as-you-go, per-token for chat and per-call for image and video. No subscription, no minimum — and you’re billed on true token counts, so what you’re charged matches what you actually used.

cat ./pricing.txt

Pay only for what you actually use.

Pre-paid wallet, no subscription and no minimum. Add $10 to get going; your balance never expires, and every charge maps to real, audited usage.

Starter

Trying it out

$10

Access to every genuine model
True per-call usage logs
Community & email support
No minimum, no credit card

Get an API key

Builder

Shipping a product

$100

Honest token billing on every call
10 isolated API keys
Auto-recharge · IP allowlist
Priority email support

Top up $100

Scale

Running production traffic

$1000

Monitored, fail-fast routing
Unlimited API keys
Webhooks · monthly invoices
Dedicated Slack/Discord support

Top up $1000

Enterprise

High-volume scale

$5000

Everything in Scale
Dedicated routing capacity
Custom rate limits & SLA
Dedicated account manager

Top up $5000

See the full pricing table

$ Per-model pricing

Every model. Every rate. In the open.

Text · Text

7 models

ls ./models/text

Model	Provider	Input price	Output price	Billing unit	vs Official
Claude Opus 4.7	Anthropic	$4.25	$21.25	per 1M tokens	−15%
Claude Opus 4.6	Anthropic	$4.25	$21.25	per 1M tokens	−15%
Claude Sonnet 4.6	Anthropic	$2.55	$12.75	per 1M tokens	−15%
Claude Sonnet 4.5	Anthropic	$2.55	$12.75	per 1M tokens	−15%
Claude Haiku 4.5	Anthropic	$0.85	$4.25	per 1M tokens	−15%
Gemini 2.5 Pro	Google	$1.0625	$8.50	per 1M tokens	−15%
Gemini 2.5 Flash	Google	$0.255	$2.125	per 1M tokens	−15%

Image · Image

6 models

ls ./models/image

Model	Provider	Flat price	Billing unit	vs Official
Nano Banana	Google	$0.024	per image	−38%
Nano Banana 2	Google	$0.042+	per image	−37%
Resolution1K $0.042−37%2K $0.063−38%4K $0.094−38%
Nano Banana Pro	Google	$0.084+	per image	−37%
Resolution1K / 2K $0.084−37%4K $0.15−38%
GPT-Image-2	OpenAI	$0.079	per image	−38%
Resolution1K $0.079−38%2K $0.079−38%4K $0.079−38%
Nano Banana Edit	Google	$0.0273	per image	−30%
GPT-Image-2 Edit	OpenAI	$0.0886	per image	−30%
Resolution1K $0.0886−30%2K $0.0886−30%4K $0.0886−30%

Video · Video

3 models

ls ./models/video

Model	Provider	Flat price	Billing unit	vs Official
Veo 3 Fast	Google	$0.25+	per video	−38%
Resolution720p $0.25−38%1080p $0.31−38%4K $0.75−38%
Veo 3 Quality	Google	$1.20+	per video	−38%
Resolution720p $1.2−38%1080p $1.2−38%4K $1.8−38%
Veo 3 Lite	Google	$0.15+	per video	−38%
Resolution720p $0.15−38%1080p $0.24−38%

$ Compare

Brievio vs cheap resellers vs going direct

If a gateway is 80% under list, ask where the capacity comes from. Here’s how the discounted-official tier compares.

diff resellers brievio

What matters	Brievio	Cheap / gray-market resellers
Genuine first-party models
max_tokens, tools & caching honored		—
True token counts, never inflated
Monitored, fail-fast routing		—
Failed requests never billed		—
Stable, enterprise-grade supply	$5
Text, image & video on one key		—
Priced just under official

$ Prompt caching

Cache reads at a fraction of the input rate.

Where the provider supports it, cache reads bill at 10% of input on Anthropic models and 20% on OpenAI and Gemini. Cache writes bill at the plain input rate — Brievio doesn’t pass through Anthropic’s upstream write surcharge. Add cache_control to your system prompt and these rates apply automatically.

cat ./cache-rates.txt

Model	Input	Cache read	Ratio
Claude Opus 4.7 Anthropic	$4.25	$0.85	0.20×
Claude Opus 4.6 Anthropic	$4.25	$0.85	0.20×
Claude Sonnet 4.6 Anthropic	$2.55	$0.51	0.20×
Claude Sonnet 4.5 Anthropic	$2.55	$0.51	0.20×
Claude Haiku 4.5 Anthropic	$0.85	$0.17	0.20×
Gemini 2.5 Pro Google	$1.06	$0.2125	0.20×
Gemini 2.5 Flash Google	$0.255	$0.051	0.20×

Full guide: how to enable it, dashboard metrics, affinity routing.Read the prompt-caching guide →

$ brievio enterprise --contact

Need dedicated capacity, custom SLAs or invoiced billing?

Higher-volume teams can reserve dedicated routing capacity, agree custom SLAs and rate limits, and move to monthly invoiced billing. Tell us your traffic shape and we’ll size it with you.

[ Talk to sales ]