Back to catalog
Google·Text15% under official

Gemini 2.5 FlashAPI

Previous-gen Gemini Flash — extreme value.

Input price

$0.255

per 1M tokens

Output price

$2.125

per 1M tokens

Prompt caching

Add cache_control to a stable prefix and cache hits bill at a fraction of input. Cache writes bill at the plain input rate.

Cache read
$0.051

Auto-derived from input × 0.2

$ Specs

Model ID
gemini-2-5-flash
Endpoint
POST /v1/chat/completions
Category
Text
Provider
Google
Capabilities
visionfunctionstreaming

Drop-in OpenAI compatibility

Point your existing OpenAI SDK at https://api.brievio.com/v1 and change the model id. No streaming changes, no SDK changes — and the model you call is the genuine article.

View docs

$ Try it

Set BRIEVIO_API_KEY from a key in /app/keys, then run any snippet below.

curl
curl https://api.brievio.com/v1/chat/completions \
  -H "Authorization: Bearer $BRIEVIO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-2-5-flash",
    "messages": [
      {"role": "user", "content": "Hello, Gemini 2.5 Flash"}
    ],
    "stream": false
  }'