Is o3 cheaper than Gemini 2.5 Flash?
No, Gemini 2.5 Flash is currently cheaper. Its starting input price is $0.06/1M tokens compared to o3's $0.4/1M tokens.
Pricing Intelligence
When comparing o3 and Gemini 2.5 Flash API pricing for 2026, Gemini 2.5 Flash is the more cost-effective option for basic input queries. It is approximately 85% cheaper starting at $0.06/1M tokens. However, your choice should also factor in context window limits (200k vs 1049k) and supported modalities.
| Feature | o3 | Gemini 2.5 Flash |
|---|---|---|
| Starting Input Price | $0.4 / 1M tokens | $0.06 / 1M tokens |
| Model Category | GPT | Gemini |
| Max Context Window | 200k tokens | 1049k tokens |
| Supported Modalities | text, image | text, image, audio, video |
| Routed Channels Count | 6 channels | 6 channels |
No, Gemini 2.5 Flash is currently cheaper. Its starting input price is $0.06/1M tokens compared to o3's $0.4/1M tokens.
o3 supports a maximum input context of 200,000 tokens, while Gemini 2.5 Flash supports 1,048,576 tokens.