Google: Gemini 2.5 Flash

Released Jan 28, 2025|1M context|$0.30/M input tokens|$2.50/M output tokens

Gemini 2.5 Flash is Google's latest multimodal model, offering exceptional performance on everyday tasks. With native support for text, audio, images, and video, plus a massive 1M token context window and adjustable thinking budget, it's perfect for a wide range of applications requiring both speed and intelligence.

Speed
Intelligence
Price (1M Tokens)$0.30
Inputs
TextImage
Outputs
Text

Pricing

Input
$0.30
Cached input
$0.07
Output
$2.50
Gemini 2.5 Flash
$0.30
GPT-4o
$2.50
GPT-4.1
$2.00

Use Case

AI Agents
State-of-the-art performance for autonomous agent applications
Advanced Coding
Advanced coding with state-of-the-art performance (72.5% SWE-bench)
Content Creation
Produce human-quality content with exceptional writing abilities

Features

Streaming
Support for streaming functionality.
Function calling
Support for function calling functionality.
Tool use
Support for tool use functionality.
JSON mode
Support for json mode functionality.
Grounding
Google Search integration
Adjustable thinking budget
Balance latency and cost
Conversation context
Support for conversation context functionality.
Native audio
24 language support