Gemini 3 Flash
Fast, cost-efficient Gemini model for high-throughput text and multimodal tasks.
Gemini 3 Flash is a cloud text model from Google. It is multimodal — it accepts text prompts along with images, video, and audio, and replies with generated text. Its context window handles up to 1M input tokens and up to 66K output tokens, and it supports adjustable reasoning effort for harder problems. Generation can be tuned with system instructions, temperature, and top-p sampling. It runs through Replicate and Runware using your own API key, from $0.50 per million input tokens.
- Pricing
- $0.50 / 1M in · $3.00 / 1M out
- Inputs
- Text, Images, Video, Audio
- Context window
- 1M in · 66K out
- Reasoning
- Adjustable effort
- Controls
- System prompt, Temperature, Top-p
Google and Google DeepMind build the Gemini family of multimodal models, the Imagen and Nano Banana image models, the Lyria music models, and the Veo video models.
deepmind.google ↗Examples
Sample outputs generated with Gemini 3 Flash will appear here.