Google · Text
Gemma 4 12B
Google's 12B multimodal Gemma 4 model with strong text and image understanding.
Modality
Text
Available on
Ollama
Model ID
ollama:gemma4:12b
Supported OS
macOSWindowsLinux
Minimum machine configuration
16 GB RAM · 128K ctx · text+image · CPU/GPU
Download size · ~7.6 GB
About the creator
Google and Google DeepMind build the Gemini family of multimodal models, the Imagen and Nano Banana image models, the Lyria music models, and the Veo video models.
deepmind.google ↗Samples
Examples
Sample outputs generated with Gemma 4 12B will appear here.
Sample coming soon
Sample coming soon
Sample coming soon
More from Google
Gemini 3.5 Flash
TextCloud
Google's latest fast Gemini 3 model: frontier-level reasoning at Flash-level latency and cost, tuned for agentic workflows and iterative coding.
Gemini 3.1 Pro
TextCloud
Google's most capable multimodal model with deep reasoning and 1M token context.
Gemini 3 Flash
TextCloud
Fast, cost-efficient Gemini model for high-throughput text and multimodal tasks.
Gemini 2.5 Flash
TextCloud
Balanced Gemini model with strong reasoning and a 1M token context window.
Nano Banana 2
ImageCloud
Ultra-fast image generation model optimised for speed and creative output.
Gemini 3.1 Flash
AudioCloud
Fast Gemini text-to-speech with natural-sounding expressive voices.