Google · Text

Gemma 4 12B

Google's 12B multimodal Gemma 4 model with strong text and image understanding.

Modality

Text

Available on

Ollama

Model ID

ollama:gemma4:12b

Supported OS

macOSWindowsLinux

Minimum machine configuration

16 GB RAM · 128K ctx · text+image · CPU/GPU

Download size · ~7.6 GB

About the creator

Google

Google and Google DeepMind build the Gemini family of multimodal models, the Imagen and Nano Banana image models, the Lyria music models, and the Veo video models.

deepmind.google ↗

Samples

Examples

Sample outputs generated with Gemma 4 12B will appear here.

Sample coming soon

More from Google

Gemini 3.5 Flash

TextCloud

Google's latest fast Gemini 3 model: frontier-level reasoning at Flash-level latency and cost, tuned for agentic workflows and iterative coding.

Gemini 3.1 Pro

TextCloud

Google's most capable multimodal model with deep reasoning and 1M token context.

Gemini 3 Flash

TextCloud

Fast, cost-efficient Gemini model for high-throughput text and multimodal tasks.

Gemini 2.5 Flash

TextCloud

Balanced Gemini model with strong reasoning and a 1M token context window.

Nano Banana 2

ImageCloud

Ultra-fast image generation model optimised for speed and creative output.

Gemini 3.1 Flash

AudioCloud

Fast Gemini text-to-speech with natural-sounding expressive voices.

Gemma 4 12B

Google

Examples

One-time payment. Yours forever.