Gemma 4 | Lab Index

Four-variant family released under Apache 2.0 (a notable license upgrade from prior Gemma releases). Includes a 31B dense flagship, 26B-A4B MoE (4B active), and edge variants E4B and E2B. 256K token context for the larger models. Multimodal: text, image, video, audio.

Architecture features alternating attention, dual RoPE, Per-Layer Embeddings (PLE), and shared KV cache. Gemma 4 31B achieves 85.2% MMLU Pro, 89.2% AIME 2026, 80.0% LiveCodeBench v6, and Codeforces Elo 2,150. Ranked #3 open model on Arena AI text leaderboard. Crossed 2M downloads shortly after release.

June 5, 2026: deployment-ready QAT (Quantization-Aware Training) 4-bit checkpoints shipped across the family on HuggingFace — including GGUF + unquantized + CT variants for E2B, E4B, 12B, 26B-A4B, and 31B (~20 repos total).

Announcement HuggingFace Apple ML Research Artificial Analysis

Model Details

Architecture DENSE

Parameters 31B

Context window 256,000

AA Intelligence 29

Variants

Name	Parameters	Notes
Gemma 4 31B	31B	Dense flagship, 256K context; AA Intelligence Index v4.1 = 29
Gemma 4 26B-A4B	26B	MoE, 256K context; AA Intelligence Index v4.1 = 26
Gemma 4 E4B	8B	Effective 4.5B, 128K context; AA Intelligence Index v4.1 = 12
Gemma 4 E2B	5.1B	Effective 2.3B, 128K context; AA Intelligence Index v4.1 = 9

open-weightopen-sourcemultimodalmoe

Model Details

Variants

Related