Step-3
model paper321B parameter MoE model (38B active) designed for high-efficiency multimodal reasoning with cost-effective decoding.
Outputs 2
Step-3
model321B parameter MoE model (38B active) designed for high-efficiency multimodal reasoning.
Architecture MOE
Parameters 321B
Active params 38B
Released Jul 31, 2025.
Step-3: Large yet Affordable Model-System Co-design
paperFlagship LLM paper detailing cost-effective decoding for the Step-3 MoE model.
arXiv: 2507.19427