First audio LLM to unlock test-time compute scaling via Chain-of-Thought reasoning. 33B parameters. Surpasses Gemini 2.5 Pro on audio understanding benchmarks.

Model Details

Architecture DENSE
Parameters 33B

Variants

Name Parameters Notes
Step-Audio-R1 Released Nov 27, 2025
Step-Audio-R1.1 Released Jan 14, 2026. Dual-Brain Architecture for real-time spoken dialogue.

Paper

arXiv: 2511.15848

audioreasoningopen-weight

Notes

arXiv submission Nov 19, 2025. Model weights released Nov 27, 2025.