Significant upgrade to R1 with enhanced logic and reduced hallucinations.

Model Details

Architecture MOE
Parameters 671B
Active params 37B
Base model deepseek-r1
reasoningtrainingopen-weight