OpenAI's first reasoning model, trained with reinforcement learning to perform extended chain-of-thought reasoning before answering. Uses hidden "thinking tokens" to explore multiple approaches, verify steps, and self-correct — pioneering test-time compute scaling where inference cost trades for accuracy.

o1 achieved breakthrough results on reasoning benchmarks: 83.3% on AIME 2024 (vs 13.4% for GPT-4o), ranked in the 89th percentile on Codeforces, and exceeded human PhD-level accuracy on GPQA-Diamond (78%). 200K context (full o1, December 2024). Also released as o1-mini for cost-efficient reasoning. AA Intelligence Index: 31. Proprietary.

Model Details

Context window 200,000

Variants

Name Parameters Notes
o1-preview Initial preview, September 2024
o1-mini Cost-efficient reasoning variant
o1 Full release, December 2024
frontierreasoning

Related