Kimi k1.5
model paperReasoning-focused model using reinforcement learning, claimed to match OpenAI's o1-preview in math and coding. Technical report details scaling RL with LLMs.
Outputs 2
Kimi k1.5 Model
modelKimi k1.5 Tech Report: Scaling RL with LLMs
paperarXiv: 2501.12599