RL framework from SII's GAIR lab for GPU kernel optimization: co-trains Skill Selection, Policy, and Skill Summary agents on a single backbone, admitting new skills into the library only after execution-verified speedups. daVinci-kernel-14B reaches 37.2% / 70.6% / 32.2% on KernelBench Levels 1/2/3 under the Fast₁ threshold, outperforming Dr.Kernel-14B by up to 46% on Level 3. Ships 8B/14B SFT and RL checkpoints plus the SFT dataset. Extends SII's daVinci line into systems-level code optimization.

Outputs 2

daVinci-kernel models

model

Variants

Name Parameters Notes
daVinci-kernel-8B-SFT 8B
daVinci-kernel-8B-RL 8B
daVinci-kernel-14B-SFT 14B
daVinci-kernel-14B-RL 14B

daVinci-kernel Paper

paper
codingrlinfrastructureopen-weight

Related