ASI-Arch
paper"AlphaGo Moment for Model Architecture Discovery." 1,773 autonomous experiments over 20,000 GPU hours discovered 106 novel SOTA linear attention architectures. Top model (PathGateFusionNet) outperforms Mamba2 and Gated DeltaNet.
Paper
arXiv: 2507.18074