Hardware-aligned and natively trainable sparse attention mechanism.

Paper

Citations 2
attentionarchitectureefficiency

More Links