AI Lab Tracker
Labs
Timeline
NSA: Native Sparse Attention
paper
2025-02-16
DeepSeek
Hardware-aligned and natively trainable sparse attention mechanism.
Paper (arXiv)
Paper
arXiv:
2502.11089
attention
architecture
efficiency
More Links
LinkedIn: Insights from Core Developer