Mixture of Block Attention mechanism for efficient long-context processing.

Paper

Citations 2
scalingattentionarchitecture

More Links