A new technical paper titled “Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention” was published by DeepSeek, Peking University and University of Washington.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results