FSA: An Alternative Efficient Implementation of Native Sparse Attention Kernel
To appear in International Conference on Learning Representations (ICLR), 2026
Ran Yan*, Youhe Jiang*, Zhuoming Chen, Haohui Mai, Beidi Chen, and Binhang Yuan
To appear in International Conference on Learning Representations (ICLR), 2026
Ran Yan*, Youhe Jiang*, Zhuoming Chen, Haohui Mai, Beidi Chen, and Binhang Yuan
To appear in Machine Learning and Systems (MLSys), 2026
Ran Yan*, Youhe Jiang*, Xiaonan Nie, Fangcheng Fu, Bin Cui, and Binhang Yuan
Preprint on arXiv, 2026
Jiarui Zhang*, Yuchen Yang*, Ran Yan*, Zhiyu Mei, Liyuan Zhang, Daifeng Li, Wei Fu, Jiaxuan Gao, Shusheng Xu, Yi Wu, and Binhang Yuan
To appear in International Conference on Learning Representations (ICLR), 2026
Guangxin He, Shen Nie, Fengqi Zhu, Yuankang Zhao, Tianyi Bai, Ran Yan, Jie Fu, Chongxuan Li, and Binhang Yuan
Preprint on arXiv, 2025
Ran Yan*, Youhe Jiang*, Tianyuan Wu*, Jiaxuan Gao, Zhiyu Mei, Wei Fu, Haohui Mai, Wei Wang, Yi Wu, and Binhang Yuan
Published in Neural Information Processing Systems (NeurIPS), 2025
Rizhen Hu, Yutong He, Ran Yan, Mou Sun, Binghang Yuan, Kun Yuan
Published in International Conference on Learning Representations (ICLR), 2025
Youhe Jiang*, Ran Yan*, and Binhang Yuan
Published in International Conference on Machine Learning (ICML), 2024
Youhe Jiang*, Ran Yan*, Xiaozhe Yao*, Yang Zhou, Beidi Chen, and Binhang Yuan