Publications

You can also find my articles on my Google Scholar profile.

Conference Papers


FSA: An Alternative Efficient Implementation of Native Sparse Attention Kernel

To appear in International Conference on Learning Representations (ICLR), 2026

Ran Yan*, Youhe Jiang*, Zhuoming Chen, Haohui Mai, Beidi Chen, and Binhang Yuan

HexiScale: Facilitating Large Language Model Training over Heterogeneous Hardware

To appear in Machine Learning and Systems (MLSys), 2026

Ran Yan*, Youhe Jiang*, Xiaonan Nie, Fangcheng Fu, Bin Cui, and Binhang Yuan

AReaL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models

Preprint on arXiv, 2026

Jiarui Zhang*, Yuchen Yang*, Ran Yan*, Zhiyu Mei, Liyuan Zhang, Daifeng Li, Wei Fu, Jiaxuan Gao, Shusheng Xu, Yi Wu, and Binhang Yuan

UltraLLaDA: Scaling the Context Length to 128K for Diffusion Large Language Models

To appear in International Conference on Learning Representations (ICLR), 2026

Guangxin He, Shen Nie, Fengqi Zhu, Yuankang Zhao, Tianyi Bai, Ran Yan, Jie Fu, Chongxuan Li, and Binhang Yuan

AReaL-Hex: Accommodating Asynchronous RL Training over Heterogeneous GPUs

Preprint on arXiv, 2025

Ran Yan*, Youhe Jiang*, Tianyuan Wu*, Jiaxuan Gao, Zhiyu Mei, Wei Fu, Haohui Mai, Wei Wang, Yi Wu, and Binhang Yuan

MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization

Published in Neural Information Processing Systems (NeurIPS), 2025

Rizhen Hu, Yutong He, Ran Yan, Mou Sun, Binghang Yuan, Kun Yuan

HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment

Published in International Conference on Learning Representations (ICLR), 2025

Youhe Jiang*, Ran Yan*, and Binhang Yuan

HexGen: Generative Inference of Large Language Model over Heterogeneous Environment

Published in International Conference on Machine Learning (ICML), 2024

Youhe Jiang*, Ran Yan*, Xiaozhe Yao*, Yang Zhou, Beidi Chen, and Binhang Yuan