Zhisheng Ye

Zhisheng Ye

Ph.D · Peking University

Latest

ICS 26 Memory Offloading for Large Language Model Inference with Latency SLO Guarantees
Chenxiang Ma, Zhisheng Ye, Hanyu Zhao, Zehua Yang, Tianhao Fu, Jiaxun Han, Jie Zhang, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Yong Li, Diyu Zhou (2026)
Euro-Par 26 FlowGPU: Transparent and Efficient GPU Checkpointing and Restore
Zehua Yang, Xiao Zheng, Yonghao Zou, Junyang Zhang, Zhisheng Ye, Feng Xie, Xiaolin Wang, Yingwei Luo, Zhenlin Wang, Diyu Zhou (2026)
ICCD 22 Tear Up the Bubble Boom: Lessons Learned From a Deep Learning Research and Development Cluster
Zehua Yang, Zhisheng Ye, Tianhao Fu, Jing Luo, Xiong Wei, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Tianwei Zhang (2022)
TPDS Astraea: A Fair Deep Learning Scheduler for Multi-Tenant GPU Clusters
Zhisheng Ye, Peng Sun, Wei Gao, Tianwei Zhang, Xiaolin Wang, Shengen Yan, Yingwei Luo (2022)