Weihao Cui
Xtra Computing Group, National University of Singapore
Currently, I am a postdoc research fellow working with Prof. Bingsheng He in National University of Singapore. I also work closely with Prof. Minyi Guo, Prof. Quan Chen and Dr. Han Zhao.
I obtained my Ph.D. degree at Department of Computer Science and Engineering (CSE), Shanghai Jiao Tong University, China, supervised by Prof. Quan Chen on AI System and Cloud Computing.
News
| Dec 10, 2025 | Two paper accepted to NSDI 2026. |
|---|---|
| Nov 08, 2025 | One paper accepted to HPCA 2026. |
| Oct 15, 2025 | Serving as the Web Chair for ICPP 2026. Submission details are available in the Call for Papers. |
| Sep 28, 2025 | PD-Multiplexing has been merged into SGLang |
Selected publications
- arXivOptimizing SLO-oriented LLM Serving with PD-MultiplexingarXiv preprint arXiv:2504.14489, 2025
- arXivEfficient Function-as-a-Service for Large Language Models with TIDALarXiv preprint arXiv:2503.06421, 2025
- NSDI ’26Flare: Anomaly diagnostics for divergent llm training in gpu clusters of thousand-plus scaleIn Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026
- OSDI ’23Optimizing dynamic neural networks with BrainstormIn 17th USENIX Symposium on Operating Systems Design and Implementation, 2023
- ATC ’22DVABatch: Diversity-aware Multi-Entry Multi-Exit batching for efficient processing of DNN services on GPUsIn 2022 USENIX Annual Technical Conference, 2022
- SC ’21Enable simultaneous DNN services based on deterministic operator overlap and precise latency predictionIn Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2021