I am an Assistant Professor in School of Data Science, The Chinese University of Hong Kong, Shenzhen. Prior to that, I received my PhD degree from Hong Kong University of Science and Technology, where I was advised by Prof. Wei Wang. I obtained my B.Eng. degree in Software Engineering from Nanjing University.

Research Interests

My research interests cover the broad area of cloud computing and distributed systems, with a special focus on serverless computing, big data and machine learning systems. Our current research projects include:

  • Scalable ML and LLM inference systems
  • Intelligent cluster management on heterogeneous resources
  • Usable and efficient serverless computing platforms

Prospective Students and Current Openings

I am looking for self-motivated graduate students (PhD/MPhil starting 2025) and research assistants (RAs) to work with me on the above topics. Please see here for the details.

Recent/Selected Publications

  • ”$\lambda$Scale: Enabling Fast Scaling for Serverless Large Language Model Inference,” in arXiv preprint arXiv:2502.09922.
  • “Pheromone: Restructuring Serverless Computing with Data-Centric Function Orchestration,” in IEEE/ACM Transactions on Networking, 2024.
  • “Following the Data, Not the Function: Rethinking Function Orchestration in Serverless Computing,” in USENIX NSDI 2023.
  • “Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning,” in IEEE ICDCS 2021. (Best Paper Runner Up)
  • “MArk: Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving,” in USENIX ATC 2019.

News

  • 2024/10: One paper accepted to TON.
  • 2024/09: Serving on the PC of ICDCS’25. Please consider submitting.
  • 2024/07: Awarded CCF-Huawei Populus Grove Fund. Thanks CCF and Huawei!
  • 2024/04: Serving on the PC of NSDI’25. Please consider submitting.
  • 2023/12: I joined CUHK-SZ as an Assistant Professor.