About Me

I am a Postdoc at UC Berkeley, working with Ion Stoica and Joseph E. Gonzalez in the Sky Computing Lab. I completed my Ph.D. in Computer Science at UCLA in 2024, where I was advised by Harry Xu and Miryung Kim.

My research focuses on systems and machine learning. I build operating systems and runtime systems to make datacenter faster and more efficient.

I have been awarded the Amazon & UCLA Science Hub Fellowship (2021), was a finalist for the Jane Street Graduate Research Fellowship (2023), and received the Outstanding Graduate Student Research Award (2024) at UCLA.

Contact

Publications

  1. Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving

    Shan Yu, Jiarong Xing, Yifan Qiao, Mingyuan Ma, Yangmin Li, Yang Wang, Shuo Yang, Zhiqiang Xie, Shiyi Cao, Ke Bao, Ion Stoica, Harry Xu, Ying Sheng

    Arxiv 2025

  2. PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications

    Kuntai Du, Bowen Wang, Chen Zhang, Yiming Cheng, Qing Lan, Hejian Sang, Yihua Cheng, Jiayi Yao, Xiaoxuan Liu, Yifan Qiao, Ion Stoica, Junchen Jiang

    Arxiv 2025

  3. ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving

    Yifan Qiao, Shu Anzai, Shan Yu, Haoran Ma, Yang Wang, Miryung Kim, Harry Xu

    Arxiv 2024

  4. DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra Efficiency

    Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Chenxi Wang, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang, Miryung Kim, and Harry Xu.

    OSDI 2024

    [full version] [code]

  5. A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

    Lei Chen*, Shi Liu*, Chenxi Wang, Haoran Ma, Yifan Qiao, Zhe Wang, Chenggang Wu, Youyou Lu, Xiaobing Feng, Huimin Cui, Shan Lu, and Harry Xu.

    OSDI 2024

    [full version] [code]

  6. Harvesting Idle Memory for Application-managed Soft State with Midas

    Yifan Qiao, Zhenyuan Ruan, Haoran Ma, Adam Belay, Miryung Kim, and Harry Xu.

    NSDI 2024

    [code] [slides]

  7. Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony

    Yifan Qiao, Chenxi Wang, Zhenyuan Ruan, Adam Belay, Qingda Lu, Yiying Zhang, Miryung Kim, and Guoqing Harry Xu.

    NSDI 2023

    [code] [slides]

  8. Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory

    Chenxi Wang*, Yifan Qiao*, Haoran Ma, Shi Liu, Yiying Zhang, Wenguang Chen, Ravi Netravali, Miryung Kim, Guoqing Harry Xu. (*contributed equally)

    NSDI 2023

    [code] [slides]

  9. Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs

    John Thorpe*, Pengzhan Zhao*, Jonathan Eyolfson, Yifan Qiao, Zhihao Jia, Minjia Zhang, Ravi Netravali, Guoqing Harry Xu.

    NSDI 2023

    [full version] [code]
  10. MemLiner: Lining up Tracing and Application for a Far-Memory-Friendly Runtime

    Chenxi Wang*, Haoran Ma*, Shi Liu, Yifan Qiao, Jonathan Eyolfson, Christian Navasca, Shan Lu, Guoqing Harry Xu.

    OSDI 2022 (Awarded Jay Lepreau Best Paper)

    [code]

  11. Mako: A Low-Pause, High-Throughput Evacuating Collector for Memory-Disaggregated Datacenters

    Haoran Ma, Shi Liu, Chenxi Wang, Yifan Qiao, Michael D. Bond, Stephen M. Blackburn, Miryung Kim, Guoqing Harry Xu.

    PLDI 2022

    [code]

  12. Dorylus: Affordable, Scalable, and Accurate GNN Training over Billion-Edge Graphs

    John Thorpe*, Yifan Qiao*, Jonathan Eyolfson, Shen Teng, Guanzhou Hu, Zhihao Jia, Jinliang Wei, Keval Vora, Ravi Netravali, Miryung Kim, and Guoqing Harry Xu. (*contributed equally)

    OSDI 2021

    [full version] [code]

  13. Algorithm-Directed Crash Consistence in Non-Volatile Memory for HPC

    Shuo Yang, Kai Wu, Yifan Qiao, Dong Li, Jidong Zhai.

    IEEE International Conference on Cluster Computing (CLUSTER) 2017

Experience

  1. Visiting Student at MIT PDOS Group, hosted by Adam Belay.

    Worked on an elastic LLM serving system.

    Jun. 2023 - Sept. 2023

  2. Visiting Student at MIT PDOS Group, hosted by Adam Belay.

    Worked on Midas, a new OS memory abstraction for soft state.

    Jun. 2022 - Sept. 2022

  3. Research Intern at Alibaba Bellevue, Cloud Storage Team, hosted by Qingda Lu.

    Worked on Hermit, a high-performance and transparent remote memory system.

    Jun. 2021 - Sept. 2021

Service

Awards

Teaching

UCLA


Last updated 06/2025