Selected Publications

Google Scholar
2026

Learning-Augmented Heuristics: Simple yet Smart, Robust and Interpretable Cache Eviction.
Haocheng Xia, William Nixon, Bintang Dwi Marthen, Pranav Bhandari, Juncheng Yang.
20th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2026

MorphServe: Efficient and Workload-Aware LLM Serving via Runtime Layer Swapping and KV Cache Resizing. pdf
Zhaoyuan Su, Tingfeng Lan, Zirui Wang, Juncheng Yang, Yue Cheng.
Conference on Machine Learning and Systems (MLSys), 2026

ZipLLM: Towards Efficient LLM Storage Reduction via Tensor Deduplication and Delta Compression. pdf
Zirui Wang, Tingfeng Lan, Zhaoyuan Su, Juncheng Yang, Yue Cheng.
The 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2026

Demystifying and Improving Lazy Promotion in Cache Eviction. pdf
Qinghan Chen, Muhammad Haekal Muhyidin Al-Araby, Ziyue Qiu, Zhuofan Chen, Rashmi Vinayak, Juncheng Yang.
52nd International Conference on Very Large Databases (VLDB), 2026

Clock2Q+: A Simple and Efficient Replacement Algorithm for Metadata Cache in VMware vSAN. pdf
Yiyan Zhai, Bintang Dwi Marthen, Sarath Balivada, Vamsi Sudhakar Bojji, Eric Knauft, Jitender Rohilla, Jiaqi Zuo, Quanxing Liu, Maxime Austruy, Wenguang Wang, Juncheng Yang.
52nd International Conference on Very Large Databases (VLDB), 2026

2025

Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow. pdf
Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak.
The 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2025

2024

SIEVE: Cache eviction can be simple, effective, and scalable.
Juncheng Yang, Yazhuo Zhang, Yao Yue, Ymir Vigfusson, K. V. Rashmi.
USENIX ;login:, 2024

SIEVE is Simpler than LRU: an Efficient Turn-Key Eviction Algorithm for Web Caches. website blog pdf slides
Yazhuo Zhang*, Juncheng Yang* (corresponding author), Yao Yue, Ymir Vigfusson, K. V. Rashmi.
The 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2024
SIEVE receives the community (best paper) award.
Featured on TLDR newsletter, blog coverage: Marc from AWS, blog post in Korean.
Independent implementations: golang-fifo, use in DNSCrypt.
Open-source libraries (not an extensive list): Golang, Python, JavaScript, Rust, Java, Swift, Ruby, Nim, Zig.

2023

FIFO queues are all you need for cache eviction. website blog pdf slides video
Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, K. V. Rashmi.
The 29th ACM Symposium on Operating Systems Principles (SOSP), 2023
Discussed in Aleksey's Online Reading Group. Covered in blog [1], [2], [3] in Korean, [4] in Chinese, [5] in Japanese, newsletters [1], [2].
Open-source libraries (not an extensive list): Rust, Golang, JavaScript, Python, C++.

FIFO Can be Better than LRU: the Power of Lazy Promotion and Quick Demotion. pdf slides
Juncheng Yang, Ziyue Qiu, Yazhuo Zhang, Yao Yue, K. V. Rashmi.
The 19th Workshop on Hot Topics in Operating Systems (HotOS), 2023

GL-Cache: Group-level learning for efficient and high-performance caching. pdf slides
Juncheng Yang, Ziming Mao, Yao Yue, K. V. Rashmi.
The 21st USENIX Conference on File and Storage Technologies (FAST), 2023

FrozenHot Cache: Rethinking Cache Management for Modern Hardware. pdf slides
Ziyue Qiu, Juncheng Yang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Qi Chen, Mao Yang, Yinlong Xu.
The European Conference on Computer Systems (EuroSys), 2023

Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding. pdf
Tianyu Zhang, Kaige Liu, Jack Kosaian, Juncheng Yang, K. V. Rashmi.
49th International Conference on Very Large Databases (VLDB), 2023

Latenseer: Causal Modeling of End-to-End Latency Distributions by Harnessing Distributed Tracing. pdf
Yazhuo Zhang, Rebecca Isaacs, Yao Yue, Juncheng Yang, Lei Zhang, Ymir Vigfusson.
ACM Symposium on Cloud Computing (SoCC), 2023

2022

C2DN: How to Harness Erasure Codes at the Edge for Efficient Content Delivery. pdf slides
Juncheng Yang, Anirudh Sabnis, Daniel S. Berger, K. V. Rashmi, Ramesh K. Sitaraman
The 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2022

2021

Kangaroo: Caching Billions of Tiny Objects on Flash. pdf
Sara McAllister*, Benjamin Berg*, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel S. Berger, Nathan Beckmann, Gregory R. Ganger.
28th ACM Symposium on Operating Systems Principles (SOSP), 2021
Best Paper Award Invited for submission to ACM Transactions on Storage.

Segcache: a memory-efficient and scalable in-memory key-value cache for small objects. pdf slides
Juncheng Yang, Yao Yue, K. V. Rashmi.
18th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2021
Community Award (Best Paper Award)

2020

A Large Scale Analysis of Hundreds of In-memory Cache Clusters at Twitter. pdf slides
Juncheng Yang, Yao Yue, K. V. Rashmi.
The 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020
Recognized as one of the best storage papers and invited for submission to ACM Transactions on Storage. Discussed in Aleksey's Online Reading Group.

Pacemaker: Avoiding HeartBleed at Scale. pdf slides
Juncheng Yang, Ymir Vigfusson, Sathya Gunasekar, Dave Watson, Javier Gonzalez, Kate Iyer, K. V. Rashmi.
14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020

2018

Mutant: Balancing Storage Cost and Latency in LSM-Tree Data Stores. pdf
Hobin Yoon, Juncheng Yang, Sveinn F. Kristjansson, Steinn E. Sigurdarson, Ymir Vigfusson, Ada Gavrilovska.
ACM Symposium on Cloud Computing (SoCC), 2018

Skyline Diagram: Finding the Voronoi Counterpart for Skyline Queries. pdf
Jinfei Liu, Juncheng Yang, Li Xiong, Jian Pei, Jun Luo.
IEEE International Conference on Data Engineering (ICDE), 2018.
Extended version — IEEE Transactions on Knowledge and Data Engineering (TKDE), 2019.

2017

MITHRIL: Mining Sporadic Associations for Cache Prefetching. pdf
Juncheng Yang, Reza Karimi, Trausti Saemundsson, Avani Wildani, Ymir Vigfusson.
ACM Symposium on Cloud Computing (SoCC), 2017

Secure Skyline Queries on Cloud Platform. pdf
Jinfei Liu, Juncheng Yang, Li Xiong, Jian Pei.
IEEE International Conference on Data Engineering (ICDE), 2017.
Extended version — IEEE Transactions on Knowledge and Data Engineering (TKDE), 2018.

2016

Enabling Space Elasticity in Storage Systems. pdf
Helgi Sigurbjarnarson, Petur Orri Ragnarsson, Juncheng Yang, Ymir Vigfusson, Mahesh Balakrishnan.
ACM International Systems and Storage Conference (SYSTOR), 2016
Best student paper