Selected Publications [Google Scholar]

Helix: Serving large language models over heterogeneous gpus and network via max-flow. [pdf]
Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak.
The 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2025

SIEVE: Cache eviction can be simple, effective, and scalable.
Juncheng Yang, Yazhuo Zhang, Yao Yue, Ymir Vigfusson, K. V. Rashmi.
USENIX ;login:, 2024

SIEVE is Simpler than LRU: an Efficient Turn-Key Eviction Algorithm for Web Caches. [website] [blog] [pdf] [slides]
Yazhuo Zhang*, Juncheng Yang* (corresponding author), Yao Yue, Ymir Vigfusson, K. V. Rashmi.
The 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2024
SIEVE receives the community (best paper) award
Featured on TLDR newletter, blog coverage: Marc from AWS, blog post in Korean.
Independent implementations and evaluation: golang-fifo, use in DNSCrypt.
Open-source libraries (not an extensive list): Golang, Python, JavaScript, Rust, Java, Swift, Ruby, Nim, Zig.

FIFO queues are all you need for cache eviction. [website] [blog] [pdf] [slides] [video]
Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, K. V. Rashmi.
The 29th ACM Symposium on Operating Systems Principles (SOSP), 2023
Will be discussed in Aleksey's Online Reading Group. Covered in blog [1], [2], [3] in Korean, [4] in Chinese, [5] in Japanese, newsletters [1], [2] .
Open-source libraries (not an extensive list): Rust, Golang, JavaScript, Python, C++.

FIFO Can be Better than LRU: the Power of Lazy Promotion and Quick Demotion. [pdf] [slides]
Juncheng Yang, Ziyue Qiu, Yazhuo Zhang, Yao Yue, K. V. Rashmi.
The 19th Workshop on Hot Topics in Operating Systems (HotOS), 2023

GL-Cache: Group-level learning for efficient and high-performance caching. [pdf] [slides]
Juncheng Yang, Ziming Mao, Yao Yue, K. V. Rashmi.
The 21st USENIX Conference on File and Storage Technologies (FAST), 2023

FrozenHot Cache: Rethinking Cache Management for Modern Hardware. [pdf] [slides]
Ziyue Qiu, Juncheng Yang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Qi Chen, Mao Yang, Yinlong Xu.
The European Conference on Computer Systems (EuroSys), 2023

Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding. [pdf]
Tianyu Zhang, Kaige Liu, Jack Kosaian, Juncheng Yang, K. V. Rashmi.
49th International Conference on Very Large Databases (VLDB), 2023

Latenseer: Causal Modeling of End-to-End Latency Distributions by Harnessing Distributed Tracing. [pdf]
Yazhuo Zhang, Rebecca Isaacs, Yao Yue, Juncheng Yang, Lei Zhang, Ymir Vigfusson.
ACM Symposium on Cloud Computing (SoCC), 2023

C2DN: How to Harness Erasure Codes at the Edge for Efficient Content Delivery. [pdf] [slides]
Juncheng Yang, Anirudh Sabnis, Daniel S. Berger, K. V. Rashmi, Ramesh K. Sitaraman
The 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2022

Kangaroo: Caching Billions of Tiny Objects on Flash. [pdf]
Sara McAllister*, Benjamin Berg*, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu, Daniel S. Berger, Nathan Beckmann, Gregory R. Ganger.
28th ACM Symposium on Operating Systems Principles (SOSP), 2021
Best Paper Award invited for submission to ACM Transactions on Storage

Segcache: a memory-efficient and scalable in-memory key-value cache for small objects. [pdf] [slides]
Juncheng Yang, Yao Yue, K. V. Rashmi.
18th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2021
Community Award (Best Paper Award)

A Large Scale Analysis of Hundreds of In-memory Cache Clusters at Twitter. [pdf] [slides]
Juncheng Yang, Yao Yue, K. V. Rashmi.
The 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020
Recognized as one of the best storage papers and invited for submission to ACM Transactions on Storage.
Discussed in Aleksey's Online Reading Group.

Pacemaker: Avoiding HeartBleed at Scale. [pdf] [slides]
Juncheng Yang, Ymir Vigfusson, Sathya Gunasekar, Dave Watson, Javier Gonzalez, Kate Iyer, K. V. Rashmi.
14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020

Mutant: Balancing Storage Cost and Latency in LSM-Tree Data Stores. [pdf]
Hobin Yoon, Juncheng Yang, Sveinn F. Kristjansson, Steinn E. Sigurdarson, Ymir Vigfusson, Ada Gavrilovska.
ACM Symposium on Cloud Computing (SoCC), 2018

Skyline Diagram: Finding the Voronoi Counterpart for kyline Queries [pdf]
Jinfei Liu, Juncheng Yang, Li Xiong, Jian Pei, Jun Luo.
IEEE International Conference on Data Engineering (ICDE), 2018.
Extended version - IEEE Transactions on Knowledge and Data Engineering (TKDE), 2019

MITHRIL: Mining Sporadic Associations for Cache Prefetching. [pdf]
Juncheng Yang, Reza Karimi, Trausti Saemundsson, Avani Wildani, Ymir Vigfusson.
ACM Symposium on Cloud Computing (SoCC), 2017

Secure Skyline Queries on Cloud Platform [pdf]
Jinfei Liu, Juncheng Yang, Li Xiong, Jian Pei.
IEEE International Conference on Data Engineering (ICDE), 2017.
Extended version - IEEE Transactions on Knowledge and Data Engineering (TKDE), 2018.

Enabling Space Elasticity in Storage Systems. [pdf]
Helgi Sigurbjarnarson, Petur Orri Ragnarsson, Juncheng Yang, Ymir Vigfusson, Mahesh Balakrishnan.
ACM International Systems and Storage Conference (SYSTOR), 2016
Best student paper