I am an Assistant Professor in the Computer Science Division in School of Applied Science and Engineering at Harvard University.
I am broadly interested in storage systems, data management and machine learning systems with particular interests on workload analysis, efficient storage, and sustainable system design. I like in-depth measurement and analysis to get deep understanding of systems and algorithms in the real world.
My works have received best-paper awards at NSDI'24, NSDI'21, SOSP'21, and SYSTOR'16 and have been deployed in production at Google, VMware, Twitter, Redpanda, Momento with many open-source libraries contributed by the community. My research has been sponsored by Meta, Google Cloud, and AWS. I am a 2020 Meta Fellow, a 2023 Google Cloud Research Innovator, and a 2023 Rising Star in Machine Learning and Systems.
I am looking for self-motivated Ph.D. students and Postdocs starting Fall 2025. I will also host two undergraduate students. If you are interested in working with me, please send me an email with your transcript and CV.
I have been very fortunate to work with many talented students. If you have worked with me, but not showing up on this page, please feel free to let me know.
SIEVE: Cache eviction can be simple, effective, and scalable.
Juncheng Yang, Yazhuo Zhang, Yao Yue, Ymir Vigfusson, K. V. Rashmi.
USENIX ;login: , 2024
SIEVE is Simpler than LRU: an Efficient Turn-Key Eviction Algorithm for Web Caches.
[website]
[blog]
[pdf]
[slides]
Yazhuo Zhang*, Juncheng Yang* (corresponding author), Yao Yue, Ymir Vigfusson, K. V. Rashmi.
The 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2024
SIEVE receives the community
(best paper) award
Featured on TLDR newletter,
blog coverage: Marc from
AWS,
blog post in Korean.
Independent implementations and evaluation: golang-fifo, use in DNSCrypt.
Open-source libraries (not an extensive list):
Golang,
Python,
JavaScript,
Rust,
Java,
Swift,
Ruby,
Nim,
Zig.
FIFO queues are all you need for cache eviction.
[website]
[blog]
[pdf]
[slides]
[video]
Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, K. V. Rashmi.
The 29th ACM Symposium on Operating Systems Principles (SOSP), 2023
Will be discussed in
Aleksey's Online Reading Group.
Covered in blog [1],
[2],
[3]
in Korean,
[4] in Chinese,
[5] in Japanese,
newsletters [1],
[2]
.
Open-source libraries (not an extensive list):
Rust,
Golang,
JavaScript,
Python,
C++.
FIFO Can be Better than LRU: the Power of Lazy Promotion and Quick Demotion.
[pdf]
[slides]
Juncheng Yang, Ziyue Qiu, Yazhuo Zhang, Yao Yue, K. V. Rashmi.
The 19th Workshop on Hot Topics in Operating Systems (HotOS), 2023
GL-Cache: Group-level learning for efficient and high-performance caching.
[pdf]
[slides]
Juncheng Yang, Ziming Mao, Yao Yue, K. V. Rashmi.
The 21st USENIX Conference on File and Storage Technologies (FAST), 2023
FrozenHot Cache: Rethinking Cache Management for Modern Hardware.
[pdf]
[slides]
Ziyue Qiu, Juncheng Yang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Qi Chen, Mao Yang, Yinlong Xu.
The European Conference on Computer Systems (EuroSys), 2023
Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding.
[pdf]
Tianyu Zhang, Kaige Liu, Jack Kosaian, Juncheng Yang, K. V. Rashmi.
49th International Conference on Very Large Databases (VLDB), 2023
Latenseer: Causal Modeling of End-to-End Latency Distributions by Harnessing Distributed Tracing.
[pdf]
Yazhuo Zhang, Rebecca Isaacs, Yao Yue, Juncheng Yang, Lei Zhang, Ymir Vigfusson.
ACM Symposium on Cloud Computing (SoCC), 2023
C2DN: How to Harness Erasure Codes at the Edge for Efficient Content Delivery.
[pdf]
[slides]
Juncheng Yang, Anirudh Sabnis, Daniel S. Berger, K. V. Rashmi, Ramesh K. Sitaraman
The 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2022
Segcache: memory-efficient and high-throughput DRAM cache for small objects.
[pdf]
[slides]
Juncheng Yang, Yao Yue, K. V. Rashmi.
The 18th USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2021
Segcache receives the community award (one of the best papers)
This work has been adopted for production at Twitter. See a short summary of the work.
Open-source systems and libraries:
Pelikan,
Rust crate,
Kangaroo: Caching Billions of Tiny Objects on Flash.
[pdf]
[slides]
Sara McAllister, Benjamin Berg, Julian Tutuncu-Macias, Juncheng Yang, Sathya Gunasekar, Jimmy Lu,
Nathan Beckmann, Gregory R. Ganger.
28th ACM Symposium on Operating Systems Principles (SOSP), 2021
Extended version (invited
submission) - ACM
Transaction on storage (TOS) 2022
Kangaroo receives the best paper award
A Large Scale Analysis of Hundreds of In-memory Cache Clusters at Twitter.
[pdf]
[slides]
Juncheng Yang, Yao Yue, K. V. Rashmi.
The 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020
Recognized as one of the best storage
papers and invited for submission to
ACM Transactions on Storage.
Discussed in Aleksey's
Online Reading Group.
PACEMAKER: Avoiding HeART Attacks in Storage Clusters with Disk-adaptive Redundancy.
[pdf][slides]
Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya, Juncheng Yang, K. V. Rashmi,
Gregory R. Ganger.
14th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2020
Mutant: Balancing Storage Cost and Latency in LSM-Tree Data Stores.
[pdf]
Hobin Yoon, Juncheng Yang, Sveinn F. Kristjansson, Steinn E. Sigurdarson, Ymir Vigfusson, Ada
Gavrilovska.
ACM Symposium on Cloud Computing (SoCC), 2018
Skyline Diagram: Finding the Voronoi Counterpart for kyline Queries
[pdf]
Jinfei Liu, Juncheng Yang, Li Xiong, Jian Pei, Jun Luo.
IEEE International Conference on Data Engineering (ICDE), 2018.
Extended version - IEEE Transactions on Knowledge and Data Engineering (TKDE), 2019
MITHRIL: Mining Sporadic Associations for Cache Prefetching.
[pdf]
Juncheng Yang, Reza Karimi, Trausti Saemundsson, Avani Wildani, Ymir Vigfusson.
ACM Symposium on Cloud Computing (SoCC), 2017
Secure Skyline Queries on Cloud Platform
[pdf]
Jinfei Liu, Juncheng Yang, Li Xiong, Jian Pei.
IEEE International Conference on Data Engineering (ICDE), 2017.
Extended version - IEEE Transactions on Knowledge and Data Engineering (TKDE), 2018.
Enabling Space Elasticity in Storage Systems.
[pdf]
Helgi Sigurbjarnarson, Petur Orri Ragnarsson, Juncheng Yang, Ymir Vigfusson, Mahesh Balakrishnan.
ACM International Systems and Storage Conference (SYSTOR), 2016
Best student paper
Juncheng Yang is an Assistant Professor in the School of Engineering and Applied Science at Harvard University. He received his Ph.D. in Computer Science from Carnegie Mellon University in 2024, His research interests broadly cover the efficiency, performance, reliability, and sustainability of large-scale data systems.
Juncheng's works have received best paper awards at NSDI'24, NSDI'21, SOSP'21, and SYSTOR'16. His OSDI'20 paper was recognized as one of the best storage papers at the conference and invited to ACM TOS'21. Juncheng received a Facebook Ph.D. Fellowship in 2020, was recognized as a Rising Star in machine learning and systems in 2023, and a Google Cloud Research Innovator in 2023.
His work, Segcache, has been adopted for production at Twitter and Momento. The two eviction algorithms he designed (S3-FIFO, SIEVE) have been adopted for production at Google, VMware, Redpanda, and several others, with over 20 open-source libraries available on GitHub. Moreover, the open-source cache simulation library he created, libCacheSim, has been used by almost 100 research institutes and companies.