| 2024 | 18th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2024, Santa Clara, CA, USA, July 10-12, 2024. Ada Gavrilovska, Douglas B. Terry |
| 2024 | A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications. Lei Chen, Shi Liu, Chenxi Wang, Haoran Ma, Yifan Qiao, Zhe Wang, Chenggang Wu, Youyou Lu, Xiaobing Feng, Huimin Cui, Shan Lu, Harry Xu |
| 2024 | ACCL+: an FPGA-Based Collective Engine for Distributed Applications. Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso |
| 2024 | Anvil: Verifying Liveness of Cluster Management Controllers. Xudong Sun, Wenjie Ma, Jiawei Tyler Gu, Zicheng Ma, Tej Chajed, Jon Howell, Andrea Lattuada, Oded Padon, Lalith Suresh, Adriana Szekeres, Tianyin Xu |
| 2024 | Automatically Reasoning About How Systems Code Uses the CPU Cache. Rishabh R. Iyer, Katerina J. Argyraki, George Candea |
| 2024 | Beaver: Practical Partial Snapshots for Distributed Cloud Services. Liangcheng Yu, Xiao Zhang, Haoran Zhang, John Sonchack, Dan R. K. Ports, Vincent Liu |
| 2024 | Burstable Cloud Block Storage with Data Processing Units. Junyi Shu, Kun Qian, Ennan Zhai, Xuanzhe Liu, Xin Jin |
| 2024 | Caravan: Practical Online Learning of In-Network ML Models with Labeling Agents. Qizheng Zhang, Ali Imran, Enkeleda Bardhi, Tushar Swamy, Nathan Zhang, Muhammad Shahbaz, Kunle Olukotun |
| 2024 | ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications. Yuhan Liu, Chengcheng Wan, Kuntai Du, Henry Hoffmann, Junchen Jiang, Shan Lu, Michael Maire |
| 2024 | Chop Chop: Byzantine Atomic Broadcast to the Network Limit. Martina Camaioni, Rachid Guerraoui, Matteo Monti, Pierre-Louis Roman, Manuel Vidigueira, Gauthier Voron |
| 2024 | DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra Efficiency. Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang, Miryung Kim, Harry Xu |
| 2024 | DSig: Breaking the Barrier of Signatures in Data Centers. Marcos K. Aguilera, Clément Burgelin, Rachid Guerraoui, Antoine Murat, Athanasios Xygkis, Igor Zablotchi |
| 2024 | Data-flow Availability: Achieving Timing Assurance in Autonomous Systems. Ao Li, Ning Zhang |
| 2024 | Detecting Logic Bugs in Database Engines via Equivalent Expression Transformation. Zu-Ming Jiang, Zhendong Su |
| 2024 | DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving. Yinmin Zhong, Shengyu Liu, Junda Chen, Jianbo Hu, Yibo Zhu, Xuanzhe Liu, Xin Jin, Hao Zhang |
| 2024 | Enabling Tensor Language Model to Assist in Generating High-Performance Tensor Programs for Deep Learning. Yi Zhai, Sijia Yang, Keyu Pan, Renwei Zhang, Shuo Liu, Chao Liu, Zichun Ye, Jianmin Ji, Jie Zhao, Yu Zhang, Yanyong Zhang |
| 2024 | Fairness in Serving Large Language Models. Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica |
| 2024 | FairyWREN: A Sustainable Cache for Emerging Write-Read-Erase Flash Interfaces. Sara McAllister, Yucong Wang, Benjamin Berg, Daniel S. Berger, George Amvrosiadis, Nathan Beckmann, Gregory R. Ganger |
| 2024 | Fast and Scalable In-network Lock Management Using Lock Fission. Hanze Zhang, Ke Cheng, Rong Chen, Haibo Chen |
| 2024 | Flock: A Framework for Deploying On-Demand Distributed Trust. Darya Kaviani, Sijun Tan, Pravein Govindan Kannan, Raluca Ada Popa |
| 2024 | Harvesting Memory-bound CPU Stall Cycles in Software with MSH. Zhihong Luo, Sam Son, Sylvia Ratnasamy, Scott Shenker |
| 2024 | High-throughput and Flexible Host Networking for Accelerated Computing. Athinagoras Skiadopoulos, Zhiqiang Xie, Mark Zhao, Qizhe Cai, Saksham Agarwal, Jacob Adelmann, David Ahern, Carlo Contavalli, Michael D. Goldflam, Vitaly Mayatskikh, Raghu Raja, Daniel Walton, Rachit Agarwal, Shrijeet Mukherjee, Christos Kozyrakis |
| 2024 | Identifying On-/Off-CPU Bottlenecks Together with Blocked Samples. Minwoo Ahn, Jeongmin Han, Youngjin Kwon, Jinkyu Jeong |
| 2024 | Inductive Invariants That Spark Joy: Using Invariant Taxonomies to Streamline Distributed Protocol Proofs. Tony Nuda Zhang, Travis Hance, Manos Kapritsos, Tej Chajed, Bryan Parno |
| 2024 | InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management. Wonbeom Lee, Jungi Lee, Junghwan Seo, Jaewoong Sim |
| 2024 | IntOS: Persistent Embedded Operating System and Language Support for Multi-threaded Intermittent Computing. Yilun Wu, Byounguk Min, Mohannad Ismail, Wenjie Xiong, Changhee Jung, Dongyoon Lee |
| 2024 | IronSpec: Increasing the Reliability of Formal Specifications. Eli Goldweber, Weixin Yu, Seyed Armin Vakil-Ghahani, Manos Kapritsos |
| 2024 | Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation. Lei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi, Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang |
| 2024 | Llumnix: Dynamic Scheduling for Large Language Model Serving. Biao Sun, Ziming Huang, Hanyu Zhao, Wencong Xiao, Xinyi Zhang, Yong Li, Wei Lin |
| 2024 | MAST: Global Scheduling of ML Training across Geo-Distributed Datacenters at Hyperscale. Arnab Choudhury, Yang Wang, Tuomas Pelkonen, Kutta Srinivasan, Abha Jain, Shenghao Lin, Delia David, Siavash Soleimanifard, Michael Chen, Abhishek Yadav, Ritesh Tijoriwala, Denis Samoylov, Chunqiang Tang |
| 2024 | Managing Memory Tiers with CXL in Virtualized Environments. Yuhong Zhong, Daniel S. Berger, Carl A. Waldspurger, Ryan Wee, Ishwar Agarwal, Rajat Agarwal, Frank Hady, Karthik Kumar, Mark D. Hill, Mosharaf Chowdhury, Asaf Cidon |
| 2024 | Massively Parallel Multi-Versioned Transaction Processing. Shujian Qian, Ashvin Goel |
| 2024 | Microkernel Goes General: Performance and Compatibility in the HongMeng Production Microkernel. Haibo Chen, Xie Miao, Ning Jia, Nan Wang, Yu Li, Nian Liu, Yutao Liu, Fei Wang, Qiang Huang, Kun Li, Hongyang Yang, Hui Wang, Jie Yin, Yu Peng, Fengwei Xu |
| 2024 | MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures. Donglin Zhuang, Zhen Zheng, Haojun Xia, Xiafei Qiu, Junjie Bai, Wei Lin, Shuaiwen Leon Song |
| 2024 | Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory. Ming Zhang, Yu Hua, Zhijun Yang |
| 2024 | Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration. Lingfeng Xiang, Zhen Lin, Weishu Deng, Hui Lu, Jia Rao, Yifan Yuan, Ren Wang |
| 2024 | Optimizing Resource Allocation in Hyperscale Datacenters: Scalability, Usability, and Experiences. Neeraj Kumar, Pol Mauri Ruiz, Vijay Menon, Igor Kabiljo, Mayank Pundir, Andrew Newell, Daniel Lee, Liyuan Wang, Chunqiang Tang |
| 2024 | Parrot: Efficient Serving of LLM-based Applications with Semantic Variable. Chaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu |
| 2024 | Performance Interfaces for Hardware Accelerators. Jiacheng Ma, Rishabh R. Iyer, Sahand Kashani, Mahyar Emami, Thomas Bourgeat, George Candea |
| 2024 | Ransom Access Memories: Achieving Practical Ransomware Protection in Cloud with DeftPunk. Zhongyu Wang, Yaheng Song, Erci Xu, Haonan Wu, Guangxun Tong, Shizhuo Sun, Haoran Li, Jincheng Liu, Lijun Ding, Rong Liu, Jiaji Zhu, Jiesheng Wu |
| 2024 | Sabre: Hardware-Accelerated Snapshot Compression for Serverless MicroVMs. Nikita Lazarev, Varun Gohil, James Tsai, Andy Anderson, Bhushan Chitlur, Zhiru Zhang, Christina Delimitrou |
| 2024 | Secret Key Recovery in a Global-Scale End-to-End Encryption System. Graeme Connell, Vivian Fang, Rolfe Schmidt, Emma Dauterman, Raluca Ada Popa |
| 2024 | ServerlessLLM: Low-Latency Serverless Inference for Large Language Models. Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai |
| 2024 | ServiceLab: Preventing Tiny Performance Regressions at Hyperscale through Pre-Production Testing. Mike Chow, Yang Wang, William Wang, Ayichew Hailu, Rohan Bopardikar, Bin Zhang, Jialiang Qu, David Meisner, Santosh Sonawane, Yunqi Zhang, Rodrigo Paim, Mack Ward, Ivor Huang, Matt McNally, Daniel Hodges, Zoltan Farkas, Caner Gocmen, Elvis Huang, Chunqiang Tang |
| 2024 | SquirrelFS: using the Rust compiler to check file-system crash consistency. Hayley LeBlanc, Nathan Taylor, James Bornholt, Vijay Chidambaram |
| 2024 | Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve. Amey Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee |
| 2024 | USHER: Holistic Interference Avoidance for Resource Optimized ML Inference. Sudipta Saha Shubha, Haiying Shen, Anand P. Iyer |
| 2024 | Using Dynamically Layered Definite Releases for Verifying the RefFS File System. Mo Zou, Dong Du, Mingkai Dong, Haibo Chen |
| 2024 | Validating the eBPF Verifier via State Embedding. Hao Sun, Zhendong Su |
| 2024 | VeriSMo: A Verified Security Module for Confidential VMs. Ziqiao Zhou, Anjali, Weiteng Chen, Sishuai Gong, Chris Hawblitzel, Weidong Cui |
| 2024 | When will my ML Job finish? Toward providing Completion Time Estimates through Predictability-Centric Scheduling. Abdullah Bin Faisal, Noah Martin, Hafiz Mohsin Bashir, Swaminathan Lamelas, Fahad R. Dogar |
| 2024 | dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving. Bingyang Wu, Ruidong Zhu, Zili Zhang, Peng Sun, Xuanzhe Liu, Xin Jin |
| 2024 | nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training. Zhiqi Lin, Youshan Miao, Quanlu Zhang, Fan Yang, Yi Zhu, Cheng Li, Saeed Maleki, Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, Mao Yang, Lintao Zhang, Lidong Zhou |
| 2024 | μSlope: High Compression and Fast Search on Semi-Structured Logs. Rui Wang, Devin Gibson, Kirk Rodrigues, Yu Luo, Yun Zhang, Kaibo Wang, Yupeng Fu, Ting Chen, Ding Yuan |