| 2025 | Aegaeon: Effective GPU Pooling for Concurrent LLM Serving on the Market. Yuxing Xiang, Xue Li, Kun Qian, Yufan Yang, Diwen Zhu, Wenyuan Yu, Ennan Zhai, Xuanzhe Liu, Xin Jin, Jingren Zhou |
| 2025 | Aeolia: A Fast and Secure Userspace Interrupt-Based Storage Stack. Chuandong Li, Ran Yi, Zonghao Zhang, Jing Liu, Changwoo Min, Jie Zhang, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Diyu Zhou |
| 2025 | Analyzing and Enhancing ArckFS: An Anecdotal Example of Benefits of Artifact Evaluation. Jonguk Jeon, Subeen Park, Sanidhya Kashyap, Sudarsun Kannan, Diyu Zhou, Jeehoon Kang |
| 2025 | Atmosphere: Practical Verified Kernels with Rust and Verus. Xiangdong Chen, Zhaofeng Li, Jerry Zhang, Vikram Narayanan, Anton Burtsev |
| 2025 | AutoMan: Facilitating Verified Distributed Systems Development Through Automatic Code Generation and Manual Optimizations. Zihao Zhang, Ti Zhou, Christa Jenkins, Omar Chowdhury, Shuai Mu |
| 2025 | CHERIoT RTOS: An OS for Fine-Grained Memory-Safe Compartments on Low-Cost Embedded Devices. Saar Amar, Tony Chen, David Chisnall, Nathaniel Wesley Filardo, Ben Laurie, Hugo Lefeuvre, Kunyan Liu, Simon W. Moore, Robert Norton-Wright, Margo I. Seltzer, Yucong Tao, Robert N. M. Watson, Hongyan Xia |
| 2025 | COpter: Efficient Large-Scale Resource-Allocation via Continual Optimization. Suhas Jayaram Subramanya, Don Kurian Dennis, Virginia Smith, Gregory R. Ganger |
| 2025 | Characterizing Mobile SoC for Accelerating Heterogeneous LLM Inference. Le Chen, Dahu Feng, Erhu Feng, Yingrui Wang, Rong Zhao, Yubin Xia, Pinjie Xu, Haibo Chen |
| 2025 | CortenMM: Efficient Memory Management with Strong Correctness Guarantees. Junyang Zhang, Xiangcan Xu, Yong-Hao Zou, Zhe Tang, Xinyi Wan, Kang Hu, Siyuan Wang, Wenbo Xu, Di Wang, Hao Chen, Lin Huang, Shoumeng Yan, Yuval Tamir, Yingwei Luo, Xiaolin Wang, Huashan Yu, Zhenlin Wang, Hongliang Tian, Diyu Zhou |
| 2025 | Coyote v2: Raising the Level of Abstraction for Data Center FPGAs. Benjamin Ramhorst, Dario Korolija, Maximilian Jakob Heer, Jonas Dann, Luhao Liu, Gustavo Alonso |
| 2025 | DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism. Chenyu Jiang, Zhenkun Cai, Ye Tian, Zhen Jia, Yida Wang, Chuan Wu |
| 2025 | Demeter: A Scalable and Elastic Tiered Memory Solution for Virtualized Cloud via Guest Delegation. Junliang Hu, Zhisheng Hu, Chun-Feng Wu, Ming-Chang Yang |
| 2025 | Device-Assisted Live Migration of RDMA Devices. Artem Y. Polyakov, Gal Shalom, Asaf Schwartz, Aviad Yehezkel, Omri Ben David, Omri Kahalon, Ariel Shahar, Liran Liss |
| 2025 | DiffKV: Differentiated Memory Management for Large Language Models with Parallel KV Compaction. Yanqi Zhang, Yuwei Hu, Runyuan Zhao, John C. S. Lui, Haibo Chen |
| 2025 | Fast End-to-End Performance Simulation of Accelerated Hardware-Software Stacks. Jiacheng Ma, Jonas Kaufmann, Emilien Guandalino, Rishabh R. Iyer, Thomas Bourgeat, George Candea |
| 2025 | Fawkes: Finding Data Durability Bugs in DBMSs via Recovered Data State Verification. Zhiyong Wu, Jie Liang, Jingzhou Fu, Wenqian Deng, Yu Jiang |
| 2025 | FlexGuard: Fast Mutual Exclusion Independent of Subscription. Victor Laforet, Sanidhya Kashyap, Calin Iorgulescu, Julia Lawall, Jean-Pierre Lozi |
| 2025 | Ghost in the Android Shell: Pragmatic Test-oracle Specification of a Production Hypervisor. Kayvan Memarian, Ben Simner, David Kaloper-Mersinjak, Thibaut Pérami, Peter Sewell |
| 2025 | HedraRAG: Co-Optimizing Generation and Retrieval for Heterogeneous RAG Workflows. Zhengding Hu, Vibha Murthy, Zaifeng Pan, Wanlu Li, Xiaoyi Fang, Yufei Ding, Yuke Wang |
| 2025 | How to Copy Memory? Coordinated Asynchronous Copy as a First-Class OS Service. Jingkai He, Yunpeng Dong, Dong Du, Mo Zou, Zhitai Yu, Yuxin Ren, Ning Jia, Yubin Xia, Haibo Chen |
| 2025 | IC-Cache: Efficient Large Language Model Serving via In-context Caching. Yifan Yu, Yu Gan, Nikhil Sarda, Lillian Tsai, Jiaming Shen, Yanqi Zhou, Arvind Krishnamurthy, Fan Lai, Hank Levy, David E. Culler |
| 2025 | Jenga: Effective Memory Management for Serving LLM with Heterogeneity. Chen Zhang, Kuntai Du, Shu Liu, Woosuk Kwon, Xiangxi Mo, Yufeng Wang, Xiaoxuan Liu, Kaichao You, Zhuohan Li, Mingsheng Long, Jidong Zhai, Joseph Gonzalez, Ion Stoica |
| 2025 | KNighter: Transforming Static Analysis with LLM-Synthesized Checkers. Chenyuan Yang, Zijie Zhao, Zichen Xie, Haoyu Li, Lingming Zhang |
| 2025 | KTransformers: Unleashing the Full Potential of CPU/GPU Hybrid Inference for MoE Models. Hongtao Chen, Weiyu Xie, Boxin Zhang, Jingqi Tang, Jiahao Wang, Jianwei Dong, Shaoyuan Chen, Ziwei Yuan, Chen Lin, Chengyu Qiu, Yuening Zhu, Qingliang Ou, Jiaqi Liao, Xianglin Chen, Zhiyuan Ai, Yongwei Wu, Mingxing Zhang |
| 2025 | LithOS: An Operating System for Efficient Machine Learning on GPUs. Patrick H. Coppock, Brian Zhang, Eliot H. Solomon, Vasilis Kypriotis, Leon Yang, Bikash Sharma, Dan Schatzberg, Todd C. Mowry, Dimitrios Skarlatos |
| 2025 | Loom: Efficient Capture and Querying of High-Frequency Telemetry. Franco Solleza, Shihang Li, William Sun, Richard Tang, Malte Schwarzkopf, Andrew Crotty, David Cohen, Nesime Tatbul, Stan Zdonik |
| 2025 | METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation. Siddhant Ray, Rui Pan, Zhuohan Gu, Kuntai Du, Shaoting Feng, Ganesh Ananthanarayanan, Ravi Netravali, Junchen Jiang |
| 2025 | Managing Scalable Direct Storage Accesses for GPUs with GoFS. Shaobo Li, Yirui Eric Zhou, Yuqi Xue, Yuan Xu, Jian Huang |
| 2025 | Mantle: Efficient Hierarchical Metadata Management for Cloud Object Storage Services. Jiahao Li, Biao Cao, Jielong Jian, Cheng Li, Sen Han, Yiduo Wang, Yufei Wu, Kang Chen, Zhihui Yin, Qiushi Chen, Jiwei Xiong, Jie Zhao, Fengyuan Liu, Yan Xing, Liguo Duan, Miao Yu, Ran Zheng, Feng Wu, Xianjun Meng |
| 2025 | Mercury: Unlocking Multi-GPU Operator Optimization for LLMs via Remote Memory Scheduling. Yue Guan, Xinwei Qiang, Zaifeng Pan, Daniels Johnson, Yuanwei Fang, Keren Zhou, Yuke Wang, Wanlu Li, Yufei Ding, Adnan Aziz |
| 2025 | Mitigating Application Resource Overload with Targeted Task Cancellation. Yigong Hu, Zeyin Zhang, Yicheng Liu, Yile Gu, Shuangyu Lei, Baris Kasikci, Peng Huang |
| 2025 | Moirai: Optimizing Placement of Data and Compute in Hybrid Clouds. Ziyue Qiu, Hojin Park, Jing Zhao, Yu-Kai Wang, Arnav Balyan, Gurmeet Singh, Yangjun Zhang, Suqiang (Jack) Song, Gregory R. Ganger, George Amvrosiadis |
| 2025 | Mycroft: Tracing Dependencies in Collective Communication Towards Reliable LLM Training. Yangtao Deng, Lei Zhang, Qinlong Wang, Xiaoyun Zhi, Xinlei Zhang, Zhuo Jiang, Haohan Xu, Lei Wang, Zuquan Song, Gaohong Liu, Yang Bai, Shuguang Wang, Wencong Xiao, Jianxi Ye, Minlan Yu, Hong Xu |
| 2025 | ORQ: Complex Analytics on Private Data with Strong Security Guarantees. Eli Baum, Sam Buxbaum, Nitin Mathai, Muhammad Faisal, Vasiliki Kalavri, Mayank Varia, John Liagouris |
| 2025 | Oasis: Pooling PCIe Devices Over CXL to Boost Utilization. Yuhong Zhong, Daniel S. Berger, Pantea Zardoshti, Enrique Saurez, Jacob Nelson, Dan R. K. Ports, Antonis Psistakis, Joshua Fried, Asaf Cidon |
| 2025 | Optimistic Recovery for High-Availability Software via Partial Process State Preservation. Yuzhuo Jing, Yuqi Mai, Angting Cai, Yi Chen, Wanning He, Xiaoyang Qian, Peter M. Chen, Peng Huang |
| 2025 | Orthrus: Efficient and Timely Detection of Silent User Data Corruption in the Cloud with Resource-Adaptive Computation Validation. Chenxiao Liu, Zhenting Zhu, Quanxi Li, Yanwen Xia, Yifan Qiao, Xiangyun Deng, Youyou Lu, Tao Xie, Huimin Cui, Zidong Du, Harry Xu, Chenxi Wang |
| 2025 | Pesto: Cooking up High Performance BFT Queries. Florian Suri-Payer, Neil Giridharan, Liam Arzola, Shir Cohen, Lorenzo Alvisi, Natacha Crooks |
| 2025 | PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated Speculation. Xingda Wei, Zhuobin Huang, Tianle Sun, Yingyi Hao, Rong Chen, Mingcong Han, Jinyu Gu, Haibo Chen |
| 2025 | Pie: A Programmable Serving System for Emerging LLM Applications. In Gim, Zhiyao Ma, SeungSeob Lee, Lin Zhong |
| 2025 | PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications. Kuntai Du, Bowen Wang, Chen Zhang, Yiming Cheng, Qing Lan, Hejian Sang, Yihua Cheng, Jiayi Yao, Xiaoxuan Liu, Yifan Qiao, Ion Stoica, Junchen Jiang |
| 2025 | Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, SOSP 2025, Lotte Hotel World, Seoul, Republic of Korea, October 13-16, 2025 Youjip Won, Youngjin Kwon, Ding Yuan, Rebecca Isaacs |
| 2025 | Proto: A Guided Journey through Modern OS Construction. Wonkyo Choe, Rongxiang Wang, Afsara Benazir, Felix Xiaozhu Lin |
| 2025 | Prove It to the Kernel: Precise Extension Analysis via Proof-Guided Abstraction Refinement. Hao Sun, Zhendong Su |
| 2025 | Quilt: Resource-aware Merging of Serverless Workflows. Yuxuan Zhang, Sebastian Angel |
| 2025 | Rearchitecting the Thread Model of In-Memory Key-Value Stores with μTPS. Youmin Chen, Jiwu Shu, Yanyan Shen, Linpeng Huang, Hong Mei |
| 2025 | Robust LLM Training Infrastructure at ByteDance. Borui Wan, Gaohong Liu, Zuquan Song, Jun Wang, Yun Zhang, Guangming Sheng, Shuguang Wang, Houmin Wei, Chenyuan Wang, Weiqiang Lou, Xi Yang, Mofan Zhang, Kaihua Jiang, Cheng Ren, Xiaoyun Zhi, Menghan Yu, Zhe Nan, Zhuolin Zheng, Baoquan Zhong, Qinlong Wang, Huan Yu, Jinxin Chi, Wang Zhang, Yuhan Li, Zixian Du, Sida Zhao, Yongqiang Zhang, Jingzhe Tang, Zherui Liu, Chuan Wu, Yanghua Peng, Haibin Lin, Wencong Xiao, Xin Liu, Liang Xiang |
| 2025 | Running Consistent Applications Closer to Users with Radical for Lower Latency. Nicolaas Kaashoek, Oleg Aleksandrovich Golev, Austin T. Li, Amit Levy, Wyatt Lloyd |
| 2025 | SAND: A New Programming Abstraction for Video-based Deep Learning. Juncheol Ye, Seungkook Lee, Hwijoon Lim, Jihyuk Lee, Uitaek Hong, Youngjin Kwon, Dongsu Han |
| 2025 | Sailor: Automating Distributed Training over Dynamic, Heterogeneous, and Geo-distributed Clusters. Foteini Strati, Zhendong Zhang, George Manos, Ixeia Sánchez Périz, Qinghao Hu, Tiancheng Chen, Berk Buzcu, Song Han, Pamela Delgado, Ana Klimovic |
| 2025 | Scalable Address Spaces using Concurrent Interval Skiplist. Tae Woo Kim, Youngjin Kwon, Jeehoon Kang |
| 2025 | Scalable Far Memory: Balancing Faults and Evictions. Yueyang Pan, Yash Lala, Musa Unal, Yujie Ren, SeungSeob Lee, Abhishek Bhattacharjee, Anurag Khandelwal, Sanidhya Kashyap |
| 2025 | Sleeping with One Eye Open: Fast, Sustainable Storage with Sandman. Yanbo Zhou, Erci Xu, Anisa Su, Jim Harris, Adam Manzanares, Steven Swanson |
| 2025 | Spirit: Fair Allocation of Interdependent Resources in Remote Memory Systems. SeungSeob Lee, Jachym Putta, Ziming Mao, Anurag Khandelwal |
| 2025 | TRIP: Coercion-resistant Registration for E-Voting with Verifiability and Usability in Votegral. Louis-Henri Merino, Simone Colombo, Rene Reyes, Alaleh Azhir, Shailesh Mishra, Pasindu Tennage, Mohammad Amin Raeisi, Haoqian Zhang, Jeff R. Allen, Bernhard Tellenbach, Vero Estrada-Galiñanes, Bryan Ford |
| 2025 | Tai Chi: A General High-Efficiency Scheduling Framework for SmartNICs in Hyperscale Clouds. Bang Di, Yun Xu, Kaijie Guo, Yibin Shen, Yu Li, Sanchuan Cheng, Hao Zheng, Fudong Qiu, Xiaokang Hu, Naixuan Guan, Dongdong Huang, Jinhu Li, Yi Wang, Yifang Yang, Jintao Li, Hang Yang, Chen Liang, Yilong Lv, Zikang Chen, Zhenwei Lu, Xiaohan Ma, Jiesheng Wu |
| 2025 | Tempo: Compiled Dynamic Deep Learning with Symbolic Dependence Graphs. Pedro F. Silvestre, Peter R. Pietzuch |
| 2025 | The Design and Implementation of a Virtual Firmware Monitor. Charly Castes, François Costa, Neelu S. Kalani, Timothy Roscoe, Nate Foster, Thomas Bourgeat, Edouard Bugnion |
| 2025 | TickTock: Verified Isolation in a Production Embedded OS. Vivien Rindisbacher, Evan Johnson, Nico Lehmann, Tyler Potyondy, Pat Pannuto, Stefan Savage, Deian Stefan, Ranjit Jhala |
| 2025 | Tiga: Accelerating Geo-Distributed Transactions with Synchronized Clocks. Jinkun Geng, Shuai Mu, Anirudh Sivaraman, Balaji Prabhakar |
| 2025 | Tock: From Research To Securing 10 Million Computers. Leon Schuermann, Brad Campbell, Branden Ghena, Philip Alexander Levis, Amit Levy, Pat Pannuto |
| 2025 | TrainVerify: Equivalence-Based Verification for Distributed LLM Training. Yunchi Lu, Youshan Miao, Cheng Tan, Peng Huang, Yi Zhu, Xian Zhang, Fan Yang |
| 2025 | Unlocking True Elasticity for the Cloud-Native Era with Dandelion. Tom Kuchler, Pinghe Li, Yazhuo Zhang, Lazar Cvetkovic, Boris Goranov, Tobias Stocker, Leon Thomm, Simone Kalbermatter, Tim Notter, Andrea Lattuada, Ana Klimovic |
| 2025 | WASIT: Deep and Continuous Differential Testing of WebAssembly System Interface Implementations. Yage Hu, Wen Zhang, Botang Xiao, Qingchen Kong, Boyang Yi, Suxin Ji, Songlan Wang, Wenwen Wang |
| 2025 | cache_ext: Customizing the Page Cache with eBPF. Tal Zussman, Ioannis Zarkadas, Jeremy Carin, Andrew Cheng, Hubertus Franke, Jonas Pfefferle, Asaf Cidon |
| 2025 | eBPF Misbehavior Detection: Fuzzing with a Specification-Based Oracle. Tao Lyu, Kumar Kartikeya Dwivedi, Thomas Bourgeat, Mathias Payer, Meng Xu, Sanidhya Kashyap |
| 2025 | μFork: Supporting POSIX fork Within a Single-Address-Space OS. John Alistair Kressel, Hugo Lefeuvre, Pierre Olivier |