SOSP A*

67 papers

YearTitle / Authors
2025Aegaeon: Effective GPU Pooling for Concurrent LLM Serving on the Market.
Yuxing Xiang, Xue Li, Kun Qian, Yufan Yang, Diwen Zhu, Wenyuan Yu, Ennan Zhai, Xuanzhe Liu, Xin Jin, Jingren Zhou
2025Aeolia: A Fast and Secure Userspace Interrupt-Based Storage Stack.
Chuandong Li, Ran Yi, Zonghao Zhang, Jing Liu, Changwoo Min, Jie Zhang, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Diyu Zhou
2025Analyzing and Enhancing ArckFS: An Anecdotal Example of Benefits of Artifact Evaluation.
Jonguk Jeon, Subeen Park, Sanidhya Kashyap, Sudarsun Kannan, Diyu Zhou, Jeehoon Kang
2025Atmosphere: Practical Verified Kernels with Rust and Verus.
Xiangdong Chen, Zhaofeng Li, Jerry Zhang, Vikram Narayanan, Anton Burtsev
2025AutoMan: Facilitating Verified Distributed Systems Development Through Automatic Code Generation and Manual Optimizations.
Zihao Zhang, Ti Zhou, Christa Jenkins, Omar Chowdhury, Shuai Mu
2025CHERIoT RTOS: An OS for Fine-Grained Memory-Safe Compartments on Low-Cost Embedded Devices.
Saar Amar, Tony Chen, David Chisnall, Nathaniel Wesley Filardo, Ben Laurie, Hugo Lefeuvre, Kunyan Liu, Simon W. Moore, Robert Norton-Wright, Margo I. Seltzer, Yucong Tao, Robert N. M. Watson, Hongyan Xia
2025COpter: Efficient Large-Scale Resource-Allocation via Continual Optimization.
Suhas Jayaram Subramanya, Don Kurian Dennis, Virginia Smith, Gregory R. Ganger
2025Characterizing Mobile SoC for Accelerating Heterogeneous LLM Inference.
Le Chen, Dahu Feng, Erhu Feng, Yingrui Wang, Rong Zhao, Yubin Xia, Pinjie Xu, Haibo Chen
2025CortenMM: Efficient Memory Management with Strong Correctness Guarantees.
Junyang Zhang, Xiangcan Xu, Yong-Hao Zou, Zhe Tang, Xinyi Wan, Kang Hu, Siyuan Wang, Wenbo Xu, Di Wang, Hao Chen, Lin Huang, Shoumeng Yan, Yuval Tamir, Yingwei Luo, Xiaolin Wang, Huashan Yu, Zhenlin Wang, Hongliang Tian, Diyu Zhou
2025Coyote v2: Raising the Level of Abstraction for Data Center FPGAs.
Benjamin Ramhorst, Dario Korolija, Maximilian Jakob Heer, Jonas Dann, Luhao Liu, Gustavo Alonso
2025DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.
Chenyu Jiang, Zhenkun Cai, Ye Tian, Zhen Jia, Yida Wang, Chuan Wu
2025Demeter: A Scalable and Elastic Tiered Memory Solution for Virtualized Cloud via Guest Delegation.
Junliang Hu, Zhisheng Hu, Chun-Feng Wu, Ming-Chang Yang
2025Device-Assisted Live Migration of RDMA Devices.
Artem Y. Polyakov, Gal Shalom, Asaf Schwartz, Aviad Yehezkel, Omri Ben David, Omri Kahalon, Ariel Shahar, Liran Liss
2025DiffKV: Differentiated Memory Management for Large Language Models with Parallel KV Compaction.
Yanqi Zhang, Yuwei Hu, Runyuan Zhao, John C. S. Lui, Haibo Chen
2025Fast End-to-End Performance Simulation of Accelerated Hardware-Software Stacks.
Jiacheng Ma, Jonas Kaufmann, Emilien Guandalino, Rishabh R. Iyer, Thomas Bourgeat, George Candea
2025Fawkes: Finding Data Durability Bugs in DBMSs via Recovered Data State Verification.
Zhiyong Wu, Jie Liang, Jingzhou Fu, Wenqian Deng, Yu Jiang
2025FlexGuard: Fast Mutual Exclusion Independent of Subscription.
Victor Laforet, Sanidhya Kashyap, Calin Iorgulescu, Julia Lawall, Jean-Pierre Lozi
2025Ghost in the Android Shell: Pragmatic Test-oracle Specification of a Production Hypervisor.
Kayvan Memarian, Ben Simner, David Kaloper-Mersinjak, Thibaut Pérami, Peter Sewell
2025HedraRAG: Co-Optimizing Generation and Retrieval for Heterogeneous RAG Workflows.
Zhengding Hu, Vibha Murthy, Zaifeng Pan, Wanlu Li, Xiaoyi Fang, Yufei Ding, Yuke Wang
2025How to Copy Memory? Coordinated Asynchronous Copy as a First-Class OS Service.
Jingkai He, Yunpeng Dong, Dong Du, Mo Zou, Zhitai Yu, Yuxin Ren, Ning Jia, Yubin Xia, Haibo Chen
2025IC-Cache: Efficient Large Language Model Serving via In-context Caching.
Yifan Yu, Yu Gan, Nikhil Sarda, Lillian Tsai, Jiaming Shen, Yanqi Zhou, Arvind Krishnamurthy, Fan Lai, Hank Levy, David E. Culler
2025Jenga: Effective Memory Management for Serving LLM with Heterogeneity.
Chen Zhang, Kuntai Du, Shu Liu, Woosuk Kwon, Xiangxi Mo, Yufeng Wang, Xiaoxuan Liu, Kaichao You, Zhuohan Li, Mingsheng Long, Jidong Zhai, Joseph Gonzalez, Ion Stoica
2025KNighter: Transforming Static Analysis with LLM-Synthesized Checkers.
Chenyuan Yang, Zijie Zhao, Zichen Xie, Haoyu Li, Lingming Zhang
2025KTransformers: Unleashing the Full Potential of CPU/GPU Hybrid Inference for MoE Models.
Hongtao Chen, Weiyu Xie, Boxin Zhang, Jingqi Tang, Jiahao Wang, Jianwei Dong, Shaoyuan Chen, Ziwei Yuan, Chen Lin, Chengyu Qiu, Yuening Zhu, Qingliang Ou, Jiaqi Liao, Xianglin Chen, Zhiyuan Ai, Yongwei Wu, Mingxing Zhang
2025LithOS: An Operating System for Efficient Machine Learning on GPUs.
Patrick H. Coppock, Brian Zhang, Eliot H. Solomon, Vasilis Kypriotis, Leon Yang, Bikash Sharma, Dan Schatzberg, Todd C. Mowry, Dimitrios Skarlatos
2025Loom: Efficient Capture and Querying of High-Frequency Telemetry.
Franco Solleza, Shihang Li, William Sun, Richard Tang, Malte Schwarzkopf, Andrew Crotty, David Cohen, Nesime Tatbul, Stan Zdonik
2025METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation.
Siddhant Ray, Rui Pan, Zhuohan Gu, Kuntai Du, Shaoting Feng, Ganesh Ananthanarayanan, Ravi Netravali, Junchen Jiang
2025Managing Scalable Direct Storage Accesses for GPUs with GoFS.
Shaobo Li, Yirui Eric Zhou, Yuqi Xue, Yuan Xu, Jian Huang
2025Mantle: Efficient Hierarchical Metadata Management for Cloud Object Storage Services.
Jiahao Li, Biao Cao, Jielong Jian, Cheng Li, Sen Han, Yiduo Wang, Yufei Wu, Kang Chen, Zhihui Yin, Qiushi Chen, Jiwei Xiong, Jie Zhao, Fengyuan Liu, Yan Xing, Liguo Duan, Miao Yu, Ran Zheng, Feng Wu, Xianjun Meng
2025Mercury: Unlocking Multi-GPU Operator Optimization for LLMs via Remote Memory Scheduling.
Yue Guan, Xinwei Qiang, Zaifeng Pan, Daniels Johnson, Yuanwei Fang, Keren Zhou, Yuke Wang, Wanlu Li, Yufei Ding, Adnan Aziz
2025Mitigating Application Resource Overload with Targeted Task Cancellation.
Yigong Hu, Zeyin Zhang, Yicheng Liu, Yile Gu, Shuangyu Lei, Baris Kasikci, Peng Huang
2025Moirai: Optimizing Placement of Data and Compute in Hybrid Clouds.
Ziyue Qiu, Hojin Park, Jing Zhao, Yu-Kai Wang, Arnav Balyan, Gurmeet Singh, Yangjun Zhang, Suqiang (Jack) Song, Gregory R. Ganger, George Amvrosiadis
2025Mycroft: Tracing Dependencies in Collective Communication Towards Reliable LLM Training.
Yangtao Deng, Lei Zhang, Qinlong Wang, Xiaoyun Zhi, Xinlei Zhang, Zhuo Jiang, Haohan Xu, Lei Wang, Zuquan Song, Gaohong Liu, Yang Bai, Shuguang Wang, Wencong Xiao, Jianxi Ye, Minlan Yu, Hong Xu
2025ORQ: Complex Analytics on Private Data with Strong Security Guarantees.
Eli Baum, Sam Buxbaum, Nitin Mathai, Muhammad Faisal, Vasiliki Kalavri, Mayank Varia, John Liagouris
2025Oasis: Pooling PCIe Devices Over CXL to Boost Utilization.
Yuhong Zhong, Daniel S. Berger, Pantea Zardoshti, Enrique Saurez, Jacob Nelson, Dan R. K. Ports, Antonis Psistakis, Joshua Fried, Asaf Cidon
2025Optimistic Recovery for High-Availability Software via Partial Process State Preservation.
Yuzhuo Jing, Yuqi Mai, Angting Cai, Yi Chen, Wanning He, Xiaoyang Qian, Peter M. Chen, Peng Huang
2025Orthrus: Efficient and Timely Detection of Silent User Data Corruption in the Cloud with Resource-Adaptive Computation Validation.
Chenxiao Liu, Zhenting Zhu, Quanxi Li, Yanwen Xia, Yifan Qiao, Xiangyun Deng, Youyou Lu, Tao Xie, Huimin Cui, Zidong Du, Harry Xu, Chenxi Wang
2025Pesto: Cooking up High Performance BFT Queries.
Florian Suri-Payer, Neil Giridharan, Liam Arzola, Shir Cohen, Lorenzo Alvisi, Natacha Crooks
2025PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated Speculation.
Xingda Wei, Zhuobin Huang, Tianle Sun, Yingyi Hao, Rong Chen, Mingcong Han, Jinyu Gu, Haibo Chen
2025Pie: A Programmable Serving System for Emerging LLM Applications.
In Gim, Zhiyao Ma, SeungSeob Lee, Lin Zhong
2025PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications.
Kuntai Du, Bowen Wang, Chen Zhang, Yiming Cheng, Qing Lan, Hejian Sang, Yihua Cheng, Jiayi Yao, Xiaoxuan Liu, Yifan Qiao, Ion Stoica, Junchen Jiang
2025Proceedings of the ACM SIGOPS 31st Symposium on Operating Systems Principles, SOSP 2025, Lotte Hotel World, Seoul, Republic of Korea, October 13-16, 2025
Youjip Won, Youngjin Kwon, Ding Yuan, Rebecca Isaacs
2025Proto: A Guided Journey through Modern OS Construction.
Wonkyo Choe, Rongxiang Wang, Afsara Benazir, Felix Xiaozhu Lin
2025Prove It to the Kernel: Precise Extension Analysis via Proof-Guided Abstraction Refinement.
Hao Sun, Zhendong Su
2025Quilt: Resource-aware Merging of Serverless Workflows.
Yuxuan Zhang, Sebastian Angel
2025Rearchitecting the Thread Model of In-Memory Key-Value Stores with μTPS.
Youmin Chen, Jiwu Shu, Yanyan Shen, Linpeng Huang, Hong Mei
2025Robust LLM Training Infrastructure at ByteDance.
Borui Wan, Gaohong Liu, Zuquan Song, Jun Wang, Yun Zhang, Guangming Sheng, Shuguang Wang, Houmin Wei, Chenyuan Wang, Weiqiang Lou, Xi Yang, Mofan Zhang, Kaihua Jiang, Cheng Ren, Xiaoyun Zhi, Menghan Yu, Zhe Nan, Zhuolin Zheng, Baoquan Zhong, Qinlong Wang, Huan Yu, Jinxin Chi, Wang Zhang, Yuhan Li, Zixian Du, Sida Zhao, Yongqiang Zhang, Jingzhe Tang, Zherui Liu, Chuan Wu, Yanghua Peng, Haibin Lin, Wencong Xiao, Xin Liu, Liang Xiang
2025Running Consistent Applications Closer to Users with Radical for Lower Latency.
Nicolaas Kaashoek, Oleg Aleksandrovich Golev, Austin T. Li, Amit Levy, Wyatt Lloyd
2025SAND: A New Programming Abstraction for Video-based Deep Learning.
Juncheol Ye, Seungkook Lee, Hwijoon Lim, Jihyuk Lee, Uitaek Hong, Youngjin Kwon, Dongsu Han
2025Sailor: Automating Distributed Training over Dynamic, Heterogeneous, and Geo-distributed Clusters.
Foteini Strati, Zhendong Zhang, George Manos, Ixeia Sánchez Périz, Qinghao Hu, Tiancheng Chen, Berk Buzcu, Song Han, Pamela Delgado, Ana Klimovic
2025Scalable Address Spaces using Concurrent Interval Skiplist.
Tae Woo Kim, Youngjin Kwon, Jeehoon Kang
2025Scalable Far Memory: Balancing Faults and Evictions.
Yueyang Pan, Yash Lala, Musa Unal, Yujie Ren, SeungSeob Lee, Abhishek Bhattacharjee, Anurag Khandelwal, Sanidhya Kashyap
2025Sleeping with One Eye Open: Fast, Sustainable Storage with Sandman.
Yanbo Zhou, Erci Xu, Anisa Su, Jim Harris, Adam Manzanares, Steven Swanson
2025Spirit: Fair Allocation of Interdependent Resources in Remote Memory Systems.
SeungSeob Lee, Jachym Putta, Ziming Mao, Anurag Khandelwal
2025TRIP: Coercion-resistant Registration for E-Voting with Verifiability and Usability in Votegral.
Louis-Henri Merino, Simone Colombo, Rene Reyes, Alaleh Azhir, Shailesh Mishra, Pasindu Tennage, Mohammad Amin Raeisi, Haoqian Zhang, Jeff R. Allen, Bernhard Tellenbach, Vero Estrada-Galiñanes, Bryan Ford
2025Tai Chi: A General High-Efficiency Scheduling Framework for SmartNICs in Hyperscale Clouds.
Bang Di, Yun Xu, Kaijie Guo, Yibin Shen, Yu Li, Sanchuan Cheng, Hao Zheng, Fudong Qiu, Xiaokang Hu, Naixuan Guan, Dongdong Huang, Jinhu Li, Yi Wang, Yifang Yang, Jintao Li, Hang Yang, Chen Liang, Yilong Lv, Zikang Chen, Zhenwei Lu, Xiaohan Ma, Jiesheng Wu
2025Tempo: Compiled Dynamic Deep Learning with Symbolic Dependence Graphs.
Pedro F. Silvestre, Peter R. Pietzuch
2025The Design and Implementation of a Virtual Firmware Monitor.
Charly Castes, François Costa, Neelu S. Kalani, Timothy Roscoe, Nate Foster, Thomas Bourgeat, Edouard Bugnion
2025TickTock: Verified Isolation in a Production Embedded OS.
Vivien Rindisbacher, Evan Johnson, Nico Lehmann, Tyler Potyondy, Pat Pannuto, Stefan Savage, Deian Stefan, Ranjit Jhala
2025Tiga: Accelerating Geo-Distributed Transactions with Synchronized Clocks.
Jinkun Geng, Shuai Mu, Anirudh Sivaraman, Balaji Prabhakar
2025Tock: From Research To Securing 10 Million Computers.
Leon Schuermann, Brad Campbell, Branden Ghena, Philip Alexander Levis, Amit Levy, Pat Pannuto
2025TrainVerify: Equivalence-Based Verification for Distributed LLM Training.
Yunchi Lu, Youshan Miao, Cheng Tan, Peng Huang, Yi Zhu, Xian Zhang, Fan Yang
2025Unlocking True Elasticity for the Cloud-Native Era with Dandelion.
Tom Kuchler, Pinghe Li, Yazhuo Zhang, Lazar Cvetkovic, Boris Goranov, Tobias Stocker, Leon Thomm, Simone Kalbermatter, Tim Notter, Andrea Lattuada, Ana Klimovic
2025WASIT: Deep and Continuous Differential Testing of WebAssembly System Interface Implementations.
Yage Hu, Wen Zhang, Botang Xiao, Qingchen Kong, Boyang Yi, Suxin Ji, Songlan Wang, Wenwen Wang
2025cache_ext: Customizing the Page Cache with eBPF.
Tal Zussman, Ioannis Zarkadas, Jeremy Carin, Andrew Cheng, Hubertus Franke, Jonas Pfefferle, Asaf Cidon
2025eBPF Misbehavior Detection: Fuzzing with a Specification-Based Oracle.
Tao Lyu, Kumar Kartikeya Dwivedi, Thomas Bourgeat, Mathias Payer, Meng Xu, Sanidhya Kashyap
2025μFork: Supporting POSIX fork Within a Single-Address-Space OS.
John Alistair Kressel, Hugo Lefeuvre, Pierre Olivier