OSDI A*

54 papers

YearTitle / Authors
202519th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2025, Boston, MA, USA, July 7-9, 2025.
Lidong Zhou, Yuanyuan Zhou
2025Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search Algorithm with SSD.
Hao Guo, Youyou Lu
2025Basilisk: Using Provenance Invariants to Automate Proofs of Undecidable Protocols.
Tony Nuda Zhang, Keshav Singh, Tej Chajed, Manos Kapritsos, Bryan Parno
2025Bayesian Code Diffusion for Efficient Automatic Deep Learning Program Optimization.
Isu Jeong, Seulki Lee
2025BlitzScale: Fast and Live Large Model Autoscaling with O(1) Host Caching.
Dingyan Zhang, Haotian Wang, Yang Liu, Xingda Wei, Yizhou Shan, Rong Chen, Haibo Chen
2025Building Bridges: Safe Interactions with Foreign Languages through Omniglot.
Leon Schuermann, Jack Toubes, Tyler Potyondy, Pat Pannuto, Mae Milano, Amit Levy
2025Compass: Encrypted Semantic Search with High Accuracy.
Jinhao Zhu, Liana Patel, Matei Zaharia, Raluca Ada Popa
2025DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization.
Yeonhong Park, Jake Hyun, Hojoon Kim, Jae W. Lee
2025Decentralized, Epoch-based F2FS Journaling with Fine-grained Crash Recovery.
Yaotian Cui, Zhiqi Wang, Renhai Chen, Zili Shao
2025Decouple and Decompose: Scaling Resource Allocation with DeDe.
Zhiying Xu, Minlan Yu, Francis Y. Yan
2025Deriving Semantic Checkers from Tests to Detect Silent Failures in Production Distributed Systems.
Chang Lou, Dimas Shidqi Parikesit, Yujin Huang, Zhewen Yang, Senapati Diwangkara, Yuzhuo Jing, Achmad Imam Kistijantoro, Ding Yuan, Suman Nath, Peng Huang
2025Deterministic Client: Enforcing Determinism on Untrusted Machine Code.
Zachary Yedidia, Geoffrey Ramseyer, David Mazières
2025Disentangling the Dual Role of NIC Receive Rings.
Boris Pismenny, Adam Morrison, Dan Tsafrir
2025EMT: An OS Framework for New Memory Translation Architectures.
Siyuan Chai, Jiyuan Zhang, Jongyul Kim, Alan Wang, Fan Chung, Jovan Stojkovic, Weiwei Jia, Dimitrios Skarlatos, Josep Torrellas, Tianyin Xu
2025Enabling Efficient GPU Communication over Multiple NICs with FuseLink.
Zhenghang Ren, Yuxuan Li, Zilong Wang, Xinyang Huang, Wenxue Li, Kaiqiang Xu, Xudong Liao, Yijun Sun, Bowen Liu, Han Tian, Junxue Zhang, Mingfei Wang, Zhizhen Zhong, Guyue Liu, Ying Zhang, Kai Chen
2025Extending Applications Safely and Efficiently.
Yusheng Zheng, Tong Yu, Yiwei Yang, Yanpeng Hu, Xiaozheng Lai, Dan Williams, Andi Quinn
2025Fast and Synchronous Crash Consistency with Metadata Write-Once File System.
Yanqi Pan, Wen Xia, Yifeng Zhang, Xiangyu Zou, Hao Huang, Zhenhua Li, Chentao Wu
2025FineMem: Breaking the Allocation Overhead vs. Memory Waste Dilemma in Fine-Grained Disaggregated Memory Management.
Xiaoyang Wang, Yongkun Li, Kan Wu, Wenzhe Zhu, Yuqi Li, Yinlong Xu
2025Fork in the Road: Reflections and Optimizations for Cold Start Latency in Production Serverless Systems.
Xiaohu Chai, Tianyu Zhou, Keyang Hu, Jianfeng Tan, Tiwei Bie, Anqi Shen, Dawei Shen, Qi Xing, Shun Song, Tongkai Yang, Le Gao, Feng Yu, Zhengyu He, Dong Du, Yubin Xia, Kang Chen, Yu Chen
2025KPerfIR: Towards a Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads.
Yue Guan, Yuanwei Fang, Keren Zhou, Corbin Robeck, Manman Ren, Zhongkai Yu, Yufei Ding, Adnan Aziz
2025KRR: Efficient and Scalable Kernel Record Replay.
Tianren Zhang, Sishuai Gong, Pedro Fonseca
2025Kamino: Efficient VM Allocation at Scale with Latency-Driven Cache-Aware Scheduling.
David Domingo, Hugo Barbalho, Marco Molinaro, Kuan Liu, Abhisek Pan, David Dion, Thomas Moscibroda, Sudarsun Kannan, Ishai Menache
2025Low End-to-End Latency atop a Speculative Shared Log with Fix-Ante Ordering.
Shreesha G. Bhat, Tony Hong, Xuhao Luo, Jiyu Hu, Aishwarya Ganesan, Ramnatthan Alagappan
2025Mako: Speculative Distributed Transactions with Geo-Replication.
Weihai Shen, Yang Cui, Siddhartha Sen, Sebastian Angel, Shuai Mu
2025MettEagle: Costs and Benefits of Implementing Containers on Microkernels.
Till Miemietz, Viktor Reusch, Matthias Hille, Lars Wrenger, Jana Eisoldt, Jan Klötzke, Max Kurze, Adam Lackorzynski, Michael Roitzsch, Hermann Härtig
2025Mirage: A Multi-Level Superoptimizer for Tensor Programs.
Mengdi Wu, Xinhao Cheng, Shengyu Liu, Chunan Shi, Jianan Ji, Man Kit Ao, Praveen Velliengiri, Xupeng Miao, Oded Padon, Zhihao Jia
2025NanoFlow: Towards Optimal Large Language Model Serving Throughput.
Kan Zhu, Yufei Gao, Yilong Zhao, Liangyu Zhao, Gefei Zuo, Yile Gu, Dedong Xie, Zihao Ye, Keisuke Kamahori, Chien-Yu Lin, Ziren Wang, Stephanie Wang, Arvind Krishnamurthy, Baris Kasikci
2025Neutrino: Fine-grained GPU Kernel Profiling via Programmable Probing.
Songlin Huang, Chenshu Wu
2025OS Rendering Service Made Parallel with Out-of-Order Execution and In-Order Commit.
Yuanpei Wu, Chao Xu, Yubin Xia, Yang Yu, Ming Fu, Binyu Zang, Haibo Chen
2025Okapi: Decoupling Data Striping and Redundancy Grouping in Cluster File Systems.
Sanjith Athlur, Timothy Kim, Saurabh Kadekodi, Francisco Maturana, Xavier Ramos, Arif Merchant, K. V. Rashmi, Gregory R. Ganger
2025Paralegal: Practical Static Analysis for Privacy Bugs.
Justus Adam, Carolyn Zech, Livia Zhu, Sreshtaa Rajesh, Nathan Harbison, Mithi Jethwa, Will Crichton, Shriram Krishnamurthi, Malte Schwarzkopf
2025Picsou: Enabling Replicated State Machines to Communicate Efficiently.
Reginald Frank, Micah Murray, Chawinphat Tankuranand, Junseo Yoo, Ethan Xu, Natacha Crooks, Suyash Gupta, Manos Kapritsos
2025PipeThreader: Software-Defined Pipelining for Efficient DNN Execution.
Yu Cheng, Lei Wang, Yining Shi, Yuqing Xia, Lingxiao Ma, Jilong Xue, Yang Wang, Zhiwen Mo, Feiyang Chen, Fan Yang, Mao Yang, Zhi Yang
2025PoWER Never Corrupts: Tool-Agnostic Verification of Crash Consistency and Corruption Detection.
Hayley LeBlanc, Jacob R. Lorch, Chris Hawblitzel, Cheng Huang, Yiheng Tao, Nickolai Zeldovich, Vijay Chidambaram
2025Principles and Methodologies for Serial Performance Optimization.
Sujin Park, Mingyu Guan, Xiang Cheng, Taesoo Kim
2025QOS: Quantum Operating System.
Emmanouil Giortamis, Francisco Romão, Nathaniel Tornow, Pramod Bhatotia
2025QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach.
Shouyang Dong, Jun Bi, Di Huang, Jiaming Guo, Jianxing Xu, Ruibai Xu, Xinkai Song, Yifan Hao, Ling Li, Xuehai Zhou, Tianshi Chen, Qi Guo, Yunji Chen
2025Quake: Adaptive Indexing for Vector Search.
Jason Mohoney, Devesh Sarda, Mengze Tang, Shihabur Rahman Chowdhury, Anil Pacaci, Ihab F. Ilyas, Theodoros Rekatsinas, Shivaram Venkataraman
2025Quantum Virtual Machines.
Runzhou Tao, Hongzheng Zhu, Jason Nieh, Jianan Yao, Ronghui Gu
2025Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload.
Xun Sun, Mingxing Zhang, Yingdi Shan, Kang Chen, Jinlei Jiang, Yongwei Wu
2025Skybridge: Bounded Staleness for Distributed Caches.
Robert Lyerly, Scott Pruett, Kevin Doherty, Greg Rogers, Nathan Bronson, John Hugg
2025Stripeless Data Placement for Erasure-Coded In-Memory Storage.
Jian Gao, Jiwu Shu, Bin Yan, Yuhao Zhang, Keji Huang
2025Söze: One Network Telemetry Is All You Need for Per-flow Weighted Bandwidth Allocation at Scale.
Weitao Wang, T. S. Eugene Ng
2025Tiered Memory Management Beyond Hotness.
Jinshu Liu, Hamid Hadian, Hanchen Xu, Huaicheng Li
2025Tigon: A Distributed Database for a CXL Pod.
Yibo Huang, Haowei Chen, Newton Ni, Yan Sun, Vijay Chidambaram, Dixin Tang, Emmett Witchel
2025Tintin: A Unified Hardware Performance Profiling Infrastructure to Uncover and Manage Uncertainty.
Ao Li, Marion Sudvarg, Zihan Li, Sanjoy K. Baruah, Chris Gill, Ning Zhang
2025To PRI or Not To PRI, That's the question.
Yun Wang, Liang Chen, Jie Ji, Xianting Tian, Ben Luo, Zhixiang Wei, Zhibai Huang, Kailiang Xu, Kaihuan Peng, Kaijie Guo, Ning Luo, Guangjian Wang, Shengdong Dai, Yibin Shen, Jiesheng Wu, Zhengwei Qi
2025Training with Confidence: Catching Silent Errors in Deep Learning Training with Automated Proactive Checks.
Yuxuan Jiang, Ziming Zhou, Boyu Xu, Beijie Liu, Runhui Xu, Peng Huang
2025Understanding Stragglers in Large Model Training Using What-if Analysis.
Jinkun Lin, Ziheng Jiang, Zuquan Song, Sida Zhao, Menghan Yu, Zhanghan Wang, Chenyuan Wang, Zuocheng Shi, Xiang Shi, Wei Jia, Zherui Liu, Shuguang Wang, Haibin Lin, Xin Liu, Aurojit Panda, Jinyang Li
2025WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training.
Zheng Wang, Anna Cai, Xinfeng Xie, Zaifeng Pan, Yue Guan, Weiwei Chu, Jie Wang, Shikai Li, Jianyu Huang, Chris Cai, Yuchen Hao, Yufei Ding
2025WaferLLM: Large Language Model Inference at Wafer Scale.
Congjie He, Yeqi Huang, Pei Mu, Ziming Miao, Jilong Xue, Lingxiao Ma, Fan Yang, Luo Mai
2025Weave: Efficient and Expressive Oblivious Analytics at Scale.
Mahdi Soleimani, Grace Jia, Anurag Khandelwal
2025XSched: Preemptive Scheduling for Diverse XPUs.
Weihang Shen, Mingcong Han, Jialong Liu, Rong Chen, Haibo Chen
2025ZEN: Empowering Distributed Training with Sparsity-driven Data Synchronization.
Zhuang Wang, Zhaozhuo Xu, Jingyi Xi, Yuke Wang, Anshumali Shrivastava, T. S. Eugene Ng