| 2026 | 2DIO: Configurable and Cache-Accurate Trace Generation for Storage Benchmarking. Yirong Wang, Isaac Khor, Peter Desnoyers |
| 2026 | A Case for Elastic Quantum Error Correction Decoders. Satvik Maurya, Abtin Molavi, Aws Albarghouthi, Swamit Tannu |
| 2026 | A Logically Disaggregated Cache for Replicated Storage Systems. Kiran Hombal, Henry Zhu, Shreesha Gopalakrishna Bhat, Neil Kaushikkar, Ramnatthan Alagappan, Aishwarya Ganesan |
| 2026 | AEP: Achieving Hierarchical Fault Tolerance in DSM Through Atomic Execution Protection. Zixuan Wang, Qi Wu, Hang Huang, Jia Rao, Hui Lu, Hao Fan, Zhuo Huang, Song Wu, Hai Jin |
| 2026 | AIMS: Cost-Efficient LLM-Based Agent Deployment in Hybrid Cloud-Edge Environments. Shiyi Liu, Haiying Shen, Shuai Che, Mahdi Ghandi, Mingqin Li |
| 2026 | ASIC-based Compression Accelerators for Storage Systems: Design, Placement, and Profiling Insights. Tao Lu, Jiapin Wang, Yelin Shan, Xiangping Zhang, Xiang Chen |
| 2026 | Accelerating Transactional Execution via Processing-In-Memory. André Lopes, Daniel Castro, Paolo Romano |
| 2026 | Accurate and Ultra-Fast Launch-Time Validation of Idempotency for GPU Kernels. Mingcong Han, Weihang Shen, Rong Chen, Haibo Chen |
| 2026 | AdaGen: Workload-Adaptive Cluster Scheduler for Latency-Optimal LLM Inference Serving. Sudipta Saha Shubha, Ayush Goel, Diman Zad Tootaghaj, Khaled Diab, Hardik Soni, K. K. Ramakrishnan, Puneet Sharma, Haiying Shen |
| 2026 | AdaServe: Accelerating Multi-SLO LLM Serving with SLO-Customized Speculative Decoding. Zikun Li, Zhuofu Chen, Remi Delacourt, Gabriele Oliaro, Zeyu Wang, Qinghan Chen, Shuhuai Lin, April Yang, Zhihao Zhang, Zhuoming Chen, Yi-Hsiang Lai, Xinhao Cheng, Xupeng Miao, Zhihao Jia |
| 2026 | Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-Design. Chunyu Xue, Weihao Cui, Quan Chen, Chen Chen, Han Zhao, Shulai Zhang, Linmei Wang, Yan Li, Limin Xiao, Weifeng Zhang, Jing Yang, Bingsheng He, Minyi Guo |
| 2026 | Automated End-to-End Model Serving with Cooperative Compilation and Scheduling. Yikang Zhang, Junlong Chen, Wei Wang, Jia Liu, Nan Hu, Haipeng Dai |
| 2026 | Avicenna: Masking Slowdowns in Replicated State Machines with Counterfactual Evaluation. Christopher Hodsdon, Zijian Qin, Khiem Ngo, Siddhartha Sen, Ethan Katz-Bassett, Wyatt Lloyd |
| 2026 | BASK: Batch And SmartNIC-offloaded KSM. Chanshin Kwak, Jaehyeon Lee, Minkyu Jung, Changjun Lee, Youngjin Kwon |
| 2026 | Bridging the GPU Utilization Gap: Predictive Multi-Dimensional Resource Scheduling for AI Workloads. Yilei Lu, Dongbiao He, Teng Ma, Zhe Liu, Letian Ruan, Jinlei Jiang, Yongwei Wu |
| 2026 | CHARM: Chiplet Heterogeneity-Aware Runtime Mapping System. Alessandro Fogli, Bo Zhao, Peter R. Pietzuch, Jana Giceva |
| 2026 | CSnake: Detecting Self-Sustaining Cascading Failure via Causal Stitching of Fault Propagations. Shangshu Qian, Lin Tan, Yongle Zhang |
| 2026 | Canopy: Property-Driven Learning for Congestion Control. Chenxi Yang, Divyanshu Saxena, Rohit Dwivedula, Kshiteej Mahajan, Swarat Chaudhuri, Aditya Akella |
| 2026 | Carbon-Aware Continuous Learning for Sustainable Real-Time Machine Learning Analytics. Gwanjong Park, Osama Khan, Dongho Ha, Myeongjae Jeon, Euiseong Seo |
| 2026 | Chimera: Transparent and High-Performance ISAX Heterogeneous Computing via Binary Rewriting. Jiatai He, Qinglin Pan, Ruilin Zhao, Ji Qi, Kaiwen Liang, Jiahao Xu, Zhiyuan Li, Yuexiang Wang, Jiageng Yu, Yanjun Wu |
| 2026 | CofferOS: Hardening OS-level Virtualization with Rust. Minkyu Jung, Chanshin Kwak, Junho Ahn, Sunho Park, Changjun Lee, Jongyul Kim, Jeehoon Kang, Youngjin Kwon |
| 2026 | ColdCode: Cold Data Encoding for Enhanced Reliability and Lifetime in 3D NAND Flash. Qiao Li, Shangyu Wu, Zheng Wan, Yufei Cui, Jie Zhang, Chun Jason Xue |
| 2026 | Concord: Learning Network Configuration Contracts. Ryan Beckett, Francis Y. Yan, Raghunadha Reddy Pocha, Vineesh V. Raj, Ayyub Shaik, Siva Kesava Reddy Kakarla |
| 2026 | Crimson: Collaborative Parameter Updates for Efficient Pipeline Training of Large Language Models. Yapeng Jiang, Wuhui Chen, Ganhong Huang, Yuzhou Huang, Zicong Hong, Song Guo, Yue Yu |
| 2026 | DROPS: Managing Serverless Resource Pools in Microsoft Azure Functions. Ahmed Alquraan, Abdelrahman Baba, Rafael Mendes da Silva, Sameh Elnikety, Paul Batum, Yan Chen, Hamid Henry Safi, Seth Fine, Samer Al-Kiswany |
| 2026 | Demystifying Serverless Costs on Public Platforms: Bridging Billing, Architecture, and OS Scheduling. Changyuan Lin, Yuanzhi Ma, Mohammad Shahrad |
| 2026 | Digital Hole: Bypassing Commercial Audio DRM Solutions with DReaMcatcher. Björn Ruytenberg, Mohammad Sina Karvandi, Herbert Bos, Erik van der Kouwe, Asia Slowinska |
| 2026 | E-Cube: Event Enhanced Efficient Video Streaming for Drones. Jingao Xu, Longfei Shangguan, Danyang Li, Yunhao Liu, Zheng Yang |
| 2026 | ECCB: Boosting Block Propagation of Blockchain with Erasure-Coded Compact Block. Bingyi Cai, Shenggang Wan, Hong Jiang |
| 2026 | ECOTE: Priority-Aware Optical Restoration for WAN Traffic Engineering. Yiren Zhao, Kunling He, Zhiquan Wang, Ran Shu, Jilong Wang, Congcong Miao |
| 2026 | EMVOD: Elastic Multi-Path QUIC Scheduling for CDN Video-on-Demand Service. Ziqi Wei, Qing Li, Tianyun Zhao, Cheng Luo, Changkui Ouyang, Xiaofei Yu, Dayi Zhao, Yong Jiang |
| 2026 | Effective On-Hardware Fuzzing of Embedded Operating Systems. Yuheng Shen, Jianzhong Liu, Qiming Guo, Yifei Chu, Qiang Zhang, Heyuan Shi, Yu Jiang |
| 2026 | Efficient Data Passing for Serverless Inference Workflows: A GPU-Centric Approach. Hao Wu, Yaochen Liu, Minchen Yu, Qizhen Weng, Junxiao Deng, Yue Yu, Hao Fan, Song Wu, Wei Wang, Hai Jin |
| 2026 | Efficient ML Model Updates for Deeply Embedded Microcontrollers. Shishir G. Patil, Sam Kumar, Prabal Dutta, Joseph E. Gonzalez |
| 2026 | Efficient Multimodal Serving via Module Multiplexing. Zicong Hong, Yuyan Chen, Haoyue Zhang, Peng Li, Wuhui Chen, Song Guo, Xiaowei Shen |
| 2026 | Efficient and Adaptable Overlapping for Computation and Communication via Signaling and Reordering. Ke Hong, Xiuhong Li, Minxu Liu, Qiuli Mao, Tianqi Wu, Zixiao Huang, Lufang Chen, Zhong Wang, Yichong Zhang, Zhenhua Zhu, Guohao Dai, Yu Wang |
| 2026 | Enabling Packet Spraying over Commodity RNICs with In-Network Support. Xiangzhou Liu, Wenxue Li, Zihao Wang, Kai Chen |
| 2026 | Ethane: Debloating State Data using Compact Trie for Account-based Blockchain. Junmo Lee, Jaehun Kim, Jiyong Youn, Soo-Mook Moon |
| 2026 | Everything You Need to Know About Virtual Machine Live Migration Between Heterogeneous Processors. Kenta Ishiguro, Caleb Fonyuy-Asheri, Elouan Barraud, Renaud Lachaize, Yérom-David Bromberg, Alain Tchana |
| 2026 | FUR: Fast and Unlimited Reads on Persistent Memory Transactions. João Barreto, Daniel Castro, Paolo Romano, Alexandro Baldassin |
| 2026 | Fast and Parallelized Crash Consistency with Opportunistic Order Elimination. Jiahao Chen, Yanqi Pan, Wen Xia, Hao Huang, Peixin Zeng, Yuchen Shan |
| 2026 | Federated Fine-Tuning of Sparsely-Activated Large Language Models on Resource-Constrained Devices. Fahao Chen, Jie Wan, Peng Li, Zhou Su, Dongxiao Yu |
| 2026 | FicusDB: Scalable Multi-Versioned Authenticated Archival Storage. Hongbo Zhang, Maofan Yin, Robbert van Renesse |
| 2026 | Five Minutes of DDoS Brings down Tor: DDoS Attacks on the Tor Directory Protocol and Mitigations. Zhongtang Luo, Jianting Zhang, Akshat Neerati, Aniket Kate |
| 2026 | Fix: externalizing network I/O in serverless computing. Yuhan Deng, Akshay Srivatsan, Sebastian Ingino, Francis Chua, Yasmine Mitchell, Matthew Vilaysack, Keith Winstein |
| 2026 | FlashPS: Efficient Generative Image Editing with Mask-aware Caching and Scheduling. Xiaoxiao Jiang, Suyi Li, Lingyun Yang, Tianyu Feng, Zhipeng Di, Weiyi Lu, Guoxuan Zhu, Xiu Lin, Kan Liu, Yinghao Yu, Tao Lan, Guodong Yang, Lin Qu, Liping Zhang, Wei Wang |
| 2026 | FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters. Yanying Lin, Shijie Peng, Chengzhi Lu, Chengzhong Xu, Kejiang Ye |
| 2026 | FlexiQ: Adaptive Mixed-Precision Quantization for Latency/Accuracy Trade-Offs in Deep Neural Networks. Jaemin Kim, Hongjun Um, Sungkyun Kim, Yongjun Park, Jiwon Seo |
| 2026 | FlexiWalker: Extensible GPU Framework for Efficient Dynamic Random Walks with Runtime Adaptation. Seongyeon Park, Jaeyong Song, Changmin Shin, Sukjin Kim, Junguk Hong, Jinho Lee |
| 2026 | From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents. Yuan Wang, Mingyu Li, Haibo Chen |
| 2026 | Fuzzing Enterprise-Grade Blockchain Systems: Industrial Practice and Solutions. Fuchen Ma, Yuanliang Chen, Zhen Yan, Yuanhang Zhou, Yu Jiang, Mingchao Wan |
| 2026 | Garen: Reliable Cluster Management with Atomic State Reconciliation. Mingi Kim, Ahnjae Shin, Jaewoo Maeng, Myeongjae Jeon, Byung-Gon Chun |
| 2026 | GeDES: GPU-Driven Discrete Event Network Simulator. Qinyong Li, Zhiwei Zhao, Geyong Min, Zi Wang, Luwei Fu |
| 2026 | Gopher: Efficient Dynamic Graph Pattern Mining via DAG-Driven Execution. Yi Zhang, Yu Huang, Chaoqiang Liu, Haifeng Liu, Juntao Chen, Jingrui Yuan, Jianhui Yue, Xiaofei Liao, Hai Jin, Jingling Xue |
| 2026 | HARP: Orchestrating Automated Parallel Training on Heterogeneous GPU Clusters. Antian Liang, Zhigang Zhao, Kai Zhang, Xuri Shi, Chuantao Li, Chunxiao Wang, Zhenying He, Yinan Jing, X. Sean Wang |
| 2026 | Handling Network Faults in Distributed AI Training: Failover is Now an Option. Xin Zhe Khooi, Zhuo Jiang, Pan Xie, Zhigang Cui, Meng Wang, Yuze Jin, Pengfei Huo, Dongyang Wang, Lulu Chen, Lei Wang, Liaoyuan Feng, Xiaodong Liu, Peng Li, Qinlong Wang, Yang Bai, Yongcan Wang, Hao Jin, Jinshuai Sun, Shan Lu, Xiang Shi, Yingkai Zhao, Haiquan Chen, Yi Li, Jianxi Ye, Mun Choon Chan |
| 2026 | HetAuto: Cross-Cluster Auto-Parallelism for Heterogeneous Distributed Training. Guicheng Qi, Junwei Su, Liqi Yang, Tao Li, Tingwen Xie, Yerui Sun, Yuchen Xie, Chuan Wu |
| 2026 | High Throughput and Low Latency LLM Serving via Adaptive KV Caching. Wenyan Chen, Chengzhi Lu, Huanle Xu, Kejiang Ye, Chengzhong Xu |
| 2026 | In-Production Characterization of an Open Source Serverless Platform and New Scaling Strategies. Nima Nasiri, Nalin Munshi, Simon Daniel Moser, Marius Pirvu, Vijay Sundaresan, Daryl Maier, Thatta Premnath, Norman Böwing, Sathish Gopalakrishnan, Mohammad Shahrad |
| 2026 | KUNSERVE: Parameter-centric Memory Management for Efficient Memory Overloading Handling in LLM Serving. Rongxin Cheng, Yuxin Lai, Xingda Wei, Rong Chen, Haibo Chen |
| 2026 | LCMP: Distributed Long-Haul Cost-Aware Multi-Path Routing for Inter-Datacenter RDMA Networks. Dong-Yang Yu, Yuchao Zhang, Xiaodi Wang, Jun Wang, Wenfei Wu, Haipeng Yao, Wendong Wang, Ke Xu |
| 2026 | LLMFolder: Revisiting Constant Folding in Large Language Models. Gansen Hu, Zhaoguo Wang, Wei Huang, Jinglin Wei, Haibo Chen |
| 2026 | Laminar: A Scalable Asynchronous RL Post-Training Framework. Guangming Sheng, Yuxuan Tong, Borui Wan, Wang Zhang, Chaobo Jia, Xibin Wu, Yuqi Wu, Xiang Li, Chi Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu |
| 2026 | Learn-to-Probe: Achieving Signal Distinguishability in Learning-based Congestion Control. Han Tian, Wenbo Li, Junxue Zhang, Xudong Liao, Decang Sun, Donghui Chen, Bin Huang, Wenxue Li, Yong Wang, Kai Chen |
| 2026 | Lessons Learned from Incorporating Formal Methods in Huawei Cloud Reliability. Claudia Cauli, Timo Lang, Shuo Chen, Sebti Mouelhi, Xin Jin, Subhajit Bandopadhyay, Xusheng Chen, Yazhi Feng, Haoze Song, Linhua Tang, Zhenli Sheng, Ananth Shrinivas Srinath |
| 2026 | LifeFuzz: Lifecycle-Guided Fuzzing for Windows Driver Cross-Handler Vulnerabilities. Chendong Yu, Yuekang Li, Yang Xiao, Jie Lu, Yeting Li, Defang Bo, Wei Huo |
| 2026 | LightDSA: Enabling Efficient DSA Through Hardware-Aware Transparent Optimization. Yuansen Wang, Teng Ma, Yuanhui Luo, Dongbiao He, Zheng Liu, Yunpeng Chai |
| 2026 | LoRAFusion: Efficient LoRA Fine-Tuning for LLMs. Zhanda Zhu, Qidong Su, Yaoyao Ding, Kevin Song, Shang Wang, Gennady Pekhimenko |
| 2026 | Low-Compilation-Cost Register Allocation in LLVM-Based Binary Translation. Xiangwei Meng, Chen Gao, Wei Li, Fengyuan Ren |
| 2026 | MFS: An Efficient Model Family Serving System for LLMs. Yunxuan Zhang, Hao Wang, Han Tian, Liu Yang, Xudong Liao, Wenxue Li, Ping Yin, Bowen Liu, Kai Chen |
| 2026 | MTTM: Dynamic Fast Memory Partitioning with Bandwidth Optimization for Multi-tenant Cloud. Changjun Lee, Sangjin Choi, Youngjin Kwon |
| 2026 | Matrix‑PIC: Harnessing Matrix Outer-product for High‑Performance Particle‑in‑Cell Simulations. Yizhuo Rao, Xingjian Cui, Jiabin Xie, Shangzhi Pang, Guangnan Feng, Jinhui Wei, Zhiguang Chen, Yutong Lu |
| 2026 | Maya: Optimizing Deep Learning Training Workloads using GPU Runtime Emulation. Srihas Yarlagadda, Amey Agrawal, Elton Pinto, Hakesh Darapaneni, Mitali Meratwal, Shivam Mittal, Pranavi Bajjuri, Srinivas Sridharan, Alexey Tumanov |
| 2026 | MegaScale-Data: Scaling DataLoader for Multisource Large Foundation Model Training. Juntao Zhao, Qi Lu, Wei Jia, Borui Wan, Lei Zuo, Junda Feng, Jianyu Jiang, Yangrui Chen, Shuaishuai Cao, Jialing He, Kaihua Jiang, Yuanzhe Hu, Shibiao Nong, Yanghua Peng, Haibin Lin, Chuan Wu |
| 2026 | MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production. Chao Jin, Ziheng Jiang, Zhihao Bai, Zheng Zhong, Juncai Liu, Xiang Li, Ningxin Zheng, Xi Wang, Cong Xie, Qi Huang, Wen Heng, Yiyuan Ma, Wenlei Bao, Size Zheng, Xuegui Zheng, Yanghua Peng, Haibin Lin, Xuanzhe Liu, Xin Jin, Xin Liu |
| 2026 | MegaScale-Omni: A Hyper-Scale, Workload-Resilient System for MultiModal LLM Training in Production. Chunyu Xue, Yangrui Chen, Jianyu Jiang, Ningxin Zheng, Junda Feng, Jingji Chen, Shixiong Zhao, Shen Yan, Yi Lin, Lei Shi, Zanbo Wang, Lishu Luo, Faming Wu, Haibin Lin, Yanghua Peng, Xin Liu, Quan Chen |
| 2026 | MesaFS: An I/O-Efficient Metadata Service for Distributed File Systems. Hao Guo, Jiwu Shu, Youyou Lu |
| 2026 | Million-Scale Text-to-Video Retrieval with Hyperdimensional Computing. Hyunsei Lee, Jaewoo Gwak, Shinhyoung Jang, Junyoung Lee, Yeseong Kim |
| 2026 | MinatoLoader: Accelerating Machine Learning Training Through Efficient Data Preprocessing. Rahma Nouaji, Stella Bitchebe, Ricardo Macedo, Oana Balmau |
| 2026 | Mitigating CDN Cache Misses with Scheduling: An Origin Shield for Billion-QPS Social Platforms. Zixuan Yang, Yimeng Xu, Jiaqi Zheng, Boxi Liu, Guihai Chen, Quan Xia, He Lin, Zhihai Huang, Shangce Yuan |
| 2026 | Multipath Collective Communication Beyond Scale-up Networks in GPU Clouds. Yuchen Xu, Jianglong Nie, Baojia Li, Mingzhuo Chen, Hao Lu, Guanyu Qu, Zhenchuan Liu, Shuangshuang Yin, Xiaojie Huang, Chunzhi He, Yinben Xia, Quan Wen, Xiang Li, Zekun He, Yachen Wang, Xianneng Zou, Congcong Miao, Wenfei Wu |
| 2026 | NecoFuzz: Effective Fuzzing of Nested Virtualization via Fuzz-Harness Virtual Machines. Reima Ishii, Takaaki Fukai, Takahiro Shinagawa |
| 2026 | Neuro-C: Neural Inference Shaped by Hardware Limits. Diletta Romano, Luca Mottola, Thiemo Voigt |
| 2026 | No More Translation at Runtime: LLM-Empowered Static Binary Translation. Zhibo Liu, Huaijin Wang, Wai Kin Wong, Daoyuan Wu, Shuai Wang |
| 2026 | Not A DPU in Name Only! Unleashing RDMA-capable DPUs in Multi-Tenant Serverless Clouds with NADINO. Shixiong Qi, Songyu Zhang, K. K. Ramakrishnan, Diman Zad Tootaghaj, Hardik Soni, Puneet Sharma |
| 2026 | NutCracker: A Compilation Framework for Hybrid DPU Architectures. Yihan Yang, Haifeng Sun, Antoine Kaufmann, Jialin Li |
| 2026 | On-device Semantic Selection Made Low Latency and Memory Efficient with Monolithic Forwarding. Jiahao Zhou, Chengliang Lin, Dingji Li, Mingkai Dong, Haibo Chen |
| 2026 | Once Rolling Hashing is Enough: Exploiting Rolling Hash Reuse in Delta Compression. Haoliang Tan, Wenhao Ou, Xiangyu Zou, Cai Deng, Yanqi Pan, Hao Huang, Zhaoquan Gu, Wen Xia |
| 2026 | OptiLog: Assigning Roles in Byzantine Consensus. Hanish Gogada, Christian Berger, Leander Jehl, Hans P. Reiser, Hein Meling |
| 2026 | PARD: Enhancing Goodput for Inference Pipeline via Proactive Request Dropping. Zhixin Zhao, Yitao Hu, Simin Chen, Mingfang Ji, Wei Yang, Yuhao Zhang, Laiping Zhao, Wenxin Li, Xiulong Liu, Wenyu Qu, Hao Wang |
| 2026 | PASS: A Power Adaptive Storage Server. Dedong Xie, Theano Stavrinos, Jonggyu Park, Simon Peter, Baris Kasikci, Thomas E. Anderson |
| 2026 | PaCaR: Improved Buffered I/O Locality on NUMA Systems with Page Cache Replication. Jérôme Coquisart, Julien Sopena, Redha Gouicem |
| 2026 | PatternSketch: General and Runtime Reconfigurable Time-series Network Traffic Pattern Detection. Yang Du, Dan Wang, He Huang, Hanwen Zhang, Jianzhi Tang, Fu Xiao, Yu-e Sun |
| 2026 | PiLLM: Resource-Efficient LLM Inference Using Workload Prediction. Yunqian Fan, Shihao Bai, Ruihao Gong, Zaijun Wang, Rui Fan |
| 2026 | PointShuffler: Accelerating Point Cloud Neural Networks on General-Purpose GPUs. Yangfan Li, Zhengjie Jin, Yue Tian, Mengquan Li, Fengxiao Tang, Ming Zhao, Cen Chen |
| 2026 | Practical and Efficient x86-64 Emulation on RISC-V. Xiongchuan Tan, Yang Liu, Sebastien Chevalier, Yangyu Chen, Xiaoyi Liu, Haohuan Fu |
| 2026 | Practical and Scalable RDMA Connection Sharing for HPC Workload. Yuejie Wang, Tuo Fang, Biyu Peng, Yang Cheng, Xin Sun, Chengchao Xu, Yuchen Tang, Yuxin Ren, Ning Jia, Xinwei Hu, Yunfei Du, Guyue (Grace) Liu |
| 2026 | Prediction-Informed Power Management for General-Purpose Compute Servers. Jonggyu Park, Simon Peter, Thomas E. Anderson |
| 2026 | Proactive Change Risk Detection in Production Cloud Systems: ByteDance's Experience. Jinyang Liu, Yichen Li, Tieying Zhang, Binbin Chen, Xiao He, Zhihan Jiang, Haipeng Zhang, Gang Wu, Yi Li |
| 2026 | Proceedings of the 21st European Conference on Computer Systems, EuroSys 2026, McEwan Hall/The University of Edinburgh, Edinburgh, Scotland, UK, April 27-30, 2026 Antonio Barbalace, Luo Mai, Roxana Geambasu, Peter R. Pietzuch |
| 2026 | Proteus: Heterogeneous FPGA Virtualization. Felix Gust, Shu Anzai, Charalampos Mainas, Atsushi Koshiba, Pramod Bhatotia |
| 2026 | Pyramid: A Secure, Resource-Efficient, and Pluggable Kubernetes for Multi-Tenancy. Xiang Li, Weijie Liu, Fabing Li, Hongliang Tian, Zheli Liu, Shoumeng Yan, Mingyu Gao |
| 2026 | REPS: Recycled Entropy Packet Spraying for Adaptive Load Balancing and Failure Mitigation. Tommaso Bonato, Abdul Kabbani, Ahmad Ghalayini, Michael Papamichael, Mohammad Dohadwala, Lukas Gianinazzi, Mikhail Khalilov, Elias Achermann, Daniele De Sensi, Torsten Hoefler |
| 2026 | RLive: Robust Delivery System for Scaling Live Streaming Services. Yu Tian, Gerui Lv, Qinghua Wu, Ruili Fang, Yajie Peng, Zhichen Xue, Rui Han, Chuanqing Lin, Xiaofei Pang, Ri Lu, Zhenyu Li |
| 2026 | RaidenSwap: A Multi-Swap Remote System for Multi-core Applications. Kefan Liu, Ke Liu, Xu Zhang, Hui Yuan, Xiaolong Zheng, Ning Liu, Sa Wang, Guanghui Zhang, Yungang Bao, Mingyu Chen, Chenxi Wang |
| 2026 | Rearchitecting Programmable Networks For In-Network Computing: From Hardware To Language. Haifeng Sun, Bing Liu, Taixu Tian, Jinbo Sun, Jintao He, Qun Huang, Luyou He, Xuan Wang, Feng Gao, Liguo Wang, Xiangcan Xu, Junyi Guo, Xiaoping Zhu, Yongqiang Yang |
| 2026 | Reducing the GPU Memory Bottleneck with Lossless Compression for ML. Aditya K. Kamath, Arvind Krishnamurthy, Marco Canini, Simon Peter |
| 2026 | RoPeerTo: A Datacenter-Scale Architecture for Peer-To-Peer DMA between GPUs and FPGAs. Marco Venere, Giuseppe Sorrentino, Benjamin Ramhorst, Maximilian Jakob Heer, Lucian Petrica, Dario Korolija, Marco D. Santambrogio, Davide Conficconi, Gustavo Alonso, Kenneth O'Brien |
| 2026 | Rose: Reproducing External-Fault-Induced Failures in Distributed Systems with Lightweight Instrumentation. Sebastião Amaro, Pedro Fonseca, Miguel Matos |
| 2026 | SAS: Sparse Attention Synthesizer for Efficient Language Model Inference. Yuan Zhou, Shaojie Xiang, Lingfan Yu, Zhenyu Song, Charith Mendis, Yida Wang |
| 2026 | SKernel: An Elastic and Efficient Secure Container System at Scale with a Split-Kernel Architecture. Xiaohu Chai, Keyang Hu, Jianfeng Tan, Tiwei Bie, Guotao Tan, Tianyu Zhou, Anqi Shen, Dawei Shen, Xinyao Yang, Xin Chen, Xu Wang, Feng Yu, Zhengyu He, Dong Du, Yubin Xia, Kang Chen, Yu Chen |
| 2026 | STAlloc: Enhancing Memory Efficiency in Large-Scale Model Training with Spatio-Temporal Planning. Zixiao Huang, Junhao Hu, Hao Lin, Chunyang Zhu, Yueran Tang, Quanlu Zhang, Zhen Guo, Zhenhua Li, Shengen Yan, Zhenhua Zhu, Guohao Dai, Yu Wang |
| 2026 | Scalable RDMA-accelerated Distributed Locks with Shared Stream Abstraction. Miao Cai, Junru Shen, Xiaojian Liao, Rong Gu, Yanchao Zhao, Hao Han, Bing Chen, Baoliu Ye |
| 2026 | Scaling LLM Test-Time Compute with Mobile NPU on Smartphones. Zixu Hao, Jianyu Wei, Tuowei Wang, Minxing Huang, Huiqiang Jiang, Shiqi Jiang, Ting Cao, Ju Ren |
| 2026 | Scheduling Cloud Block Storage Proactively and Reactively with Omar. Xinqi Chen, Weidong Zhang, Zhongyu Wang, Erci Xu, Xiaolu Zhang, Dong Wu, Junping Wu, Haonan Wu, Ruiming Lu, Yaheng Song, Chaolei Hu, Lijun Ding, Guangtao Xue, Patrick P. C. Lee |
| 2026 | Serverless Replication of Object Storage across Multi-Vendor Clouds and Regions. Junyi Shu, Xiaolong Huang, Gang Huang, Hong Mei, Xuanzhe Liu, Xin Jin |
| 2026 | SkyWalker: A Locality-Aware Cross-Region Load Balancer for LLM Inference. Tian Xia, Ziming Mao, Jamison Kerney, Ethan J. Jackson, Zhifei Li, Jiarong Xing, Scott Shenker, Ion Stoica |
| 2026 | SmartNS: Enabling Line-rate and Flexible Network Stack with SmartNIC. Xuzheng Chen, Jie Zhang, Baolin Zhu, Xueying Zhu, Zhongqing Chen, Ting Fu, Shu Ma, Lingjun Zhu, Chao Shi, Yin Zhang, Yuanchao Shu, Peng Cheng, Zeke Wang |
| 2026 | Squeezy: Rapid VM Memory Reclamation for Serverless Functions. Orestis Lagkas Nikolos, Chloe Alverti, Stratos Psomadakis, Georgios I. Goumas, Nectarios Koziris |
| 2026 | Suika: Efficient and High-quality Re-scheduling of 3D-parallelized LLM Training Jobs in Shared Clusters. Yuxuan Wang, Yanbo Wang, Chen Chen, Chunyu Xue, Qizhen Weng, Yin Chen, Zeren Li, Xuqi Zhu, Yongqiang Yang, Quan Chen, Minyi Guo |
| 2026 | SwiftFL: Enabling Speculative Training for On-Device Federated Deep Learning. Yuhui Zhang, Guang Yan, Xin Zhang, Zimu Guo, Lutan Zhao, Jiangfeng Cao, Dan Meng, Rui Hou |
| 2026 | SwitchFS: Asynchronous Metadata Updates for Distributed Filesystems with In-Network Coordination. Jingwei Xu, Mingkai Dong, Qiulin Tian, Ziyi Tian, Tong Xin, Haibo Chen |
| 2026 | TAO: Tolerance-Aware Optimistic Verification for Floating-Point Neural Networks. Jianzhu Yao, Hongxu Su, Taobo Liao, Zerui Cheng, Huan Zhang, Xuechao Wang, Pramod Viswanath |
| 2026 | TCO-driven Storage Provisioning for Exascale Data Centers. Timothy Kim, Saurabh Kadekodi, Arif Merchant, Prashant Nema, Jai Menon, K. V. Rashmi, Gregory R. Ganger |
| 2026 | TZ-LLM: Protecting On-Device Large Language Models with Arm TrustZone. Xunjie Wang, Jiacheng Shi, Zihan Zhao, Yang Yu, Zhichao Hua, Jinyu Gu |
| 2026 | TailorLLM: Collaborative End-Cloud Inference of Large and Small Language Models Based on Low-Rank Adaptation. Zian Wang, Ziyi Wang, Haonan Jin, Jie Xing, Lanshan Zhang |
| 2026 | Taming Latency-Memory Trade-Off in MoE-Based LLM Serving via Fine-Grained Expert Offloading. Hanfei Yu, Xingqi Cui, Hong Zhang, Hao Wang, Hao Wang |
| 2026 | TierScape: Harnessing Multiple Compressed Tiers to Tame Server Memory TCO. Sandeep Kumar, Aravinda Prasad, Sreenivas Subramoney |
| 2026 | TokenFlow: Responsive LLM Text Streaming Serving under Request Burst via Preemptive Scheduling. Junyi Chen, Chuheng Du, Renyuan Liu, Shuochao Yao, Dingtian Yan, Jiang Liao, Shengzhong Liu, Fan Wu, Guihai Chen |
| 2026 | Towards Improving Throughput and Scalability of DAG-based BFT SMR. Nibesh Shrestha, Aniket Kate |
| 2026 | TrustWeave: Integrity Measurement and Attestation For Multi-Cloud LLMs. Jianchang Su, Wenhui Zhang, Yifan Zhang, Kexin Chu, Hao Guo, Youyou Lu, Wei Zhang |
| 2026 | Turnstile: Hybrid Information Flow Control Framework for Managing Privacy in Internet-of-Things Applications. Kumseok Jung, Mohanna Shahrad, Gargi Mitra, Karthik Pattabiraman |
| 2026 | Untangling GPU Power Consumption: Job-Level Inference in Cloud Shared Settings. Pierre Jacquet, Maxime Agusti, Eddy Caron, Camille Coti, Marcos Dias de Assunção, Laurent Lefèvre, Anne-Cécile Orgerie |
| 2026 | Wayfinder: Automated Operating System Specialization. Alexander Jung, Cezar Craciunoiu, Nikolaos Karaolidis, Hugo Lefeuvre, Daniel Oñoro-Rubio, Felipe Huici, Charalampos Rotsos, Pierre Olivier |
| 2026 | Yield Not Thy Core. Achilles Benetopoulos, Peter Alvaro, Andi Quinn, Robert Soulé |
| 2026 | Zeppelin: Balancing Variable-length Workloads in Data Parallel Large Model Training. Chang Chen, Tiancheng Chen, Jiangfei Duan, Qianchao Zhu, Zerui Wang, Qinghao Hu, Peng Sun, Xiuhong Li, Chao Yang, Torsten Hoefler |
| 2026 | iRoute: Local Routing Table-based Workflow Management in Serverless Computing. Yiming Li, Laiping Zhao, Zhiyuan Su, Guowei Liu, Wenhao Huang, Kang Chen, Zhaolin Duan, Jingjie Zong, Wenxin Li, Deze Zeng, Dong Zhang, Wenyu Qu |
| 2026 | swKokkos: An Athread Backend for Enhanced Kokkos with the Sunway Heterogeneous Architecture. Junlin Wei, Jinrong Jiang, Wu Wang, Chen Li, Yehong Zhang, Yue Yu, Lian Zhao, Xiang Han, Zhenjia Li, Feng Zhang, Haoyuan Zhang, Yidi Bai, Maoxue Yu, Kai Xu, Hailong Liu, Xuebin Chi |
| 2026 | viNPU: Optimizing Vision Transformer Inference on Mobile NPUs. Jeho Lee, Gunjoong Kim, Chanyoung Jung, Jaehee Kim, Seonghoon Park, Hojung Cha |