ASPLOS A*

81 papers

YearTitle / Authors
2022A full-stack search technique for domain optimized deep learning accelerators.
Dan Zhang, Safeen Huda, Ebrahim M. Songhori, Kartik Prabhu, Quoc V. Le, Anna Goldie, Azalia Mirhoseini
2022A one-for-all and
Bangyan Wang, Lei Deng, Fei Sun, Guohao Dai, Liu Liu, Yu Wang, Yuan Xie
2022A tree clock data structure for causal orderings in concurrent executions.
Umang Mathur, Andreas Pavlogiannis, Hünkar Can Tunç, Mahesh Viswanathan
2022ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022
Babak Falsafi, Michael Ferdman, Shan Lu, Thomas F. Wenisch
2022AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures.
Zhen Zheng, Xuanda Yang, Pengzhan Zhao, Guoping Long, Kai Zhu, Feiwen Zhu, Wenyi Zhao, Xiaoyong Liu, Jun Yang, Jidong Zhai, Shuaiwen Leon Song, Wei Lin
2022Adelie: continuous address space layout re-randomization for Linux drivers.
Ruslan Nikolaev, Hassan Nadeem, Cathlyn Stone, Binoy Ravindran
2022Astraea: towards QoS-aware and resource-efficient multi-stage GPU services.
Wei Zhang, Quan Chen, Kaihua Fu, Ningxin Zheng, Zhiyi Huang, Jingwen Leng, Minyi Guo
2022BiSon-e: a lightweight and high-performance accelerator for narrow integer linear algebra computing on the edge.
Enrico Reggiani, Cristóbal Ramírez Lazo, Roger Figueras Bagué, Adrián Cristal, Mauro Olivieri, Osman Sabri Unsal
2022Breaking the computation and communication abstraction barrier in distributed machine learning workloads.
Abhinav Jangda, Jun Huang, Guodong Liu, Amir Hossein Nodehi Sabet, Saeed Maleki, Youshan Miao, Madanlal Musuvathi, Todd Mytkowicz, Olli Saarikivi
2022CARAT CAKE: replacing paging via compiler/kernel cooperation.
Brian Suchy, Souradip Ghosh, Drew Kersnar, Siyuan Chai, Zhen Huang, Aaron Nelson, Michael Cuevas, Alex Bernat, Gaurav Chaudhary, Nikos Hardavellas, Simone Campanoni, Peter A. Dinda
2022CRISP: critical slice prefetching.
Heiner Litz, Grant Ayers, Parthasarathy Ranganathan
2022CirFix: automatically repairing defects in hardware design code.
Hammad Ahmad, Yu Huang, Westley Weimer
2022Client-optimized algorithms and acceleration for encrypted compute offloading.
McKenzie van der Hagen, Brandon Lucia
2022Clio: a hardware-software co-designed disaggregated memory system.
Zhiyuan Guo, Yizhou Shan, Xuhao Luo, Yutong Huang, Yiying Zhang
2022CoolEdge: hotspot-relievable warm water cooling for energy-efficient edge datacenters.
Qiangyu Pei, Shutong Chen, Qixia Zhang, Xinhui Zhu, Fangming Liu, Ziyang Jia, Yishuo Wang, Yongjie Yuan
2022Creating concise and efficient dynamic analyses with ALDA.
Xiang Cheng, David Devecsery
2022CryoWire: wire-driven microarchitecture designs for cryogenic computing.
Dongmoon Min, Yujin Chung, Ilkwon Byun, Junpyo Kim, Jangwoo Kim
2022DAGguise: mitigating memory timing side channels.
Peter W. Deutsch, Yuheng Yang, Thomas Bourgeat, Jules Drean, Joel S. Emer, Mengjia Yan
2022DOTA: detect and omit weak attentions for scalable transformer acceleration.
Zheng Qu, Liu Liu, Fengbin Tu, Zhaodong Chen, Yufei Ding, Yuan Xie
2022Debugging in the brave new world of reconfigurable hardware.
Jiacheng Ma, Gefei Zuo, Kevin Loughlin, Haoyang Zhang, Andrew Quinn, Baris Kasikci
2022Domain specific run time optimization for software data planes.
Sebastiano Miano, Alireza Sanaee, Fulvio Risso, Gábor Rétvári, Gianni Antichi
2022EXAMINER: automatically locating inconsistent instructions between real devices and CPU emulators for ARM.
Muhui Jiang, Tianyi Xu, Yajin Zhou, Yufeng Hu, Ming Zhong, Lei Wu, Xiapu Luo, Kui Ren
2022Eavesdropping user credentials via GPU side channels on smartphones.
Boyuan Yang, Ruirong Chen, Kai Huang, Jun Yang, Wei Gao
2022Efficient and scalable core multiplexing with M³v.
Nils Asmussen, Sebastian Haas, Carsten Weinhold, Till Miemietz, Michael Roitzsch
2022Efficiently detecting concurrency bugs in persistent memory programs.
Zhangyu Chen, Yu Hua, Yongle Zhang, Luochangqi Ding
2022Enzian: an open, general, CPU/FPGA platform for systems software research.
David A. Cock, Abishek Ramdas, Daniel Schwyn, Michael Giardino, Adam Turowski, Zhenhao He, Nora Hossle, Dario Korolija, Melissa Licciardello, Kristina Martsenko, Reto Achermann, Gustavo Alonso, Timothy Roscoe
2022Every walk's a hit: making page walks single-access cache hits.
Chang Hyun Park, Ilias Vougioukas, Andreas Sandberg, David Black-Schaffer
2022FINGERS: exploiting fine-grained parallelism in graph mining accelerators.
Qihang Chen, Boyu Tian, Mingyu Gao
2022FaaSFlow: enable efficient workflow execution for function-as-a-service.
Zijun Li, Yushi Liu, Linsong Guo, Quan Chen, Jiagan Cheng, Wenli Zheng, Minyi Guo
2022Finding missed optimizations through the lens of dead code elimination.
Theodoros Theodoridis, Manuel Rigger, Zhendong Su
2022FlexDriver: a network driver for your accelerator.
Haggai Eran, Maxim Fudim, Gabi Malka, Gal Shalom, Noam Cohen, Amit Hermony, Dotan Levi, Liran Liss, Mark Silberstein
2022FlexOS: towards flexible OS isolation.
Hugo Lefeuvre, Vlad-Andrei Badoiu, Alexander Jung, Stefan Lucian Teodorescu, Sebastian Rauch, Felipe Huici, Costin Raiciu, Pierre Olivier
2022GPM: leveraging persistent memory from a GPU.
Shweta Pandey, Aditya K. Kamath, Arkaprava Basu
2022GPUReplay: a 50-KB GPU stack for client ML.
Heejin Park, Felix Xiaozhu Lin
2022GenStore: a high-performance in-storage processing system for genome sequence analysis.
Nika Mansouri-Ghiasi, Jisung Park, Harun Mustafa, Jeremie S. Kim, Ataberk Olgun, Arvid Gollwitzer, Damla Senol Cali, Can Firtina, Haiyu Mao, Nour Almadhoun Alserr, Rachata Ausavarungnirun, Nandita Vijaykumar, Mohammed Alser, Onur Mutlu
2022HAMMER: boosting fidelity of noisy Quantum circuits by exploiting Hamming behavior of erroneous outcomes.
Swamit S. Tannu, Poulami Das, Ramin Ayanzadeh, Moinuddin K. Qureshi
2022HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair.
Qian Zhang, Jiyuan Wang, Guoqing Harry Xu, Miryung Kim
2022INFless: a native serverless system for low-latency, high-throughput inference.
Yanan Yang, Laiping Zhao, Yiming Li, Huanyu Zhang, Jie Li, Mingyang Zhao, Xingzhen Chen, Keqiu Li
2022IOCost: block IO control for containers in datacenters.
Tejun Heo, Dan Schatzberg, Andrew Newell, Song Liu, Saravanan Dhakshinamurthy, Iyswarya Narayanan, Josef Bacik, Chris Mason, Chunqiang Tang, Dimitrios Skarlatos
2022IceBreaker: warming serverless functions better with heterogeneity.
Rohan Basu Roy, Tirthak Patel, Devesh Tiwari
2022Invisible bits: hiding secret messages in SRAM's analog domain.
Jubayer Mahmod, Matthew Hicks
2022JSONSki: streaming semi-structured data with bit-parallel fast-forwarding.
Lin Jiang, Zhijia Zhao
2022LILLIPUT: a lightweight low-latency lookup-table decoder for near-term Quantum error correction.
Poulami Das, Aditya Locharla, Cody Jones
2022Memory-harvesting VMs in cloud platforms.
Alexander Fuerst, Stanko Novakovic, Iñigo Goiri, Gohar Irfan Chaudhry, Prateek Sharma, Kapil Arya, Kevin Broas, Eugene Bak, Mehmet Iyigun, Ricardo Bianchini
2022MineSweeper: a "clean sweep" for drop-in use-after-free prevention.
Márton Erdos, Sam Ainsworth, Timothy M. Jones
2022NASPipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism.
Shixiong Zhao, Fanxin Li, Xusheng Chen, Tianxiang Shen, Li Chen, Sen Wang, Nicholas Zhang, Cheng Li, Heming Cui
2022NVAlloc: rethinking heap metadata management in persistent memory allocators.
Zheng Dang, Shuibing He, Peiyi Hong, Zhenxin Li, Xuechen Zhang, Xian-He Sun, Gang Chen
2022One size does not fit all: security hardening of MIPS embedded systems via static binary debloating for shared libraries.
Haotian Zhang, Mengfei Ren, Yu Lei, Jiang Ming
2022PLD: fast FPGA compilation to make reconfigurable acceleration compatible with modern incremental refinement software development.
Yuanlong Xiao, Eric Micallef, Andrew Butt, Matthew Hofmann, Marc Alston, Matthew Goldsmith, Andrew Merczynski-Hait, André DeHon
2022Parallel virtualized memory translation with nested elastic cuckoo page tables.
Jovan Stojkovic, Dimitrios Skarlatos, Apostolos Kokolis, Tianyin Xu, Josep Torrellas
2022Path-sensitive and alias-aware typestate analysis for detecting OS bugs.
Tuo Li, Jia-Ju Bai, Yulei Sui, Shi-Min Hu
2022Paulihedral: a generalized block-wise compiler optimization framework for Quantum simulation kernels.
Gushu Li, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding, Yuan Xie
2022Pinned loads: taming speculative loads in secure processors.
Zirui Neil Zhao, Houxiang Ji, Adam Morrison, Darko Marinov, Josep Torrellas
2022ProSE: the architecture and design of a protein discovery engine.
Eyes Robson, Ceyu Xu, Lisa Wu Wills
2022Protecting adaptive sampling from information leakage on low-power sensors.
Tejas Kannan, Henry Hoffmann
2022QUEST: systematically approximating Quantum circuits for higher output fidelity.
Tirthak Patel, Ed Younis, Costin Iancu, Wibe de Jong, Devesh Tiwari
2022REVAMP: a systematic framework for heterogeneous CGRA realization.
Thilini Kaushalya Bandara, Dhananjaya Wijerathne, Tulika Mitra, Li-Shiuan Peh
2022RSSD: defend against ransomware with hardware-isolated network-storage codesign and post-attack analysis.
Benjamin Reidys, Peng Liu, Jian Huang
2022Randomized row-swap: mitigating Row Hammer by breaking spatial correlation between aggressor and victim rows.
Gururaj Saileshwar, Bolin Wang, Moinuddin K. Qureshi, Prashant J. Nair
2022RecShard: statistical feature-based memory optimization for industry-scale neural recommendation.
Geet Sethi, Bilge Acun, Niket Agarwal, Christos Kozyrakis, Caroline Trippel, Carole-Jean Wu
2022Revizor: testing black-box CPUs against speculation contracts.
Oleksii Oleksenko, Christof Fetzer, Boris Köpf, Mark Silberstein
2022SOL: safe on-node learning in cloud platforms.
Yawen Wang, Daniel Crankshaw, Neeraja J. Yadwadkar, Daniel S. Berger, Christos Kozyrakis, Ricardo Bianchini
2022SRAM has no chill: exploiting power domain separation to steal on-chip secrets.
Jubayer Mahmod, Matthew Hicks
2022Serverless computing on heterogeneous computers.
Dong Du, Qingyuan Liu, Xueqiang Jiang, Yubin Xia, Binyu Zang, Haibo Chen
2022ShEF: shielded enclaves for cloud FPGAs.
Mark Zhao, Mingyu Gao, Christos Kozyrakis
2022Software-defined address mapping: a case on 3D memory.
Jialiang Zhang, Michael M. Swift, Jing Jane Li
2022SparseCore: stream ISA and processor specialization for sparse computation.
Gengyu Rao, Jingji Chen, Jason Yik, Xuehai Qian
2022Suppressing ZZ crosstalk of Quantum computers through pulse and scheduling co-optimization.
Lei Xie, Jidong Zhai, Zhenxing Zhang, Jonathan Allcock, Shengyu Zhang, Yicong Zheng
2022TMO: transparent memory offloading in datacenters.
Johannes Weiner, Niket Agarwal, Dan Schatzberg, Leon Yang, Hao Wang, Blaise Sanouillet, Bikash Sharma, Tejun Heo, Mayank Jain, Chunqiang Tang, Dimitrios Skarlatos
2022TaskStream: accelerating task-parallel workloads by recovering program structure.
Vidushi Dadu, Tony Nowatzki
2022Taurus: a data plane architecture for per-packet ML.
Tushar Swamy, Alexander Rucker, Muhammad Shahbaz, Ishan Gaur, Kunle Olukotun
2022Temporal and SFQ pulse-streams encoding for area-efficient superconducting accelerators.
Patricia Gonzalez-Guerrero, Meriam Gay Bautista, Darren Lyles, George Michelogiannakis
2022The benefits of general-purpose on-NIC memory.
Boris Pismenny, Liran Liss, Adam Morrison, Dan Tsafrir
2022Tree traversal synthesis using domain-specific symbolic compilation.
Yanju Chen, Junrui Liu, Yu Feng, Rastislav Bodík
2022Understanding and exploiting optimal function inlining.
Theodoros Theodoridis, Tobias Grosser, Zhendong Su
2022VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling.
Zihan Liu, Jingwen Leng, Zhihui Zhang, Quan Chen, Chao Li, Minyi Guo
2022ValueExpert: exploring value patterns in GPU-accelerated applications.
Keren Zhou, Yueming Hao, John M. Mellor-Crummey, Xiaozhu Meng, Xu Liu
2022Vector instruction selection for digital signal processors using program synthesis.
Maaz Bin Safeer Ahmad, Alexander J. Root, Andrew Adams, Shoaib Kamil, Alvin Cheung
2022ViK: practical mitigation of temporal memory safety violations through object ID inspection.
Haehyun Cho, Jinbum Park, Adam Oest, Tiffany Bao, Ruoyu Wang, Yan Shoshitaishvili, Adam Doupé, Gail-Joon Ahn
2022Who goes first? detecting go concurrency bugs via message reordering.
Ziheng Liu, Shihao Xia, Yu Liang, Linhai Song, Hong Hu
2022Yashme: detecting persistency races.
Hamed Gorjiara, Guoqing Harry Xu, Brian Demsky