| 2021 | 48th ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2021, Virtual Event / Valencia, Spain, June 14-18, 2021 |
| 2021 | A Cost-Effective Entangling Prefetcher for Instructions. Alberto Ros, Alexandra Jimborean |
| 2021 | A RISC-V in-network accelerator for flexible high-performance low-power packet processing. Salvatore Di Girolamo, Andreas Kurth, Alexandru Calotoiu, Thomas Benz, Timo Schneider, Jakub Beránek, Luca Benini, Torsten Hoefler |
| 2021 | ABC-DIMM: Alleviating the Bottleneck of Communication in DIMM-based Near-Memory Processing with Inter-DIMM Broadcast. Weiyi Sun, Zhaoshi Li, Shouyi Yin, Shaojun Wei, Leibo Liu |
| 2021 | Accelerated Seeding for Genome Sequence Alignment with Enumerated Radix Trees. Arun Subramaniyan, Jack Wadden, Kush Goliya, Nathan Ozog, Xiao Wu, Satish Narayanasamy, David T. Blaauw, Reetuparna Das |
| 2021 | Albireo: Energy-Efficient Acceleration of Convolutional Neural Networks via Silicon Photonics. Kyle Shiflett, Avinash Karanth, Razvan C. Bunescu, Ahmed Louri |
| 2021 | Aurochs: An Architecture for Dataflow Threads. Matthew Vilim, Alexander Rucker, Kunle Olukotun |
| 2021 | BOSS: Bandwidth-Optimized Search Accelerator for Storage-Class Memory. Jun Heo, Seung Yul Lee, Sunhong Min, Yeonhong Park, Sungjun Jung, Tae Jun Ham, Jae W. Lee |
| 2021 | BlockMaestro: Enabling Programmer-Transparent Task-based Execution in GPU Systems. AmirAli Abdolrashidi, Hodjat Asghari Esfeden, Ali Jahanshahi, Kaustubh Singh, Nael B. Abu-Ghazaleh, Daniel Wong |
| 2021 | CODIC: A Low-Cost Substrate for Enabling Custom In-DRAM Functionalities and Optimizations. Lois Orosa, Yaohua Wang, Mohammad Sadrosadati, Jeremie S. Kim, Minesh Patel, Ivan Puddu, Haocong Luo, Kaveh Razavi, Juan Gómez-Luna, Hasan Hassan, Nika Mansouri-Ghiasi, Saugata Ghose, Onur Mutlu |
| 2021 | Cambricon-Q: A Hybrid Architecture for Efficient Training. Yongwei Zhao, Chang Liu, Zidong Du, Qi Guo, Xing Hu, Yimin Zhuang, Zhenxing Zhang, Xinkai Song, Wei Li, Xishan Zhang, Ling Li, Zhiwei Xu, Tianshi Chen |
| 2021 | CoSA: Scheduling by Constrained Optimization for Spatial Accelerators. Qijing Huang, Aravind Kalaiah, Minwoo Kang, James Demmel, Grace Dinh, John Wawrzynek, Thomas Norell, Yakun Sophia Shao |
| 2021 | Communication Algorithm-Architecture Co-Design for Distributed Deep Learning. Jiayi Huang, Pritam Majumder, Sungkeun Kim, Abdullah Muzahid, Ki Hwan Yum, Eun Jung Kim |
| 2021 | Confidential Serverless Made Efficient with Plug-In Enclaves. Mingyu Li, Yubin Xia, Haibo Chen |
| 2021 | Cost-Efficient Overclocking in Immersion-Cooled Datacenters. Majid Jalili, Ioannis Manousakis, Iñigo Goiri, Pulkit A. Misra, Ashish Raniwala, Husam Alissa, Bharath Ramakrishnan, Phillip Tuma, Christian Belady, Marcus Fontoura, Ricardo Bianchini |
| 2021 | CryoGuard: A Near Refresh-Free Robust DRAM Design for Cryogenic Computing. Gyu-hyeon Lee, Seongmin Na, Ilkwon Byun, Dongmoon Min, Jangwoo Kim |
| 2021 | Demystifying the System Vulnerability Stack: Transient Fault Effects Across the Layers. George Papadimitriou, Dimitris Gizopoulos |
| 2021 | Designing Calibration and Expressivity-Efficient Instruction Sets for Quantum Computing. Lingling Lao, Prakash Murali, Margaret Martonosi, Dan E. Browne |
| 2021 | Don't Forget the I/O When Allocating Your LLC. Yifan Yuan, Mohammad Alian, Yipeng Wang, Ren Wang, Ilia Kurakin, Charlie Tai, Nam Sung Kim |
| 2021 | Dual-side Sparse Tensor Core. Yang Wang, Chen Zhang, Zhiqiang Xie, Cong Guo, Yunxin Liu, Jingwen Leng |
| 2021 | Dvé: Improving DRAM Reliability and Performance On-Demand via Coherent Replication. Adarsh Patil, Vijay Nagarajan, Rajeev Balasubramonian, Nicolai Oswald |
| 2021 | ELSA: Hardware-Software Co-design for Efficient, Lightweight Self-Attention Mechanism in Neural Networks. Tae Jun Ham, Yejin Lee, Seong Hoon Seo, Soosung Kim, Hyunji Choi, Sung Jun Jung, Jae W. Lee |
| 2021 | Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers. Harini Muthukrishnan, David W. Nellans, Daniel Lustig, Jeffrey A. Fessler, Thomas F. Wenisch |
| 2021 | Enabling Compute-Communication Overlap in Distributed Deep Learning Training Platforms. Saeed Rashidi, Matthew Denton, Srinivas Sridharan, Sudarshan Srinivasan, Amoghavarsha Suresh, Jade Nie, Tushar Krishna |
| 2021 | Energy Efficiency Boost in the AI-Infused POWER10 Processor. Brian W. Thompto, Dung Q. Nguyen, José E. Moreira, Ramon Bertran, Hans M. Jacobson, Richard J. Eickemeyer, Rahul M. Rao, Michael Goulet, Marcy Byers, Christopher J. Gonzalez, Karthik Swaminathan, Nagu R. Dhanwada, Silvia M. Müller, Andreas Wagner, Satish Kumar Sadasivam, Robert K. Montoye, William J. Starke, Christian G. Zoellin, Michael S. Floyd, Jeffrey Stuecheli, Nandhini Chandramoorthy, John-David Wellman, Alper Buyuktosunoglu, Matthias Pflanz, Balaram Sinharoy, Pradip Bose |
| 2021 | Execution Dependence Extension (EDE): ISA Support for Eliminating Fences. Thomas Shull, Ilias Vougioukas, Nikos Nikoleris, Wendy Elsasser, Josep Torrellas |
| 2021 | Exploiting Long-Distance Interactions and Tolerating Atom Loss in Neutral Atom Quantum Architectures. Jonathan M. Baker, Andrew Litteken, Casey Duckering, Henry Hoffmann, Hannes Bernien, Frederic T. Chong |
| 2021 | Exploiting Page Table Locality for Agile TLB Prefetching. Georgios Vavouliotis, Lluc Alvarez, Vasileios Karakostas, Konstantinos Nikas, Nectarios Koziris, Daniel A. Jiménez, Marc Casas |
| 2021 | FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator. Geng Yuan, Payman Behnam, Zhengang Li, Ali Shafiee, Sheng Lin, Xiaolong Ma, Hang Liu, Xuehai Qian, Mahdi Nazm Bojnordi, Yanzhi Wang, Caiwen Ding |
| 2021 | Failure Sentinels: Ubiquitous Just-in-time Intermittent Computation via Low-cost Hardware Support for Voltage Monitoring. Harrison Williams, Michael Moukarzel, Matthew Hicks |
| 2021 | Flex: High-Availability Datacenters With Zero Reserved Power. Chaojie Zhang, Alok Gautam Kumbhare, Ioannis Manousakis, Deli Zhang, Pulkit A. Misra, Rod Assis, Kyle Woolcock, Nithish Mahalingam, Brijesh Warrier, David Gauthier, Lalu Kunnath, Steve Solomon, Osvaldo Morales, Marcus Fontoura, Ricardo Bianchini |
| 2021 | FlexMiner: A Pattern-Aware Accelerator for Graph Pattern Mining. Xuhao Chen, Tianhao Huang, Shuotao Xu, Thomas Bourgeat, Chanwoo Chung, Arvind |
| 2021 | Ghost Routing to Enable Oblivious Computation on Memory-centric Networks. Yeonju Ro, Seongwook Jin, Jaehyuk Huh, John Kim |
| 2021 | GoSPA: An Energy-efficient High-performance Globally Optimized SParse Convolutional Neural Network Accelerator. Chunhua Deng, Yang Sui, Siyu Liao, Xuehai Qian, Bo Yuan |
| 2021 | HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation. Qingcheng Xiao, Size Zheng, Bingzhe Wu, Pengcheng Xu, Xuehai Qian, Yun Liang |
| 2021 | Hardware Architecture and Software Stack for PIM Based on Commercial DRAM Technology : Industrial Product. Sukhan Lee, Shinhaeng Kang, Jaehoon Lee, Hyeonsu Kim, Eojin Lee, Seungwoo Seo, Hosang Yoon, Seungwon Lee, Kyounghwan Lim, Hyunsung Shin, JinHyun Kim, Seongil O, Anand Iyer, David Wang, Kyomin Sohn, Nam Sung Kim |
| 2021 | Hetero-ViTAL: A Virtualization Stack for Heterogeneous FPGA Clusters. Yue Zha, Jing Li |
| 2021 | I See Dead µops: Leaking Secrets via Intel/AMD Micro-Op Caches. Xida Ren, Logan Moody, Mohammadkazem Taram, Matthew Jordan, Dean M. Tullsen, Ashish Venkat |
| 2021 | IChannels: Exploiting Current Management Mechanisms to Create Covert Channels in Modern Processors. Jawad Haj-Yahya, Lois Orosa, Jeremie S. Kim, Juan Gómez-Luna, Abdullah Giray Yaglikçi, Mohammed Alser, Ivan Puddu, Onur Mutlu |
| 2021 | INTROSPECTRE: A Pre-Silicon Framework for Discovery and Analysis of Transient Execution Vulnerabilities. Moein Ghaniyoun, Kristin Barber, Yinqian Zhang, Radu Teodorescu |
| 2021 | Large-Scale Graph Processing on FPGAs with Caches for Thousands of Simultaneous Misses. Mikhail Asiatici, Paolo Ienne |
| 2021 | Leaky Buddies: Cross-Component Covert Channels on Integrated CPU-GPU Systems. Sankha Baran Dutta, Hoda Naghibijouybari, Nael B. Abu-Ghazaleh, Andres Marquez, Kevin J. Barker |
| 2021 | Maya: Using Formal Control to Obfuscate Power Side Channels. Raghavendra Pradyumna Pothukuchi, Sweta Yamini Pothukuchi, Petros G. Voulgaris, Alexander G. Schwing, Josep Torrellas |
| 2021 | NASA: Accelerating Neural Network Design with a NAS Processor. Xiaohan Ma, Chang Si, Ying Wang, Cheng Liu, Lei Zhang |
| 2021 | NASGuard: A Novel Accelerator Architecture for Robust Neural Architecture Search (NAS) Networks. Xingbin Wang, Boyan Zhao, Rui Hou, Amro Awad, Zhihong Tian, Dan Meng |
| 2021 | NN-Baton: DNN Workload Orchestration and Chiplet Granularity Exploration for Multichip Accelerators. Zhanhong Tan, Hongyu Cai, Runpei Dong, Kaisheng Ma |
| 2021 | NVOverlay: Enabling Efficient and Scalable High-Frequency Snapshotting to NVM. Ziqi Wang, Chul-Hwan Choo, Michael A. Kozuch, Todd C. Mowry, Gennady Pekhimenko, Vivek Seshadri, Dimitrios Skarlatos |
| 2021 | No-FAT: Architectural Support for Low Overhead Memory Safety Checks. Mohamed Tarek Ibn Ziad, Miguel A. Arroyo, Evgeny Manzhosov, Ryan Piersma, Simha Sethumadhavan |
| 2021 | Opening Pandora's Box: A Systematic Study of New Ways Microarchitecture Can Leak Private Data. Jose Rodrigo Sanchez Vicarte, Pradyumna Shome, Nandeeka Nayak, Caroline Trippel, Adam Morrison, David Kohlbrenner, Christopher W. Fletcher |
| 2021 | PF-DRAM: A Precharge-Free DRAM Structure. Nezam Rohbani, Sina Darabi, Hamid Sarbazi-Azad |
| 2021 | PMNet: In-Network Data Persistence. Korakit Seemakhupt, Sihang Liu, Yasas Senevirathne, Muhammad Shahbaz, Samira Manabi Khan |
| 2021 | Pioneering Chiplet Technology and Design for the AMD EPYC™ and Ryzen™ Processor Families : Industrial Product. Samuel Naffziger, Noah Beck, Thomas Burd, Kevin Lepak, Gabriel H. Loh, Mahesh Subramony, Sean White |
| 2021 | PipeZK: Accelerating Zero-Knowledge Proof with a Pipelined Architecture. Ye Zhang, Shuo Wang, Xian Zhang, Jiangbin Dong, Xingzhong Mao, Fan Long, Cong Wang, Dong Zhou, Mingyu Gao, Guangyu Sun |
| 2021 | PolyGraph: Exposing the Value of Flexibility for Graph Processing Accelerators. Vidushi Dadu, Sihao Liu, Tony Nowatzki |
| 2021 | QUAC-TRNG: High-Throughput True Random Number Generation Using Quadruple Row Activation in Commodity DRAM Chips. Ataberk Olgun, Minesh Patel, Abdullah Giray Yaglikçi, Haocong Luo, Jeremie S. Kim, Nisa Bostanci, Nandita Vijaykumar, Oguz Ergin, Onur Mutlu |
| 2021 | Quantifying Server Memory Frequency Margin and Using It to Improve Performance in HPC Systems. Da Zhang, Gagandeep Panwar, Jagadish B. Kotra, Nathan DeBardeleben, Sean Blanchard, Xun Jian |
| 2021 | REDUCT: Keep it Close, Keep it Cool! : Efficient Scaling of DNN Inference on Multi-core CPUs with Near-Cache Compute. Anant V. Nori, Rahul Bera, Shankar Balachandran, Joydeep Rakshit, Om Ji Omer, Avishaii Abuhatzera, Belliappa Kuttanna, Sreenivas Subramoney |
| 2021 | RaPiD: AI Accelerator for Ultra-low Precision Training and Inference. Swagath Venkataramani, Vijayalakshmi Srinivasan, Wei Wang, Sanchari Sen, Jintao Zhang, Ankur Agrawal, Monodeep Kar, Shubham Jain, Alberto Mannari, Hoang Tran, Yulong Li, Eri Ogawa, Kazuaki Ishizaki, Hiroshi Inoue, Marcel Schaal, Mauricio J. Serrano, Jungwook Choi, Xiao Sun, Naigang Wang, Chia-Yu Chen, Allison Allain, James Bonanno, Nianzheng Cao, Robert Casatuta, Matthew Cohen, Bruce M. Fleischer, Michael Guillorn, Howard Haynie, Jinwook Jung, Mingu Kang, Kyu-Hyoun Kim, Siyu Koswatta, Sae Kyu Lee, Martin Lutz, Silvia M. Mueller, Jinwook Oh, Ashish Ranjan, Zhibin Ren, Scot Rider, Kerstin Schelm, Michael Scheuermann, Joel Silberman, Jie Yang, Vidhi Zalani, Xin Zhang, Ching Zhou, Matthew M. Ziegler, Vinay Shah, Moriyoshi Ohara, Pong-Fei Lu, Brian W. Curran, Sunil Shukla, Leland Chang, Kailash Gopalakrishnan |
| 2021 | Rebooting Virtual Memory with Midgard. Siddharth Gupta, Atri Bhattacharyya, Yunho Oh, Abhishek Bhattacharjee, Babak Falsafi, Mathias Payer |
| 2021 | Revamping Storage Class Memory With Hardware Automated Memory-Over-Storage Solution. Jie Zhang, Miryeong Kwon, Donghyun Gouk, Sungjoon Koh, Nam Sung Kim, Mahmut Taylan Kandemir, Myoungsoo Jung |
| 2021 | RingCNN: Exploiting Algebraically-Sparse Ring Tensors for Energy-Efficient CNN-Based Computational Imaging. Chao-Tsung Huang |
| 2021 | Ripple: Profile-Guided Instruction Cache Replacement for Data Center Applications. Tanvir Ahmed Khan, Dexin Zhang, Akshitha Sriraman, Joseph Devietti, Gilles Pokam, Heiner Litz, Baris Kasikci |
| 2021 | SARA: Scaling a Reconfigurable Dataflow Accelerator. Yaqi Zhang, Nathan Zhang, Tian Zhao, Matt Vilim, Muhammad Shahbaz, Kunle Olukotun |
| 2021 | SATORI: Efficient and Fair Resource Partitioning by Sacrificing Short-Term Benefits for Long-Term Gains Rohan Basu Roy, Tirthak Patel, Devesh Tiwari |
| 2021 | SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations. Hongju Kal, Seokmin Lee, Gun Ko, Won Woo Ro |
| 2021 | Sieve: Scalable In-situ DRAM-based Accelerator Designs for Massively Parallel k-mer Matching. Lingxi Wu, Rasool Sharifi, Marzieh Lenjani, Kevin Skadron, Ashish Venkat |
| 2021 | Snafu: An Ultra-Low-Power, Energy-Minimal CGRA-Generation Framework and Architecture. Graham Gobieski, Ahmet Oguz Atli, Kenneth Mai, Brandon Lucia, Nathan Beckmann |
| 2021 | Software-Hardware Co-Optimization for Computational Chemistry on Superconducting Quantum Processors. Gushu Li, Yunong Shi, Ali Javadi-Abhari |
| 2021 | SpZip: Architectural Support for Effective Data Compression In Irregular Applications. Yifan Yang, Joel S. Emer, Daniel Sánchez |
| 2021 | Sparsity-Aware and Re-configurable NPU Architecture for Samsung Flagship Mobile SoC. Jun-Woo Jang, Sehwan Lee, Dongyoung Kim, Hyunsun Park, Ali Shafiee Ardestani, Yeongjae Choi, Channoh Kim, Yoojin Kim, Hyeongseok Yu, Hamzah Abdel-Aziz, Jun-Seok Park, Heonsoo Lee, Dongwoo Lee, Myeong Woo Kim, Hanwoong Jung, Heewoo Nam, Dongguen Lim, Seungwon Lee, Joon-Ho Song, Suknam Kwon, Joseph Hassoun, Sukhwan Lim, Changkyu Choi |
| 2021 | Speculative Vectorisation with Selective Replay. Peng Sun, Giacomo Gabrielli, Timothy M. Jones |
| 2021 | Superconducting Computing with Alternating Logic Elements. Georgios Tzimpragos, Jennifer Volk, Alex Wynn, James E. Smith, Timothy Sherwood |
| 2021 | Supporting Legacy Libraries on Non-Volatile Memory: A User-Transparent Approach. Chencheng Ye, Yuanchao Xu, Xipeng Shen, Xiaofei Liao, Hai Jin, Yan Solihin |
| 2021 | TENET: A Framework for Modeling Tensor Dataflow Based on Relation-centric Notation. Liqiang Lu, Naiqing Guan, Yuyue Wang, Liancheng Jia, Zizhang Luo, Jieming Yin, Jason Cong, Yun Liang |
| 2021 | Taming the Zoo: The Unified GraphIt Compiler Framework for Novel Architectures. Ajay Brahmakshatriya, Emily Furst, Victor A. Ying, Claire Hsu, Changwan Hong, Max Ruttenberg, Yunming Zhang, Dai Cheol Jung, Dustin Richmond, Michael B. Taylor, Julian Shun, Mark Oskin, Daniel Sánchez, Saman P. Amarasinghe |
| 2021 | Ten Lessons From Three Generations Shaped Google's TPUv4i : Industrial Product. Norman P. Jouppi, Doe Hyun Yoon, Matthew Ashcraft, Mark Gottscho, Thomas B. Jablin, George Kurian, James Laudon, Sheng Li, Peter C. Ma, Xiaoyu Ma, Thomas Norrie, Nishant Patil, Sushma Prasad, Cliff Young, Zongwei Zhou, David A. Patterson |
| 2021 | TimeCache: Using Time to Eliminate Cache Side Channels when Sharing Software. Divya Ojha, Sandhya Dwarkadas |
| 2021 | Unlimited Vector Extension with Data Streaming Support. Joao Mario Domingos, Nuno Neves, Nuno Roma, Pedro Tomás |
| 2021 | Vector Runahead. Ajeya Naithani, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout |
| 2021 | ZeRØ: Zero-Overhead Resilient Operation Under Pointer Integrity Attacks. Mohamed Tarek Ibn Ziad, Miguel A. Arroyo, Evgeny Manzhosov, Simha Sethumadhavan |
| 2021 | Zero Inclusion Victim: Isolating Core Caches from Inclusive Last-level Cache Evictions. Mainak Chaudhuri |
| 2021 | η-LSTM: Co-Designing Highly-Efficient Large LSTM Training via Exploiting Memory-Saving and Architectural Design Opportunities. Xingyao Zhang, Haojun Xia, Donglin Zhuang, Hao Sun, Xin Fu, Michael B. Taylor, Shuaiwen Leon Song |