| 2021 | A Computational Stack for Cross-Domain Acceleration. Sean Kinzer, Joon Kyung Kim, Soroush Ghodrati, Brahmendra Reddy Yatham, Alric Althoff, Divya Mahajan, Sorin Lerner, Hadi Esmaeilzadeh |
| 2021 | A Write-Friendly and Fast-Recovery Scheme for Security Metadata in Non-Volatile Memories. Jianming Huang, Yu Hua |
| 2021 | Adapt-NoC: A Flexible Network-on-Chip Design for Heterogeneous Manycore Architectures. Hao Zheng, Ke Wang, Ahmed Louri |
| 2021 | An Analog Preconditioner for Solving Linear Systems. Ben Feinberg, Ryan Wong, T. Patrick Xiao, Christopher H. Bennett, Jacob N. Rohan, Erik G. Boman, Matthew J. Marinella, Sapan Agarwal, Engin Ipek |
| 2021 | Analyzing and Leveraging Decoupled L1 Caches in GPUs. Mohamed Assem Ibrahim, Onur Kayiran, Yasuko Eckert, Gabriel H. Loh, Adwait Jog |
| 2021 | Ascend: a Scalable and Unified Architecture for Ubiquitous Deep Neural Network Computing : Industry Track Paper. Heng Liao, Jiajin Tu, Jing Xia, Hu Liu, Xiping Zhou, Honghui Yuan, Yuxing Hu |
| 2021 | Automatic Microprocessor Performance Bug Detection. Erick Carvajal Barboza, Sara Jacob, Mahesh Ketkar, Michael Kishinevsky, Paul Gratz, Jiang Hu |
| 2021 | BBB: Simplifying Persistent Programming using Battery-Backed Buffers. Mohammad A. Alshboul, Prakash Ramrakhyani, William Wang, James Tuck, Yan Solihin |
| 2021 | BRIM: Bistable Resistively-Coupled Ising Machine. Richard Afoakwa, Yiqiao Zhang, Uday Kumar Reddy Vengalam, Zeljko Ignjatovic, Michael C. Huang |
| 2021 | BlockHammer: Preventing RowHammer at Low Cost by Blacklisting Rapidly-Accessed DRAM Rows. Abdullah Giray Yaglikçi, Minesh Patel, Jeremie S. Kim, Roknoddin Azizi, Ataberk Olgun, Lois Orosa, Hasan Hassan, Jisung Park, Konstantinos Kanellopoulos, Taha Shahroodi, Saugata Ghose, Onur Mutlu |
| 2021 | BoomGate: Deadlock Avoidance in Non-Minimal Routing for High-Radix Networks. Gyuyoung Kwauk, Seungkwan Kang, Hans Kasan, Hyojun Son, John Kim |
| 2021 | CAPE: A Content-Addressable Processing Engine. Helena Caminal, Kailin Yang, Srivatsa Srinivasa, Akshay Krishna Ramanathan, Khalid Al-Hawaj, Tianshu Wu, Vijaykrishnan Narayanan, Christopher Batten, José F. Martínez |
| 2021 | CARE: Coordinated Augmentation for Elastic Resilience on DRAM Errors in Data Centers. Jian Chen, Xiaowei Jiang, Ying Zhang, Liyin Liu, Huifeng Xu, Qiang Liu |
| 2021 | CHOPIN: Scalable Graphics Rendering in Multi-GPU Systems via Parallel Image Composition. Xiaowei Ren, Mieszko Lis |
| 2021 | CSCNN: Algorithm-hardware Co-design for CNN Accelerators using Centrosymmetric Filters. Jiajun Li, Ahmed Louri, Avinash Karanth, Razvan C. Bunescu |
| 2021 | Chasing Carbon: The Elusive Environmental Footprint of Computing. Udit Gupta, Young Geun Kim, Sylvia Lee, Jordan Tse, Hsien-Hsin S. Lee, Gu-Yeon Wei, David Brooks, Carole-Jean Wu |
| 2021 | Cheetah: Optimizing and Accelerating Homomorphic Encryption for Private Inference. Brandon Reagen, Wooseok Choi, Yeongil Ko, Vincent T. Lee, Hsien-Hsin S. Lee, Gu-Yeon Wei, David Brooks |
| 2021 | Common Counters: Compressed Encryption Counters for Secure GPU Memory. Seonjin Na, Sunho Lee, Yeonjae Kim, Jongse Park, Jaehyuk Huh |
| 2021 | DeACT: Architecture-Aware Virtual Memory Support for Fabric Attached Memory Systems. Vamsee Reddy Kommareddy, Clayton Hughes, Simon David Hammond, Amro Awad |
| 2021 | Dead Page and Dead Block Predictors: Cleaning TLBs and Caches Together. Chandrashis Mazumdar, Prachatos Mitra, Arkaprava Basu |
| 2021 | Deadline-Aware Offloading for High-Throughput Accelerators. Tsung Tai Yeh, Matthew D. Sinclair, Bradford M. Beckmann, Timothy G. Rogers |
| 2021 | DepGraph: A Dependency-Driven Accelerator for Efficient Iterative Graph Processing. Yu Zhang, Xiaofei Liao, Hai Jin, Ligang He, Bingsheng He, Haikun Liu, Lin Gu |
| 2021 | Designing a Cost-Effective Cache Replacement Policy using Machine Learning. Subhash Sethumurugan, Jieming Yin, John Sartori |
| 2021 | EXMA: A Genomics Accelerator for Exact-Matching. Lei Jiang, Farzaneh Zokaee |
| 2021 | Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines Industry Track Paper. Yiming Gan, Bo Yu, Boyuan Tian, Leimeng Xu, Wei Hu, Shaoshan Liu, Qiang Liu, Yanjun Zhang, Jie Tang, Yuhao Zhu |
| 2021 | FAFNIR: Accelerating Sparse Gathering by Using Efficient Near-Memory Intelligent Reduction. Bahar Asgari, Ramyad Hadidi, Jiashen Cao, Da Eun Shim, Sung Kyu Lim, Hyesoon Kim |
| 2021 | Faster Schrödinger-style simulation of quantum circuits. Aneeqa Fatima, Igor L. Markov |
| 2021 | FuseKNA: Fused Kernel Convolution based Accelerator for Deep Neural Networks. Jianxun Yang, Zhao Zhang, Zhuangzhi Liu, Jing Zhou, Leibo Liu, Shaojun Wei, Shouyi Yin |
| 2021 | GCNAX: A Flexible and Energy-efficient Accelerator for Graph Convolutional Neural Networks. Jiajun Li, Ahmed Louri, Avinash Karanth, Razvan C. Bunescu |
| 2021 | GSSA: A Resource Allocation Scheme Customized for 3D NAND SSDs. Chun-Yi Liu, Yunju Lee, Wonil Choi, Myoungsoo Jung, Mahmut Taylan Kandemir, Chita R. Das |
| 2021 | GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent. Heesu Kim, Hanmin Park, Taehyun Kim, Kwanheum Cho, Eojin Lee, Soojung Ryu, Hyuk-Jae Lee, Kiyoung Choi, Jinho Lee |
| 2021 | Hardware-Based Address-Centric Acceleration of Key-Value Store. Chencheng Ye, Yuanchao Xu, Xipeng Shen, Xiaofei Liao, Hai Jin, Yan Solihin |
| 2021 | Heat Behind the Meter: A Hidden Threat of Thermal Attacks in Edge Colocation Data Centers. Zhihui Shao, Mohammad A. Islam, Shaolei Ren |
| 2021 | Heterogeneous Dataflow Accelerators for Multi-DNN Workloads. Hyoukjun Kwon, Liangzhen Lai, Michael Pellauer, Tushar Krishna, Yu-Hsin Chen, Vikas Chandra |
| 2021 | IEEE International Symposium on High-Performance Computer Architecture, HPCA 2021, Seoul, South Korea, February 27 - March 3, 2021 |
| 2021 | Improving GPU Multi-tenancy with Page Walk Stealing. B Pratheek, Neha Jawalkar, Arkaprava Basu |
| 2021 | LIBRA: Clearing the Cloud Through Dynamic Memory Bandwidth Management. Ying Zhang, Jian Chen, Xiaowei Jiang, Qiang Liu, Ian M. Steiner, Andrew J. Herdrich, Kevin Shu, Ripan Das, Long Cui, Litrin Jiang |
| 2021 | Layerweaver: Maximizing Resource Utilization of Neural Processing Units via Layer-Wise Scheduling. Young H. Oh, Seonghak Kim, Yunho Jin, Sam Son, Jonghyun Bae, Jongsung Lee, Yeonhong Park, Dong Uk Kim, Tae Jun Ham, Jae W. Lee |
| 2021 | Lazy Batching: An SLA-aware Batching System for Cloud Machine Learning Inference. Yujeong Choi, Yunseong Kim, Minsoo Rhu |
| 2021 | Memristive Data Ranking. Ananth Krishna Prasad, Morteza Rezaalipour, Masoud Dehyadegari, Mahdi Nazm Bojnordi |
| 2021 | Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework. Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden K. H. So, Xuehai Qian, Yanzhi Wang, Xue Lin |
| 2021 | Need for Speed: Experiences Building a Trustworthy System-Level GPU Simulator. Oreste Villa, Daniel Lustig, Zi Yan, Evgeny Bolotin, Yaosheng Fu, Niladrish Chatterjee, Nan Jiang, David W. Nellans |
| 2021 | NeuroMeter: An Integrated Power, Area, and Timing Modeling Framework for Machine Learning Accelerators Industry Track Paper. Tianqi Tang, Sheng Li, Lifeng Nai, Norman P. Jouppi, Yuan Xie |
| 2021 | New Models for Understanding and Reasoning about Speculative Execution Attacks. Zecheng He, Guangyuan Hu, Ruby B. Lee |
| 2021 | Operating Liquid-Cooled Large-Scale Systems: Long-Term Monitoring, Reliability Analysis, and Efficiency Measures. Rohan Basu Roy, Tirthak Patel, Raj Kettimuthu, William E. Allcock, Paul Rich, Adam Scovel, Devesh Tiwari |
| 2021 | P-OPT: Practical Optimal Cache Replacement for Graph Analytics. Vignesh Balaji, Neal Clayton Crago, Aamer Jaleel, Brandon Lucia |
| 2021 | ParaDox: Eliminating Voltage Margins via Heterogeneous Fault Tolerance. Sam Ainsworth, Lionel Zoubritzky, Alan Mycroft, Timothy M. Jones |
| 2021 | Pitstop: Enabling a Virtual Network Free Network-on-Chip. Hossein Farrokhbakht, Henry Kao, Kamran Hasan, Paul V. Gratz, Tushar Krishna, Joshua San Miguel, Natalie D. Enright Jerger |
| 2021 | Prodigy: Improving the Memory Latency of Data-Indirect Irregular Workloads Using Hardware-Software Co-Design. Nishil Talati, Kyle May, Armand Behroozi, Yichen Yang, Kuba Kaszyk, Christos Vasiladiotis, Tarunesh Verma, Lu Li, Brandon Nguyen, Jiawen Sun, John Magnus Morton, Agreen Ahmadi, Todd M. Austin, Michael F. P. O'Boyle, Scott A. Mahlke, Trevor N. Mudge, Ronald G. Dreslinski |
| 2021 | QEI: Query Acceleration Can be Generic and Efficient in the Cloud. Yifan Yuan, Yipeng Wang, Ren Wang, Rangeen Basu Roy Chowdhury, Charlie Tai, Nam Sung Kim |
| 2021 | QuCloud: A New Qubit Mapping Mechanism for Multi-programming Quantum Computing in Cloud Environment. Lei Liu, Xinglei Dou |
| 2021 | Revisiting HyperDimensional Learning for FPGA and Low-Power Architectures. Mohsen Imani, Zhuowen Zou, Samuel Bosch, Sanjay Anantha Rao, Sahand Salamat, Venkatesh Kumar, Yeseong Kim, Tajana Rosing |
| 2021 | SPAGHETTI: Streaming Accelerators for Highly Sparse GEMM on FPGAs. Reza Hojabr, Ali Sedaghati, Amirali Sharifian, Ahmad Khonsari, Arrvindh Shriraman |
| 2021 | Sentinel: Efficient Tensor Migration and Allocation on Heterogeneous Memory Systems for Deep Learning. Jie Ren, Jiaolin Luo, Kai Wu, Minjia Zhang, Hyeran Jeon, Dong Li |
| 2021 | SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning. Hanrui Wang, Zhekai Zhang, Song Han |
| 2021 | SpaceA: Sparse Matrix Vector Multiplication on Processing-in-Memory Accelerator. Xinfeng Xie, Zheng Liang, Peng Gu, Abanti Basak, Lei Deng, Ling Liang, Xing Hu, Yuan Xie |
| 2021 | Stealth-Persist: Architectural Support for Persistent Applications in Hybrid Memory Systems. Mazen Al-Wadi, Vamsee Reddy Kommareddy, Clayton Hughes, Simon David Hammond, Amro Awad |
| 2021 | Stream Floating: Enabling Proactive and Decentralized Cache Optimizations. Zhengrong Wang, Jian Weng, Jason Lowe-Power, Jayesh Gaur, Tony Nowatzki |
| 2021 | Streamline Ring ORAM Accesses through Spatial and Temporal Optimization. Dingyuan Cao, Mingzhe Zhang, Hang Lu, Xiaochun Ye, Dongrui Fan, Yuezhi Che, Rujia Wang |
| 2021 | SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures. Christina Giannoula, Nandita Vijaykumar, Nikela Papadopoulou, Vasileios Karakostas, Ivan Fernandez, Juan Gómez-Luna, Lois Orosa, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu |
| 2021 | Systematic Approaches for Precise and Approximate Quantum State Runtime Assertion. Ji Liu, Huiyang Zhou |
| 2021 | TILT: Achieving Higher Fidelity on a Trapped-Ion Linear-Tape Quantum Computing Architecture. Xin-Chuan Wu, Dripto M. Debroy, Yongshan Ding, Jonathan M. Baker, Yuri Alexeev, Kenneth R. Brown, Frederic T. Chong |
| 2021 | TSOPER: Efficient Coherence-Based Strict Persistency. Per Ekemark, Yuan Yao, Alberto Ros, Konstantinos Sagonas, Stefanos Kaxiras |
| 2021 | Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training. Youngeun Kwon, Yunjae Lee, Minsoo Rhu |
| 2021 | Trident: A Hybrid Correlation-Collision GPU Cache Timing Attack for AES Key Recovery. Jaeguk Ahn, Cheolgyu Jin, Jiho Kim, Minsoo Rhu, Yunsi Fei, David R. Kaeli, John Kim |
| 2021 | Ultra-Elastic CGRAs for Irregular Loop Specialization. Christopher Torng, Peitian Pan, Yanghui Ou, Cheng Tan, Christopher Batten |
| 2021 | Understanding Training Efficiency of Deep Learning Recommendation Models at Scale. Bilge Acun, Matthew Murphy, Xiaodong Wang, Jade Nie, Carole-Jean Wu, Kim M. Hazelwood |
| 2021 | VIA: A Smart Scratchpad for Vector Units with Application to Sparse Matrix Computations. Julian Pavon, Iván Vargas Valdivieso, Adrián Barredo, Joan Marimon, Miquel Moretó, Francesc Moll, Osman S. Unsal, Mateo Valero, Adrián Cristal |
| 2021 | WiDir: A Wireless-Enabled Directory Cache Coherence Protocol. Antonio Franques, Apostolos Kokolis, Sergi Abadal, Vimuth Fernando, Sasa Misailovic, Josep Torrellas |
| 2021 | Zero Directory Eviction Victim: Unbounded Coherence Directory and Core Cache Isolation. Mainak Chaudhuri |