| 2021 | 2-in-1 Accelerator: Enabling Random Precision Switch for Winning Both Adversarial Robustness and Efficiency. Yonggan Fu, Yang Zhao, Qixuan Yu, Chaojian Li, Yingyan Lin |
| 2021 | : Near-Storage Accelerator for High-Performance Log Analytics. Seongyoung Kang, Jiyoung An, Jinpyo Kim, Sang-Woo Jun |
| 2021 | A Deeper Look into RowHammer's Sensitivities: Experimental Analysis of Real DRAM Chipsand Implications on Future Attacks and Defenses. Lois Orosa, Abdullah Giray Yaglikçi, Haocong Luo, Ataberk Olgun, Jisung Park, Hasan Hassan, Minesh Patel, Jeremie S. Kim, Onur Mutlu |
| 2021 | A Hardware Accelerator for Protocol Buffers. Sagar Karandikar, Chris Leary, Chris Kennelly, Jerry Zhao, Dinesh Parimi, Borivoje Nikolic, Krste Asanovic, Parthasarathy Ranganathan |
| 2021 | ADAPT: Mitigating Idling Errors in Qubits via Adaptive Dynamical Decoupling. Poulami Das, Swamit S. Tannu, Siddharth Dangwal, Moinuddin K. Qureshi |
| 2021 | APOLLO: An Automated Power Modeling Framework for Runtime Power Introspection in High-Volume Commercial Microprocessors. Zhiyao Xie, Xiaoqing Xu, Matt Walker, Joshua Knebel, Kumaraguru Palaniswamy, Nicolas Hebert, Jiang Hu, Huanrui Yang, Yiran Chen, Shidhartha Das |
| 2021 | AccelWattch: A Power Modeling Framework for Modern GPUs. Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Junrui Pan, Amogh Manjunath, Timothy G. Rogers, Tor M. Aamodt, Nikos Hardavellas |
| 2021 | Archytas: A Framework for Synthesizing and Dynamically Optimizing Accelerators for Robotic Localization. Weizhuang Liu, Bo Yu, Yiming Gan, Qiang Liu, Jie Tang, Shaoshan Liu, Yuhao Zhu |
| 2021 | AutoBraid: A Framework for Enabling Efficient Surface Code Communication in Quantum Computing. Fei Hua, Yan-Hao Chen, Yuwei Jin, Chi Zhang, Ari B. Hayes, Youtao Zhang, Eddy Z. Zhang |
| 2021 | AutoFL: Enabling Heterogeneity-Aware Energy Efficient Federated Learning. Young Geun Kim, Carole-Jean Wu |
| 2021 | Bonsai Merkle Forests: Efficiently Achieving Crash Consistency in Secure Persistent Memory. Alexander Freij, Huiyang Zhou, Yan Solihin |
| 2021 | Branch Runahead: An Alternative to Branch Prediction for Impossible to Predict Branches. Stephen Pruett, Yale N. Patt |
| 2021 | BurstLink: Techniques for Energy-Efficient Video Display for Conventional and Virtual Reality Systems. Jawad Haj-Yahya, Jisung Park, Rahul Bera, Juan Gómez-Luna, Efraim Rotem, Taha Shahroodi, Jeremie S. Kim, Onur Mutlu |
| 2021 | COSPlay: Leveraging Task-Level Parallelism for High-Throughput Synchronous Persistence. Marina Vemmou, Alexandros Daglis |
| 2021 | Capstan: A Vector RDA for Sparsity. Alexander Rucker, Matthew Vilim, Tian Zhao, Yaqi Zhang, Raghu Prabhakar, Kunle Olukotun |
| 2021 | Cerebros: Evading the RPC Tax in Datacenters. Arash Pourhabibi Zarandi, Mark Sutherland, Alexandros Daglis, Babak Falsafi |
| 2021 | Characterizing and Mitigating Soft Errors in GPU DRAM. Michael B. Sullivan, Nirmal R. Saxena, Mike O'Connor, Donghyuk Lee, Paul Racunas, Saurabh Hukerikar, Timothy Tsai, Siva Kumar Sastry Hari, Stephen W. Keckler |
| 2021 | Cohmeleon: Learning-Based Orchestration of Accelerator Coherence in Heterogeneous SoCs. Joseph Zuckerman, Davide Giri, Jihye Kwon, Paolo Mantovani, Luca P. Carloni |
| 2021 | Criticality Driven Fetch. Aniket Deshmukh, Yale N. Patt |
| 2021 | Cryptographic Capability Computing. Michael LeMay, Joydeep Rakshit, Sergej Deutsch, David M. Durham, Santosh Ghosh, Anant Nori, Jayesh Gaur, Andrew Weiler, Salmin Sultana, Karanvir Grewal, Sreenivas Subramoney |
| 2021 | DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware. Hanieh Hashemi, Yongqin Wang, Murali Annavaram |
| 2021 | Distilling Bit-level Sparsity Parallelism for General Purpose Deep Learning Acceleration. Hang Lu, Liang Chang, Chenglong Li, Zixuan Zhu, Shengjian Lu, Yanhuan Liu, Mingzhe Zhang |
| 2021 | Distributed Data Persistency. Apostolos Kokolis, Antonis Psistakis, Benjamin Reidys, Jian Huang, Josep Torrellas |
| 2021 | Dolos: Improving the Performance of Persistent Applications in ADR-Supported Secure Memory. Xijing Han, James Tuck, Amro Awad |
| 2021 | ENMC: Extreme Near-Memory Classification via Approximate Screening. Liu Liu, Jilan Lin, Zheng Qu, Yufei Ding, Yuan Xie |
| 2021 | ESCALATE: Boosting the Efficiency of Sparse CNN Accelerator with Kernel Decomposition. Shiyu Li, Edward Hanson, Xuehai Qian, Hai (Helen) Li, Yiran Chen |
| 2021 | EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference. Thierry Tambe, Coleman Hooper, Lillian Pentecost, Tianyu Jia, En-Yu Yang, Marco Donato, Victor Sanh, Paul N. Whatmough, Alexander M. Rush, David Brooks, Gu-Yeon Wei |
| 2021 | Effective Processor Verification with Logic Fuzzer Enhanced Co-simulation. Nursultan Kabylkas, Tommy Thorn, Shreesha Srinath, Polychronis Xekalakis, Jose Renau |
| 2021 | Efficient, Distributed, and Non-Speculative Multi-Address Atomic Operations. Eduardo José Gómez-Hernández, Juan M. Cebrian, J. Rubén Titos Gil, Stefanos Kaxiras, Alberto Ros |
| 2021 | Enabling Branch-Mispredict Level Parallelism by Selectively Flushing Instructions. Stijn Eyerman, Wim Heirman, Sam Van den Steen, Ibrahim Hur |
| 2021 | Equinox: Training (for Free) on a Custom Inference Accelerator. Mario Drumond, Louis Coulon, Arash Pourhabibi Zarandi, Ahmet Caner Yüzügüler, Babak Falsafi, Martin Jaggi |
| 2021 | Exploiting Different Levels of Parallelism in the Quantum Control Microarchitecture for Superconducting Qubits. Mengyu Zhang, Lei Xie, Zhenxing Zhang, Qiaonian Yu, Guanglei Xi, Hualiang Zhang, Fuming Liu, Yarui Zheng, Yicong Zheng, Shengyu Zhang |
| 2021 | F1: A Fast and Programmable Accelerator for Fully Homomorphic Encryption. Nikola Samardzic, Axel Feldmann, Aleksandar Krastev, Srinivas Devadas, Ronald G. Dreslinski, Christopher Peikert, Daniel Sánchez |
| 2021 | FPRaker: A Processing Element For Accelerating Neural Network Training. Omar Mohamed Awad, Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Ciaran Bannon, Anand Jayarajan, Gennady Pekhimenko, Andreas Moshovos |
| 2021 | Fat Loads: Exploiting Locality Amongst Contemporaneous Load Operations to Optimize Cache Accesses. Vanshika Baoni, Adarsh Mittal, Gurindar S. Sohi |
| 2021 | Fifer: Practical Acceleration of Irregular Applications on Reconfigurable Architectures. Quan M. Nguyen, Daniel Sánchez |
| 2021 | GPS: A Global Publish-Subscribe Model for Multi-GPU Memory Management. Harini Muthukrishnan, Daniel Lustig, David W. Nellans, Thomas F. Wenisch |
| 2021 | GhostMinion: A Strictness-Ordered Cache System for Spectre Mitigation. Sam Ainsworth |
| 2021 | GreenDIMM: OS-assisted DRAM Power Management for DRAM with a Sub-array Granularity Power-Down State. Seunghak Lee, Ki-Dong Kang, Hwanjun Lee, Hyungwon Park, Young Hoon Son, Nam Sung Kim, Daehoon Kim |
| 2021 | HARP: Practically and Effectively Identifying Uncorrectable Errors in Memory Chips That Use On-Die Error-Correcting Codes. Minesh Patel, Geraldo F. Oliveira, Onur Mutlu |
| 2021 | HiMA: A Fast and Scalable History-based Memory Access Engine for Differentiable Neural Computer. Yaoyu Tao, Zhengya Zhang |
| 2021 | HoloAR: On-the-fly Optimization of 3D Holographic Processing for Augmented Reality. Shulin Zhao, Haibo Zhang, Cyan Subhra Mishra, Sandeepa Bhuyan, Ziyu Ying, Mahmut Taylan Kandemir, Anand Sivasubramaniam, Chita R. Das |
| 2021 | I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization. Tong Geng, Chunshu Wu, Yongan Zhang, Cheng Tan, Chenhao Xie, Haoran You, Martin C. Herbordt, Yingyan Lin, Ang Li |
| 2021 | ITSLF: Inter-Thread Store-to-Load Forwardingin Simultaneous Multithreading. Josué Feliu, Alberto Ros, Manuel E. Acacio, Stefanos Kaxiras |
| 2021 | IceClave: A Trusted Execution Environment for In-Storage Computing. Luyi Kang, Yuqi Xue, Weiwei Jia, Xiaohao Wang, Jongryool Kim, Changhwan Youn, Myeong Joon Kang, Hyung Jin Lim, Bruce L. Jacob, Jian Huang |
| 2021 | Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design. Bingyao Li, Jieming Yin, Youtao Zhang, Xulong Tang |
| 2021 | Improving Streaming Graph Processing Performance using Input Knowledge. Abanti Basak, Zheng Qu, Jilan Lin, Alaa R. Alameldeen, Zeshan Chishti, Yufei Ding, Yuan Xie |
| 2021 | Increasing GPU Translation Reach by Leveraging Under-Utilized On-Chip Resources. Jagadish B. Kotra, Michael LeBeane, Mahmut T. Kandemir, Gabriel H. Loh |
| 2021 | Intersection Prediction for Accelerated GPU Ray Tracing. Lufei Liu, Wesley Chang, Francois Demoullin, Yuan-Hsi Chou, Mohammadreza Saed, David Pankratz, Tyler Nowicki, Tor M. Aamodt |
| 2021 | JetStream: Graph Analytics on Streaming Data with Event-Driven Hardware Accelerator. Shafiur Rahman, Mahbod Afarin, Nael B. Abu-Ghazaleh, Rajiv Gupta |
| 2021 | JigSaw: Boosting Fidelity of NISQ Programs via Measurement Subsetting. Poulami Das, Swamit S. Tannu, Moinuddin K. Qureshi |
| 2021 | LADDER: Architecting Content and Location-aware Writes for Crossbar Resistive Memories. Md Hafizul Islam Chowdhuryy, Muhammad Rashedul Haq Rashed, Amro Awad, Rickard Ewetz, Fan Yao |
| 2021 | Leveraging Targeted Value Prediction to Unlock New Hardware Strength Reduction Potential. Arthur Perais |
| 2021 | MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, Virtual Event, Greece, October 18-22, 2021 |
| 2021 | Morrigan: A Composite Instruction TLB Prefetcher. Georgios Vavouliotis, Lluc Alvarez, Boris Grot, Daniel A. Jiménez, Marc Casas |
| 2021 | NDS: N-Dimensional Storage. Yu-Chia Liu, Hung-Wei Tseng |
| 2021 | NMAP: Power Management Based on Network Packet Processing Mode Transition for Latency-Critical Workloads. Ki-Dong Kang, Gyeongseo Park, Hyosang Kim, Mohammad Alian, Nam Sung Kim, Daehoon Kim |
| 2021 | NOVIA: A Framework for Discovering Non-Conventional Inline Accelerators. David Trilla, John-David Wellman, Alper Buyuktosunoglu, Pradip Bose |
| 2021 | Network-on-Chip Microarchitecture-based Covert Channel in GPUs. Jaeguk Ahn, Jiho Kim, Hans Kasan, Zhixian Jin, Leila Delshadtehrani, WonJun Song, Ajay Joshi, John Kim |
| 2021 | Noema: Hardware-Efficient Template Matching for Neural Population Pattern Detection. Ameer M. S. Abdelhadi, Eugene Sha, Ciaran Bannon, Hendrik Steenland, Andreas Moshovos |
| 2021 | Ohm-GPU: Integrating New Optical Network and Heterogeneous Memory into GPU Multi-Processors. Jie Zhang, Myoungsoo Jung |
| 2021 | OrderLight: Lightweight Memory-Ordering Primitive for Efficient Fine-Grained PIM Computations. Anirban Nag, Rajeev Balasubramonian |
| 2021 | PCCS: Processor-Centric Contention-aware Slowdown Model for Heterogeneous System-on-Chips. Yuanchao Xu, Mehmet Esat Belviranli, Xipeng Shen, Jeffrey S. Vetter |
| 2021 | PDede: Partitioned, Deduplicated, Delta Branch Target Buffer. Niranjan K. Soundararajan, Peter Braun, Tanvir Ahmed Khan, Baris Kasikci, Heiner Litz, Sreenivas Subramoney |
| 2021 | ParaBit: Processing Parallel Bitwise Operations in NAND Flash Memory based SSDs. Congming Gao, Xin Xin, Youyou Lu, Youtao Zhang, Jun Yang, Jiwu Shu |
| 2021 | Point-X: A Spatial-Locality-Aware Architecture for Energy-Efficient Graph-Based Point-Cloud Deep Learning. Jie-Fang Zhang, Zhengya Zhang |
| 2021 | PointAcc: Efficient Point Cloud Accelerator. Yujun Lin, Zhekai Zhang, Haotian Tang, Hanrui Wang, Song Han |
| 2021 | Post-Fabrication Microarchitecture. Chanchal Kumar, Anirudh Seshadri, Aayush Chaudhary, Shubham Bhawalkar, Rohit Singh, Eric Rotenberg |
| 2021 | Principal Kernel Analysis: A Tractable Methodology to Simulate Scaled GPU Workloads. Cesar Avalos Baddouh, Mahmoud Khairy, Roland N. Green, Mathias Payer, Timothy G. Rogers |
| 2021 | Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning. Rahul Bera, Konstantinos Kanellopoulos, Anant Nori, Taha Shahroodi, Sreenivas Subramoney, Onur Mutlu |
| 2021 | RACER: Bit-Pipelined Processing Using Resistive Memory. Minh S. Q. Truong, Eric Chen, Deanyone Su, Liting Shen, Alexander Glass, L. Richard Carley, James A. Bain, Saugata Ghose |
| 2021 | RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance. Udit Gupta, Samuel Hsia, Jeff Zhang, Mark Wilkening, Javin Pombra, Hsien-Hsin Sean Lee, Gu-Yeon Wei, Carole-Jean Wu, David Brooks |
| 2021 | ReplayCache: Enabling Volatile Cachesfor Energy Harvesting Systems. Jianping Zeng, Jongouk Choi, Xinwei Fu, Ajay Paddayuru Shreepathi, Dongyoon Lee, Changwoo Min, Changhee Jung |
| 2021 | SAM: Accelerating Strided Memory Accesses. Xin Xin, Yanan Guo, Youtao Zhang, Jun Yang |
| 2021 | SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems. Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwasniewski, Rachata Ausavarungnirun, Jakub Beránek, Konstantinos Kanellopoulos, Kacper Janda, Zur Vonarburg-Shmaria, Lukas Gianinazzi, Ioana Stefan, Juan Gómez-Luna, Jakub Golinowski, Marcin Copik, Lukas Kapp-Schwoerer, Salvatore Di Girolamo, Nils Blach, Marek Konieczny, Onur Mutlu, Torsten Hoefler |
| 2021 | SMART: A Heterogeneous Scratchpad Memory Architecture for Superconductor SFQ-based Systolic CNN Accelerators. Farzaneh Zokaee, Lei Jiang |
| 2021 | Sanger: A Co-Design Framework for Enabling Sparse Attention using Reconfigurable Architecture. Liqiang Lu, Yicheng Jin, Hangrui Bi, Zizhang Luo, Peng Li, Tao Wang, Yun Liang |
| 2021 | Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving. Qiyu Wan, Haojun Xia, Xingyao Zhang, Lening Wang, Shuaiwen Leon Song, Xin Fu |
| 2021 | Software-Defined Vector Processing on Manycore Fabrics. Philip Bedoukian, Neil Adit, Edwin Peguero, Adrian Sampson |
| 2021 | Soteria: Towards Resilient Integrity-Protected and Encrypted Non-Volatile Memories. Kazi Abu Zubair, Sudhanva Gurumurthi, Vilas Sridharan, Amro Awad |
| 2021 | SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator. Subhankar Pal, Aporva Amarnath, Siying Feng, Michael F. P. O'Boyle, Ronald G. Dreslinski, Christophe Dubach |
| 2021 | Speculative Privacy Tracking (SPT): Leaking Information From Speculative Execution Without Compromising Privacy. Rutvik Choudhary, Jiyong Yu, Christopher W. Fletcher, Adam Morrison |
| 2021 | SquiggleFilter: An Accelerator for Portable Virus Detection. Timothy Dunn, Harisankar Sadasivan, Jack Wadden, Kush Goliya, Kuan-Yu Chen, David T. Blaauw, Reetuparna Das, Satish Narayanasamy |
| 2021 | Sunder: Enabling Low-Overhead and Scalable Near-Data Pattern Matching Acceleration. Elaheh Sadredini, Reza Rahimi, Mohsen Imani, Kevin Skadron |
| 2021 | Synthesizing Formal Models of Hardware from RTL for Efficient Verification of Memory Model Implementations. Yao Hsiao, Dominic P. Mulligan, Nikos Nikoleris, Gustavo Petri, Caroline Trippel |
| 2021 | TIP: Time-Proportional Instruction Profiling. Björn Gottschall, Lieven Eeckhout, Magnus Jahre |
| 2021 | TRiM: Enhancing Processor-Memory Interfaces with Scalable Tensor Reduction in Memory. Jaehyun Park, Byeongho Kim, Sungmin Yun, Eojin Lee, Minsoo Rhu, Jung Ho Ahn |
| 2021 | The Laplace Microarchitecture for Tracking Data Uncertainty and Its Implementation in a RISC-V Processor. Vasileios Tsoutsouras, Orestis Kaparounakis, Bilgesu Arif Bilgin, Chatura Samarakoon, James Timothy Meech, Jan Heck, Phillip Stanley-Marbell |
| 2021 | Trident: Harnessing Architectural Resources for All Page Sizes in x86 Processors. Venkat Sri Sai Ram, Ashish Panwar, Arkaprava Basu |
| 2021 | Turnpike: Lightweight Soft Error Resilience for In-Order Cores. Jianping Zeng, Hongjune Kim, Jaejin Lee, Changhee Jung |
| 2021 | Twig: Profile-Guided BTB Prefetching for Data Center Applications. Tanvir Ahmed Khan, Nathan Brown, Akshitha Sriraman, Niranjan K. Soundararajan, Rakesh Kumar, Joseph Devietti, Sreenivas Subramoney, Gilles A. Pokam, Heiner Litz, Baris Kasikci |
| 2021 | UC-Check: Characterizing Micro-operation Caches in x86 Processors and Implications in Security and Performance. Joonsung Kim, Hamin Jang, Hunjun Lee, Seungho Lee, Jangwoo Kim |
| 2021 | Uncovering In-DRAM RowHammer Protection Mechanisms: A New Methodology, Custom RowHammer Patterns, and Implications. Hasan Hassan, Yahya Can Tugrul, Jeremie S. Kim, Victor van der Veen, Kaveh Razavi, Onur Mutlu |
| 2021 | Validation of Side-Channel Models via Observation Refinement. Pablo Buiras, Hamed Nemati, Andreas Lindner, Roberto Guanciale |
| 2021 | Vortex: Extending the RISC-V ISA for GPGPU and 3D-Graphics. Blaise Tine, Krishna Praveen Yalamarthy, Fares Elsabbagh, Hyesoon Kim |