| 2021 | A Case Against Hardware Managed DRAM Caches for NVRAM Based Systems. Mark Hildebrand, Julian T. Angeles, Jason Lowe-Power, Venkatesh Akella |
| 2021 | A Defense-Inspired Benchmark Suite. Pete Ehrett, Nathan Block, Bing Schaefer, Adrian Berding, John Paul Koenig, Pranav Srinivasan, Valeria Bertacco, Todd M. Austin |
| 2021 | AI Tax in Mobile SoCs: End-to-end Performance Analysis of Machine Learning in Smartphones. Michael Buch, Zahra Azad, Ajay Joshi, Vijay Janapa Reddi |
| 2021 | AIBench Training: Balanced Industry-Standard AI Training Benchmarking. Fei Tang, Wanling Gao, Jianfeng Zhan, Chuanxin Lan, Xu Wen, Lei Wang, Chunjie Luo, Zheng Cao, Xingwang Xiong, Zihan Jiang, Tianshu Hao, Fanda Fan, Fan Zhang, Yunyou Huang, Jianan Chen, Mengjia Du, Rui Ren, Chen Zheng, Daoyi Zheng, Haoning Tang, Kunlin Zhan, Biao Wang, Defei Kong, Minghe Yu, Chongkang Tan, Huan Li, Xinhui Tian, Yatao Li, Junchao Shao, Zhenyu Wang, Xiaoyu Wang, Jiahui Dai, Hainan Ye |
| 2021 | Accelerating Fully Homomorphic Encryption Through Microarchitecture-Aware Analysis and Optimization. Wonkyung Jung, Eojin Lee, Sangpyo Kim, Namhoon Kim, Keewoo Lee, Chohong Min, Jung Hee Cheon, Jung Ho Ahn |
| 2021 | An Automated Traffic Generation Framework for Performance Evaluation of Networks-on-Chip for Real World Use Cases. Sri Harsha Gade, Anup Gangwar, Ambica Prasad, Nitin Kumar Agarwal, Ravishankar Sreedharan |
| 2021 | Analysis of Factors Affecting Power Consumption and Energy Efficiency of SGEMM on the Low-Power Myriad-2 VPU. Suyash Bakshi, S. Lennart Johnsson |
| 2021 | Analyzing Secure Memory Architecture for GPUs. Shougang Yuan, Ardhi Wiratama Baskara Yudha, Yan Solihin, Huiyang Zhou |
| 2021 | Analyzing the Interplay Between Random Shuffling and Storage Devices for Efficient Machine Learning. Zhi-Lin Ke, Hsiang-Yun Cheng, Chia-Lin Yang, Han-wei Huang |
| 2021 | Architecture-Level Energy Estimation for Heterogeneous Computing Systems. Francis Wang, Yannan Nellie Wu, Matthew E. Woicik, Joel S. Emer, Vivienne Sze |
| 2021 | COBRA: A Framework for Evaluating Compositions of Hardware Branch Predictors. Jerry Zhao, Abraham Gonzalez, Alon Amid, Sagar Karandikar, Krste Asanovic |
| 2021 | Characterizing Massively Parallel Polymorphism. Mengchi Zhang, Ahmad Alawneh, Timothy G. Rogers |
| 2021 | CoCoPeLia: Communication-Computation Overlap Prediction for Efficient Linear Algebra on GPUs. Petros Anastasiadis, Nikela Papadopoulou, Georgios I. Goumas, Nectarios Koziris |
| 2021 | Comparative Code Structure Analysis using Deep Learning for Performance Prediction. Tarek Ramadan, Tanzima Z. Islam, Chase Phelps, Nathan Pinnow, Jayaraman J. Thiagarajan |
| 2021 | Designing GPU Architecture for Memory Bandwidth Reservation. Emir C. Marangoz, Kyoung-Don Kang, Seunghee Shin |
| 2021 | E3: A HW/SW Co-design Neuroevolution Platform for Autonomous Learning in Edge Device. Sheng-Chun Kao, Tushar Krishna |
| 2021 | Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators. Subhankar Pal, Swagath Venkataramani, Viji Srinivasan, Kailash Gopalakrishnan |
| 2021 | Efficient Split Counter Mode Encryption for NVM. Qi Pei, Seunghee Shin |
| 2021 | Enabling Reproducible and Agile Full-System Simulation. Bobby R. Bruce, Ayaz Akram, Hoa Nguyen, Kyle Roarty, Mahyar Samani, Marjan Fariborz, Trivikram Reddy, Matthew D. Sinclair, Jason Lowe-Power |
| 2021 | FireMarshal: Making HW/SW Co-Design Reproducible and Reliable. Nathan Pemberton, Alon Amid |
| 2021 | GNNMark: A Benchmark Suite to Characterize Graph Neural Network Training on GPUs. Trinayan Baruah, Kaustubh Shivdikar, Shi Dong, Yifan Sun, Saiful A. Mojumder, Kihoon Jung, José L. Abellán, Yash Ukidave, Ajay Joshi, John Kim, David R. Kaeli |
| 2021 | GenomicsBench: A Benchmark Suite for Genomics. Arun Subramaniyan, Yufeng Gu, Timothy Dunn, Somnath Paul, Md. Vasimuddin, Sanchit Misra, David T. Blaauw, Satish Narayanasamy, Reetuparna Das |
| 2021 | Hardware Acceleration for DBMS Machine Learning Scoring: Is It Worth the Overheads? Zahra Azad, Rathijit Sen, Kwanghyun Park, Ajay Joshi |
| 2021 | How Do Graph Relabeling Algorithms Improve Memory Locality? Mohsen Koohi Esfahani, Peter Kilpatrick, Hans Vandierendonck |
| 2021 | IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2021, Stony Brook, NY, USA, March 28-30, 2021 |
| 2021 | Learning Sparse Matrix Row Permutations for Efficient SpMM on GPU Architectures. Atefeh Mehrabi, Donghyuk Lee, Niladrish Chatterjee, Daniel J. Sorin, Benjamin C. Lee, Mike O'Connor |
| 2021 | Loopapalooza: Investigating Limits of Loop-Level Parallelism with a Compiler-Driven Approach. Ali Mustafa Zaidi, Konstantinos Iordanou, Mikel Luján, Giacomo Gabrielli |
| 2021 | Memory-Efficient Hardware Performance Counters with Approximate-Counting Algorithms. Jingyi Xu, Sehoon Kim, Borivoje Nikolic, Yakun Sophia Shao |
| 2021 | MicroGrad: A Centralized Framework for Workload Cloning and Stress Testing. Gokul Subramanian Ravi, Ramon Bertran, Pradip Bose, Mikko H. Lipasti |
| 2021 | Performance Analysis of Graph Neural Network Frameworks. Junwei Wu, Jingwei Sun, Hao Sun, Guangzhong Sun |
| 2021 | Performance Characterization of .NET Benchmarks. Aniket Deshmukh, Ruihao Li, Rathijit Sen, Robert R. Henry, Monica Beckwith, Gagan Gupta |
| 2021 | Pinpointing the Memory Behaviors of DNN Training. Jiansong Li, Xiao Dong, Guangli Li, Peng Zhao, Xueying Wang, Xiaobing Chen, Xianzhi Yu, Yongxin Yang, Zihan Jiang, Wei Cao, Lei Liu, Xiaobing Feng |
| 2021 | Pitfalls of InfiniBand with On-Demand Paging. Takuya Fukuoka, Shigeyuki Sato, Kenjiro Taura |
| 2021 | Re-establishing Fetch-Directed Instruction Prefetching: An Industry Perspective. Yasuo Ishii, Jaekyu Lee, Krishnendra Nathella, Dam Sunwoo |
| 2021 | Real-Time Characterization of Data Access Correlations. Bryan Harris, Michael Marzullo, Nihat Altiparmak |
| 2021 | Reducing BERT Computation by Padding Removal and Curriculum Learning. Wei Zhang, Wei Wei, Wen Wang, Lingling Jin, Zheng Cao |
| 2021 | Sparseloop: An Analytical, Energy-Focused Design Space Exploration Methodology for Sparse Tensor Accelerators. Yannan Nellie Wu, Po-An Tsai, Angshuman Parashar, Vivienne Sze, Joel S. Emer |
| 2021 | Splash-4: Improving Scalability with Lock-Free Constructs. Eduardo José Gómez-Hernández, Ruixiang Shao, Christos Sakalis, Stefanos Kaxiras, Alberto Ros |
| 2021 | TPUPoint: Automatic Characterization of Hardware-Accelerated Machine-Learning Behavior for Cloud Computing. Abenezer Wudenhe, Hung-Wei Tseng |
| 2021 | The Impact of SoC Integration and OS Deployment on the Reliability of Arm Processors. Pablo Bodmann, George Papadimitriou, Dimitris Gizopoulos, Paolo Rech |
| 2021 | Thermal-Aware Overclocking for Smartphones. Guru Prasad Srinivasa, David Werner, Mark Hempstead, Geoffrey Challen |
| 2021 | Understanding Capacity-Driven Scale-Out Neural Recommendation Inference. Michael Lui, Yavuz Yetim, Özgür Özkan, Zhuoran Zhao, Shin-Yeh Tsai, Carole-Jean Wu, Mark Hempstead |
| 2021 | ViStA: Video Streaming and Analytics Benchmark. Navneet Raju, Rahul M. Koushik, Hari Om, Subramaniam Kalambur |