ISPASS B

43 papers

YearTitle / Authors
2021A Case Against Hardware Managed DRAM Caches for NVRAM Based Systems.
Mark Hildebrand, Julian T. Angeles, Jason Lowe-Power, Venkatesh Akella
2021A Defense-Inspired Benchmark Suite.
Pete Ehrett, Nathan Block, Bing Schaefer, Adrian Berding, John Paul Koenig, Pranav Srinivasan, Valeria Bertacco, Todd M. Austin
2021AI Tax in Mobile SoCs: End-to-end Performance Analysis of Machine Learning in Smartphones.
Michael Buch, Zahra Azad, Ajay Joshi, Vijay Janapa Reddi
2021AIBench Training: Balanced Industry-Standard AI Training Benchmarking.
Fei Tang, Wanling Gao, Jianfeng Zhan, Chuanxin Lan, Xu Wen, Lei Wang, Chunjie Luo, Zheng Cao, Xingwang Xiong, Zihan Jiang, Tianshu Hao, Fanda Fan, Fan Zhang, Yunyou Huang, Jianan Chen, Mengjia Du, Rui Ren, Chen Zheng, Daoyi Zheng, Haoning Tang, Kunlin Zhan, Biao Wang, Defei Kong, Minghe Yu, Chongkang Tan, Huan Li, Xinhui Tian, Yatao Li, Junchao Shao, Zhenyu Wang, Xiaoyu Wang, Jiahui Dai, Hainan Ye
2021Accelerating Fully Homomorphic Encryption Through Microarchitecture-Aware Analysis and Optimization.
Wonkyung Jung, Eojin Lee, Sangpyo Kim, Namhoon Kim, Keewoo Lee, Chohong Min, Jung Hee Cheon, Jung Ho Ahn
2021An Automated Traffic Generation Framework for Performance Evaluation of Networks-on-Chip for Real World Use Cases.
Sri Harsha Gade, Anup Gangwar, Ambica Prasad, Nitin Kumar Agarwal, Ravishankar Sreedharan
2021Analysis of Factors Affecting Power Consumption and Energy Efficiency of SGEMM on the Low-Power Myriad-2 VPU.
Suyash Bakshi, S. Lennart Johnsson
2021Analyzing Secure Memory Architecture for GPUs.
Shougang Yuan, Ardhi Wiratama Baskara Yudha, Yan Solihin, Huiyang Zhou
2021Analyzing the Interplay Between Random Shuffling and Storage Devices for Efficient Machine Learning.
Zhi-Lin Ke, Hsiang-Yun Cheng, Chia-Lin Yang, Han-wei Huang
2021Architecture-Level Energy Estimation for Heterogeneous Computing Systems.
Francis Wang, Yannan Nellie Wu, Matthew E. Woicik, Joel S. Emer, Vivienne Sze
2021COBRA: A Framework for Evaluating Compositions of Hardware Branch Predictors.
Jerry Zhao, Abraham Gonzalez, Alon Amid, Sagar Karandikar, Krste Asanovic
2021Characterizing Massively Parallel Polymorphism.
Mengchi Zhang, Ahmad Alawneh, Timothy G. Rogers
2021CoCoPeLia: Communication-Computation Overlap Prediction for Efficient Linear Algebra on GPUs.
Petros Anastasiadis, Nikela Papadopoulou, Georgios I. Goumas, Nectarios Koziris
2021Comparative Code Structure Analysis using Deep Learning for Performance Prediction.
Tarek Ramadan, Tanzima Z. Islam, Chase Phelps, Nathan Pinnow, Jayaraman J. Thiagarajan
2021Designing GPU Architecture for Memory Bandwidth Reservation.
Emir C. Marangoz, Kyoung-Don Kang, Seunghee Shin
2021E3: A HW/SW Co-design Neuroevolution Platform for Autonomous Learning in Edge Device.
Sheng-Chun Kao, Tushar Krishna
2021Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators.
Subhankar Pal, Swagath Venkataramani, Viji Srinivasan, Kailash Gopalakrishnan
2021Efficient Split Counter Mode Encryption for NVM.
Qi Pei, Seunghee Shin
2021Enabling Reproducible and Agile Full-System Simulation.
Bobby R. Bruce, Ayaz Akram, Hoa Nguyen, Kyle Roarty, Mahyar Samani, Marjan Fariborz, Trivikram Reddy, Matthew D. Sinclair, Jason Lowe-Power
2021FireMarshal: Making HW/SW Co-Design Reproducible and Reliable.
Nathan Pemberton, Alon Amid
2021GNNMark: A Benchmark Suite to Characterize Graph Neural Network Training on GPUs.
Trinayan Baruah, Kaustubh Shivdikar, Shi Dong, Yifan Sun, Saiful A. Mojumder, Kihoon Jung, José L. Abellán, Yash Ukidave, Ajay Joshi, John Kim, David R. Kaeli
2021GenomicsBench: A Benchmark Suite for Genomics.
Arun Subramaniyan, Yufeng Gu, Timothy Dunn, Somnath Paul, Md. Vasimuddin, Sanchit Misra, David T. Blaauw, Satish Narayanasamy, Reetuparna Das
2021Hardware Acceleration for DBMS Machine Learning Scoring: Is It Worth the Overheads?
Zahra Azad, Rathijit Sen, Kwanghyun Park, Ajay Joshi
2021How Do Graph Relabeling Algorithms Improve Memory Locality?
Mohsen Koohi Esfahani, Peter Kilpatrick, Hans Vandierendonck
2021IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2021, Stony Brook, NY, USA, March 28-30, 2021
2021Learning Sparse Matrix Row Permutations for Efficient SpMM on GPU Architectures.
Atefeh Mehrabi, Donghyuk Lee, Niladrish Chatterjee, Daniel J. Sorin, Benjamin C. Lee, Mike O'Connor
2021Loopapalooza: Investigating Limits of Loop-Level Parallelism with a Compiler-Driven Approach.
Ali Mustafa Zaidi, Konstantinos Iordanou, Mikel Luján, Giacomo Gabrielli
2021Memory-Efficient Hardware Performance Counters with Approximate-Counting Algorithms.
Jingyi Xu, Sehoon Kim, Borivoje Nikolic, Yakun Sophia Shao
2021MicroGrad: A Centralized Framework for Workload Cloning and Stress Testing.
Gokul Subramanian Ravi, Ramon Bertran, Pradip Bose, Mikko H. Lipasti
2021Performance Analysis of Graph Neural Network Frameworks.
Junwei Wu, Jingwei Sun, Hao Sun, Guangzhong Sun
2021Performance Characterization of .NET Benchmarks.
Aniket Deshmukh, Ruihao Li, Rathijit Sen, Robert R. Henry, Monica Beckwith, Gagan Gupta
2021Pinpointing the Memory Behaviors of DNN Training.
Jiansong Li, Xiao Dong, Guangli Li, Peng Zhao, Xueying Wang, Xiaobing Chen, Xianzhi Yu, Yongxin Yang, Zihan Jiang, Wei Cao, Lei Liu, Xiaobing Feng
2021Pitfalls of InfiniBand with On-Demand Paging.
Takuya Fukuoka, Shigeyuki Sato, Kenjiro Taura
2021Re-establishing Fetch-Directed Instruction Prefetching: An Industry Perspective.
Yasuo Ishii, Jaekyu Lee, Krishnendra Nathella, Dam Sunwoo
2021Real-Time Characterization of Data Access Correlations.
Bryan Harris, Michael Marzullo, Nihat Altiparmak
2021Reducing BERT Computation by Padding Removal and Curriculum Learning.
Wei Zhang, Wei Wei, Wen Wang, Lingling Jin, Zheng Cao
2021Sparseloop: An Analytical, Energy-Focused Design Space Exploration Methodology for Sparse Tensor Accelerators.
Yannan Nellie Wu, Po-An Tsai, Angshuman Parashar, Vivienne Sze, Joel S. Emer
2021Splash-4: Improving Scalability with Lock-Free Constructs.
Eduardo José Gómez-Hernández, Ruixiang Shao, Christos Sakalis, Stefanos Kaxiras, Alberto Ros
2021TPUPoint: Automatic Characterization of Hardware-Accelerated Machine-Learning Behavior for Cloud Computing.
Abenezer Wudenhe, Hung-Wei Tseng
2021The Impact of SoC Integration and OS Deployment on the Reliability of Arm Processors.
Pablo Bodmann, George Papadimitriou, Dimitris Gizopoulos, Paolo Rech
2021Thermal-Aware Overclocking for Smartphones.
Guru Prasad Srinivasa, David Werner, Mark Hempstead, Geoffrey Challen
2021Understanding Capacity-Driven Scale-Out Neural Recommendation Inference.
Michael Lui, Yavuz Yetim, Özgür Özkan, Zhuoran Zhao, Shin-Yeh Tsai, Carole-Jean Wu, Mark Hempstead
2021ViStA: Video Streaming and Analytics Benchmark.
Navneet Raju, Rahul M. Koushik, Hari Om, Subramaniam Kalambur