ISCA A*

82 papers

YearTitle / Authors
202148th ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2021, Virtual Event / Valencia, Spain, June 14-18, 2021
2021A Cost-Effective Entangling Prefetcher for Instructions.
Alberto Ros, Alexandra Jimborean
2021A RISC-V in-network accelerator for flexible high-performance low-power packet processing.
Salvatore Di Girolamo, Andreas Kurth, Alexandru Calotoiu, Thomas Benz, Timo Schneider, Jakub Beránek, Luca Benini, Torsten Hoefler
2021ABC-DIMM: Alleviating the Bottleneck of Communication in DIMM-based Near-Memory Processing with Inter-DIMM Broadcast.
Weiyi Sun, Zhaoshi Li, Shouyi Yin, Shaojun Wei, Leibo Liu
2021Accelerated Seeding for Genome Sequence Alignment with Enumerated Radix Trees.
Arun Subramaniyan, Jack Wadden, Kush Goliya, Nathan Ozog, Xiao Wu, Satish Narayanasamy, David T. Blaauw, Reetuparna Das
2021Albireo: Energy-Efficient Acceleration of Convolutional Neural Networks via Silicon Photonics.
Kyle Shiflett, Avinash Karanth, Razvan C. Bunescu, Ahmed Louri
2021Aurochs: An Architecture for Dataflow Threads.
Matthew Vilim, Alexander Rucker, Kunle Olukotun
2021BOSS: Bandwidth-Optimized Search Accelerator for Storage-Class Memory.
Jun Heo, Seung Yul Lee, Sunhong Min, Yeonhong Park, Sungjun Jung, Tae Jun Ham, Jae W. Lee
2021BlockMaestro: Enabling Programmer-Transparent Task-based Execution in GPU Systems.
AmirAli Abdolrashidi, Hodjat Asghari Esfeden, Ali Jahanshahi, Kaustubh Singh, Nael B. Abu-Ghazaleh, Daniel Wong
2021CODIC: A Low-Cost Substrate for Enabling Custom In-DRAM Functionalities and Optimizations.
Lois Orosa, Yaohua Wang, Mohammad Sadrosadati, Jeremie S. Kim, Minesh Patel, Ivan Puddu, Haocong Luo, Kaveh Razavi, Juan Gómez-Luna, Hasan Hassan, Nika Mansouri-Ghiasi, Saugata Ghose, Onur Mutlu
2021Cambricon-Q: A Hybrid Architecture for Efficient Training.
Yongwei Zhao, Chang Liu, Zidong Du, Qi Guo, Xing Hu, Yimin Zhuang, Zhenxing Zhang, Xinkai Song, Wei Li, Xishan Zhang, Ling Li, Zhiwei Xu, Tianshi Chen
2021CoSA: Scheduling by Constrained Optimization for Spatial Accelerators.
Qijing Huang, Aravind Kalaiah, Minwoo Kang, James Demmel, Grace Dinh, John Wawrzynek, Thomas Norell, Yakun Sophia Shao
2021Communication Algorithm-Architecture Co-Design for Distributed Deep Learning.
Jiayi Huang, Pritam Majumder, Sungkeun Kim, Abdullah Muzahid, Ki Hwan Yum, Eun Jung Kim
2021Confidential Serverless Made Efficient with Plug-In Enclaves.
Mingyu Li, Yubin Xia, Haibo Chen
2021Cost-Efficient Overclocking in Immersion-Cooled Datacenters.
Majid Jalili, Ioannis Manousakis, Iñigo Goiri, Pulkit A. Misra, Ashish Raniwala, Husam Alissa, Bharath Ramakrishnan, Phillip Tuma, Christian Belady, Marcus Fontoura, Ricardo Bianchini
2021CryoGuard: A Near Refresh-Free Robust DRAM Design for Cryogenic Computing.
Gyu-hyeon Lee, Seongmin Na, Ilkwon Byun, Dongmoon Min, Jangwoo Kim
2021Demystifying the System Vulnerability Stack: Transient Fault Effects Across the Layers.
George Papadimitriou, Dimitris Gizopoulos
2021Designing Calibration and Expressivity-Efficient Instruction Sets for Quantum Computing.
Lingling Lao, Prakash Murali, Margaret Martonosi, Dan E. Browne
2021Don't Forget the I/O When Allocating Your LLC.
Yifan Yuan, Mohammad Alian, Yipeng Wang, Ren Wang, Ilia Kurakin, Charlie Tai, Nam Sung Kim
2021Dual-side Sparse Tensor Core.
Yang Wang, Chen Zhang, Zhiqiang Xie, Cong Guo, Yunxin Liu, Jingwen Leng
2021Dvé: Improving DRAM Reliability and Performance On-Demand via Coherent Replication.
Adarsh Patil, Vijay Nagarajan, Rajeev Balasubramonian, Nicolai Oswald
2021ELSA: Hardware-Software Co-design for Efficient, Lightweight Self-Attention Mechanism in Neural Networks.
Tae Jun Ham, Yejin Lee, Seong Hoon Seo, Soosung Kim, Hyunji Choi, Sung Jun Jung, Jae W. Lee
2021Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers.
Harini Muthukrishnan, David W. Nellans, Daniel Lustig, Jeffrey A. Fessler, Thomas F. Wenisch
2021Enabling Compute-Communication Overlap in Distributed Deep Learning Training Platforms.
Saeed Rashidi, Matthew Denton, Srinivas Sridharan, Sudarshan Srinivasan, Amoghavarsha Suresh, Jade Nie, Tushar Krishna
2021Energy Efficiency Boost in the AI-Infused POWER10 Processor.
Brian W. Thompto, Dung Q. Nguyen, José E. Moreira, Ramon Bertran, Hans M. Jacobson, Richard J. Eickemeyer, Rahul M. Rao, Michael Goulet, Marcy Byers, Christopher J. Gonzalez, Karthik Swaminathan, Nagu R. Dhanwada, Silvia M. Müller, Andreas Wagner, Satish Kumar Sadasivam, Robert K. Montoye, William J. Starke, Christian G. Zoellin, Michael S. Floyd, Jeffrey Stuecheli, Nandhini Chandramoorthy, John-David Wellman, Alper Buyuktosunoglu, Matthias Pflanz, Balaram Sinharoy, Pradip Bose
2021Execution Dependence Extension (EDE): ISA Support for Eliminating Fences.
Thomas Shull, Ilias Vougioukas, Nikos Nikoleris, Wendy Elsasser, Josep Torrellas
2021Exploiting Long-Distance Interactions and Tolerating Atom Loss in Neutral Atom Quantum Architectures.
Jonathan M. Baker, Andrew Litteken, Casey Duckering, Henry Hoffmann, Hannes Bernien, Frederic T. Chong
2021Exploiting Page Table Locality for Agile TLB Prefetching.
Georgios Vavouliotis, Lluc Alvarez, Vasileios Karakostas, Konstantinos Nikas, Nectarios Koziris, Daniel A. Jiménez, Marc Casas
2021FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator.
Geng Yuan, Payman Behnam, Zhengang Li, Ali Shafiee, Sheng Lin, Xiaolong Ma, Hang Liu, Xuehai Qian, Mahdi Nazm Bojnordi, Yanzhi Wang, Caiwen Ding
2021Failure Sentinels: Ubiquitous Just-in-time Intermittent Computation via Low-cost Hardware Support for Voltage Monitoring.
Harrison Williams, Michael Moukarzel, Matthew Hicks
2021Flex: High-Availability Datacenters With Zero Reserved Power.
Chaojie Zhang, Alok Gautam Kumbhare, Ioannis Manousakis, Deli Zhang, Pulkit A. Misra, Rod Assis, Kyle Woolcock, Nithish Mahalingam, Brijesh Warrier, David Gauthier, Lalu Kunnath, Steve Solomon, Osvaldo Morales, Marcus Fontoura, Ricardo Bianchini
2021FlexMiner: A Pattern-Aware Accelerator for Graph Pattern Mining.
Xuhao Chen, Tianhao Huang, Shuotao Xu, Thomas Bourgeat, Chanwoo Chung, Arvind
2021Ghost Routing to Enable Oblivious Computation on Memory-centric Networks.
Yeonju Ro, Seongwook Jin, Jaehyuk Huh, John Kim
2021GoSPA: An Energy-efficient High-performance Globally Optimized SParse Convolutional Neural Network Accelerator.
Chunhua Deng, Yang Sui, Siyu Liao, Xuehai Qian, Bo Yuan
2021HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation.
Qingcheng Xiao, Size Zheng, Bingzhe Wu, Pengcheng Xu, Xuehai Qian, Yun Liang
2021Hardware Architecture and Software Stack for PIM Based on Commercial DRAM Technology : Industrial Product.
Sukhan Lee, Shinhaeng Kang, Jaehoon Lee, Hyeonsu Kim, Eojin Lee, Seungwoo Seo, Hosang Yoon, Seungwon Lee, Kyounghwan Lim, Hyunsung Shin, JinHyun Kim, Seongil O, Anand Iyer, David Wang, Kyomin Sohn, Nam Sung Kim
2021Hetero-ViTAL: A Virtualization Stack for Heterogeneous FPGA Clusters.
Yue Zha, Jing Li
2021I See Dead µops: Leaking Secrets via Intel/AMD Micro-Op Caches.
Xida Ren, Logan Moody, Mohammadkazem Taram, Matthew Jordan, Dean M. Tullsen, Ashish Venkat
2021IChannels: Exploiting Current Management Mechanisms to Create Covert Channels in Modern Processors.
Jawad Haj-Yahya, Lois Orosa, Jeremie S. Kim, Juan Gómez-Luna, Abdullah Giray Yaglikçi, Mohammed Alser, Ivan Puddu, Onur Mutlu
2021INTROSPECTRE: A Pre-Silicon Framework for Discovery and Analysis of Transient Execution Vulnerabilities.
Moein Ghaniyoun, Kristin Barber, Yinqian Zhang, Radu Teodorescu
2021Large-Scale Graph Processing on FPGAs with Caches for Thousands of Simultaneous Misses.
Mikhail Asiatici, Paolo Ienne
2021Leaky Buddies: Cross-Component Covert Channels on Integrated CPU-GPU Systems.
Sankha Baran Dutta, Hoda Naghibijouybari, Nael B. Abu-Ghazaleh, Andres Marquez, Kevin J. Barker
2021Maya: Using Formal Control to Obfuscate Power Side Channels.
Raghavendra Pradyumna Pothukuchi, Sweta Yamini Pothukuchi, Petros G. Voulgaris, Alexander G. Schwing, Josep Torrellas
2021NASA: Accelerating Neural Network Design with a NAS Processor.
Xiaohan Ma, Chang Si, Ying Wang, Cheng Liu, Lei Zhang
2021NASGuard: A Novel Accelerator Architecture for Robust Neural Architecture Search (NAS) Networks.
Xingbin Wang, Boyan Zhao, Rui Hou, Amro Awad, Zhihong Tian, Dan Meng
2021NN-Baton: DNN Workload Orchestration and Chiplet Granularity Exploration for Multichip Accelerators.
Zhanhong Tan, Hongyu Cai, Runpei Dong, Kaisheng Ma
2021NVOverlay: Enabling Efficient and Scalable High-Frequency Snapshotting to NVM.
Ziqi Wang, Chul-Hwan Choo, Michael A. Kozuch, Todd C. Mowry, Gennady Pekhimenko, Vivek Seshadri, Dimitrios Skarlatos
2021No-FAT: Architectural Support for Low Overhead Memory Safety Checks.
Mohamed Tarek Ibn Ziad, Miguel A. Arroyo, Evgeny Manzhosov, Ryan Piersma, Simha Sethumadhavan
2021Opening Pandora's Box: A Systematic Study of New Ways Microarchitecture Can Leak Private Data.
Jose Rodrigo Sanchez Vicarte, Pradyumna Shome, Nandeeka Nayak, Caroline Trippel, Adam Morrison, David Kohlbrenner, Christopher W. Fletcher
2021PF-DRAM: A Precharge-Free DRAM Structure.
Nezam Rohbani, Sina Darabi, Hamid Sarbazi-Azad
2021PMNet: In-Network Data Persistence.
Korakit Seemakhupt, Sihang Liu, Yasas Senevirathne, Muhammad Shahbaz, Samira Manabi Khan
2021Pioneering Chiplet Technology and Design for the AMD EPYC™ and Ryzen™ Processor Families : Industrial Product.
Samuel Naffziger, Noah Beck, Thomas Burd, Kevin Lepak, Gabriel H. Loh, Mahesh Subramony, Sean White
2021PipeZK: Accelerating Zero-Knowledge Proof with a Pipelined Architecture.
Ye Zhang, Shuo Wang, Xian Zhang, Jiangbin Dong, Xingzhong Mao, Fan Long, Cong Wang, Dong Zhou, Mingyu Gao, Guangyu Sun
2021PolyGraph: Exposing the Value of Flexibility for Graph Processing Accelerators.
Vidushi Dadu, Sihao Liu, Tony Nowatzki
2021QUAC-TRNG: High-Throughput True Random Number Generation Using Quadruple Row Activation in Commodity DRAM Chips.
Ataberk Olgun, Minesh Patel, Abdullah Giray Yaglikçi, Haocong Luo, Jeremie S. Kim, Nisa Bostanci, Nandita Vijaykumar, Oguz Ergin, Onur Mutlu
2021Quantifying Server Memory Frequency Margin and Using It to Improve Performance in HPC Systems.
Da Zhang, Gagandeep Panwar, Jagadish B. Kotra, Nathan DeBardeleben, Sean Blanchard, Xun Jian
2021REDUCT: Keep it Close, Keep it Cool! : Efficient Scaling of DNN Inference on Multi-core CPUs with Near-Cache Compute.
Anant V. Nori, Rahul Bera, Shankar Balachandran, Joydeep Rakshit, Om Ji Omer, Avishaii Abuhatzera, Belliappa Kuttanna, Sreenivas Subramoney
2021RaPiD: AI Accelerator for Ultra-low Precision Training and Inference.
Swagath Venkataramani, Vijayalakshmi Srinivasan, Wei Wang, Sanchari Sen, Jintao Zhang, Ankur Agrawal, Monodeep Kar, Shubham Jain, Alberto Mannari, Hoang Tran, Yulong Li, Eri Ogawa, Kazuaki Ishizaki, Hiroshi Inoue, Marcel Schaal, Mauricio J. Serrano, Jungwook Choi, Xiao Sun, Naigang Wang, Chia-Yu Chen, Allison Allain, James Bonanno, Nianzheng Cao, Robert Casatuta, Matthew Cohen, Bruce M. Fleischer, Michael Guillorn, Howard Haynie, Jinwook Jung, Mingu Kang, Kyu-Hyoun Kim, Siyu Koswatta, Sae Kyu Lee, Martin Lutz, Silvia M. Mueller, Jinwook Oh, Ashish Ranjan, Zhibin Ren, Scot Rider, Kerstin Schelm, Michael Scheuermann, Joel Silberman, Jie Yang, Vidhi Zalani, Xin Zhang, Ching Zhou, Matthew M. Ziegler, Vinay Shah, Moriyoshi Ohara, Pong-Fei Lu, Brian W. Curran, Sunil Shukla, Leland Chang, Kailash Gopalakrishnan
2021Rebooting Virtual Memory with Midgard.
Siddharth Gupta, Atri Bhattacharyya, Yunho Oh, Abhishek Bhattacharjee, Babak Falsafi, Mathias Payer
2021Revamping Storage Class Memory With Hardware Automated Memory-Over-Storage Solution.
Jie Zhang, Miryeong Kwon, Donghyun Gouk, Sungjoon Koh, Nam Sung Kim, Mahmut Taylan Kandemir, Myoungsoo Jung
2021RingCNN: Exploiting Algebraically-Sparse Ring Tensors for Energy-Efficient CNN-Based Computational Imaging.
Chao-Tsung Huang
2021Ripple: Profile-Guided Instruction Cache Replacement for Data Center Applications.
Tanvir Ahmed Khan, Dexin Zhang, Akshitha Sriraman, Joseph Devietti, Gilles Pokam, Heiner Litz, Baris Kasikci
2021SARA: Scaling a Reconfigurable Dataflow Accelerator.
Yaqi Zhang, Nathan Zhang, Tian Zhao, Matt Vilim, Muhammad Shahbaz, Kunle Olukotun
2021SATORI: Efficient and Fair Resource Partitioning by Sacrificing Short-Term Benefits for Long-Term Gains
Rohan Basu Roy, Tirthak Patel, Devesh Tiwari
2021SPACE: Locality-Aware Processing in Heterogeneous Memory for Personalized Recommendations.
Hongju Kal, Seokmin Lee, Gun Ko, Won Woo Ro
2021Sieve: Scalable In-situ DRAM-based Accelerator Designs for Massively Parallel k-mer Matching.
Lingxi Wu, Rasool Sharifi, Marzieh Lenjani, Kevin Skadron, Ashish Venkat
2021Snafu: An Ultra-Low-Power, Energy-Minimal CGRA-Generation Framework and Architecture.
Graham Gobieski, Ahmet Oguz Atli, Kenneth Mai, Brandon Lucia, Nathan Beckmann
2021Software-Hardware Co-Optimization for Computational Chemistry on Superconducting Quantum Processors.
Gushu Li, Yunong Shi, Ali Javadi-Abhari
2021SpZip: Architectural Support for Effective Data Compression In Irregular Applications.
Yifan Yang, Joel S. Emer, Daniel Sánchez
2021Sparsity-Aware and Re-configurable NPU Architecture for Samsung Flagship Mobile SoC.
Jun-Woo Jang, Sehwan Lee, Dongyoung Kim, Hyunsun Park, Ali Shafiee Ardestani, Yeongjae Choi, Channoh Kim, Yoojin Kim, Hyeongseok Yu, Hamzah Abdel-Aziz, Jun-Seok Park, Heonsoo Lee, Dongwoo Lee, Myeong Woo Kim, Hanwoong Jung, Heewoo Nam, Dongguen Lim, Seungwon Lee, Joon-Ho Song, Suknam Kwon, Joseph Hassoun, Sukhwan Lim, Changkyu Choi
2021Speculative Vectorisation with Selective Replay.
Peng Sun, Giacomo Gabrielli, Timothy M. Jones
2021Superconducting Computing with Alternating Logic Elements.
Georgios Tzimpragos, Jennifer Volk, Alex Wynn, James E. Smith, Timothy Sherwood
2021Supporting Legacy Libraries on Non-Volatile Memory: A User-Transparent Approach.
Chencheng Ye, Yuanchao Xu, Xipeng Shen, Xiaofei Liao, Hai Jin, Yan Solihin
2021TENET: A Framework for Modeling Tensor Dataflow Based on Relation-centric Notation.
Liqiang Lu, Naiqing Guan, Yuyue Wang, Liancheng Jia, Zizhang Luo, Jieming Yin, Jason Cong, Yun Liang
2021Taming the Zoo: The Unified GraphIt Compiler Framework for Novel Architectures.
Ajay Brahmakshatriya, Emily Furst, Victor A. Ying, Claire Hsu, Changwan Hong, Max Ruttenberg, Yunming Zhang, Dai Cheol Jung, Dustin Richmond, Michael B. Taylor, Julian Shun, Mark Oskin, Daniel Sánchez, Saman P. Amarasinghe
2021Ten Lessons From Three Generations Shaped Google's TPUv4i : Industrial Product.
Norman P. Jouppi, Doe Hyun Yoon, Matthew Ashcraft, Mark Gottscho, Thomas B. Jablin, George Kurian, James Laudon, Sheng Li, Peter C. Ma, Xiaoyu Ma, Thomas Norrie, Nishant Patil, Sushma Prasad, Cliff Young, Zongwei Zhou, David A. Patterson
2021TimeCache: Using Time to Eliminate Cache Side Channels when Sharing Software.
Divya Ojha, Sandhya Dwarkadas
2021Unlimited Vector Extension with Data Streaming Support.
Joao Mario Domingos, Nuno Neves, Nuno Roma, Pedro Tomás
2021Vector Runahead.
Ajeya Naithani, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout
2021ZeRØ: Zero-Overhead Resilient Operation Under Pointer Integrity Attacks.
Mohamed Tarek Ibn Ziad, Miguel A. Arroyo, Evgeny Manzhosov, Simha Sethumadhavan
2021Zero Inclusion Victim: Isolating Core Caches from Inclusive Last-level Cache Evictions.
Mainak Chaudhuri
2021η-LSTM: Co-Designing Highly-Efficient Large LSTM Training via Exploiting Memory-Saving and Architectural Design Opportunities.
Xingyao Zhang, Haojun Xia, Donglin Zhuang, Hao Sun, Xin Fu, Michael B. Taylor, Shuaiwen Leon Song