HPCA A*

70 papers

YearTitle / Authors
2021A Computational Stack for Cross-Domain Acceleration.
Sean Kinzer, Joon Kyung Kim, Soroush Ghodrati, Brahmendra Reddy Yatham, Alric Althoff, Divya Mahajan, Sorin Lerner, Hadi Esmaeilzadeh
2021A Write-Friendly and Fast-Recovery Scheme for Security Metadata in Non-Volatile Memories.
Jianming Huang, Yu Hua
2021Adapt-NoC: A Flexible Network-on-Chip Design for Heterogeneous Manycore Architectures.
Hao Zheng, Ke Wang, Ahmed Louri
2021An Analog Preconditioner for Solving Linear Systems.
Ben Feinberg, Ryan Wong, T. Patrick Xiao, Christopher H. Bennett, Jacob N. Rohan, Erik G. Boman, Matthew J. Marinella, Sapan Agarwal, Engin Ipek
2021Analyzing and Leveraging Decoupled L1 Caches in GPUs.
Mohamed Assem Ibrahim, Onur Kayiran, Yasuko Eckert, Gabriel H. Loh, Adwait Jog
2021Ascend: a Scalable and Unified Architecture for Ubiquitous Deep Neural Network Computing : Industry Track Paper.
Heng Liao, Jiajin Tu, Jing Xia, Hu Liu, Xiping Zhou, Honghui Yuan, Yuxing Hu
2021Automatic Microprocessor Performance Bug Detection.
Erick Carvajal Barboza, Sara Jacob, Mahesh Ketkar, Michael Kishinevsky, Paul Gratz, Jiang Hu
2021BBB: Simplifying Persistent Programming using Battery-Backed Buffers.
Mohammad A. Alshboul, Prakash Ramrakhyani, William Wang, James Tuck, Yan Solihin
2021BRIM: Bistable Resistively-Coupled Ising Machine.
Richard Afoakwa, Yiqiao Zhang, Uday Kumar Reddy Vengalam, Zeljko Ignjatovic, Michael C. Huang
2021BlockHammer: Preventing RowHammer at Low Cost by Blacklisting Rapidly-Accessed DRAM Rows.
Abdullah Giray Yaglikçi, Minesh Patel, Jeremie S. Kim, Roknoddin Azizi, Ataberk Olgun, Lois Orosa, Hasan Hassan, Jisung Park, Konstantinos Kanellopoulos, Taha Shahroodi, Saugata Ghose, Onur Mutlu
2021BoomGate: Deadlock Avoidance in Non-Minimal Routing for High-Radix Networks.
Gyuyoung Kwauk, Seungkwan Kang, Hans Kasan, Hyojun Son, John Kim
2021CAPE: A Content-Addressable Processing Engine.
Helena Caminal, Kailin Yang, Srivatsa Srinivasa, Akshay Krishna Ramanathan, Khalid Al-Hawaj, Tianshu Wu, Vijaykrishnan Narayanan, Christopher Batten, José F. Martínez
2021CARE: Coordinated Augmentation for Elastic Resilience on DRAM Errors in Data Centers.
Jian Chen, Xiaowei Jiang, Ying Zhang, Liyin Liu, Huifeng Xu, Qiang Liu
2021CHOPIN: Scalable Graphics Rendering in Multi-GPU Systems via Parallel Image Composition.
Xiaowei Ren, Mieszko Lis
2021CSCNN: Algorithm-hardware Co-design for CNN Accelerators using Centrosymmetric Filters.
Jiajun Li, Ahmed Louri, Avinash Karanth, Razvan C. Bunescu
2021Chasing Carbon: The Elusive Environmental Footprint of Computing.
Udit Gupta, Young Geun Kim, Sylvia Lee, Jordan Tse, Hsien-Hsin S. Lee, Gu-Yeon Wei, David Brooks, Carole-Jean Wu
2021Cheetah: Optimizing and Accelerating Homomorphic Encryption for Private Inference.
Brandon Reagen, Wooseok Choi, Yeongil Ko, Vincent T. Lee, Hsien-Hsin S. Lee, Gu-Yeon Wei, David Brooks
2021Common Counters: Compressed Encryption Counters for Secure GPU Memory.
Seonjin Na, Sunho Lee, Yeonjae Kim, Jongse Park, Jaehyuk Huh
2021DeACT: Architecture-Aware Virtual Memory Support for Fabric Attached Memory Systems.
Vamsee Reddy Kommareddy, Clayton Hughes, Simon David Hammond, Amro Awad
2021Dead Page and Dead Block Predictors: Cleaning TLBs and Caches Together.
Chandrashis Mazumdar, Prachatos Mitra, Arkaprava Basu
2021Deadline-Aware Offloading for High-Throughput Accelerators.
Tsung Tai Yeh, Matthew D. Sinclair, Bradford M. Beckmann, Timothy G. Rogers
2021DepGraph: A Dependency-Driven Accelerator for Efficient Iterative Graph Processing.
Yu Zhang, Xiaofei Liao, Hai Jin, Ligang He, Bingsheng He, Haikun Liu, Lin Gu
2021Designing a Cost-Effective Cache Replacement Policy using Machine Learning.
Subhash Sethumurugan, Jieming Yin, John Sartori
2021EXMA: A Genomics Accelerator for Exact-Matching.
Lei Jiang, Farzaneh Zokaee
2021Eudoxus: Characterizing and Accelerating Localization in Autonomous Machines Industry Track Paper.
Yiming Gan, Bo Yu, Boyuan Tian, Leimeng Xu, Wei Hu, Shaoshan Liu, Qiang Liu, Yanjun Zhang, Jie Tang, Yuhao Zhu
2021FAFNIR: Accelerating Sparse Gathering by Using Efficient Near-Memory Intelligent Reduction.
Bahar Asgari, Ramyad Hadidi, Jiashen Cao, Da Eun Shim, Sung Kyu Lim, Hyesoon Kim
2021Faster Schrödinger-style simulation of quantum circuits.
Aneeqa Fatima, Igor L. Markov
2021FuseKNA: Fused Kernel Convolution based Accelerator for Deep Neural Networks.
Jianxun Yang, Zhao Zhang, Zhuangzhi Liu, Jing Zhou, Leibo Liu, Shaojun Wei, Shouyi Yin
2021GCNAX: A Flexible and Energy-efficient Accelerator for Graph Convolutional Neural Networks.
Jiajun Li, Ahmed Louri, Avinash Karanth, Razvan C. Bunescu
2021GSSA: A Resource Allocation Scheme Customized for 3D NAND SSDs.
Chun-Yi Liu, Yunju Lee, Wonil Choi, Myoungsoo Jung, Mahmut Taylan Kandemir, Chita R. Das
2021GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent.
Heesu Kim, Hanmin Park, Taehyun Kim, Kwanheum Cho, Eojin Lee, Soojung Ryu, Hyuk-Jae Lee, Kiyoung Choi, Jinho Lee
2021Hardware-Based Address-Centric Acceleration of Key-Value Store.
Chencheng Ye, Yuanchao Xu, Xipeng Shen, Xiaofei Liao, Hai Jin, Yan Solihin
2021Heat Behind the Meter: A Hidden Threat of Thermal Attacks in Edge Colocation Data Centers.
Zhihui Shao, Mohammad A. Islam, Shaolei Ren
2021Heterogeneous Dataflow Accelerators for Multi-DNN Workloads.
Hyoukjun Kwon, Liangzhen Lai, Michael Pellauer, Tushar Krishna, Yu-Hsin Chen, Vikas Chandra
2021IEEE International Symposium on High-Performance Computer Architecture, HPCA 2021, Seoul, South Korea, February 27 - March 3, 2021
2021Improving GPU Multi-tenancy with Page Walk Stealing.
B Pratheek, Neha Jawalkar, Arkaprava Basu
2021LIBRA: Clearing the Cloud Through Dynamic Memory Bandwidth Management.
Ying Zhang, Jian Chen, Xiaowei Jiang, Qiang Liu, Ian M. Steiner, Andrew J. Herdrich, Kevin Shu, Ripan Das, Long Cui, Litrin Jiang
2021Layerweaver: Maximizing Resource Utilization of Neural Processing Units via Layer-Wise Scheduling.
Young H. Oh, Seonghak Kim, Yunho Jin, Sam Son, Jonghyun Bae, Jongsung Lee, Yeonhong Park, Dong Uk Kim, Tae Jun Ham, Jae W. Lee
2021Lazy Batching: An SLA-aware Batching System for Cloud Machine Learning Inference.
Yujeong Choi, Yunseong Kim, Minsoo Rhu
2021Memristive Data Ranking.
Ananth Krishna Prasad, Morteza Rezaalipour, Masoud Dehyadegari, Mahdi Nazm Bojnordi
2021Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework.
Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden K. H. So, Xuehai Qian, Yanzhi Wang, Xue Lin
2021Need for Speed: Experiences Building a Trustworthy System-Level GPU Simulator.
Oreste Villa, Daniel Lustig, Zi Yan, Evgeny Bolotin, Yaosheng Fu, Niladrish Chatterjee, Nan Jiang, David W. Nellans
2021NeuroMeter: An Integrated Power, Area, and Timing Modeling Framework for Machine Learning Accelerators Industry Track Paper.
Tianqi Tang, Sheng Li, Lifeng Nai, Norman P. Jouppi, Yuan Xie
2021New Models for Understanding and Reasoning about Speculative Execution Attacks.
Zecheng He, Guangyuan Hu, Ruby B. Lee
2021Operating Liquid-Cooled Large-Scale Systems: Long-Term Monitoring, Reliability Analysis, and Efficiency Measures.
Rohan Basu Roy, Tirthak Patel, Raj Kettimuthu, William E. Allcock, Paul Rich, Adam Scovel, Devesh Tiwari
2021P-OPT: Practical Optimal Cache Replacement for Graph Analytics.
Vignesh Balaji, Neal Clayton Crago, Aamer Jaleel, Brandon Lucia
2021ParaDox: Eliminating Voltage Margins via Heterogeneous Fault Tolerance.
Sam Ainsworth, Lionel Zoubritzky, Alan Mycroft, Timothy M. Jones
2021Pitstop: Enabling a Virtual Network Free Network-on-Chip.
Hossein Farrokhbakht, Henry Kao, Kamran Hasan, Paul V. Gratz, Tushar Krishna, Joshua San Miguel, Natalie D. Enright Jerger
2021Prodigy: Improving the Memory Latency of Data-Indirect Irregular Workloads Using Hardware-Software Co-Design.
Nishil Talati, Kyle May, Armand Behroozi, Yichen Yang, Kuba Kaszyk, Christos Vasiladiotis, Tarunesh Verma, Lu Li, Brandon Nguyen, Jiawen Sun, John Magnus Morton, Agreen Ahmadi, Todd M. Austin, Michael F. P. O'Boyle, Scott A. Mahlke, Trevor N. Mudge, Ronald G. Dreslinski
2021QEI: Query Acceleration Can be Generic and Efficient in the Cloud.
Yifan Yuan, Yipeng Wang, Ren Wang, Rangeen Basu Roy Chowdhury, Charlie Tai, Nam Sung Kim
2021QuCloud: A New Qubit Mapping Mechanism for Multi-programming Quantum Computing in Cloud Environment.
Lei Liu, Xinglei Dou
2021Revisiting HyperDimensional Learning for FPGA and Low-Power Architectures.
Mohsen Imani, Zhuowen Zou, Samuel Bosch, Sanjay Anantha Rao, Sahand Salamat, Venkatesh Kumar, Yeseong Kim, Tajana Rosing
2021SPAGHETTI: Streaming Accelerators for Highly Sparse GEMM on FPGAs.
Reza Hojabr, Ali Sedaghati, Amirali Sharifian, Ahmad Khonsari, Arrvindh Shriraman
2021Sentinel: Efficient Tensor Migration and Allocation on Heterogeneous Memory Systems for Deep Learning.
Jie Ren, Jiaolin Luo, Kai Wu, Minjia Zhang, Hyeran Jeon, Dong Li
2021SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning.
Hanrui Wang, Zhekai Zhang, Song Han
2021SpaceA: Sparse Matrix Vector Multiplication on Processing-in-Memory Accelerator.
Xinfeng Xie, Zheng Liang, Peng Gu, Abanti Basak, Lei Deng, Ling Liang, Xing Hu, Yuan Xie
2021Stealth-Persist: Architectural Support for Persistent Applications in Hybrid Memory Systems.
Mazen Al-Wadi, Vamsee Reddy Kommareddy, Clayton Hughes, Simon David Hammond, Amro Awad
2021Stream Floating: Enabling Proactive and Decentralized Cache Optimizations.
Zhengrong Wang, Jian Weng, Jason Lowe-Power, Jayesh Gaur, Tony Nowatzki
2021Streamline Ring ORAM Accesses through Spatial and Temporal Optimization.
Dingyuan Cao, Mingzhe Zhang, Hang Lu, Xiaochun Ye, Dongrui Fan, Yuezhi Che, Rujia Wang
2021SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures.
Christina Giannoula, Nandita Vijaykumar, Nikela Papadopoulou, Vasileios Karakostas, Ivan Fernandez, Juan Gómez-Luna, Lois Orosa, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu
2021Systematic Approaches for Precise and Approximate Quantum State Runtime Assertion.
Ji Liu, Huiyang Zhou
2021TILT: Achieving Higher Fidelity on a Trapped-Ion Linear-Tape Quantum Computing Architecture.
Xin-Chuan Wu, Dripto M. Debroy, Yongshan Ding, Jonathan M. Baker, Yuri Alexeev, Kenneth R. Brown, Frederic T. Chong
2021TSOPER: Efficient Coherence-Based Strict Persistency.
Per Ekemark, Yuan Yao, Alberto Ros, Konstantinos Sagonas, Stefanos Kaxiras
2021Tensor Casting: Co-Designing Algorithm-Architecture for Personalized Recommendation Training.
Youngeun Kwon, Yunjae Lee, Minsoo Rhu
2021Trident: A Hybrid Correlation-Collision GPU Cache Timing Attack for AES Key Recovery.
Jaeguk Ahn, Cheolgyu Jin, Jiho Kim, Minsoo Rhu, Yunsi Fei, David R. Kaeli, John Kim
2021Ultra-Elastic CGRAs for Irregular Loop Specialization.
Christopher Torng, Peitian Pan, Yanghui Ou, Cheng Tan, Christopher Batten
2021Understanding Training Efficiency of Deep Learning Recommendation Models at Scale.
Bilge Acun, Matthew Murphy, Xiaodong Wang, Jade Nie, Carole-Jean Wu, Kim M. Hazelwood
2021VIA: A Smart Scratchpad for Vector Units with Application to Sparse Matrix Computations.
Julian Pavon, Iván Vargas Valdivieso, Adrián Barredo, Joan Marimon, Miquel Moretó, Francesc Moll, Osman S. Unsal, Mateo Valero, Adrián Cristal
2021WiDir: A Wireless-Enabled Directory Cache Coherence Protocol.
Antonio Franques, Apostolos Kokolis, Sergi Abadal, Vimuth Fernando, Sasa Misailovic, Josep Torrellas
2021Zero Directory Eviction Victim: Unbounded Coherence Directory and Core Cache Isolation.
Mainak Chaudhuri