HPCA A*

86 papers

YearTitle / Authors
2022AFS: Accurate, Fast, and Scalable Error-Decoding for Fault-Tolerant Quantum Computers.
Poulami Das, Christopher A. Pattison, Srilatha Manne, Douglas M. Carmean, Krysta M. Svore, Moinuddin K. Qureshi, Nicolas Delfosse
2022AI-Enabling Workloads on Large-Scale GPU-Accelerated System: Characterization, Opportunities, and Implications.
Baolin Li, Rohin Arora, Siddharth Samsi, Tirthak Patel, William Arcand, David Bestor, Chansup Byun, Rohan Basu Roy, Bill Bergeron, John T. Holodnak, Michael Houle, Matthew Hubbell, Michael Jones, Jeremy Kepner, Anna Klein, Peter Michaleas, Joseph McDonald, Lauren Milechin, Julie Mullen, Andrew Prout, Benjamin Price, Albert Reuther, Antonio Rosa, Matthew L. Weiss, Charles Yee, Daniel Edelman, Allan Vanterpool, Anson Cheng, Vijay Gadepally, Devesh Tiwari
2022ANNA: Specialized Architecture for Approximate Nearest Neighbor Search.
Yejin Lee, Hyunji Choi, Sunhong Min, Hyunseung Lee, Sangwon Beak, Dawoon Jeong, Jae W. Lee, Tae Jun Ham
2022ASAP: A Speculative Approach to Persistence.
Sujay Yadalam, Nisarg Shah, Xiangyao Yu, Michael M. Swift
2022Abusing Cache Line Dirty States to Leak Information in Commercial Processors.
Yujie Cui, Chun Yang, Xu Cheng
2022Accelerating Graph Convolutional Networks Using Crossbar-based Processing-In-Memory Architectures.
Yu Huang, Long Zheng, Pengcheng Yao, Qinggang Wang, Xiaofei Liao, Hai Jin, Jingling Xue
2022Adaptable Register File Organization for Vector Processors.
Cristóbal Ramírez Lazo, Enrico Reggiani, Carlos Rojas Morales, Roger Figueras Bagué, Luis A. Villa Vargas, Marco Antonio Ramírez Salinas, Mateo Valero Cortés, Osman Sabri Unsal, Adrián Cristal
2022Adaptive Security Support for Heterogeneous Memory on GPUs.
Shougang Yuan, Amro Awad, Ardhi Wiratama Baskara Yudha, Yan Solihin, Huiyang Zhou
2022Application Defined On-chip Networks for Heterogeneous Chiplets: An Implementation Perspective.
Tianqi Wang, Fan Feng, Shaolin Xiang, Qi Li, Jing Xia
2022Atomic Dataflow based Graph-Level Workload Orchestration for Scalable DNN Accelerators.
Shixuan Zheng, Xianjue Zhang, Leibo Liu, Shaojun Wei, Shouyi Yin
2022CAMA: Energy and Memory Efficient Automata Processing in Content-Addressable Memories.
Yi Huang, Zhiyu Chen, Dai Li, Kaiyuan Yang
2022CANDLES: Channel-Aware Novel Dataflow-Microarchitecture Co-Design for Low Energy Sparse Neural Network Acceleration.
Sumanth Gudaparthi, Sarabjeet Singh, Surya Narayanan, Rajeev Balasubramonian, Visvesh Sathe
2022Compiler-Driven Simulation of Reconfigurable Hardware Accelerators.
Zhijing Li, Yuwei Ye, Stephen Neuendorffer, Adrian Sampson
2022CoopMC: Algorithm-Architecture Co-Optimization for Markov Chain Monte Carlo Accelerators.
Yuji Chai, Glenn G. Ko, Wei-Te Mark Ting, Luke Bailey, David Brooks, Gu-Yeon Wei
2022Cottage: Coordinated Time Budget Assignment for Latency, Quality and Power Optimization in Web Search.
Liang Zhou, Laxmi N. Bhuyan, K. K. Ramakrishnan
2022DPrime+DAbort: A High-Precision and Timer-Free Directory-Based Side-Channel Attack in Non-Inclusive Cache Hierarchies using Intel TSX.
Sowoong Kim, Myeonggyun Han, Woongki Baek
2022DR-STRaNGe: End-to-End System Design for DRAM-based True Random Number Generators.
F. Nisa Bostanci, Ataberk Olgun, Lois Orosa, Abdullah Giray Yaglikçi, Jeremie S. Kim, Hasan Hassan, Oguz Ergin, Onur Mutlu
2022DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications on CGRAs.
Cheng Tan, Nicolas Bohm Agostini, Tong Geng, Chenhao Xie, Jiajia Li, Ang Li, Kevin J. Barker, Antonino Tumeo
2022DarkGates: A Hybrid Power-Gating Architecture to Mitigate the Performance Impact of Dark-Silicon in High Performance Processors.
Jawad Haj-Yahya, Jeremie S. Kim, Abdullah Giray Yaglikçi, Jisung Park, Efraim Rotem, Yanos Sazeides, Onur Mutlu
2022Delegated Replies: Alleviating Network Clogging in Heterogeneous Architectures.
Xia Zhao, Lieven Eeckhout, Magnus Jahre
2022Detecting Qubit-coupling Faults in Ion-trap Quantum Computers.
Andrii Maksymov, Jason Nguyen, Vandiver Chaplin, Yun Seong Nam, Igor L. Markov
2022DigiQ: A Scalable Digital Controller for Quantum Computers Using SFQ Logic.
Mohammad Reza Jokar, Richard Rines, Ghasem Pasandi, Haolin Cong, Adam Holmes, Yunong Shi, Massoud Pedram, Frederic T. Chong
2022Direct Spatial Implementation of Sparse Matrix Multipliers for Reservoir Computing.
Matthew Denton, Herman Schmit
2022Effective Mimicry of Belady's MIN Policy.
Ishan Shah, Akanksha Jain, Calvin Lin
2022Efficient Bad Block Management with Cluster Similarity.
Jui-Nan Yen, Yao-Ching Hsieh, Cheng-Yu Chen, Tseng-Yi Chen, Chia-Lin Yang, Hsiang-Yun Cheng, Yixin Luo
2022Enabling Efficient Large-Scale Deep Learning Training with Cache Coherent Disaggregated Memory Systems.
Zixuan Wang, Joonseop Sim, Euicheol Lim, Jishen Zhao
2022Enabling High-Quality Uncertainty Quantification in a PIM Designed for Bayesian Neural Network.
Xingchen Li, Bingzhe Wu, Guangyu Sun, Zhe Zhang, Zhihang Yuan, Runsheng Wang, Ru Huang, Dimin Niu, Hongzhong Zheng, Zhichao Lu, Liang Zhao, Meng-Fan Marvin Chang, Tianchan Guan, Xin Si
2022Exploiting Inter-block Entropy to Enhance the Compressibility of Blocks with Diverse Data.
Jinkwon Kim, Mincheol Kang, Jeongkyu Hong, Soontae Kim
2022FAST: DNN Training Under Variable Precision Block Floating Point with Stochastic Rounding.
Sai Qian Zhang, Bradley McDanel, H. T. Kung
2022FastTrackNoC: A NoC with FastTrack Router Datapaths.
Ahsen Ejaz, Ioannis Sourdis
2022Filesystem Encryption or Direct-Access for NVM Filesystems? Let's Have Both!
Kazi Abu Zubair, David Mohaisen, Amro Awad
2022GBDI: Going Beyond Base-Delta-Immediate Compression with Global Bases.
Alexandra Angerd, Angelos Arelakis, Vasilis Spiliopoulos, Erik Sintorn, Per Stenström
2022GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design.
Haoran You, Tong Geng, Yongan Zhang, Ang Li, Yingyan Lin
2022GPU Subwarp Interleaving.
Sana Damani, Mark Stephenson, Ram Rangan, Daniel R. Johnson, Rishkul Kulkami, Stephen W. Keckler
2022Griffin: Rethinking Sparse Optimization for Deep Learning Architectures.
Jong Hoon Shin, Ali Shafiee, Ardavan Pedram, Hamzah Abdel-Aziz, Ling Li, Joseph Hassoun
2022HD-CPS: Hardware-assisted Drift-aware Concurrent Priority Scheduler for Shared Memory Multicores.
Mohsin Shan, Omer Khan
2022Hardware-Accelerated Hypergraph Processing with Chain-Driven Scheduling.
Qinggang Wang, Long Zheng, Jingrui Yuan, Yu Huang, Pengcheng Yao, Chuangyi Gui, Ao Hu, Xiaofei Liao, Hai Jin
2022Hercules: Heterogeneity-Aware Inference Serving for At-Scale Personalized Recommendation.
Liu Ke, Udit Gupta, Mark Hempstead, Carole-Jean Wu, Hsien-Hsin S. Lee, Xuan Zhang
2022HeteroGen: Automatic Synthesis of Heterogeneous Cache Coherence Protocols.
Nicolai Oswald, Vijay Nagarajan, Daniel J. Sorin, Vasilis Gavrielatos, Theo Olausson, Reece Carr
2022HiPerRF: A Dual-Bit Dense Storage SFQ Register File.
Haipeng Zha, Naveen Kumar Katam, Massoud Pedram, Murali Annavaram
2022HyBP: Hybrid Isolation-Randomization Secure Branch Predictor.
Lutan Zhao, Peinan Li, Rui Hou, Michael C. Huang, Xuehai Qian, Lixin Zhang, Dan Meng
2022IEEE International Symposium on High-Performance Computer Architecture, HPCA 2022, Seoul, South Korea, April 2-6, 2022
2022IR-ORAM: Path Access Type Based Memory Intensity Reduction for Path-ORAM.
Mehrnoosh Raoufi, Youtao Zhang, Jun Yang
2022Improving Locality of Irregular Updates with Hardware Assisted Propagation Blocking.
Vignesh Balaji, Brandon Lucia
2022LISA: Graph Neural Network based Portable Mapping on Spatial Accelerators.
Zhaoying Li, Dan Wu, Dhananjaya Wijerathne, Tulika Mitra
2022Leaky Frontends: Security Vulnerabilities in Processor Frontends.
Shuwen Deng, Bowen Huang, Jakub Szefer
2022LoopPoint: Checkpoint-driven Sampled Simulation for Multi-threaded Applications.
Alen Sabu, Harish Patil, Wim Heirman, Trevor E. Carlson
2022MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores.
Sheng-Chun Kao, Tushar Krishna
2022MULTI-CLOCK: Dynamic Tiering for Hybrid Memory Systems.
Adnan Maruf, Ashikee Ghosh, Janki Bhimani, Daniel Campello, Andy Rudoff, Raju Rangaswami
2022Mithril: Cooperative Row Hammer Protection on Commodity DRAM Leveraging Managed Refresh.
Michael Jaemin Kim, Jaehyun Park, Yeonhong Park, Wanju Doh, Namhoon Kim, Tae Jun Ham, Jae W. Lee, Jung Ho Ahn
2022NVMExplorer: A Framework for Cross-Stack Comparisons of Embedded Non-Volatile Memories.
Lillian Pentecost, Alexander Hankin, Marco Donato, Mark Hempstead, Gu-Yeon Wei, David Brooks
2022Near-Stream Computing: General and Transparent Near-Cache Acceleration.
Zhengrong Wang, Jian Weng, Sihao Liu, Tony Nowatzki
2022NeuroSync: A Scalable and Accurate Brain Simulator Using Safe and Efficient Speculation.
Hunjun Lee, Chanmyeong Kim, Minseop Kim, Yujin Chung, Jangwoo Kim
2022Not All SWAPs Have the Same Cost: A Case for Optimization-Aware Qubit Routing.
Ji Liu, Peiyi Li, Huiyang Zhou
2022Only Buffer When You Need To: Reducing On-chip GPU Traffic with Reconfigurable Local Atomic Buffers.
Preyesh Dalmia, Rohan Mahapatra, Matthew D. Sinclair
2022PIMCloud: QoS-Aware Resource Management of Latency-Critical Applications in Clouds with Processing-in-Memory.
Shuang Chen, Yi Jiang, Christina Delimitrou, José F. Martínez
2022Parallel Time Batching: Systolic-Array Acceleration of Sparse Spiking Neural Computation.
Jeong-Jun Lee, Wenrui Zhang, Peng Li
2022Q-GPU: A Recipe of Optimizations for Quantum Circuit Simulation Using GPUs.
Yilun Zhao, Yanan Guo, Yuan Yao, Amanda Dumi, Devin M. Mulvey, Shiv Upadhyay, Youtao Zhang, Kenneth D. Jordan, Jun Yang, Xulong Tang
2022QULATIS: A Quantum Error Correction Methodology toward Lattice Surgery.
Yosuke Ueno, Masaaki Kondo, Masamitsu Tanaka, Yasunari Suzuki, Yutaka Tabuchi
2022QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits.
Hanrui Wang, Yongshan Ding, Jiaqi Gu, Yujun Lin, David Z. Pan, Frederic T. Chong, Song Han
2022RM-SSD: In-Storage Computing for Large-Scale Recommendation Inference.
Xuan Sun, Hu Wan, Qiao Li, Chia-Lin Yang, Tei-Wei Kuo, Chun Jason Xue
2022ReGNN: A Redundancy-Eliminated Graph Neural Networks Accelerator.
Cen Chen, Kenli Li, Yangfan Li, Xiaofeng Zou
2022ReTail: Opting for Learning Simplicity to Enable QoS-Aware Power Management in the Cloud.
Shuang Chen, Angela Jin, Christina Delimitrou, José F. Martínez
2022Reducing Load Latency with Cache Level Prediction.
Majid Jalili, Mattan Erez
2022Reliability-Aware Runahead.
Ajeya Naithani, Lieven Eeckhout
2022S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN Acceleration.
Zhi Gang Liu, Paul N. Whatmough, Yuhao Zhu, Matthew Mattina
2022SPACX: Silicon Photonics-based Scalable Chiplet Accelerator for DNN Inference.
Yuan Li, Ahmed Louri, Avinash Karanth
2022SafeGuard: Reducing the Security Risk from Row-Hammer via Low-Cost Integrity Protection.
Ali Fakhrzadehgan, Yale N. Patt, Prashant J. Nair, Moinuddin K. Qureshi
2022Saving PAM4 Bus Energy with SMOREs: Sparse Multi-level Opportunistic Restricted Encodings.
Mike O'Connor, Donghyuk Lee, Niladrish Chatterjee, Michael B. Sullivan, Stephen W. Keckler
2022ScalaGraph: A Scalable Accelerator for Massively Parallel Graph Processing.
Pengcheng Yao, Long Zheng, Yu Huang, Qinggang Wang, Chuangyi Gui, Zhen Zeng, Xiaofei Liao, Hai Jin, Jingling Xue
2022ScaleHLS: A New Scalable High-Level Synthesis Framework on Multi-Level Intermediate Representation.
Hanchen Ye, Cong Hao, Jianyi Cheng, Hyunmin Jeong, Jack Huang, Stephen Neuendorffer, Deming Chen
2022SecNDP: Secure Near-Data Processing with Untrusted Memory.
Wenjie Xiong, Liu Ke, Dimitrije Jankov, Michael Kounavis, Xiaochen Wang, Eric Northup, Jie Amy Yang, Bilge Acun, Carole-Jean Wu, Ping Tak Peter Tang, G. Edward Suh, Xuan Zhang, Hsien-Hsin S. Lee
2022Stay in your Lane: A NoC with Low-overhead Multi-packet Bypassing.
Hossein Farrokhbakht, Paul V. Gratz, Tushar Krishna, Joshua San Miguel, Natalie D. Enright Jerger
2022SupermarQ: A Scalable Quantum Benchmark Suite.
Teague Tomesh, Pranav Gokhale, Victory Omole, Gokul Subramanian Ravi, Kaitlin N. Smith, Joshua Viszlai, Xin-Chuan Wu, Nikos Hardavellas, Margaret Martonosi, Frederic T. Chong
2022TCOR: A Tile Cache with Optimal Replacement.
Diya Joseph, Juan L. Aragón, Joan-Manuel Parcerisa, Antonio González
2022TNPU: Supporting Trusted Execution with Tree-less Integrity Protection for Neural Processing Unit.
Sunho Lee, Jungwoo Kim, Seonjin Na, Jongse Park, Jaehyuk Huh
2022Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS.
Han Zhao, Weihao Cui, Quan Chen, Youtao Zhang, Yanchao Lu, Chao Li, Jingwen Leng, Minyi Guo
2022Temporal Exposure Reduction Protection for Persistent Memory.
Yuanchao Xu, Chencheng Ye, Xipeng Shen, Yan Solihin
2022The Specialized High-Performance Network on Anton 3.
Keun Sup Shim, Brian Greskamp, Brian Towles, Bruce Edwards, J. P. Grossman, David E. Shaw
2022TransPIM: A Memory-based Acceleration via Software-Hardware Co-Design for Transformer.
Minxuan Zhou, Weihong Xu, Jaeyoung Kang, Tajana Rosing
2022Upward Packet Popup for Deadlock Freedom in Modular Chiplet-Based Systems.
Yibo Wu, Liang Wang, Xiaohang Wang, Jie Han, Jianfeng Zhu, Honglan Jiang, Shouyi Yin, Shaojun Wei, Leibo Liu
2022Using Psychophysics to Guide Power Adaptation for Input Methods on Mobile Architectures.
Xueliang Li, Shicong Hong, Junyang Chen, Guihai Yan, Kaishun Wu
2022VAQEM: A Variational Approach to Quantum Error Mitigation.
Gokul Subramanian Ravi, Kaitlin N. Smith, Pranav Gokhale, Andrea Mari, Nathan Earnest, Ali Javadi-Abhari, Frederic T. Chong
2022Virtual Coset Coding for Encrypted Non-Volatile Memories with Multi-Level Cells.
Stephen Longofono, Seyed Mohammad Seyedzadeh, Alex K. Jones
2022uSystolic: Byte-Crawling Unary Systolic Array.
Di Wu, Joshua San Miguel
2022unXpec: Breaking Undo-based Safe Speculation.
Mengming Li, Chenlu Miao, Yilong Yang, Kai Bu