HPCA A*

95 papers

YearTitle / Authors
2023A Pulse Generation Framework with Augmented Program-aware Basis Gates and Criticality Analysis.
Yan-Hao Chen, Yuwei Jin, Fei Hua, Ari B. Hayes, Ang Li, Yunong Shi, Eddy Z. Zhang
2023A Scalable Methodology for Designing Efficient Interconnection Network of Chiplets.
Yinxiao Feng, Dong Xiang, Kaisheng Ma
2023A Storage-Effective BTB Organization for Servers.
Truls Asheim, Boris Grot, Rakesh Kumar
2023A Systematic Study of DDR4 DRAM Faults in the Field.
Majed Valad Beigi, Yi Cao, Sudhanva Gurumurthi, Charles Recchia, Andrew C. Walton, Vilas Sridharan
2023AB-ORAM: Constructing Adjustable Buckets for Space Reduction in Ring ORAM.
Mehrnoosh Raoufi, Jun Yang, Xulong Tang, Youtao Zhang
2023ACIC: Admission-Controlled Instruction Cache.
Yunjin Wang, Chia-Hao Chang, Anand Sivasubramaniam, Niranjan Soundararajan
2023AVGI: Microarchitecture-Driven, Fast and Accurate Vulnerability Assessment.
George Papadimitriou, Dimitris Gizopoulos
2023Adrias: Interference-Aware Memory Orchestration for Disaggregated Cloud Infrastructures.
Dimosthenis Masouros, Christian Pinto, Michele Gazzetti, Sotirios Xydis, Dimitrios Soudris
2023Ah-Q: Quantifying and Handling the Interference within a Datacenter from a System Perspective.
Yuhang Liu, Xin Deng, Jiapeng Zhou, Mingyu Chen, Yungang Bao
2023Are Randomized Caches Truly Random? Formal Analysis of Randomized-Partitioned Caches.
Anirban Chakraborty, Sarani Bhattacharya, Sayandeep Saha, Debdeep Mukhopadhyay
2023AstriFlash A Flash-Based System for Online Services.
Siddharth Gupta, Yunho Oh, Lei Yan, Mark Sutherland, Abhishek Bhattacharjee, Babak Falsafi, Peter Hsu
2023AutoCAT: Reinforcement Learning for Automated Exploration of Cache-Timing Attacks.
Mulong Luo, Wenjie Xiong, Geunbae Lee, Yueying Li, Xiaomeng Yang, Amy Zhang, Yuandong Tian, Hsien-Hsin S. Lee, G. Edward Suh
2023BM-Store: A Transparent and High-performance Local Storage Architecture for Bare-metal Clouds Enabling Large-scale Deployment.
Yiquan Chen, Jiexiong Xu, Chengkun Wei, Yijing Wang, Xin Yuan, Yangming Zhang, Xulin Yu, Yi Chen, Zeke Wang, Shuibing He, Wenzhi Chen
2023Baryon: Efficient Hybrid Memory Management with Compression and Sub-Blocking.
Yiwei Li, Mingyu Gao
2023CARE: A Concurrency-Aware Enhanced Lightweight Cache Management Framework.
Xiaoyang Lu, Rujia Wang, Xian-He Sun
2023CEGMA: Coordinated Elastic Graph Matching Acceleration for Graph Matching Networks.
Yue Dai, Youtao Zhang, Xulong Tang
2023CHOPPER: A Compiler Infrastructure for Programmable Bit-serial SIMD Processing Using Memory in DRAM.
Xiangjun Peng, Yaohua Wang, Ming-Chang Yang
2023CTA: Hardware-Software Co-design for Compressed Token Attention Mechanism.
Haoran Wang, Haobo Xu, Ying Wang, Yinhe Han
2023Chimera: An Analytical Optimizing Framework for Effective Compute-intensive Operators Fusion.
Size Zheng, Siyuan Chen, Peidi Song, Renze Chen, Xiuhong Li, Shengen Yan, Dahua Lin, Jingwen Leng, Yun Liang
2023Co-Designed Architectures for Modular Superconducting Quantum Computers.
Evan McKinney, Mingkang Xia, Chao Zhou, Pinlei Lu, Michael Hatridge, Alex K. Jones
2023Compression-Aware and Performance-Efficient Insertion Policies for Long-Lasting Hybrid LLCs.
Carlos Escuin, Asif Ali Khan, Pablo Ibáñez, Teresa Monreal, Jerónimo Castrillón, Víctor Viñals
2023D-Shield: Enabling Processor-side Encryption and Integrity Verification for Secure NVMe Drives.
Md Hafizul Islam Chowdhuryy, Myoungsoo Jung, Fan Yao, Amro Awad
2023DIMM-Link: Enabling Efficient Inter-DIMM Communication for Near-Memory Processing.
Zhe Zhou, Cong Li, Fan Yang, Guangyu Sun
2023Dalorex: A Data-Local Program Execution and Architecture for Memory-bound Applications.
Marcelo Orenes-Vera, Esin Tureci, David Wentzlaff, Margaret Martonosi
2023DeFiNES: Enabling Fast Exploration of the Depth-first Scheduling Space for DNN Accelerators through Analytical Modeling.
Linyan Mei, Koen Goetschalckx, Arne Symons, Marian Verhelst
2023Duet: Creating Harmony between Processors and Embedded FPGAs.
Ang Li, August Ning, David Wentzlaff
2023ESD: An ECC-assisted and Selective Deduplication for Encrypted Non-Volatile Main Memory.
Chunfeng Du, Suzhen Wu, Jiapeng Wu, Bo Mao, Shengzhe Wang
2023EVE: Ephemeral Vector Engines.
Khalid Al-Hawaj, Tuan Ta, Nick Cebry, Shady Agwa, Olalekan Afuye, Eric Hall, Courtney Golden, Alyssa B. Apsel, Christopher Batten
2023Efficient Distributed Secure Memory with Migratable Merkle Tree.
Erhu Feng, Dong Du, Yubin Xia, Haibo Chen
2023Efficient Supernet Training Using Path Parallelism.
Ying Xu, Long Cheng, Xuyi Cai, Xiaohan Ma, Weiwei Chen, Lei Zhang, Ying Wang
2023FAB: An FPGA-based Accelerator for Bootstrappable Fully Homomorphic Encryption.
Rashmi Agrawal, Leo de Castro, Guowei Yang, Chiraag Juvekar, Rabia Tugce Yazicigil, Anantha P. Chandrakasan, Vinod Vaikuntanathan, Ajay Joshi
2023FinePack: Transparently Improving the Efficiency of Fine-Grained Transfers in Multi-GPU Systems.
Harini Muthukrishnan, Daniel Lustig, Oreste Villa, Thomas F. Wenisch, David W. Nellans
2023FlowGNN: A Dataflow Architecture for Real-Time Workload-Agnostic Graph Neural Network Inference.
Rishov Sarkar, Stefan Abi-Karam, Yuqi He, Lakshmi Sathidevi, Cong Hao
2023FxHENN: FPGA-based acceleration framework for homomorphic encrypted CNN inference.
Yilan Zhu, Xinyao Wang, Lei Ju, Shanqing Guo
2023GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks.
Ranggi Hwang, Minhoo Kang, Jiwon Lee, Dongyun Kam, Youngjoo Lee, Minsoo Rhu
2023HIRAC: A Hierarchical Accelerator with Sorting-based Packing for SpGEMMs in DNN Applications.
Hesam Shabani, Abhishek Singh, Bishoy Youhana, Xiaochen Guo
2023HeatViT: Hardware-Efficient Adaptive Token Pruning for Vision Transformers.
Peiyan Dong, Mengshu Sun, Alec Lu, Yanyue Xie, Kenneth Liu, Zhenglun Kong, Xin Meng, Zhengang Li, Xue Lin, Zhenman Fang, Yanzhi Wang
2023High Performance and Power Efficient Accelerator for Cloud Inference.
Jianguo Yao, Hao Zhou, Yalin Zhang, Ying Li, Chuang Feng, Shi Chen, Jiaoyan Chen, Yongdong Wang, Qiaojuan Hu
2023HoPP: Hardware-Software Co-Designed Page Prefetching for Disaggregated Memory.
Haifeng Li, Ke Liu, Ting Liang, Zuojun Li, Tianyue Lu, Hui Yuan, Yinben Xia, Yungang Bao, Mingyu Chen, Yizhou Shan
2023HyQSAT: A Hybrid Approach for 3-SAT Problems by Integrating Quantum Annealer with CDCL.
Siwei Tan, Mingqian Yu, Andre Python, Yongheng Shang, Tingting Li, Liqiang Lu, Jianwei Yin
2023IEEE International Symposium on High-Performance Computer Architecture, HPCA 2023, Montreal, QC, Canada, February 25 - March 1, 2023
2023INCA: Input-stationary Dataflow at Outside-the-box Thinking about Deep Learning Accelerators.
Bokyung Kim, Shiyu Li, Hai Li
2023ISOSceles: Accelerating Sparse CNNs through Inter-Layer Pipelining.
Yifan Yang, Joel S. Emer, Daniel Sánchez
2023KRISP: Enabling Kernel-wise RIght-sizing for Spatial Partitioned GPU Inference Servers.
Marcus Chow, Ali Jahanshahi, Daniel Wong
2023Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving.
Junyeol Yu, Jongseok Kim, Euiseong Seo
2023Leveraging Domain Information for the Efficient Automated Design of Deep Learning Accelerators.
Chirag Sakhuja, Zhan Shi, Calvin Lin
2023LightTrader: A Standalone High-Frequency Trading System with Deep Learning Inference Accelerators and Proactive Scheduler.
Sungyeob Yoo, Hyunsung Kim, Jinseok Kim, Sunghyun Park, Joo-Young Kim, Jinwook Oh
2023Logical/Physical Topology-Aware Collective Communication in Deep Learning Training.
Jo Sanghoon, Hyojun Son, John Kim
2023MERCURY: Accelerating DNN Training By Exploiting Input Similarity.
Vahid Janfaza, Kevin Weston, Moein Razavi, Shantanu Mandal, Farabi Mahmud, Alex Hilty, Abdullah Muzahid
2023MGC: Multiple-Gray-Code for 3D NAND Flash based High-Density SSDs.
Yina Lv, Liang Shi, Qiao Li, Congming Gao, Yunpeng Song, Longfei Luo, Youtao Zhang
2023MPress: Democratizing Billion-Scale Model Training on Multi-GPU Servers via Memory-Saving Inter-Operator Parallelism.
Quan Zhou, Haiquan Wang, Xiaoyan Yu, Cheng Li, Youhui Bai, Feng Yan, Yinlong Xu
2023Market Mechanism-Based User-in-the-Loop Scalable Power Oversubscription for HPC Systems.
Md Rajib Hossen, Kishwar Ahmed, Mohammad A. Islam
2023Memory-Efficient Hashed Page Tables.
Jovan Stojkovic, Namrata Mantri, Dimitrios Skarlatos, Tianyin Xu, Josep Torrellas
2023Mitigating GPU Core Partitioning Performance Effects.
Aaron Barnes, Fangjia Shen, Timothy G. Rogers
2023Mix-GEMM: An efficient HW-SW Architecture for Mixed-Precision Quantized Deep Neural Networks Inference on Edge Devices.
Enrico Reggiani, Alessandro Pappalardo, Max Doblas, Miquel Moretó, Mauro Olivieri, Osman Sabri Unsal, Adrián Cristal
2023MoCA: Memory-Centric, Adaptive Execution for Multi-Tenant Deep Neural Networks.
Seah Kim, Hasan Genc, Vadim Vadimovich Nikiforov, Krste Asanovic, Borivoje Nikolic, Yakun Sophia Shao
2023Multi-Granularity Shadow Paging with NVM Write Optimization for Crash-Consistent Memory-Mapped I/O.
Hongchao Du, Qiao Li, Riwei Pan, Tei-Wei Kuo, Chun Jason Xue
2023NOMAD: Enabling Non-blocking OS-managed DRAM Cache via Tag-Data Decoupling.
Youngin Kim, Hyeonjin Kim, William J. Song
2023NvWa: Enhancing Sequence Alignment Accelerator Throughput via Hardware Scheduling.
Yewen Li, Xueqi Li, Ruihao Gao, Wanqi Liu, Guangming Tan
2023On Consistency for Bulk-Bitwise Processing-in-Memory.
Ben Perach, Ronny Ronen, Shahar Kvatinsky
2023OptimStore: In-Storage Optimization of Large Scale DNNs with On-Die Processing.
Junkyum Kim, Myeonggu Kang, Yunki Han, Yanggon Kim, Lee-Sup Kim
2023ParallelNN: A Parallel Octree-based Nearest Neighbor Search Accelerator for 3D Point Clouds.
Faquan Chen, Rendong Ying, Jianwei Xue, Fei Wen, Peilin Liu
2023Phloem: Automatic Acceleration of Irregular Applications with Fine-Grain Pipeline Parallelism.
Quan M. Nguyen, Daniel Sánchez
2023PhotoFourier: A Photonic Joint Transform Correlator-Based Neural Network Accelerator.
Shurui Li, Hangbo Yang, Chee Wei Wong, Volker J. Sorger, Puneet Gupta
2023Plutus: Bandwidth-Efficient Memory Security for GPUs.
Rahaf Abdullah, Huiyang Zhou, Amro Awad
2023Poseidon: Practical Homomorphic Encryption Accelerator.
Yinghao Yang, Huaizhi Zhang, Shengyu Fan, Hang Lu, Mingzhe Zhang, Xiaowei Li
2023Post0-VR: Enabling Universal Realistic Rendering for Modern VR via Exploiting Architectural Similarity and Data Sharing.
Yu Wen, Chenhao Xie, Shuaiwen Leon Song, Xin Fu
2023Rambda: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications.
Yifan Yuan, Jinghan Huang, Yan Sun, Tianchen Wang, Jacob Nelson, Dan R. K. Ports, Yipeng Wang, Ren Wang, Charlie Tai, Nam Sung Kim
2023Realizing Extreme Endurance Through Fault-aware Wear Leveling and Improved Tolerance.
Jiangwei Zhang, Chong Wang, Zhenhua Zhu, Donald Kline, Alex K. Jones, Huazhong Yang, Yu Wang
2023Reconciling Selective Logging and Hardware Persistent Memory Transaction.
Chencheng Ye, Yuanchao Xu, Xipeng Shen, Yan Sha, Xiaofei Liao, Hai Jin, Yan Solihin
2023Root Crash Consistency of SGX-style Integrity Trees in Secure Non-Volatile Memory Systems.
Jianming Huang, Yu Hua
2023SGCN: Exploiting Compressed-Sparse Features in Deep Graph Convolutional Network Accelerators.
Mingi Yoo, Jaeyong Song, Jounghoo Lee, Namhyung Kim, Youngsok Kim, Jinho Lee
2023SHADOW: Preventing Row Hammer in DRAM with Intra-Subarray Row Shuffling.
Minbok Wi, Jaehyun Park, Seoyoung Ko, Michael Jaemin Kim, Nam Sung Kim, Eojin Lee, Jung Ho Ahn
2023Safety Hints for HTM Capacity Abort Mitigation.
Anirudh Jain, Divya Kiran Kadiyala, Alexandros Daglis
2023Scalable and Secure Row-Swap: Efficient and Safe Row Hammer Mitigation in Memory Systems.
Jeonghyun Woo, Gururaj Saileshwar, Prashant J. Nair
2023SecPB: Architectures for Secure Non-Volatile Memory with Battery-Backed Persist Buffers.
Alexander Freij, Huiyang Zhou, Yan Solihin
2023Securator: A Fast and Secure Neural Processing Unit.
Nivedita Shrivastava, Smruti Ranjan Sarangi
2023Sibia: Signed Bit-slice Architecture for Dense DNN Acceleration with Slice-level Sparsity Exploitation.
Dongseok Im, Gwangtae Park, Zhiyong Li, Junha Ryu, Hoi-Jun Yoo
2023Silo: Speculative Hardware Logging for Atomic Durability in Persistent Memory.
Ming Zhang, Yu Hua
2023SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs.
Jiwon Lee, Ju Min Lee, Yunho Oh, William J. Song, Won Woo Ro
2023SpecFaaS: Accelerating Serverless Applications with Speculative Function Execution.
Jovan Stojkovic, Tianyin Xu, Hubertus Franke, Josep Torrellas
2023Speculative Register Reclamation.
Sanyam Mehta
2023Tensor Movement Orchestration in Multi-GPU Training Systems.
Shao-Fu Lin, Yi-Jung Chen, Hsiang-Yun Cheng, Chia-Lin Yang
2023TensorFHE: Achieving Practical Computation on Encrypted Data Using GPGPU.
Shengyu Fan, Zhiwei Wang, Weizhi Xu, Rui Hou, Dan Meng, Mingzhe Zhang
2023The Imitation Game: Leveraging CopyCats for Robust Native Gate Selection in NISQ Programs.
Poulami Das, Eric Kessler, Yunong Shi
2023Thoth: Bridging the Gap Between Persistently Secure Memories and Memory Interfaces of Emerging NVMs.
Xijing Han, James Tuck, Amro Awad
2023Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding.
Bingyao Li, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, Xulong Tang
2023Turbo: SmartNIC-enabled Dynamic Load Balancing of µs-scale RPCs.
Hamed Seyedroudbari, Srikar Vanavasam, Alexandros Daglis
2023VAQUERO: A Scratchpad-based Vector Accelerator for Query Processing.
Julián Pavón, Iván Vargas Valdivieso, Joan Marimon, Roger Figueras, Francesc Moll, Osman S. Unsal, Mateo Valero, Adrián Cristal
2023VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs.
Geonhwa Jeong, Sana Damani, Abhimanyu Rajeshkumar Bambhaniya, Eric Qin, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna
2023VVQ: Virtualizing Virtual Channel for Cost-Efficient Protocol Deadlock Avoidance.
Hans Kasan, John Kim
2023ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention.
Jyotikrishna Dass, Shang Wu, Huihong Shi, Chaojian Li, Zhifan Ye, Zhongfeng Wang, Yingyan Lin
2023ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design.
Haoran You, Zhanyi Sun, Huihong Shi, Zhongzhi Yu, Yang Zhao, Yongan Zhang, Chaojian Li, Baopu Li, Yingyan Lin
2023eNODE: Energy-Efficient and Low-Latency Edge Inference and Training of Neural ODEs.
Junkang Zhu, Yaoyu Tao, Zhengya Zhang
2023iCache: An Importance-Sampling-Informed Cache for Accelerating I/O-Bound DNN Model Training.
Weijian Chen, Shuibing He, Yaowen Xu, Xuechen Zhang, Siling Yang, Shuang Hu, Xian-He Sun, Gang Chen