HPCA A*

55 papers

YearTitle / Authors
2020A
Tae Jun Ham, Sungjun Jung, Seonghak Kim, Young H. Oh, Yeonhong Park, Yoonho Song, Jung-Hun Park, Sanghee Lee, Kyoung Park, Jae W. Lee, Deog-Kyoon Jeong
2020A Deep Reinforcement Learning Framework for Architectural Exploration: A Routerless NoC Case Study.
Ting-Ru Lin, Drew Penney, Massoud Pedram, Lizhong Chen
2020A Hybrid Systolic-Dataflow Architecture for Inductive Matrix Algorithms.
Jian Weng, Sihao Liu, Zhengrong Wang, Vidushi Dadu, Tony Nowatzki
2020A New Side-Channel Vulnerability on Modern Computers by Exploiting Electromagnetic Emanations from the Power Management Unit.
Nader Sehatbakhsh, Baki Berkay Yilmaz, Alenka G. Zajic, Milos Prvulovic
2020ACR: Amnesic Checkpointing and Recovery.
Ismail Akturk, Ulya R. Karpuzcu
2020ALRESCHA: A Lightweight Reconfigurable Sparse-Computation Accelerator.
Bahar Asgari, Ramyad Hadidi, Tushar Krishna, Hyesoon Kim, Sudhakar Yalamanchili
2020AccPar: Tensor Partitioning for Heterogeneous Deep Learning Accelerators.
Linghao Song, Fan Chen, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
2020Asymmetric Resilience: Exploiting Task-Level Idempotency for Transient Error Recovery in Accelerator-Based Systems.
Jingwen Leng, Alper Buyuktosunoglu, Ramon Bertran, Pradip Bose, Quan Chen, Minyi Guo, Vijay Janapa Reddi
2020BBS: Micro-Architecture Benchmarking Blockchain Systems through Machine Learning and Fuzzy Set.
Liang Zhu, Chao Chen, Zihao Su, Weiguang Chen, Tao Li, Zhibin Yu
2020BCoal: Bucketing-Based Memory Coalescing for Efficient and Secure GPUs.
Gurunath Kadam, Danfeng Zhang, Adwait Jog
2020Baldur: A Power-Efficient and Scalable Network Using All-Optical Switches.
Mohammad Reza Jokar, Junyi Qiu, Frederic T. Chong, Lynford L. Goddard, John M. Dallesasse, Milton Feng, Yanjing Li
2020CASINO Core Microarchitecture: Generating Out-of-Order Schedules Using Cascaded In-Order Scheduling Windows.
Ipoom Jeong, Seihoon Park, Changmin Lee, Won Woo Ro
2020CLITE: Efficient and QoS-Aware Co-Location of Multiple Latency-Critical Jobs for Warehouse Scale Computers.
Tirthak Patel, Devesh Tiwari
2020Charge-Aware DRAM Refresh Reduction with Value Transformation.
Seikwon Kim, Wonsang Kwak, Changdae Kim, Daehyeon Baek, Jaehyuk Huh
2020Communication Lower Bound in Convolution Accelerators.
Xiaoming Chen, Yinhe Han, Yu Wang
2020DRAIN: Deadlock Removal for Arbitrary Irregular Networks.
Mayank Parasar, Hossein Farrokhbakht, Natalie D. Enright Jerger, Paul V. Gratz, Tushar Krishna, Joshua San Miguel
2020DRAM-Less: Hardware Acceleration of Data Processing with New Memory.
Jie Zhang, Gyuyoung Park, David Donofrio, John Shalf, Myoungsoo Jung
2020DWT: Decoupled Workload Tracing for Data Centers.
Jian Chen, Ying Zhang, Xiaowei Jiang, Li Zhao, Zheng Cao, Qiang Liu
2020Deep Learning Acceleration with Neuron-to-Memory Transformation.
Mohsen Imani, Mohammad Samragh Razlighi, Yeseong Kim, Saransh Gupta, Farinaz Koushanfar, Tajana Rosing
2020Delay and Bypass: Ready and Criticality Aware Instruction Scheduling in Out-of-Order Processors.
Mehdi Alipour, Stefanos Kaxiras, David Black-Schaffer, Rakesh Kumar
2020Domain-Specialized Cache Management for Graph Analytics.
Priyank Faldu, Jeff Diamond, Boris Grot
2020EFLOPS: Algorithm and System Co-Design for a High Performance Distributed Training Platform.
Jianbo Dong, Zheng Cao, Tao Zhang, Jianxi Ye, Shaochuang Wang, Fei Feng, Li Zhao, Xiaoyong Liu, Liuyihan Song, Liwei Peng, Yiqun Guo, Xiaowei Jiang, Lingbo Tang, Yin Du, Yingya Zhang, Pan Pan, Yuan Xie
2020ELP2IM: Efficient and Low Power Bitwise Operation Processing in DRAM.
Xin Xin, Youtao Zhang, Jun Yang
2020EMSim: A Microarchitecture-Level Simulation Tool for Modeling Electromagnetic Side-Channel Signals.
Nader Sehatbakhsh, Baki Berkay Yilmaz, Alenka G. Zajic, Milos Prvulovic
2020Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design.
Xingyao Zhang, Shuaiwen Leon Song, Chenhao Xie, Jing Wang, Weigong Zhang, Xin Fu
2020EquiNox: Equivalent NoC Injection Routers for Silicon Interposer-Based Throughput Processors.
Yunfan Li, Lizhong Chen
2020Experiences with ML-Driven Design: A NoC Case Study.
Jieming Yin, Subhash Sethumurugan, Yasuko Eckert, Chintan Patel, Alan Smith, Eric Morton, Mark Oskin, Natalie D. Enright Jerger, Gabriel H. Loh
2020FLOWER and FaME: A Low Overhead Bit-Level Fault-map and Fault-Tolerance Approach for Deeply Scaled Memories.
Donald Kline Jr., Jiangwei Zhang, Rami G. Melhem, Alex K. Jones
2020Fulcrum: A Simplified Control and Access Mechanism Toward Flexible and Practical In-Situ Accelerators.
Marzieh Lenjani, Patricia Gonzalez-Guerrero, Elaheh Sadredini, Shuangchen Li, Yuan Xie, Ameen Akel, Sean Eilert, Mircea R. Stan, Kevin Skadron
2020Griffin: Hardware-Software Support for Efficient Page Migration in Multi-GPU Systems.
Trinayan Baruah, Yifan Sun, Ali Tolga Dinçer, Saiful A. Mojumder, José L. Abellán, Yash Ukidave, Ajay Joshi, Norman Rubin, John Kim, David R. Kaeli
2020HMG: Extending Cache Coherence Protocols Across Modern Hierarchical Multi-GPU Systems.
Xiaowei Ren, Daniel Lustig, Evgeny Bolotin, Aamer Jaleel, Oreste Villa, David W. Nellans
2020HyGCN: A GCN Accelerator with Hybrid Architecture.
Mingyu Yan, Lei Deng, Xing Hu, Ling Liang, Yujing Feng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, Yuan Xie
2020Hybrid2: Combining Caching and Migration in Hybrid Memory Systems.
Evangelos Vasilakis, Vassilis Papaefstathiou, Pedro Trancoso, Ioannis Sourdis
2020IEEE International Symposium on High Performance Computer Architecture, HPCA 2020, San Diego, CA, USA, February 22-26, 2020
2020IRONHIDE: A Secure Multicore that Efficiently Mitigates Microarchitecture State Attacks for Interactive Applications.
Hamza Omar, Omer Khan
2020Impala: Algorithm/Architecture Co-Design for In-Memory Multi-Stride Pattern Matching.
Elaheh Sadredini, Reza Rahimi, Marzieh Lenjani, Mircea Stan, Kevin Skadron
2020Improving Predication Efficiency through Compaction/Restoration of SIMD Instructions.
Adrián Barredo, Juan M. Cebrian, Miquel Moretó, Marc Casas, Mateo Valero
2020Leaking Information Through Cache LRU States.
Wenjie Xiong, Jakub Szefer
2020Missing the Forest for the Trees: End-to-End AI Application Performance in Edge Data Centers.
Daniel Richins, Dharmisha Doshi, Matthew Blackmore, Aswathy Thulaseedharan Nair, Neha Pathapati, Ankit Patel, Brainard Daguman, Daniel Dobrijalowski, Ramesh Illikkal, Kevin Long, David Zimmerman, Vijay Janapa Reddi
2020Mitigating Voltage Drop in Resistive Memories by Dynamic RESET Voltage Regulation and Partition RESET.
Farzaneh Zokaee, Lei Jiang
2020Multi-Range Supported Oblivious RAM for Efficient Block Data Retrieval.
Yuezhi Che, Rujia Wang
2020NVDIMM-C: A Byte-Addressable Non-Volatile Memory Module for Compatibility with Standard DDR Memory Interfaces.
Changmin Lee, Wonjae Shin, Dae Jeong Kim, Yongjun Yu, Sung-Joon Kim, TaeKyeong Ko, Deokho Seo, Jongmin Park, Kwanghee Lee, Seongho Choi, Namhyung Kim, Vishak G, Arun George, Vishwas V, Donghun Lee, Kang-Woo Choi, Changbin Song, Dohan Kim, Insu Choi, Ilgyu Jung, Yong Ho Song, Jinman Han
2020PIXEL: Photonic Neural Network Accelerator.
Kyle Shiflett, Dylan Wright, Avinash Karanth, Ahmed Louri
2020PREMA: A Predictive Multi-Task Scheduling Algorithm For Preemptible Neural Processing Units.
Yujeong Choi, Minsoo Rhu
2020Precise Runahead Execution.
Ajeya Naithani, Josué Feliu, Almutaz Adileh, Lieven Eeckhout
2020Q-Zilla: A Scheduling Framework and Core Microarchitecture for Tail-Tolerant Microservices.
Amirhossein Mirhosseini, Brendan L. West, Geoffrey W. Blake, Thomas F. Wenisch
2020QuickNN: Memory and Performance Optimization of k-d Tree Based Nearest Neighbor Search for 3D Point Clouds.
Reid Pinkham, Shuqing Zeng, Zhengya Zhang
2020ResiRCA: A Resilient Energy Harvesting ReRAM Crossbar-Based Accelerator for Intelligent Embedded Processors.
Keni Qiu, Nicholas Jao, Mengying Zhao, Cyan Subhra Mishra, Gulsum Gudukbay, Sethu Jose, Jack Sampson, Mahmut Taylan Kandemir, Vijaykrishnan Narayanan
2020SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training.
Eric Qin, Ananda Samajdar, Hyoukjun Kwon, Vineet Nadella, Sudarshan Srinivasan, Dipankar Das, Bharat Kaul, Tushar Krishna
2020SnackNoC: Processing in the Communication Layer.
Karthik Sangaiah, Michael Lui, Ragh Kuttappa, Baris Taskin, Mark Hempstead
2020SpArch: Efficient Architecture for Sparse Matrix Multiplication.
Zhekai Zhang, Hanrui Wang, Song Han, William J. Dally
2020Techniques for Reducing the Connected-Standby Energy Consumption of Mobile Devices.
Jawad Haj-Yahya, Yanos Sazeides, Mohammed Alser, Efraim Rotem, Onur Mutlu
2020Tensaurus: A Versatile Accelerator for Mixed Sparse-Dense Tensor Computations.
Nitish Kumar Srivastava, Hanchen Jin, Shaden Smith, Hongbo Rong, David H. Albonesi, Zhiru Zhang
2020The Architectural Implications of Facebook's DNN-Based Personalized Recommendation.
Udit Gupta, Carole-Jean Wu, Xiaodong Wang, Maxim Naumov, Brandon Reagen, David Brooks, Bradford Cottel, Kim M. Hazelwood, Mark Hempstead, Bill Jia, Hsien-Hsin S. Lee, Andrey Malevich, Dheevatsa Mudigere, Mikhail Smelyanskiy, Liang Xiong, Xuan Zhang
2020Twig: Multi-Agent Task Management for Colocated Latency-Critical Cloud Services.
Rajiv Nishtala, Vinicius Petrucci, Paul M. Carpenter, Magnus Själander