MICRO A*

83 papers

YearTitle / Authors
2020(Almost) Fence-less Persist Ordering.
Sara Mahdizadeh-Shahri, Seyed Armin Vakil-Ghahani, Aasheesh Kolli
202053rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2020, Athens, Greece, October 17-21, 2020
2020A Benchmarking Framework for Interactive 3D Applications in the Cloud.
Tianyi Liu, Sen He, Sunzhou Huang, Danny H. K. Tsang, Lingjia Tang, Jason Mars, Wei Wang
2020A Locality-Aware Energy-Efficient Accelerator for Graph Mining Applications.
Pengcheng Yao, Long Zheng, Zhen Zeng, Yu Huang, Chuangyi Gui, Xiaofei Liao, Hai Jin, Jingling Xue
2020AQUOMAN: An Analytic-Query Offloading Machine.
Shuotao Xu, Thomas Bourgeat, Tianhao Huang, Hojun Kim, Sungjin Lee, Arvind
2020AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing.
Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steven K. Reinhardt, Martin C. Herbordt
2020AutoScale: Energy Efficiency Optimization for Stochastic Edge Inference Using Reinforcement Learning.
Young Geun Kim, Carole-Jean Wu
2020BOW: Breathing Operand Windows to Exploit Bypassing in GPUs.
Hodjat Asghari Esfeden, AmirAli Abdolrashidi, Shafiur Rahman, Daniel Wong, Nael B. Abu-Ghazaleh
2020Bit-Exact ECC Recovery (BEER): Determining DRAM On-Die ECC Functions by Exploiting DRAM Data Retention Characteristics.
Minesh Patel, Jeremie S. Kim, Taha Shahroodi, Hasan Hassan, Onur Mutlu
2020Boosting Store Buffer Efficiency with Store-Prefetch Bursts.
Juan M. Cebrian, Stefanos Kaxiras, Alberto Ros
2020BranchNet: A Convolutional Neural Network to Predict Hard-To-Predict Branches.
Siavash Zangeneh, Stephen Pruett, Sangkug Lym, Yale N. Patt
2020Building the Computing System for Autonomous Micromobility Vehicles: Design Constraints and Architectural Optimizations.
Bo Yu, Wei Hu, Leimeng Xu, Jie Tang, Shaoshan Liu, Yuhao Zhu
2020CATCAM: Constant-time Alteration Ternary CAM with Scalable In-Memory Architecture.
Dibei Chen, Zhaoshi Li, Tianzhu Xiong, Zhiwei Liu, Jun Yang, Shouyi Yin, Shaojun Wei, Leibo Liu
2020CHiRP: Control-Flow History Reuse Prediction.
Samira Mirbagher Ajorpaz, Elba Garza, Gilles Pokam, Daniel A. Jiménez
2020CaSA: End-to-end Quantitative Security Analysis of Randomly Mapped Caches.
Thomas Bourgeat, Jules Drean, Yuheng Yang, Lillian Tsai, Joel S. Emer, Mengjia Yan
2020Characterizing and Modeling Non-Volatile Memory Systems.
Zixuan Wang, Xiao Liu, Jian Yang, Theodore Michailidis, Steven Swanson, Jishen Zhao
2020Circuit Compilation Methodologies for Quantum Approximate Optimization Algorithm.
Mahabubul Alam, Abdullah Ash-Saki, Swaroop Ghosh
2020ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning.
Sheng-Chun Kao, Geonhwa Jeong, Tushar Krishna
2020Coordinated Priority-aware Charging of Distributed Batteries in Oversubscribed Data Centers.
Sulav Malla, Qingyuan Deng, Zoh Ebrahimzadeh, Joe Gasperetti, Sajal Jain, Parimala Kondety, Thiara Ortiz, Debra Vieira
2020CuttleSys: Data-Driven Resource Management for Interactive Services on Reconfigurable Multicores.
Neeraj Kulkarni, Gonzalo Gonzalez-Pumariega, Amulya Khurana, Christine A. Shoemaker, Christina Delimitrou, David H. Albonesi
2020D-SOAP: Dynamic Spatial Orientation Affinity Prediction for Caching in Multi-Orientation Memory Systems.
Minli Julie Liao, Jack Sampson
2020DStress: Automatic Synthesis of DRAM Reliability Stress Viruses using Genetic Algorithms.
Lev Mukhanov, Dimitrios S. Nikolopoulos, Georgios Karakonstantis
2020DUAL: Acceleration of Clustering Algorithms using Digital-based Processing In-Memory.
Mohsen Imani, Saikishan Pampana, Saransh Gupta, Minxuan Zhou, Yeseong Kim, Tajana Rosing
2020DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture.
Liu Liu, Zheng Qu, Lei Deng, Fengbin Tu, Shuangchen Li, Xing Hu, Zhenyu Gu, Yufei Ding, Yuan Xie
2020Deterministic Atomic Buffering.
Yuan-Hsi Chou, Christopher Ng, Shaylin Cattell, Jeremy Intan, Matthew D. Sinclair, Joseph Devietti, Timothy G. Rogers, Tor M. Aamodt
2020DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates.
Alex Renda, Yishen Chen, Charith Mendis, Michael Carbin
2020Draco: Architectural and Operating System Support for System Call Security.
Dimitrios Skarlatos, Qingrong Chen, Jianyan Chen, Tianyin Xu, Josep Torrellas
2020Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores.
Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, William J. Song
2020FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching.
Yaohua Wang, Lois Orosa, Xiangjun Peng, Yang Guo, Saugata Ghose, Minesh Patel, Jeremie S. Kim, Juan Gómez-Luna, Mohammad Sadrosadati, Nika Mansouri-Ghiasi, Onur Mutlu
2020FIdelity: Efficient Resilience Analysis Framework for Deep Learning Accelerators.
Yi He, Prasanna Balaprakash, Yanjing Li
2020FReaC Cache: Folded-logic Reconfigurable Computing in the Last Level Cache.
Ashutosh Dhar, Xiaohao Wang, Hubertus Franke, Jinjun Xiong, Jian Huang, Wen-mei W. Hwu, Nam Sung Kim, Deming Chen
2020Fast-BCNN: Massive Neuron Skipping in Bayesian Convolutional Neural Networks.
Qiyu Wan, Xin Fu
2020FlexWatts: A Power- and Workload-Aware Hybrid Power Delivery Network for Energy-Efficient Microprocessors.
Jawad Haj-Yahya, Mohammed Alser, Jeremie S. Kim, Lois Orosa, Efraim Rotem, Avi Mendelson, Anupam Chattopadhyay, Onur Mutlu
2020GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference.
Ali Hadi Zadeh, Isak Edo, Omar Mohamed Awad, Andreas Moshovos
2020Gemini: Learning to Manage CPU Power for Latency-Critical Search Engines.
Liang Zhou, Laxmi N. Bhuyan, K. K. Ramakrishnan
2020GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis.
Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gómez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu
2020GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph Processing.
Shafiur Rahman, Nael B. Abu-Ghazaleh, Rajiv Gupta
2020Graphene: Strong yet Lightweight Row Hammer Protection.
Yeonhong Park, Woosuk Kwon, Eojin Lee, Tae Jun Ham, Jung Ho Ahn, Jae W. Lee
2020Hardware-based Always-On Heap Memory Safety.
Yonghae Kim, Jaekyu Lee, Hyesoon Kim
2020HyperPlane: A Scalable Low-Latency Notification Accelerator for Software Data Planes.
Amirhossein Mirhosseini, Hossein Golestani, Thomas F. Wenisch
2020I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing.
Tanvir Ahmed Khan, Akshitha Sriraman, Joseph Devietti, Gilles Pokam, Heiner Litz, Baris Kasikci
2020Improving the Utilization of Micro-operation Caches in x86 Processors.
Jagadish B. Kotra, John Kalamatianos
2020Jumanji: The Case for Dynamic NUCA in the Datacenter.
Brian C. Schwedock, Nathan Beckmann
2020Locality-Centric Data and Threadblock Management for Massive GPUs.
Mahmoud Khairy, Vadim Nikiforov, David W. Nellans, Timothy G. Rogers
2020Look-Up Table based Energy Efficient Processing in Cache Support for Neural Network Acceleration.
Akshay Krishna Ramanathan, Gurpreet S. Kalsi, Srivatsa Srinivasa, Tarun Makesh Chandran, Kamlesh R. Pillai, Om Ji Omer, Vijaykrishnan Narayanan, Sreenivas Subramoney
2020MDM: The GPU Memory Divergence Model.
Lu Wang, Magnus Jahre, Almutaz Adileh, Lieven Eeckhout
2020MOUSE: Inference In Non-volatile Memory for Energy Harvesting Applications.
Salonik Resch, S. Karen Khatamifard, Zamshed I. Chowdhury, Masoud Zabihi, Zhengyang Zhao, M. Hüsrev Cilasun, Jianping Wang, Sachin S. Sapatnekar, Ulya R. Karpuzcu
2020MatRaptor: A Sparse-Sparse Matrix Multiplication Accelerator Based on Row-Wise Product.
Nitish Kumar Srivastava, Hanchen Jin, Jie Liu, David H. Albonesi, Zhiru Zhang
2020Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation.
Yu Feng, Boyuan Tian, Tiancheng Xu, Paul N. Whatmough, Yuhao Zhu
2020More with Less - Deriving More Translation Rules with Less Training Data for DBTs Using Parameterization.
Jinhu Jiang, Rongchao Dong, Zhongjun Zhou, Changheng Song, Wenwen Wang, Pen-Chung Yew, Weihua Zhang
2020NCPU: An Embedded Neural CPU Architecture on Resource-Constrained Low Power Devices for Real-time End-to-End Performance.
Tianyu Jia, Yuhao Ju, Russ Joseph, Jie Gu
2020Newton: A DRAM-maker's Accelerator-in-Memory (AiM) Architecture for Machine Learning.
Mingxuan He, Choungki Song, Ilkon Kim, Chunseok Jeong, Seho Kim, Il Park, Mithuna Thottethodi, T. N. Vijaykumar
2020Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks.
Gil Shomron, Uri C. Weiser
2020Optimized Quantum Compilation for Near-Term Algorithms with OpenPulse.
Pranav Gokhale, Ali Javadi-Abhari, Nathan Earnest, Yunong Shi, Frederic T. Chong
2020Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data.
Jie Zhao, Peng Di
2020P-INSPECT: Architectural Support for Programmable Non-Volatile Memory Frameworks.
Apostolos Kokolis, Thomas Shull, Jian Huang, Josep Torrellas
2020PThammer: Cross-User-Kernel-Boundary Rowhammer through Implicit Accesses.
Zhi Zhang, Yueqiang Cheng, Dongxi Liu, Surya Nepal, Zhi Wang, Yuval Yarom
2020PerSpectron: Detecting Invariant Footprints of Microarchitectural Attacks with Perceptron.
Samira Mirbagher Ajorpaz, Gilles Pokam, Esmaeil Mohammadian Koruyeh, Elba Garza, Nael B. Abu-Ghazaleh, Daniel A. Jiménez
2020PerpLE: Improving the Speed and Effectiveness of Memory Consistency Testing.
Themis Melissaris, Markos Markakis, Kelly A. Shaw, Margaret Martonosi
2020Persist Level Parallelism: Streamlining Integrity Tree Updates for Secure Persistent Memory.
Alexander Freij, Shougang Yuan, Huiyang Zhou, Yan Solihin
2020Pipette: Improving Core Utilization on Irregular Applications through Intra-Core Pipeline Parallelism.
Quan M. Nguyen, Daniel Sánchez
2020Planaria: Dynamic Architecture Fission for Spatial Multi-Tenant Acceleration of Deep Neural Networks.
Soroush Ghodrati, Byung Hoon Ahn, Joon Kyung Kim, Sean Kinzer, Brahmendra Reddy Yatham, Navateja Alla, Hardik Sharma, Mohammad Alian, Eiman Ebrahimi, Nam Sung Kim, Cliff Young, Hadi Esmaeilzadeh
2020Predicting Execution Times With Partial Simulations in Virtual Memory Research: Why and How.
Mohammad Agbarya, Idan Yaniv, Jayneel Gandhi, Dan Tsafrir
2020Printed Machine Learning Classifiers.
Muhammad Husnain Mubarik, Dennis D. Weller, Nathaniel Bleier, Matthew Tomei, Jasmin Aghassi-Hagmann, Mehdi B. Tahoori, Rakesh Kumar
2020Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training.
Dingqing Yang, Amin Ghasemazar, Xiaowei Ren, Maximilian Golub, Guy Lemieux, Mieszko Lis
2020Ptolemy: Architecture Support for Robust Deep Learning.
Yiming Gan, Yuxian Qiu, Jingwen Leng, Minyi Guo, Yuhao Zhu
2020RnR: A Software-Assisted Record-and-Replay Hardware Prefetcher.
Chao Zhang, Yuan Zeng, John Shalf, Xiaochen Guo
2020SAVE: Sparsity-Aware Vector Engine for Accelerating DNN Training and Inference on CPUs.
Zhangxiaowen Gong, Houxiang Ji, Christopher W. Fletcher, Christopher J. Hughes, Sara S. Baghsorkhi, Josep Torrellas
2020SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space.
Daichi Fujiki, Shunhao Wu, Nathan Ozog, Kush Goliya, David T. Blaauw, Satish Narayanasamy, Reetuparna Das
2020Selective Replication in Memory-Side GPU Caches.
Xia Zhao, Magnus Jahre, Lieven Eeckhout
2020Shaving Retries with Sentinels for Fast Read over High-Density 3D Flash.
Qiao Li, Min Ye, Yufei Cui, Liang Shi, Xiaoqiang Li, Tei-Wei Kuo, Chun Jason Xue
2020Speculation Invariance (InvarSpec): Faster Safe Execution Through Program Analysis.
Zirui Neil Zhao, Houxiang Ji, Mengjia Yan, Jiyong Yu, Christopher W. Fletcher, Adam Morrison, Darko Marinov, Josep Torrellas
2020Speculative Enforcement of Store Atomicity.
Alberto Ros, Stefanos Kaxiras
2020SuperNPU: An Extremely Fast Neural Processing Unit Using Superconducting Logic Devices.
Koki Ishida, Ilkwon Byun, Ikki Nagaoka, Kosuke Fukumitsu, Masamitsu Tanaka, Satoshi Kawakami, Teruo Tanimoto, Takatsugu Ono, Jangwoo Kim, Koji Inoue
2020Systematic Crosstalk Mitigation for Superconducting Qubits via Frequency-Aware Compilation.
Yongshan Ding, Pranav Gokhale, Sophia Fuhui Lin, Richard Rines, Thomas Propson, Frederic T. Chong
2020TFE: Energy-efficient Transferred Filter-based Engine to Compress and Accelerate Convolutional Neural Networks.
Huiyu Mo, Leibo Liu, Wenjing Hu, Wenping Zhu, Qiang Li, Ang Li, Shouyi Yin, Jian Chen, Xiaowei Jiang, Shaojun Wei
2020TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training.
Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Omar Mohamed Awad, Gennady Pekhimenko, Jorge Albericio, Andreas Moshovos
2020ThymesisFlow: A Software-Defined, HW/SW co-Designed Interconnect Stack for Rack-Scale Memory Disaggregation.
Christian Pinto, Dimitris Syrivelis, Michele Gazzetti, Panos K. Koutsovasilis, Andrea Reale, Kostas Katrinis, H. Peter Hofstee
2020TrainBox: An Extreme-Scale Neural Network Training Server Architecture by Systematically Balancing Operations.
Pyeongsu Park, Heetaek Jeong, Jangwoo Kim
2020Unbounded Hardware Transactional Memory for a Hybrid DRAM/NVM Memory System.
Jungi Jeong, Jaewan Hong, Seungryoul Maeng, Changhee Jung, Youngjin Kwon
2020VR-DANN: Real-Time Video Recognition via Decoder-Assisted Neural Network Acceleration.
Zhuoran Song, Feiyang Wu, Xueyuan Liu, Jing Ke, Naifeng Jing, Xiaoyao Liang
2020Virtualized Logical Qubits: A 2.5D Architecture for Error-Corrected Quantum Computing.
Casey Duckering, Jonathan M. Baker, David I. Schuster, Frederic T. Chong
2020gem5-SALAM: A System Architecture for LLVM-based Accelerator Modeling.
Samuel Rogers, Joshua Slycord, Mohammadreza Baharani, Hamed Tabkhi