MICRO A*

95 papers

YearTitle / Authors
20212-in-1 Accelerator: Enabling Random Precision Switch for Winning Both Adversarial Robustness and Efficiency.
Yonggan Fu, Yang Zhao, Qixuan Yu, Chaojian Li, Yingyan Lin
2021: Near-Storage Accelerator for High-Performance Log Analytics.
Seongyoung Kang, Jiyoung An, Jinpyo Kim, Sang-Woo Jun
2021A Deeper Look into RowHammer's Sensitivities: Experimental Analysis of Real DRAM Chipsand Implications on Future Attacks and Defenses.
Lois Orosa, Abdullah Giray Yaglikçi, Haocong Luo, Ataberk Olgun, Jisung Park, Hasan Hassan, Minesh Patel, Jeremie S. Kim, Onur Mutlu
2021A Hardware Accelerator for Protocol Buffers.
Sagar Karandikar, Chris Leary, Chris Kennelly, Jerry Zhao, Dinesh Parimi, Borivoje Nikolic, Krste Asanovic, Parthasarathy Ranganathan
2021ADAPT: Mitigating Idling Errors in Qubits via Adaptive Dynamical Decoupling.
Poulami Das, Swamit S. Tannu, Siddharth Dangwal, Moinuddin K. Qureshi
2021APOLLO: An Automated Power Modeling Framework for Runtime Power Introspection in High-Volume Commercial Microprocessors.
Zhiyao Xie, Xiaoqing Xu, Matt Walker, Joshua Knebel, Kumaraguru Palaniswamy, Nicolas Hebert, Jiang Hu, Huanrui Yang, Yiran Chen, Shidhartha Das
2021AccelWattch: A Power Modeling Framework for Modern GPUs.
Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Junrui Pan, Amogh Manjunath, Timothy G. Rogers, Tor M. Aamodt, Nikos Hardavellas
2021Archytas: A Framework for Synthesizing and Dynamically Optimizing Accelerators for Robotic Localization.
Weizhuang Liu, Bo Yu, Yiming Gan, Qiang Liu, Jie Tang, Shaoshan Liu, Yuhao Zhu
2021AutoBraid: A Framework for Enabling Efficient Surface Code Communication in Quantum Computing.
Fei Hua, Yan-Hao Chen, Yuwei Jin, Chi Zhang, Ari B. Hayes, Youtao Zhang, Eddy Z. Zhang
2021AutoFL: Enabling Heterogeneity-Aware Energy Efficient Federated Learning.
Young Geun Kim, Carole-Jean Wu
2021Bonsai Merkle Forests: Efficiently Achieving Crash Consistency in Secure Persistent Memory.
Alexander Freij, Huiyang Zhou, Yan Solihin
2021Branch Runahead: An Alternative to Branch Prediction for Impossible to Predict Branches.
Stephen Pruett, Yale N. Patt
2021BurstLink: Techniques for Energy-Efficient Video Display for Conventional and Virtual Reality Systems.
Jawad Haj-Yahya, Jisung Park, Rahul Bera, Juan Gómez-Luna, Efraim Rotem, Taha Shahroodi, Jeremie S. Kim, Onur Mutlu
2021COSPlay: Leveraging Task-Level Parallelism for High-Throughput Synchronous Persistence.
Marina Vemmou, Alexandros Daglis
2021Capstan: A Vector RDA for Sparsity.
Alexander Rucker, Matthew Vilim, Tian Zhao, Yaqi Zhang, Raghu Prabhakar, Kunle Olukotun
2021Cerebros: Evading the RPC Tax in Datacenters.
Arash Pourhabibi Zarandi, Mark Sutherland, Alexandros Daglis, Babak Falsafi
2021Characterizing and Mitigating Soft Errors in GPU DRAM.
Michael B. Sullivan, Nirmal R. Saxena, Mike O'Connor, Donghyuk Lee, Paul Racunas, Saurabh Hukerikar, Timothy Tsai, Siva Kumar Sastry Hari, Stephen W. Keckler
2021Cohmeleon: Learning-Based Orchestration of Accelerator Coherence in Heterogeneous SoCs.
Joseph Zuckerman, Davide Giri, Jihye Kwon, Paolo Mantovani, Luca P. Carloni
2021Criticality Driven Fetch.
Aniket Deshmukh, Yale N. Patt
2021Cryptographic Capability Computing.
Michael LeMay, Joydeep Rakshit, Sergej Deutsch, David M. Durham, Santosh Ghosh, Anant Nori, Jayesh Gaur, Andrew Weiler, Salmin Sultana, Karanvir Grewal, Sreenivas Subramoney
2021DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware.
Hanieh Hashemi, Yongqin Wang, Murali Annavaram
2021Distilling Bit-level Sparsity Parallelism for General Purpose Deep Learning Acceleration.
Hang Lu, Liang Chang, Chenglong Li, Zixuan Zhu, Shengjian Lu, Yanhuan Liu, Mingzhe Zhang
2021Distributed Data Persistency.
Apostolos Kokolis, Antonis Psistakis, Benjamin Reidys, Jian Huang, Josep Torrellas
2021Dolos: Improving the Performance of Persistent Applications in ADR-Supported Secure Memory.
Xijing Han, James Tuck, Amro Awad
2021ENMC: Extreme Near-Memory Classification via Approximate Screening.
Liu Liu, Jilan Lin, Zheng Qu, Yufei Ding, Yuan Xie
2021ESCALATE: Boosting the Efficiency of Sparse CNN Accelerator with Kernel Decomposition.
Shiyu Li, Edward Hanson, Xuehai Qian, Hai (Helen) Li, Yiran Chen
2021EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference.
Thierry Tambe, Coleman Hooper, Lillian Pentecost, Tianyu Jia, En-Yu Yang, Marco Donato, Victor Sanh, Paul N. Whatmough, Alexander M. Rush, David Brooks, Gu-Yeon Wei
2021Effective Processor Verification with Logic Fuzzer Enhanced Co-simulation.
Nursultan Kabylkas, Tommy Thorn, Shreesha Srinath, Polychronis Xekalakis, Jose Renau
2021Efficient, Distributed, and Non-Speculative Multi-Address Atomic Operations.
Eduardo José Gómez-Hernández, Juan M. Cebrian, J. Rubén Titos Gil, Stefanos Kaxiras, Alberto Ros
2021Enabling Branch-Mispredict Level Parallelism by Selectively Flushing Instructions.
Stijn Eyerman, Wim Heirman, Sam Van den Steen, Ibrahim Hur
2021Equinox: Training (for Free) on a Custom Inference Accelerator.
Mario Drumond, Louis Coulon, Arash Pourhabibi Zarandi, Ahmet Caner Yüzügüler, Babak Falsafi, Martin Jaggi
2021Exploiting Different Levels of Parallelism in the Quantum Control Microarchitecture for Superconducting Qubits.
Mengyu Zhang, Lei Xie, Zhenxing Zhang, Qiaonian Yu, Guanglei Xi, Hualiang Zhang, Fuming Liu, Yarui Zheng, Yicong Zheng, Shengyu Zhang
2021F1: A Fast and Programmable Accelerator for Fully Homomorphic Encryption.
Nikola Samardzic, Axel Feldmann, Aleksandar Krastev, Srinivas Devadas, Ronald G. Dreslinski, Christopher Peikert, Daniel Sánchez
2021FPRaker: A Processing Element For Accelerating Neural Network Training.
Omar Mohamed Awad, Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Ciaran Bannon, Anand Jayarajan, Gennady Pekhimenko, Andreas Moshovos
2021Fat Loads: Exploiting Locality Amongst Contemporaneous Load Operations to Optimize Cache Accesses.
Vanshika Baoni, Adarsh Mittal, Gurindar S. Sohi
2021Fifer: Practical Acceleration of Irregular Applications on Reconfigurable Architectures.
Quan M. Nguyen, Daniel Sánchez
2021GPS: A Global Publish-Subscribe Model for Multi-GPU Memory Management.
Harini Muthukrishnan, Daniel Lustig, David W. Nellans, Thomas F. Wenisch
2021GhostMinion: A Strictness-Ordered Cache System for Spectre Mitigation.
Sam Ainsworth
2021GreenDIMM: OS-assisted DRAM Power Management for DRAM with a Sub-array Granularity Power-Down State.
Seunghak Lee, Ki-Dong Kang, Hwanjun Lee, Hyungwon Park, Young Hoon Son, Nam Sung Kim, Daehoon Kim
2021HARP: Practically and Effectively Identifying Uncorrectable Errors in Memory Chips That Use On-Die Error-Correcting Codes.
Minesh Patel, Geraldo F. Oliveira, Onur Mutlu
2021HiMA: A Fast and Scalable History-based Memory Access Engine for Differentiable Neural Computer.
Yaoyu Tao, Zhengya Zhang
2021HoloAR: On-the-fly Optimization of 3D Holographic Processing for Augmented Reality.
Shulin Zhao, Haibo Zhang, Cyan Subhra Mishra, Sandeepa Bhuyan, Ziyu Ying, Mahmut Taylan Kandemir, Anand Sivasubramaniam, Chita R. Das
2021I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization.
Tong Geng, Chunshu Wu, Yongan Zhang, Cheng Tan, Chenhao Xie, Haoran You, Martin C. Herbordt, Yingyan Lin, Ang Li
2021ITSLF: Inter-Thread Store-to-Load Forwardingin Simultaneous Multithreading.
Josué Feliu, Alberto Ros, Manuel E. Acacio, Stefanos Kaxiras
2021IceClave: A Trusted Execution Environment for In-Storage Computing.
Luyi Kang, Yuqi Xue, Weiwei Jia, Xiaohao Wang, Jongryool Kim, Changhwan Youn, Myeong Joon Kang, Hyung Jin Lim, Bruce L. Jacob, Jian Huang
2021Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design.
Bingyao Li, Jieming Yin, Youtao Zhang, Xulong Tang
2021Improving Streaming Graph Processing Performance using Input Knowledge.
Abanti Basak, Zheng Qu, Jilan Lin, Alaa R. Alameldeen, Zeshan Chishti, Yufei Ding, Yuan Xie
2021Increasing GPU Translation Reach by Leveraging Under-Utilized On-Chip Resources.
Jagadish B. Kotra, Michael LeBeane, Mahmut T. Kandemir, Gabriel H. Loh
2021Intersection Prediction for Accelerated GPU Ray Tracing.
Lufei Liu, Wesley Chang, Francois Demoullin, Yuan-Hsi Chou, Mohammadreza Saed, David Pankratz, Tyler Nowicki, Tor M. Aamodt
2021JetStream: Graph Analytics on Streaming Data with Event-Driven Hardware Accelerator.
Shafiur Rahman, Mahbod Afarin, Nael B. Abu-Ghazaleh, Rajiv Gupta
2021JigSaw: Boosting Fidelity of NISQ Programs via Measurement Subsetting.
Poulami Das, Swamit S. Tannu, Moinuddin K. Qureshi
2021LADDER: Architecting Content and Location-aware Writes for Crossbar Resistive Memories.
Md Hafizul Islam Chowdhuryy, Muhammad Rashedul Haq Rashed, Amro Awad, Rickard Ewetz, Fan Yao
2021Leveraging Targeted Value Prediction to Unlock New Hardware Strength Reduction Potential.
Arthur Perais
2021MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, Virtual Event, Greece, October 18-22, 2021
2021Morrigan: A Composite Instruction TLB Prefetcher.
Georgios Vavouliotis, Lluc Alvarez, Boris Grot, Daniel A. Jiménez, Marc Casas
2021NDS: N-Dimensional Storage.
Yu-Chia Liu, Hung-Wei Tseng
2021NMAP: Power Management Based on Network Packet Processing Mode Transition for Latency-Critical Workloads.
Ki-Dong Kang, Gyeongseo Park, Hyosang Kim, Mohammad Alian, Nam Sung Kim, Daehoon Kim
2021NOVIA: A Framework for Discovering Non-Conventional Inline Accelerators.
David Trilla, John-David Wellman, Alper Buyuktosunoglu, Pradip Bose
2021Network-on-Chip Microarchitecture-based Covert Channel in GPUs.
Jaeguk Ahn, Jiho Kim, Hans Kasan, Zhixian Jin, Leila Delshadtehrani, WonJun Song, Ajay Joshi, John Kim
2021Noema: Hardware-Efficient Template Matching for Neural Population Pattern Detection.
Ameer M. S. Abdelhadi, Eugene Sha, Ciaran Bannon, Hendrik Steenland, Andreas Moshovos
2021Ohm-GPU: Integrating New Optical Network and Heterogeneous Memory into GPU Multi-Processors.
Jie Zhang, Myoungsoo Jung
2021OrderLight: Lightweight Memory-Ordering Primitive for Efficient Fine-Grained PIM Computations.
Anirban Nag, Rajeev Balasubramonian
2021PCCS: Processor-Centric Contention-aware Slowdown Model for Heterogeneous System-on-Chips.
Yuanchao Xu, Mehmet Esat Belviranli, Xipeng Shen, Jeffrey S. Vetter
2021PDede: Partitioned, Deduplicated, Delta Branch Target Buffer.
Niranjan K. Soundararajan, Peter Braun, Tanvir Ahmed Khan, Baris Kasikci, Heiner Litz, Sreenivas Subramoney
2021ParaBit: Processing Parallel Bitwise Operations in NAND Flash Memory based SSDs.
Congming Gao, Xin Xin, Youyou Lu, Youtao Zhang, Jun Yang, Jiwu Shu
2021Point-X: A Spatial-Locality-Aware Architecture for Energy-Efficient Graph-Based Point-Cloud Deep Learning.
Jie-Fang Zhang, Zhengya Zhang
2021PointAcc: Efficient Point Cloud Accelerator.
Yujun Lin, Zhekai Zhang, Haotian Tang, Hanrui Wang, Song Han
2021Post-Fabrication Microarchitecture.
Chanchal Kumar, Anirudh Seshadri, Aayush Chaudhary, Shubham Bhawalkar, Rohit Singh, Eric Rotenberg
2021Principal Kernel Analysis: A Tractable Methodology to Simulate Scaled GPU Workloads.
Cesar Avalos Baddouh, Mahmoud Khairy, Roland N. Green, Mathias Payer, Timothy G. Rogers
2021Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning.
Rahul Bera, Konstantinos Kanellopoulos, Anant Nori, Taha Shahroodi, Sreenivas Subramoney, Onur Mutlu
2021RACER: Bit-Pipelined Processing Using Resistive Memory.
Minh S. Q. Truong, Eric Chen, Deanyone Su, Liting Shen, Alexander Glass, L. Richard Carley, James A. Bain, Saugata Ghose
2021RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance.
Udit Gupta, Samuel Hsia, Jeff Zhang, Mark Wilkening, Javin Pombra, Hsien-Hsin Sean Lee, Gu-Yeon Wei, Carole-Jean Wu, David Brooks
2021ReplayCache: Enabling Volatile Cachesfor Energy Harvesting Systems.
Jianping Zeng, Jongouk Choi, Xinwei Fu, Ajay Paddayuru Shreepathi, Dongyoon Lee, Changwoo Min, Changhee Jung
2021SAM: Accelerating Strided Memory Accesses.
Xin Xin, Yanan Guo, Youtao Zhang, Jun Yang
2021SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems.
Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwasniewski, Rachata Ausavarungnirun, Jakub Beránek, Konstantinos Kanellopoulos, Kacper Janda, Zur Vonarburg-Shmaria, Lukas Gianinazzi, Ioana Stefan, Juan Gómez-Luna, Jakub Golinowski, Marcin Copik, Lukas Kapp-Schwoerer, Salvatore Di Girolamo, Nils Blach, Marek Konieczny, Onur Mutlu, Torsten Hoefler
2021SMART: A Heterogeneous Scratchpad Memory Architecture for Superconductor SFQ-based Systolic CNN Accelerators.
Farzaneh Zokaee, Lei Jiang
2021Sanger: A Co-Design Framework for Enabling Sparse Attention using Reconfigurable Architecture.
Liqiang Lu, Yicheng Jin, Hangrui Bi, Zizhang Luo, Peng Li, Tao Wang, Yun Liang
2021Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving.
Qiyu Wan, Haojun Xia, Xingyao Zhang, Lening Wang, Shuaiwen Leon Song, Xin Fu
2021Software-Defined Vector Processing on Manycore Fabrics.
Philip Bedoukian, Neil Adit, Edwin Peguero, Adrian Sampson
2021Soteria: Towards Resilient Integrity-Protected and Encrypted Non-Volatile Memories.
Kazi Abu Zubair, Sudhanva Gurumurthi, Vilas Sridharan, Amro Awad
2021SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator.
Subhankar Pal, Aporva Amarnath, Siying Feng, Michael F. P. O'Boyle, Ronald G. Dreslinski, Christophe Dubach
2021Speculative Privacy Tracking (SPT): Leaking Information From Speculative Execution Without Compromising Privacy.
Rutvik Choudhary, Jiyong Yu, Christopher W. Fletcher, Adam Morrison
2021SquiggleFilter: An Accelerator for Portable Virus Detection.
Timothy Dunn, Harisankar Sadasivan, Jack Wadden, Kush Goliya, Kuan-Yu Chen, David T. Blaauw, Reetuparna Das, Satish Narayanasamy
2021Sunder: Enabling Low-Overhead and Scalable Near-Data Pattern Matching Acceleration.
Elaheh Sadredini, Reza Rahimi, Mohsen Imani, Kevin Skadron
2021Synthesizing Formal Models of Hardware from RTL for Efficient Verification of Memory Model Implementations.
Yao Hsiao, Dominic P. Mulligan, Nikos Nikoleris, Gustavo Petri, Caroline Trippel
2021TIP: Time-Proportional Instruction Profiling.
Björn Gottschall, Lieven Eeckhout, Magnus Jahre
2021TRiM: Enhancing Processor-Memory Interfaces with Scalable Tensor Reduction in Memory.
Jaehyun Park, Byeongho Kim, Sungmin Yun, Eojin Lee, Minsoo Rhu, Jung Ho Ahn
2021The Laplace Microarchitecture for Tracking Data Uncertainty and Its Implementation in a RISC-V Processor.
Vasileios Tsoutsouras, Orestis Kaparounakis, Bilgesu Arif Bilgin, Chatura Samarakoon, James Timothy Meech, Jan Heck, Phillip Stanley-Marbell
2021Trident: Harnessing Architectural Resources for All Page Sizes in x86 Processors.
Venkat Sri Sai Ram, Ashish Panwar, Arkaprava Basu
2021Turnpike: Lightweight Soft Error Resilience for In-Order Cores.
Jianping Zeng, Hongjune Kim, Jaejin Lee, Changhee Jung
2021Twig: Profile-Guided BTB Prefetching for Data Center Applications.
Tanvir Ahmed Khan, Nathan Brown, Akshitha Sriraman, Niranjan K. Soundararajan, Rakesh Kumar, Joseph Devietti, Sreenivas Subramoney, Gilles A. Pokam, Heiner Litz, Baris Kasikci
2021UC-Check: Characterizing Micro-operation Caches in x86 Processors and Implications in Security and Performance.
Joonsung Kim, Hamin Jang, Hunjun Lee, Seungho Lee, Jangwoo Kim
2021Uncovering In-DRAM RowHammer Protection Mechanisms: A New Methodology, Custom RowHammer Patterns, and Implications.
Hasan Hassan, Yahya Can Tugrul, Jeremie S. Kim, Victor van der Veen, Kaveh Razavi, Onur Mutlu
2021Validation of Side-Channel Models via Observation Refinement.
Pablo Buiras, Hamed Nemati, Andreas Lindner, Roberto Guanciale
2021Vortex: Extending the RISC-V ISA for GPGPU and 3D-Graphics.
Blaise Tine, Krishna Praveen Yalamarthy, Fares Elsabbagh, Hyesoon Kim