| 2020 | A Methodology for Principled Approximation in Visual SLAM. Yan Pei, Swarnendu Biswas, Donald S. Fussell, Keshav Pingali |
| 2020 | A New Qubits Mapping Mechanism for Multi-programming Quantum Computing. Xinglei Dou, Lei Liu |
| 2020 | ATTC (@C): Addressable-TLB based Translation Coherence. Harsh Gugale, Nagendra Gulur, Yashwant Marathe, Lizy K. John |
| 2020 | Accelerating Sparse CNN Inference on GPUs with Performance-Aware Weight Pruning. Masuma Akter Rumi, Xiaolong Ma, Yanzhi Wang, Peng Jiang |
| 2020 | Analyzing and Leveraging Shared L1 Caches in GPUs. Mohamed Assem Ibrahim, Onur Kayiran, Yasuko Eckert, Gabriel H. Loh, Adwait Jog |
| 2020 | Approximate Pattern Matching for On-Chip Interconnect Traffic Prediction. Vignesh Adhinarayanan, Wu-chun Feng |
| 2020 | AutoHOOT: Automatic High-Order Optimization for Tensors. Linjian Ma, Jiayu Ye, Edgar Solomonik |
| 2020 | Automatic Generation of Multi-Objective Polyhedral Compiler Transformations. Lorenzo Chelini, Tobias Gysi, Tobias Grosser, Martin Kong, Henk Corporaal |
| 2020 | Bandwidth Bottleneck in Network-on-Chip for High-Throughput Processors. Jiho Kim, Sanghun Cho, Minsoo Rhu, Ali Bakhoda, Tor M. Aamodt, John Kim |
| 2020 | Bandwidth-Aware Loop Tiling for DMA-Supported Scratchpad Memory. Mingchuan Wu, Ying Liu, Huimin Cui, Qingfu Wei, Quanfeng Li, Limin Li, Fang Lv, Jingling Xue, Xiaobing Feng |
| 2020 | Clearing the Shadows: Recovering Lost Performance for Invisible Speculative Execution through HW/SW Co-Design. Kim-Anh Tran, Christos Sakalis, Magnus Själander, Alberto Ros, Stefanos Kaxiras, Alexandra Jimborean |
| 2020 | Collective Affinity Aware Computation Mapping. Mahmut T. Kandemir, Jihyun Ryoo, Hui Zhao, Myoungsoo Jung, Mustafa Karaköy |
| 2020 | Compiling Chapel: Keys to Making Parallel Programming Productive at Scale. Bradford L. Chamberlain |
| 2020 | Decoupled Address Translation for Heterogeneous Memory Systems. Bokyeong Kim, Soojin Hwang, Sanghoon Cha, Chang Hyun Park, Jongse Park, Jaehyuk Huh |
| 2020 | Deep Learning Assisted Resource Partitioning for Improving Performance on Commodity Servers. Ruobing Chen, Jinping Wu, Haosen Shi, Yusen Li, Haiyan Yin, Shanjiang Tang, Xiaoguang Liu, Gang Wang |
| 2020 | Deep Program Structure Modeling Through Multi-Relational Graph-based Learning. Guixin Ye, Zhanyong Tang, Huanting Wang, Dingyi Fang, Jianbin Fang, Songfang Huang, Zheng Wang |
| 2020 | DeepSwapper: A Deep Learning Based Page Swap Management Scheme for Hybrid Memory Systems. Majed Valad Beigi, Bahareh Pourshirazi, Gokhan Memik, Zhichun Zhu |
| 2020 | Enhancing Address Translations in Throughput Processors via Compression. Xulong Tang, Ziyu Zhang, Weizheng Xu, Mahmut Taylan Kandemir, Rami G. Melhem, Jun Yang |
| 2020 | Exploiting Locality in Scalable Ordered Maps. Matthew Rodriguez, Ahmed Hassan, Michael F. Spear |
| 2020 | Exploring the Design Space of Static and Incremental Graph Connectivity Algorithms on GPUs. Changwan Hong, Laxman Dhulipala, Julian Shun |
| 2020 | Fast Convolutional Neural Networks with Fine-Grained FFTs. Yulin Zhang, Xiaoming Li |
| 2020 | Fireiron: A Data-Movement-Aware Scheduling Language for GPUs. Bastian Hagedorn, Archibald Samuel Elliott, Henrik Barthels, Rastislav Bodík, Vinod Grover |
| 2020 | GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU. Chanyoung Oh, Zhen Zheng, Xipeng Shen, Jidong Zhai, Youngmin Yi |
| 2020 | Helix: Algorithm/Architecture Co-design for Accelerating Nanopore Genome Base-calling. Qian Lou, Sarath Chandra Janga, Lei Jiang |
| 2020 | Intelligent Data Placement on Discrete GPU Nodes with Unified Memory. Tanzima Sultana, Blake Allen, Apan Qasem |
| 2020 | Low-Latency Proactive Continuous Vision. Yiming Gan, Yuxian Qiu, Lele Chen, Jingwen Leng, Yuhao Zhu |
| 2020 | MEPHESTO: Modeling Energy-Performance in Heterogeneous SoCs and Their Trade-Offs. Mohammad Alaul Haque Monil, Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony |
| 2020 | Memory-Equipped Quantum Architectures: The Power of Random Access. Jonathan M. Baker, David I. Schuster, Frederic T. Chong |
| 2020 | Mixed-Signal Charge-Domain Acceleration of Deep Neural Networks through Interleaved Bit-Partitioned Arithmetic. Soroush Ghodrati, Hardik Sharma, Sean Kinzer, Amir Yazdanbakhsh, Jongse Park, Nam Sung Kim, Doug Burger, Hadi Esmaeilzadeh |
| 2020 | Model-Based Warp Overlapped Tiling for Image Processing Programs on GPUs. Abhinav Jangda, Arjun Guha |
| 2020 | Opportunistic Early Pipeline Re-steering for Data-dependent Branches. Saurabh Gupta, Niranjan Soundararajan, Ragavendra Natarajan, Sreenivas Subramoney |
| 2020 | Overview of HPC and AI Computing for COVID-19 in the US. Rick Stevens |
| 2020 | PACT '20: International Conference on Parallel Architectures and Compilation Techniques, Virtual Event, GA, USA, October 3-7, 2020 Vivek Sarkar, Hyesoon Kim |
| 2020 | PRISM: Architectural Support for Variable-granularity Memory Metadata. Rachata Ausavarungnirun, Timothy Merrifield, Jayneel Gandhi, Christopher J. Rossbach |
| 2020 | Parallel and Scalable Precise Clustering. Stuart Byma, Akash Balasaheb Dhasade, Adrian M. Altenhoff, Christophe Dessimoz, James R. Larus |
| 2020 | Parallelizing Parallel Programs: A Dynamic Pattern Analysis for Modernization of Legacy Parallel Code. Roberto Castañeda Lozano, Murray Cole, Björn Franke |
| 2020 | RackMem: A Tailored Caching Layer for Rack Scale Computing. Changyeon Jo, Hyunik Kim, Hexiang Geng, Bernhard Egger |
| 2020 | Regional Out-of-Order Writes in Total Store Order. Sawan Singh, Alexandra Jimborean, Alberto Ros |
| 2020 | Ribbon: High Performance Cache Line Flushing for Persistent Memory. Kai Wu, Ivy Bo Peng, Jie Ren, Dong Li |
| 2020 | Scalable Specialization: Architectures, Interfaces, & Applications. Sarita V. Adve |
| 2020 | SecSched: Flexible Scheduling in Secure Processors. Omais Shafi, Janibul Bashir |
| 2020 | SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning Inference. Ziheng Wang |
| 2020 | SparseTrain: Leveraging Dynamic Sparsity in Software for Training DNNs on General-Purpose SIMD Processors. Zhangxiaowen Gong, Houxiang Ji, Christopher W. Fletcher, Christopher J. Hughes, Josep Torrellas |
| 2020 | TAFE: Thread Address Footprint Estimation for Capturing Data/Thread Locality in GPU Systems. Kishore Punniyamurthy, Andreas Gerstlauer |
| 2020 | The Forward Slice Core Microarchitecture. Kartik Lakshminarasimhan, Ajeya Naithani, Josué Feliu, Lieven Eeckhout |
| 2020 | Transmuter: Bridging the Efficiency Gap using Memory and Dataflow Reconfiguration. Subhankar Pal, Siying Feng, Dong-Hyeon Park, Sung Kim, Aporva Amarnath, Chi-Sheng Yang, Xin He, Jonathan Beaumont, Kyle May, Yan Xiong, Kuba Kaszyk, John Magnus Morton, Jiawen Sun, Michael F. P. O'Boyle, Murray Cole, Chaitali Chakrabarti, David T. Blaauw, Hun-Seok Kim, Trevor N. Mudge, Ronald G. Dreslinski |
| 2020 | VP Float: First Class Treatment for Variable Precision Floating Point Arithmetic. Tiago T. Jost, Yves Durand, Christian Fabre, Albert Cohen, Frédéric Pétrot |
| 2020 | VTensor: Using Virtual Tensors to Build a Layout-oblivious AI Programming Framework. Feng Yu, Jiacheng Zhao, Huimin Cui, Xiaobing Feng, Jingling Xue |
| 2020 | Valkyrie: Leveraging Inter-TLB Locality to Enhance GPU Performance. Trinayan Baruah, Yifan Sun, Saiful A. Mojumder, José L. Abellán, Yash Ukidave, Ajay Joshi, Norman Rubin, John Kim, David R. Kaeli |
| 2020 | cuSZ: An Efficient GPU-Based Error-Bounded Lossy Compression Framework for Scientific Data. Jiannan Tian, Sheng Di, Kai Zhao, Cody Rivera, Megan Hickman Fulp, Robert Underwood, Sian Jin, Xin Liang, Jon Calhoun, Dingwen Tao, Franck Cappello |