PACT B

50 papers

YearTitle / Authors
2020A Methodology for Principled Approximation in Visual SLAM.
Yan Pei, Swarnendu Biswas, Donald S. Fussell, Keshav Pingali
2020A New Qubits Mapping Mechanism for Multi-programming Quantum Computing.
Xinglei Dou, Lei Liu
2020ATTC (@C): Addressable-TLB based Translation Coherence.
Harsh Gugale, Nagendra Gulur, Yashwant Marathe, Lizy K. John
2020Accelerating Sparse CNN Inference on GPUs with Performance-Aware Weight Pruning.
Masuma Akter Rumi, Xiaolong Ma, Yanzhi Wang, Peng Jiang
2020Analyzing and Leveraging Shared L1 Caches in GPUs.
Mohamed Assem Ibrahim, Onur Kayiran, Yasuko Eckert, Gabriel H. Loh, Adwait Jog
2020Approximate Pattern Matching for On-Chip Interconnect Traffic Prediction.
Vignesh Adhinarayanan, Wu-chun Feng
2020AutoHOOT: Automatic High-Order Optimization for Tensors.
Linjian Ma, Jiayu Ye, Edgar Solomonik
2020Automatic Generation of Multi-Objective Polyhedral Compiler Transformations.
Lorenzo Chelini, Tobias Gysi, Tobias Grosser, Martin Kong, Henk Corporaal
2020Bandwidth Bottleneck in Network-on-Chip for High-Throughput Processors.
Jiho Kim, Sanghun Cho, Minsoo Rhu, Ali Bakhoda, Tor M. Aamodt, John Kim
2020Bandwidth-Aware Loop Tiling for DMA-Supported Scratchpad Memory.
Mingchuan Wu, Ying Liu, Huimin Cui, Qingfu Wei, Quanfeng Li, Limin Li, Fang Lv, Jingling Xue, Xiaobing Feng
2020Clearing the Shadows: Recovering Lost Performance for Invisible Speculative Execution through HW/SW Co-Design.
Kim-Anh Tran, Christos Sakalis, Magnus Själander, Alberto Ros, Stefanos Kaxiras, Alexandra Jimborean
2020Collective Affinity Aware Computation Mapping.
Mahmut T. Kandemir, Jihyun Ryoo, Hui Zhao, Myoungsoo Jung, Mustafa Karaköy
2020Compiling Chapel: Keys to Making Parallel Programming Productive at Scale.
Bradford L. Chamberlain
2020Decoupled Address Translation for Heterogeneous Memory Systems.
Bokyeong Kim, Soojin Hwang, Sanghoon Cha, Chang Hyun Park, Jongse Park, Jaehyuk Huh
2020Deep Learning Assisted Resource Partitioning for Improving Performance on Commodity Servers.
Ruobing Chen, Jinping Wu, Haosen Shi, Yusen Li, Haiyan Yin, Shanjiang Tang, Xiaoguang Liu, Gang Wang
2020Deep Program Structure Modeling Through Multi-Relational Graph-based Learning.
Guixin Ye, Zhanyong Tang, Huanting Wang, Dingyi Fang, Jianbin Fang, Songfang Huang, Zheng Wang
2020DeepSwapper: A Deep Learning Based Page Swap Management Scheme for Hybrid Memory Systems.
Majed Valad Beigi, Bahareh Pourshirazi, Gokhan Memik, Zhichun Zhu
2020Enhancing Address Translations in Throughput Processors via Compression.
Xulong Tang, Ziyu Zhang, Weizheng Xu, Mahmut Taylan Kandemir, Rami G. Melhem, Jun Yang
2020Exploiting Locality in Scalable Ordered Maps.
Matthew Rodriguez, Ahmed Hassan, Michael F. Spear
2020Exploring the Design Space of Static and Incremental Graph Connectivity Algorithms on GPUs.
Changwan Hong, Laxman Dhulipala, Julian Shun
2020Fast Convolutional Neural Networks with Fine-Grained FFTs.
Yulin Zhang, Xiaoming Li
2020Fireiron: A Data-Movement-Aware Scheduling Language for GPUs.
Bastian Hagedorn, Archibald Samuel Elliott, Henrik Barthels, Rastislav Bodík, Vinod Grover
2020GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU.
Chanyoung Oh, Zhen Zheng, Xipeng Shen, Jidong Zhai, Youngmin Yi
2020Helix: Algorithm/Architecture Co-design for Accelerating Nanopore Genome Base-calling.
Qian Lou, Sarath Chandra Janga, Lei Jiang
2020Intelligent Data Placement on Discrete GPU Nodes with Unified Memory.
Tanzima Sultana, Blake Allen, Apan Qasem
2020Low-Latency Proactive Continuous Vision.
Yiming Gan, Yuxian Qiu, Lele Chen, Jingwen Leng, Yuhao Zhu
2020MEPHESTO: Modeling Energy-Performance in Heterogeneous SoCs and Their Trade-Offs.
Mohammad Alaul Haque Monil, Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony
2020Memory-Equipped Quantum Architectures: The Power of Random Access.
Jonathan M. Baker, David I. Schuster, Frederic T. Chong
2020Mixed-Signal Charge-Domain Acceleration of Deep Neural Networks through Interleaved Bit-Partitioned Arithmetic.
Soroush Ghodrati, Hardik Sharma, Sean Kinzer, Amir Yazdanbakhsh, Jongse Park, Nam Sung Kim, Doug Burger, Hadi Esmaeilzadeh
2020Model-Based Warp Overlapped Tiling for Image Processing Programs on GPUs.
Abhinav Jangda, Arjun Guha
2020Opportunistic Early Pipeline Re-steering for Data-dependent Branches.
Saurabh Gupta, Niranjan Soundararajan, Ragavendra Natarajan, Sreenivas Subramoney
2020Overview of HPC and AI Computing for COVID-19 in the US.
Rick Stevens
2020PACT '20: International Conference on Parallel Architectures and Compilation Techniques, Virtual Event, GA, USA, October 3-7, 2020
Vivek Sarkar, Hyesoon Kim
2020PRISM: Architectural Support for Variable-granularity Memory Metadata.
Rachata Ausavarungnirun, Timothy Merrifield, Jayneel Gandhi, Christopher J. Rossbach
2020Parallel and Scalable Precise Clustering.
Stuart Byma, Akash Balasaheb Dhasade, Adrian M. Altenhoff, Christophe Dessimoz, James R. Larus
2020Parallelizing Parallel Programs: A Dynamic Pattern Analysis for Modernization of Legacy Parallel Code.
Roberto Castañeda Lozano, Murray Cole, Björn Franke
2020RackMem: A Tailored Caching Layer for Rack Scale Computing.
Changyeon Jo, Hyunik Kim, Hexiang Geng, Bernhard Egger
2020Regional Out-of-Order Writes in Total Store Order.
Sawan Singh, Alexandra Jimborean, Alberto Ros
2020Ribbon: High Performance Cache Line Flushing for Persistent Memory.
Kai Wu, Ivy Bo Peng, Jie Ren, Dong Li
2020Scalable Specialization: Architectures, Interfaces, & Applications.
Sarita V. Adve
2020SecSched: Flexible Scheduling in Secure Processors.
Omais Shafi, Janibul Bashir
2020SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep Learning Inference.
Ziheng Wang
2020SparseTrain: Leveraging Dynamic Sparsity in Software for Training DNNs on General-Purpose SIMD Processors.
Zhangxiaowen Gong, Houxiang Ji, Christopher W. Fletcher, Christopher J. Hughes, Josep Torrellas
2020TAFE: Thread Address Footprint Estimation for Capturing Data/Thread Locality in GPU Systems.
Kishore Punniyamurthy, Andreas Gerstlauer
2020The Forward Slice Core Microarchitecture.
Kartik Lakshminarasimhan, Ajeya Naithani, Josué Feliu, Lieven Eeckhout
2020Transmuter: Bridging the Efficiency Gap using Memory and Dataflow Reconfiguration.
Subhankar Pal, Siying Feng, Dong-Hyeon Park, Sung Kim, Aporva Amarnath, Chi-Sheng Yang, Xin He, Jonathan Beaumont, Kyle May, Yan Xiong, Kuba Kaszyk, John Magnus Morton, Jiawen Sun, Michael F. P. O'Boyle, Murray Cole, Chaitali Chakrabarti, David T. Blaauw, Hun-Seok Kim, Trevor N. Mudge, Ronald G. Dreslinski
2020VP Float: First Class Treatment for Variable Precision Floating Point Arithmetic.
Tiago T. Jost, Yves Durand, Christian Fabre, Albert Cohen, Frédéric Pétrot
2020VTensor: Using Virtual Tensors to Build a Layout-oblivious AI Programming Framework.
Feng Yu, Jiacheng Zhao, Huimin Cui, Xiaobing Feng, Jingling Xue
2020Valkyrie: Leveraging Inter-TLB Locality to Enhance GPU Performance.
Trinayan Baruah, Yifan Sun, Saiful A. Mojumder, José L. Abellán, Yash Ukidave, Ajay Joshi, Norman Rubin, John Kim, David R. Kaeli
2020cuSZ: An Efficient GPU-Based Error-Bounded Lossy Compression Framework for Scientific Data.
Jiannan Tian, Sheng Di, Kai Zhao, Cody Rivera, Megan Hickman Fulp, Robert Underwood, Sian Jin, Xin Liang, Jon Calhoun, Dingwen Tao, Franck Cappello