| 2017 | 26th International Conference on Parallel Architectures and Compilation Techniques, PACT 2017, Portland, OR, USA, September 9-13, 2017 |
| 2017 | A DSL for Performance Orchestration. Thiago Santos Faria Xavier Teixeira, David A. Padua, William Gropp |
| 2017 | A GPU-Friendly Skiplist Algorithm. Nurit Moscovici, Nachshon Cohen, Erez Petrank |
| 2017 | A Generalized Framework for Automatic Scripting Language Parallelization. Taewook Oh, Stephen R. Beard, Nick P. Johnson, Sergiy Popovych, David I. August |
| 2017 | An Ultra Low-Power Hardware Accelerator for Acoustic Scoring in Speech Recognition. Hamid Tabani, José-María Arnau, Jordi Tubella, Antonio González |
| 2017 | Application Clustering Policies to Address System Fairness with Intel's Cache Allocation Technology. Vicent Selfa, Julio Sahuquillo, Lieven Eeckhout, Salvador Petit, María Engracia Gómez |
| 2017 | Architecting a Novel Hybrid Cache with Low Energy. Jiacong He, Joseph Callenes-Sloan |
| 2017 | Avoiding TLB Shootdowns Through Self-Invalidating TLB Entries. Amro Awad, Arkaprava Basu, Sergey Blagodurov, Yan Solihin, Gabriel H. Loh |
| 2017 | Cache Automaton: Repurposing Caches for Automata Processing. Arun Subramaniyan, Jingcheng Wang, Ezhil R. M. Balasubramanian, David T. Blaauw, Dennis Sylvester, Reetuparna Das |
| 2017 | DRUT: An Efficient Turbo Boost Solution via Load Balancing in Decoupled Look-Ahead Architecture. Raj Parihar, Michael C. Huang |
| 2017 | DrMP: Mixed Precision-Aware DRAM for High Performance Approximate and Precise Computing. Xianwei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang |
| 2017 | Efficient Checkpointing of Loop-Based Codes for Non-volatile Main Memory. Hussein Elnawawy, Mohammad A. Alshboul, James Tuck, Yan Solihin |
| 2017 | End-to-End Deep Learning of Optimization Heuristics. Chris Cummins, Pavlos Petoumenos, Zheng Wang, Hugh Leather |
| 2017 | Exploiting Asymmetric SIMD Register Configurations in ARM-to-x86 Dynamic Binary Translation. Yu-Ping Liu, Ding-Yong Hong, Jan-Jan Wu, Sheng-Yu Fu, Wei-Chung Hsu |
| 2017 | Graphie: Large-Scale Asynchronous Graph Traversals on Just a GPU. Wei Han, Daniel Mawhirter, Bo Wu, Matthew Buland |
| 2017 | In-memory Data Flow Processor. Daichi Fujiki, Scott A. Mahlke, Reetuparna Das |
| 2017 | Introspective Computing. Karl Taht, Rajeev Balasubramonian |
| 2017 | Large Scale Data Clustering Using Memristive k-Median Computation. Yomi Karthik Rupesh, Mahdi Nazm Bojnordi |
| 2017 | Leeway: Addressing Variability in Dead-Block Prediction for Last-Level Caches. Priyank Faldu, Boris Grot |
| 2017 | Lightweight Provenance Service for High-Performance Computing. Dong Dai, Yong Chen, Philip H. Carns, John Jenkins, Robert B. Ross |
| 2017 | MultiGraph: Efficient Graph Processing on GPUs. Changwan Hong, Aravind Sukumaran-Rajam, Jinsung Kim, P. Sadayappan |
| 2017 | Multilayer Compute Resource Management with Robust Control Theory. Raghavendra Pradyumna Pothukuchi, Sweta Yamini Pothukuchi, Petros G. Voulgaris, Josep Torrellas |
| 2017 | Near-Memory Address Translation. Javier Picorel, Djordje Jevdjic, Babak Falsafi |
| 2017 | Nexus: A New Approach to Replication in Distributed Shared Caches. Po-An Tsai, Nathan Beckmann, Daniel Sánchez |
| 2017 | POSTER: Accelerate GPU Concurrent Kernel Execution by Mitigating Memory Pipeline Stalls. Hongwen Dai, Zhen Lin, Chao Li, Chen Zhao, Fei Wang, Nanning Zheng, Huiyang Zhou |
| 2017 | POSTER: Application-Driven Near-Data Processing for Similarity Search. Vincent T. Lee, Amrita Mazumdar, Carlo C. del Mundo, Armin Alaghi, Luis Ceze, Mark Oskin |
| 2017 | POSTER: BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads. Yuxi Liu, Xia Zhao, Zhibin Yu, Zhenlin Wang, Xiaolin Wang, Yingwei Luo, Lieven Eeckhout |
| 2017 | POSTER: BigBus: A Scalable Optical Interconnect. Eldhose Peter, Janibul Bashir, Smruti R. Sarangi |
| 2017 | POSTER: Bridge the Gap Between Neural Networks and Neuromorphic Hardware. Yu Ji, Youhui Zhang, Wenguang Chen, Yuan Xie |
| 2017 | POSTER: Bridging the Gap Between Deep Learning and Sparse Matrix Format Selection. Yue Zhao, Jiajia Li, Chunhua Liao, Xipeng Shen |
| 2017 | POSTER: Cutting the Fat: Speeding Up RBM for Fast Deep Learning Through Generalized Redundancy Elimination. Lin Ning, Randall Pittman, Xipeng Shen |
| 2017 | POSTER: DaQueue: A Data-Aware Work-Queue Design for GPGPUs. Ya-Shuai Lü, Libo Huang, Li Shen |
| 2017 | POSTER: Design Space Exploration for Performance Optimization of Deep Neural Networks on Shared Memory Accelerators. Swagath Venkataramani, Jungwook Choi, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan, Leland Chang |
| 2017 | POSTER: Elastic Reconfiguration for Heterogeneous NoCs with BiNoCHS. Amirhossein Mirhosseini, Mohammad Sadrosadati, Behnaz Soltani, Hamid Sarbazi-Azad, Thomas F. Wenisch |
| 2017 | POSTER: Exploiting Approximations for Energy/Quality Tradeoffs in Service-Based Applications. Liu Liu, Sibren Isaacman, Abhishek Bhattacharjee, Ulrich Kremer |
| 2017 | POSTER: Improving Datacenter Efficiency Through Partitioning-Aware Scheduling. Harshad Kasture, Xu Ji, Nosayba El-Sayed, Nathan Beckmann, Xiaosong Ma, Daniel Sánchez |
| 2017 | POSTER: Improving NUMA System Efficiency with a Utilization-Based Co-scheduling. Younghyun Cho, Camilo A. Celis Guzman, Bernhard Egger |
| 2017 | POSTER: Location-Aware Computation Mapping for Manycore Processors. Orhan Kislal, Jagadish Kotra, Xulong Tang, Mahmut Taylan Kandemir, Myoungsoo Jung |
| 2017 | POSTER: NUMA-Aware Power Management for Chip Multiprocessors. Changmin Ahn, Camilo A. Celis Guzman, Bernhard Egger |
| 2017 | POSTER: Putting the G back into GPU/CPU Systems Research. Andreas Sembrant, Trevor E. Carlson, Erik Hagersten, David Black-Schaffer |
| 2017 | POSTER: Statement Reordering to Alleviate Register Pressure for Stencils on GPUs. Prashant Singh Rawat, Aravind Sukumaran-Rajam, Atanas Rountev, Fabrice Rastello, Louis-Noël Pouchet, P. Sadayappan |
| 2017 | POSTER: The Liberation Day of Nondeterministic Programs. Enrico Armenio Deiana, Vincent St-Amour, Peter A. Dinda, Nikos Hardavellas, Simone Campanoni |
| 2017 | Performance Improvement via Always-Abort HTM. Joseph Izraelevitz, Lingxiang Xiang, Michael L. Scott |
| 2017 | Proxy Benchmarks for Emerging Big-Data Workloads. Reena Panda, Lizy Kurian John |
| 2017 | RCU-HTM: Combining RCU with HTM to Implement Highly Efficient Concurrent Binary Search Trees. Dimitrios Siakavaras, Konstantinos Nikas, Georgios I. Goumas, Nectarios Koziris |
| 2017 | Redesigning Go's Built-In Map to Support Concurrent Operations. Louis Jenkins, Tingzhe Zhou, Michael F. Spear |
| 2017 | SAM: Optimizing Multithreaded Cores for Speculative Parallelism. Maleen Abeydeera, Suvinay Subramanian, Mark C. Jeffrey, Joel S. Emer, Daniel Sánchez |
| 2017 | Sthira: A Formal Approach to Minimize Voltage Guardbands under Variation in Networks-on-Chip for Energy Efficiency. Raghavendra Pradyumna Pothukuchi, Amin Ansari, Bhargava Gopireddy, Josep Torrellas |
| 2017 | SuperGraph-SLP Auto-Vectorization. Vasileios Porpodas |
| 2017 | Transparent Dual Memory Compression Architecture. Seikwon Kim, Seonyoung Lee, Taehoon Kim, Jaehyuk Huh |
| 2017 | Weak Memory Models: Balancing Definitional Simplicity and Implementation Flexibility. Sizhuo Zhang, Muralidaran Vijayaraghavan, Arvind |