| 2017 | A Multicore Path to Connectomics-on-Demand. Alexander Matveev, Yaron Meirovitch, Hayk Saribekyan, Wiktor Jakubiuk, Tim Kaler, Gergely Ódor, David M. Budden, Aleksandar Zlateski, Nir Shavit |
| 2017 | An Efficient Abortable-locking Protocol for Multi-level NUMA Systems. Milind Chabbi, Abdelhalim Amer, Shasha Wen, Xu Liu |
| 2017 | Checking Concurrent Data Structures Under the C/C++11 Memory Model. Peizhao Ou, Brian Demsky |
| 2017 | Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation. Peng Jiang, Gagan Agrawal |
| 2017 | Contention in Structured Concurrency: Provably Efficient Dynamic Non-Zero Indicators for Nested Parallelism. Umut A. Acar, Naama Ben-David, Mike Rainey |
| 2017 | EffiSha: A Software Framework for Enabling Effficient Preemptive Scheduling of GPU. Guoyang Chen, Yue Zhao, Xipeng Shen, Huiyang Zhou |
| 2017 | Eunomia: Scaling Concurrent Search Trees under Contention Using HTM. Xin Wang, Weihua Zhang, Zhaoguo Wang, Ziyun Wei, Haibo Chen, Wenyun Zhao |
| 2017 | Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs. Bin Ren, Sriram Krishnamoorthy, Kunal Agrawal, Milind Kulkarni |
| 2017 | Function Call Re-Vectorization. Rubens E. A. Moreira, Caroline Collange, Fernando Magno Quintão Pereira |
| 2017 | Grammar-aware Parallelization for Scalable XPath Querying. Lin Jiang, Zhijia Zhao |
| 2017 | Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations. Tal Ben-Nun, Michael Sutton, Sreepathi Pai, Keshav Pingali |
| 2017 | Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications. Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Felix Wolf |
| 2017 | It's Time for a New Old Language. Guy L. Steele Jr. |
| 2017 | KiWi: A Key-Value Map for Scalable Real-Time Analytics. Dmitry Basin, Edward Bortnikov, Anastasia Braginsky, Guy Golan-Gueta, Eshcar Hillel, Idit Keidar, Moshe Sulamy |
| 2017 | Layout Lock: A Scalable Locking Paradigm for Concurrent Data Layout Modifications. Nachshon Cohen, Arie Tal, Erez Petrank |
| 2017 | Model-based Iterative CT Image Reconstruction on GPUs. Amit Sabne, Xiao Wang, Sherman J. Kisner, Charles A. Bouman, Anand Raghunathan, Samuel P. Midkiff |
| 2017 | Noise Injection Techniques to Expose Subtle and Unintended Message Races. Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz, Christopher M. Chambreau |
| 2017 | Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis. Samyam Rajbhandari, Fabrice Rastello, Karol Kowalski, Sriram Krishnamoorthy, P. Sadayappan |
| 2017 | POSTER: A GPU-Friendly Skiplist Algorithm. Nurit Moscovici, Nachshon Cohen, Erez Petrank |
| 2017 | POSTER: A Wait-Free Queue with Wait-Free Memory Reclamation. Pedro Ramalhete, Andreia Correia |
| 2017 | POSTER: An Architecture and Programming Model for Accelerating Parallel Commutative Computations via Privatization. Vignesh Balaji, Dhruva Tirumala, Brandon Lucia |
| 2017 | POSTER: An Infrastructure for HPC Knowledge Sharing and Reuse. Yue Zhao, Chunhua Liao, Xipeng Shen |
| 2017 | POSTER: Automated Load Balancer Selection Based on Application Characteristics. Harshitha Menon, Kavitha Chandrasekar, Laxmikant V. Kalé |
| 2017 | POSTER: Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures. Shigang Li, Yunquan Zhang, Torsten Hoefler |
| 2017 | POSTER: Distributed Control: The Benefits of Eliminating Global Synchronization via Effective Scheduling. Jesun Sahariar Firoz, Thejaka Amila Kanewala, Marcin Zalewski, Martina Barnas, Andrew Lumsdaine |
| 2017 | POSTER: HythTM: Extending the Applicability of Intel TSX Hardware Transactional Support. Arnamoy Bhattacharyya, Mike Dai Wang, Mihai Burcea, Yi Ding, Allen Deng, Sai Varikooty, Shafaaf Hossain, Cristiana Amza |
| 2017 | POSTER: IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases. Dong Dai, Wei Zhang, Yong Chen |
| 2017 | POSTER: MAPA: An Automatic Memory Access Pattern Analyzer for GPU Applications. Gangwon Jo, Jaehoon Jung, Jiyoung Park, Jaejin Lee |
| 2017 | POSTER: On the Problem of Consistency Exceptions in the Context of Strong Memory Models. Minjia Zhang, Swarnendu Biswas, Michael D. Bond |
| 2017 | POSTER: Poor Man's URCU. Pedro Ramalhete, Andreia Correia |
| 2017 | POSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms. Rezaul Chowdhury, Pramod Ganapathi, Yuan Tang, Jesmin Jahan Tithi |
| 2017 | POSTER: Recovering Performance for Vector-based Machine Learning on Managed Runtime. Mingyu Wu, Haibing Guan, Binyu Zang, Haibo Chen |
| 2017 | POSTER: Reuse, don't Recycle: Transforming Algorithms that Throw Away Descriptors. Maya Arbel-Raviv, Trevor Brown |
| 2017 | POSTER: STAR (Space-Time Adaptive and Reductive) Algorithms for Real-World Space-Time Optimality. Yuan Tang, Ronghui You |
| 2017 | POSTER: State Teleportation via Hardware Transactional Memory. Nachshon Cohen, Maurice Herlihy, Erez Petrank, Elias Wald |
| 2017 | Pagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks. Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, Timothy G. Rogers |
| 2017 | Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Austin, TX, USA, February 4-8, 2017 Vivek Sarkar, Lawrence Rauchwerger |
| 2017 | Processor-Oblivious Record and Replay. Robert Utterback, Kunal Agrawal, I-Ting Angelina Lee, Milind Kulkarni |
| 2017 | S-Caffe: Co-designing MPI Runtimes and Caffe for Scalable Deep Learning on Modern GPU Clusters. Ammar Ahmad Awan, Khaled Hamidouche, Jahanzeb Maqbool Hashmi, Dhabaleswar K. Panda |
| 2017 | SC-Haskell: Sequential Consistency in Languages That Minimize Mutable Shared Heap. Michael Vollmer, Ryan G. Scott, Madanlal Musuvathi, Ryan R. Newton |
| 2017 | Self-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL. Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng |
| 2017 | Silent Data Corruption Resilient Two-sided Matrix Factorizations. Panruo Wu, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Jieyang Chen, Dingwen Tao, Xin Liang, Kaiming Ouyang, Zizhong Chen |
| 2017 | Simple, Accurate, Analytical Time Modeling and Optimal Tile Size Selection for GPGPU Stencils. Nirmal Prajapati, Waruna Ranasinghe, Sanjay V. Rajopadhye, Rumen Andonov, Hristo N. Djidjev, Tobias Grosser |
| 2017 | Synchronized-by-Default Concurrency for Shared-Memory Systems. Martin Bättig, Thomas R. Gross |
| 2017 | Tapir: Embedding Fork-Join Parallelism into LLVM's Intermediate Representation. Tao B. Schardl, William S. Moses, Charles E. Leiserson |
| 2017 | Thread Data Sharing in Cache: Theory and Measurement. Hao Luo, Pengcheng Li, Chen Ding |
| 2017 | Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning. Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li, Keren Zhou, Mingyu Chen |
| 2017 | Using Butterfly-Patterned Partial Sums to Draw from Discrete Distributions. Guy L. Steele Jr., Jean-Baptiste Tristan |