| 2015 | A Nested Partitioning Algorithm for Adaptive Meshes on Heterogeneous Clusters. Hari Sundar, Omar Ghattas |
| 2015 | A Stall-Aware Warp Scheduling for Dynamically Optimizing Thread-level Parallelism in GPGPUs. Yulong Yu, Weijun Xiao, Xubin He, He Guo, Yuxin Wang, Xin Chen |
| 2015 | ASPaS: A Framework for Automatic SIMDization of Parallel Sorting on x86-based Many-core Processors. Kaixi Hou, Hao Wang, Wu-chun Feng |
| 2015 | Active Access: A Mechanism for High-Performance Distributed Data-Centric Computations. Maciej Besta, Torsten Hoefler |
| 2015 | Automatic Energy Efficient Parallelization of Uniform Dependence Computations. Yun Zou, Sanjay V. Rajopadhye |
| 2015 | Automatic Parallelization of Kernels in Shared-Memory Multi-GPU Nodes. Javier Cabezas, Lluís Vilanova, Isaac Gelado, Thomas B. Jablin, Nacho Navarro, Wen-mei W. Hwu |
| 2015 | Automatic Selection of Sparse Matrix Representation on GPUs. Naser Sedaghati, Te Mu, Louis-Noël Pouchet, Srinivasan Parthasarathy, P. Sadayappan |
| 2015 | Automatically Scalable Computation. Margo I. Seltzer |
| 2015 | Building Fuel Powered Supercomputing Data Center at Low Cost. Yiqing Hua, Chao Li, Weichao Tang, Li Jiang, Xiaoyao Liang |
| 2015 | COMPASS: A Framework for Automated Performance Modeling and Prediction. Seyong Lee, Jeremy S. Meredith, Jeffrey S. Vetter |
| 2015 | CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication. Weifeng Liu, Brian Vinter |
| 2015 | Composing Algorithmic Skeletons to Express High-Performance Scientific Applications. Mani Zandifar, Mustafa Abdul Jabbar, Alireza Majidi, David E. Keyes, Nancy M. Amato, Lawrence Rauchwerger |
| 2015 | Criticality-Aware Dynamic Task Scheduling for Heterogeneous Architectures. Kallia Chronaki, Alejandro Rico, Rosa M. Badia, Eduard Ayguadé, Jesús Labarta, Mateo Valero |
| 2015 | DASX: Hardware Accelerator for Software Data Structures. Snehasish Kumar, Naveen Vedula, Arrvindh Shriraman, Vijayalakshmi Srinivasan |
| 2015 | DaCache: Memory Divergence-Aware GPU Cache Management. Bin Wang, Weikuan Yu, Xian-He Sun, Xinning Wang |
| 2015 | Datacenter Efficiency: What's Next? Ricardo Bianchini |
| 2015 | Enabling and Exploiting Flexible Task Assignment on GPU through SM-Centric Program Transformations. Bo Wu, Guoyang Chen, Dong Li, Xipeng Shen, Jeffrey S. Vetter |
| 2015 | Exascaling Your Library: Will Your Implementation Meet Your Expectations? Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Alexandre Strube, Felix Wolf |
| 2015 | Exploiting Process Imbalance to Improve MPI Collective Operations in Hierarchical Systems. Benjamin S. Parsons, Vijay S. Pai |
| 2015 | FAST: A Fast Stencil Autotuning Framework Based On An Optimal-solution Space Model. Yulong Luo, Guangming Tan, Zeyao Mo, Ninghui Sun |
| 2015 | Fine-Grained Synchronizations and Dataflow Programming on GPUs. Ang Li, Gert-Jan van den Braak, Henk Corporaal, Akash Kumar |
| 2015 | GreenPar: Scheduling Parallel High Performance Applications in Green Datacenters. Md. Enamul Haque, Iñigo Goiri, Ricardo Bianchini, Thu D. Nguyen |
| 2015 | Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous Clusters. Wenting He, Huimin Cui, Binbin Lu, Jiacheng Zhao, Shengmei Li, Gong Ruan, Jingling Xue, Xiaobing Feng, Wensen Yang, Youliang Yan |
| 2015 | History-Assisted Adaptive-Granularity Caches (HAAG$) for High Performance 3D DRAM Architectures. Ke Chen, Sheng Li, Jung Ho Ahn, Naveen Muralimanohar, Jishen Zhao, Cong Xu, Seongil O, Yuan Xie, Jay B. Brockman, Norman P. Jouppi |
| 2015 | Leveraging Silicon-Photonic NoC for Designing Scalable GPUs. Amir Kavyan Ziabari, José L. Abellán, Rafael Ubal, Chao Chen, Ajay Joshi, David R. Kaeli |
| 2015 | Locality-Driven Dynamic GPU Cache Bypassing. Chao Li, Shuaiwen Leon Song, Hongwen Dai, Albert Sidelnik, Siva Kumar Sastry Hari, Huiyang Zhou |
| 2015 | MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures. Tobias Gysi, Tobias Grosser, Torsten Hoefler |
| 2015 | Mower: A New Design for Non-blocking Misprediction Recovery. Zhaoxiang Jin, Görkem Asilioglu, Soner Önder |
| 2015 | Optimistic Delinearization of Parametrically Sized Arrays. Tobias Grosser, Jagannathan Ramanujam, Louis-Noël Pouchet, P. Sadayappan, Sebastian Pop |
| 2015 | Optimizing Overlapped Memory Accesses in User-directed Vectorization. Diego Caballero, Sara Royuela, Roger Ferrer, Alejandro Duran, Xavier Martorell |
| 2015 | PALMOS: A Transparent, Multi-tasking Acceleration Layer for Parallel Heterogeneous Systems. Christos Margiolas, Michael F. P. O'Boyle |
| 2015 | PaCMap: Topology Mapping of Unstructured Communication Patterns onto Non-contiguous Allocations. Ozan Tuncer, Vitus J. Leung, Ayse K. Coskun |
| 2015 | Parameterized Diamond Tiling for Stencil Computations with Chapel parallel iterators. Ian J. Bertolacci, Catherine Olschanowsky, Ben Harshbarger, Bradford L. Chamberlain, David G. Wonnacott, Michelle Mills Strout |
| 2015 | PeerWave: Exploiting Wavefront Parallelism on GPUs with Peer-SM Synchronization. Mehmet E. Belviranli, Peng Deng, Laxmi N. Bhuyan, Rajiv Gupta, Qi Zhu |
| 2015 | Proceedings of the 29th ACM on International Conference on Supercomputing, ICS'15, Newport Beach/Irvine, CA, USA, June 08 - 11, 2015 Laxmi N. Bhuyan, Fred Chong, Vivek Sarkar |
| 2015 | Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model. Holger Stengel, Jan Treibig, Georg Hager, Gerhard Wellein |
| 2015 | Real-Time In-Memory Checkpointing for Future Hybrid Memory Systems. Shen Gao, Bingsheng He, Jianliang Xu |
| 2015 | STAPL-RTS: An Application Driven Runtime System. Ioannis Papadopoulos, Nathan L. Thomas, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger |
| 2015 | SemCache++: Semantics-Aware Caching for Efficient Multi-GPU Offloading. Nabeel AlSaber, Milind Kulkarni |
| 2015 | Streaming Task Parallelism. Albert Cohen |
| 2015 | Towards Lightweight and Swift Storage Resource Management in Big Data Cloud Era. Ruijin Zhou, Huixiang Chen, Tao Li |
| 2015 | Underprovisioning the Grid Power Infrastructure for Green Datacenters. Xu Zhou, Qiang Cao, Hong Jiang, Changsheng Xie |
| 2015 | Unique Worker model for OpenMP. Raghesh Aloor, V. Krishna Nandivada |
| 2015 | zFENCE: Data-less Coherence for Efficient Fences. Shaizeen Aga, Abhayendra Singh, Satish Narayanasamy |