ICS A

44 papers

YearTitle / Authors
2015A Nested Partitioning Algorithm for Adaptive Meshes on Heterogeneous Clusters.
Hari Sundar, Omar Ghattas
2015A Stall-Aware Warp Scheduling for Dynamically Optimizing Thread-level Parallelism in GPGPUs.
Yulong Yu, Weijun Xiao, Xubin He, He Guo, Yuxin Wang, Xin Chen
2015ASPaS: A Framework for Automatic SIMDization of Parallel Sorting on x86-based Many-core Processors.
Kaixi Hou, Hao Wang, Wu-chun Feng
2015Active Access: A Mechanism for High-Performance Distributed Data-Centric Computations.
Maciej Besta, Torsten Hoefler
2015Automatic Energy Efficient Parallelization of Uniform Dependence Computations.
Yun Zou, Sanjay V. Rajopadhye
2015Automatic Parallelization of Kernels in Shared-Memory Multi-GPU Nodes.
Javier Cabezas, Lluís Vilanova, Isaac Gelado, Thomas B. Jablin, Nacho Navarro, Wen-mei W. Hwu
2015Automatic Selection of Sparse Matrix Representation on GPUs.
Naser Sedaghati, Te Mu, Louis-Noël Pouchet, Srinivasan Parthasarathy, P. Sadayappan
2015Automatically Scalable Computation.
Margo I. Seltzer
2015Building Fuel Powered Supercomputing Data Center at Low Cost.
Yiqing Hua, Chao Li, Weichao Tang, Li Jiang, Xiaoyao Liang
2015COMPASS: A Framework for Automated Performance Modeling and Prediction.
Seyong Lee, Jeremy S. Meredith, Jeffrey S. Vetter
2015CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication.
Weifeng Liu, Brian Vinter
2015Composing Algorithmic Skeletons to Express High-Performance Scientific Applications.
Mani Zandifar, Mustafa Abdul Jabbar, Alireza Majidi, David E. Keyes, Nancy M. Amato, Lawrence Rauchwerger
2015Criticality-Aware Dynamic Task Scheduling for Heterogeneous Architectures.
Kallia Chronaki, Alejandro Rico, Rosa M. Badia, Eduard Ayguadé, Jesús Labarta, Mateo Valero
2015DASX: Hardware Accelerator for Software Data Structures.
Snehasish Kumar, Naveen Vedula, Arrvindh Shriraman, Vijayalakshmi Srinivasan
2015DaCache: Memory Divergence-Aware GPU Cache Management.
Bin Wang, Weikuan Yu, Xian-He Sun, Xinning Wang
2015Datacenter Efficiency: What's Next?
Ricardo Bianchini
2015Enabling and Exploiting Flexible Task Assignment on GPU through SM-Centric Program Transformations.
Bo Wu, Guoyang Chen, Dong Li, Xipeng Shen, Jeffrey S. Vetter
2015Exascaling Your Library: Will Your Implementation Meet Your Expectations?
Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Alexandre Strube, Felix Wolf
2015Exploiting Process Imbalance to Improve MPI Collective Operations in Hierarchical Systems.
Benjamin S. Parsons, Vijay S. Pai
2015FAST: A Fast Stencil Autotuning Framework Based On An Optimal-solution Space Model.
Yulong Luo, Guangming Tan, Zeyao Mo, Ninghui Sun
2015Fine-Grained Synchronizations and Dataflow Programming on GPUs.
Ang Li, Gert-Jan van den Braak, Henk Corporaal, Akash Kumar
2015GreenPar: Scheduling Parallel High Performance Applications in Green Datacenters.
Md. Enamul Haque, Iñigo Goiri, Ricardo Bianchini, Thu D. Nguyen
2015Hadoop+: Modeling and Evaluating the Heterogeneity for MapReduce Applications in Heterogeneous Clusters.
Wenting He, Huimin Cui, Binbin Lu, Jiacheng Zhao, Shengmei Li, Gong Ruan, Jingling Xue, Xiaobing Feng, Wensen Yang, Youliang Yan
2015History-Assisted Adaptive-Granularity Caches (HAAG$) for High Performance 3D DRAM Architectures.
Ke Chen, Sheng Li, Jung Ho Ahn, Naveen Muralimanohar, Jishen Zhao, Cong Xu, Seongil O, Yuan Xie, Jay B. Brockman, Norman P. Jouppi
2015Leveraging Silicon-Photonic NoC for Designing Scalable GPUs.
Amir Kavyan Ziabari, José L. Abellán, Rafael Ubal, Chao Chen, Ajay Joshi, David R. Kaeli
2015Locality-Driven Dynamic GPU Cache Bypassing.
Chao Li, Shuaiwen Leon Song, Hongwen Dai, Albert Sidelnik, Siva Kumar Sastry Hari, Huiyang Zhou
2015MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures.
Tobias Gysi, Tobias Grosser, Torsten Hoefler
2015Mower: A New Design for Non-blocking Misprediction Recovery.
Zhaoxiang Jin, Görkem Asilioglu, Soner Önder
2015Optimistic Delinearization of Parametrically Sized Arrays.
Tobias Grosser, Jagannathan Ramanujam, Louis-Noël Pouchet, P. Sadayappan, Sebastian Pop
2015Optimizing Overlapped Memory Accesses in User-directed Vectorization.
Diego Caballero, Sara Royuela, Roger Ferrer, Alejandro Duran, Xavier Martorell
2015PALMOS: A Transparent, Multi-tasking Acceleration Layer for Parallel Heterogeneous Systems.
Christos Margiolas, Michael F. P. O'Boyle
2015PaCMap: Topology Mapping of Unstructured Communication Patterns onto Non-contiguous Allocations.
Ozan Tuncer, Vitus J. Leung, Ayse K. Coskun
2015Parameterized Diamond Tiling for Stencil Computations with Chapel parallel iterators.
Ian J. Bertolacci, Catherine Olschanowsky, Ben Harshbarger, Bradford L. Chamberlain, David G. Wonnacott, Michelle Mills Strout
2015PeerWave: Exploiting Wavefront Parallelism on GPUs with Peer-SM Synchronization.
Mehmet E. Belviranli, Peng Deng, Laxmi N. Bhuyan, Rajiv Gupta, Qi Zhu
2015Proceedings of the 29th ACM on International Conference on Supercomputing, ICS'15, Newport Beach/Irvine, CA, USA, June 08 - 11, 2015
Laxmi N. Bhuyan, Fred Chong, Vivek Sarkar
2015Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model.
Holger Stengel, Jan Treibig, Georg Hager, Gerhard Wellein
2015Real-Time In-Memory Checkpointing for Future Hybrid Memory Systems.
Shen Gao, Bingsheng He, Jianliang Xu
2015STAPL-RTS: An Application Driven Runtime System.
Ioannis Papadopoulos, Nathan L. Thomas, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger
2015SemCache++: Semantics-Aware Caching for Efficient Multi-GPU Offloading.
Nabeel AlSaber, Milind Kulkarni
2015Streaming Task Parallelism.
Albert Cohen
2015Towards Lightweight and Swift Storage Resource Management in Big Data Cloud Era.
Ruijin Zhou, Huixiang Chen, Tao Li
2015Underprovisioning the Grid Power Infrastructure for Green Datacenters.
Xu Zhou, Qiang Cao, Hong Jiang, Changsheng Xie
2015Unique Worker model for OpenMP.
Raghesh Aloor, V. Krishna Nandivada
2015zFENCE: Data-less Coherence for Efficient Fences.
Shaizeen Aga, Abhayendra Singh, Satish Narayanasamy