| 2017 | 46th International Conference on Parallel Processing, ICPP 2017, Bristol, United Kingdom, August 14-17, 2017 |
| 2017 | A Coflow-Based Co-Optimization Framework for High-Performance Data Analytics. Long Cheng, Ying Wang, Yulong Pei, Dick H. J. Epema |
| 2017 | A Dynamic Resource Controller for a Lambda Architecture. MohammadReza HoseinyFarahabady, Javid Taheri, Zahir Tari, Albert Y. Zomaya |
| 2017 | A Machine Learning Approach for Efficient Parallel Simulation of Beam Dynamics on GPUs. Kamesh Arumugam, Desh Ranjan, Mohammad Zubair, Balsa Terzic, Alexander N. Godunov, Tunazzina Islam |
| 2017 | A Novel Minimum Time Parallel 2-D Discrete Wavelet Transform Algorithm for General Purpose Processors. Eduardo Moscoso Rubino, Alberto Jose Alvares, Raúl Marín Prades, Pedro Sanz Valero |
| 2017 | A Parallel TSP-Based Algorithm for Balanced Graph Partitioning. Harshvardhan Das, Subodh Kumar |
| 2017 | A Pareto Framework for Data Analytics on Heterogeneous Systems: Implications for Green Energy Usage and Performance. Aniket Chakrabarti, Srinivasan Parthasarathy, Christopher Stewart |
| 2017 | A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters. Isuru Dilanka Fernando, Sanath Jayasena, Milinda Fernando, Hari Sundar |
| 2017 | Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning. Jiawen Sun, Hans Vandierendonck, Dimitrios S. Nikolopoulos |
| 2017 | An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications. Guojing Cong, Onkar Bhardwaj, Minwei Feng |
| 2017 | Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems. Rong Ge, Pengfei Zou, Xizhou Feng |
| 2017 | Autotuning GPU Kernels via Static and Predictive Analysis. Robert V. Lim, Boyana Norris, Allen D. Malony |
| 2017 | Bitslice Vectors: A Software Approach to Customizable Data Precision on Processors with SIMD Extensions. Shixiong Xu, David Gregg |
| 2017 | Boosting the Efficiency of HPCG and Graph500 with Near-Data Processing. Erik Vermij, Leandro Fiorin, Christoph Hagleitner, Koen Bertels |
| 2017 | CELIA: Cost-Time Performance of Elastic Applications on Cloud. Sunimal Rathnayake, Dumitrel Loghin, Yong Meng Teo |
| 2017 | Constrained Tensor Factorization with Accelerated AO-ADMM. Shaden Smith, Alec Beri, George Karypis |
| 2017 | Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line. Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, Joseph C. Culberson, Joseph Horton |
| 2017 | E-Storm: Replication-Based State Management in Distributed Stream Processing Systems. Xunyun Liu, Aaron Harwood, Shanika Karunasekera, Benjamin I. P. Rubinstein, Rajkumar Buyya |
| 2017 | ES2: Aiming at an Optimal Virtual I/O Event Path. Xiaokang Hu, Wang Zhang, Jian Li, Ruhui Ma, Feng Wu, Haibing Guan |
| 2017 | Efficient Data Sharing on Heterogeneous Systems. Victor Garcia-Flores, Eduard Ayguadé, Antonio J. Peña |
| 2017 | Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning. Ching-Hsiang Chu, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Jahanzeb Maqbool Hashmi, Bracy Elton, Dhabaleswar K. Panda |
| 2017 | Exploiting GPUs for Fast Force-Directed Visualization of Large-Scale Networks. Govert G. Brinkmann, Kristian F. D. Rietveld, Frank W. Takes |
| 2017 | Fading-Resistant Link Scheduling in Wireless Networks. Chenxi Qiu, Haiying Shen |
| 2017 | Favorable Block First: A Comprehensive Cache Scheme to Accelerate Partial Stripe Recovery of Triple Disk Failure Tolerant Arrays. Luyu Li, Houxiang Ji, Chentao Wu, Jie Li, Minyi Guo |
| 2017 | GCN: GPU-Based Cube CNN Framework for Hyperspectral Image Classification. Han Dong, Tao Li, Jiabing Leng, Lingyan Kong, Gang Bai |
| 2017 | GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations. Adrián Castelló, Sangmin Seo, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña |
| 2017 | Greed Is Good: Parallel Algorithms for Bipartite-Graph Partial Coloring on Multicore Architectures. Mustafa Kemal Tas, Kamer Kaya, Erik Saule |
| 2017 | High Performance Query Processing for Web Scale RDF Data using BSP Style Communication and Balanced Distribution. Minho Bae, Junho Eum, Donghoon Kim, Sangyoon Oh |
| 2017 | High-Performance Recommender System Training Using Co-Clustering on CPU/GPU Clusters. Kubilay Atasu, Thomas P. Parnell, Celestine Dünner, Michail Vlachos, Haralampos Pozidis |
| 2017 | High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU. Yusuke Nagasaka, Akira Nukada, Satoshi Matsuoka |
| 2017 | HyPPI NoC: Bringing Hybrid Plasmonics to an Opto-Electronic Network-on-Chip. Vikram K. Narayana, Shuai Sun, Armin Mehrabian, Volker J. Sorger, Tarek A. El-Ghazawi |
| 2017 | Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster. Yingrui Wang, Leisheng Li, Rong Tian |
| 2017 | Locality-Aware Dynamic Task Graph Scheduling. Jordyn Maglalang, Sriram Krishnamoorthy, Kunal Agrawal |
| 2017 | MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling. Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Davide Rossetti, Ching-Hsiang Chu, Dhabaleswar K. Panda |
| 2017 | Multiple Pattern Matching for Network Security Applications: Acceleration through Vectorization. Charalampos Stylianopoulos, Magnus Almgren, Olaf Landsiedel, Marina Papatriantafilou |
| 2017 | Nearly Balanced Work Partitioning for Heterogeneous Algorithms. Mallipeddi Hardhik, Dip Sankar Banerjee, Kiran Raj Ramamoorthy, Kishore Kothapalli, Kannan Srinathan |
| 2017 | Network Aware Multi-User Computation Partitioning in Mobile Edge Clouds. Lei Yang, Jiannong Cao, Zhenyu Wang, Weigang Wu |
| 2017 | Non-Sequential Striping for Distributed Storage Systems with Different Redundancy Schemes. Yanwen Xie, Dan Feng, Fang Wang |
| 2017 | OptiMatch: Enabling an Optimal Match between Green Power and Various Workloads for Renewable-Energy Powered Storage Systems. Xiaoyang Qu, Jiguang Wan, Fengguang Song, Xiaozhao Zhuang, Fei Wu, Changsheng Xie |
| 2017 | Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor. James Lin, Zhigeng Xu, Akira Nukada, Naoya Maruyama, Satoshi Matsuoka |
| 2017 | Order/Radix Problem: Towards Low End-to-End Latency Interconnection Networks. Ryota Yasudo, Michihiro Koibuchi, Koji Nakano, Hiroki Matsutani, Hideharu Amano |
| 2017 | Overlapping Data Transfers with Computation on GPU with Tiles. Burak Bastem, Didem Unat, Weiqun Zhang, Ann S. Almgren, John Shalf |
| 2017 | PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout. Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian, Youhui Bai |
| 2017 | Parallel Algorithm for Single-Source Earliest-Arrival Problem in Temporal Graphs. Peng Ni, Masatoshi Hanai, Wen Jun Tan, Chen Wang, Wentong Cai |
| 2017 | Parallel Algorithms for the Computation of Cycles in Relative Neighborhood Graphs. Hari Sundar, Parmeshwar Khurd |
| 2017 | Parallel Construction of Simultaneous Deterministic Finite Automata on Shared-Memory Multicores. Minyoung Jung, Jinwoo Park, Johann Blieberger, Bernd Burgstaller |
| 2017 | Parallel Reconstruction of Three Dimensional Magnetohydrodynamic Equilibria in Plasma Confinement Devices. Sudip K. Seal, Mark R. Cianciosa, Steven P. Hirshman, Andreas Wingen, Robert S. Wilcox, Ezekial A. Unterberg |
| 2017 | Parallel Space-Time Kernel Density Estimation. Erik Saule, Dinesh Panchananam, Alexander Hohl, Wenwu Tang, Eric Delmelle |
| 2017 | Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors. Athena Elafrou, Georgios I. Goumas, Nectarios Koziris |
| 2017 | Practical Experience with Transactional Lock Elision. Tingzhe Zhou, Pantea Zardoshti, Michael F. Spear |
| 2017 | Predicting Response Latency Percentiles for Cloud Object Storage Systems. Yi Su, Dan Feng, Yu Hua, Zhan Shi |
| 2017 | Preparing HPC Applications for the Exascale Era: A Decoupling Strategy. Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Erwin Laure, Stefano Markidis |
| 2017 | Resilience for Stencil Computations with Latent Errors. Aiman Fang, Aurélien Cavelan, Yves Robert, Andrew A. Chien |
| 2017 | Runtime Data Layout Scheduling for Machine Learning Dataset. Yang You, James Demmel |
| 2017 | Scalable Write Allocation in the WAFL File System. Matthew Curtis-Maury, Ram Kesavan, Mrinal K. Bhattacharjee |
| 2017 | Scheduling Independent Tasks in Parallel under Power Constraints. Ayham Kassab, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo |
| 2017 | Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations. Takumi Honda, Shinnosuke Yamamoto, Hiroaki Honda, Koji Nakano, Yasuaki Ito |
| 2017 | The Cloud as an OpenMP Offloading Device. Hervé Yviquel, Guido Araujo |
| 2017 | Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor. Lijuan Jiang, Chao Yang, Yulong Ao, Wanwang Yin, Wenjing Ma, Qiao Sun, Fangfang Liu, Rongfen Lin, Peng Zhang |
| 2017 | Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning. Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí |
| 2017 | WA-Dataspaces: Exploring the Data Staging Abstractions for Wide-Area Distributed Scientific Workflows. Mehmet Fatih Aktas, Javier Diaz Montes, Ivan Rodero, Manish Parashar |