ICPP B

61 papers

YearTitle / Authors
201746th International Conference on Parallel Processing, ICPP 2017, Bristol, United Kingdom, August 14-17, 2017
2017A Coflow-Based Co-Optimization Framework for High-Performance Data Analytics.
Long Cheng, Ying Wang, Yulong Pei, Dick H. J. Epema
2017A Dynamic Resource Controller for a Lambda Architecture.
MohammadReza HoseinyFarahabady, Javid Taheri, Zahir Tari, Albert Y. Zomaya
2017A Machine Learning Approach for Efficient Parallel Simulation of Beam Dynamics on GPUs.
Kamesh Arumugam, Desh Ranjan, Mohammad Zubair, Balsa Terzic, Alexander N. Godunov, Tunazzina Islam
2017A Novel Minimum Time Parallel 2-D Discrete Wavelet Transform Algorithm for General Purpose Processors.
Eduardo Moscoso Rubino, Alberto Jose Alvares, Raúl Marín Prades, Pedro Sanz Valero
2017A Parallel TSP-Based Algorithm for Balanced Graph Partitioning.
Harshvardhan Das, Subodh Kumar
2017A Pareto Framework for Data Analytics on Heterogeneous Systems: Implications for Green Energy Usage and Performance.
Aniket Chakrabarti, Srinivasan Parthasarathy, Christopher Stewart
2017A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters.
Isuru Dilanka Fernando, Sanath Jayasena, Milinda Fernando, Hari Sundar
2017Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning.
Jiawen Sun, Hans Vandierendonck, Dimitrios S. Nikolopoulos
2017An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications.
Guojing Cong, Onkar Bhardwaj, Minwei Feng
2017Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems.
Rong Ge, Pengfei Zou, Xizhou Feng
2017Autotuning GPU Kernels via Static and Predictive Analysis.
Robert V. Lim, Boyana Norris, Allen D. Malony
2017Bitslice Vectors: A Software Approach to Customizable Data Precision on Processors with SIMD Extensions.
Shixiong Xu, David Gregg
2017Boosting the Efficiency of HPCG and Graph500 with Near-Data Processing.
Erik Vermij, Leandro Fiorin, Christoph Hagleitner, Koen Bertels
2017CELIA: Cost-Time Performance of Elastic Applications on Cloud.
Sunimal Rathnayake, Dumitrel Loghin, Yong Meng Teo
2017Constrained Tensor Factorization with Accelerated AO-ADMM.
Shaden Smith, Alec Beri, George Karypis
2017Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line.
Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, Joseph C. Culberson, Joseph Horton
2017E-Storm: Replication-Based State Management in Distributed Stream Processing Systems.
Xunyun Liu, Aaron Harwood, Shanika Karunasekera, Benjamin I. P. Rubinstein, Rajkumar Buyya
2017ES2: Aiming at an Optimal Virtual I/O Event Path.
Xiaokang Hu, Wang Zhang, Jian Li, Ruhui Ma, Feng Wu, Haibing Guan
2017Efficient Data Sharing on Heterogeneous Systems.
Victor Garcia-Flores, Eduard Ayguadé, Antonio J. Peña
2017Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning.
Ching-Hsiang Chu, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Jahanzeb Maqbool Hashmi, Bracy Elton, Dhabaleswar K. Panda
2017Exploiting GPUs for Fast Force-Directed Visualization of Large-Scale Networks.
Govert G. Brinkmann, Kristian F. D. Rietveld, Frank W. Takes
2017Fading-Resistant Link Scheduling in Wireless Networks.
Chenxi Qiu, Haiying Shen
2017Favorable Block First: A Comprehensive Cache Scheme to Accelerate Partial Stripe Recovery of Triple Disk Failure Tolerant Arrays.
Luyu Li, Houxiang Ji, Chentao Wu, Jie Li, Minyi Guo
2017GCN: GPU-Based Cube CNN Framework for Hyperspectral Image Classification.
Han Dong, Tao Li, Jiabing Leng, Lingyan Kong, Gang Bai
2017GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations.
Adrián Castelló, Sangmin Seo, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña
2017Greed Is Good: Parallel Algorithms for Bipartite-Graph Partial Coloring on Multicore Architectures.
Mustafa Kemal Tas, Kamer Kaya, Erik Saule
2017High Performance Query Processing for Web Scale RDF Data using BSP Style Communication and Balanced Distribution.
Minho Bae, Junho Eum, Donghoon Kim, Sangyoon Oh
2017High-Performance Recommender System Training Using Co-Clustering on CPU/GPU Clusters.
Kubilay Atasu, Thomas P. Parnell, Celestine Dünner, Michail Vlachos, Haralampos Pozidis
2017High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU.
Yusuke Nagasaka, Akira Nukada, Satoshi Matsuoka
2017HyPPI NoC: Bringing Hybrid Plasmonics to an Opto-Electronic Network-on-Chip.
Vikram K. Narayana, Shuai Sun, Armin Mehrabian, Volker J. Sorger, Tarek A. El-Ghazawi
2017Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster.
Yingrui Wang, Leisheng Li, Rong Tian
2017Locality-Aware Dynamic Task Graph Scheduling.
Jordyn Maglalang, Sriram Krishnamoorthy, Kunal Agrawal
2017MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling.
Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Davide Rossetti, Ching-Hsiang Chu, Dhabaleswar K. Panda
2017Multiple Pattern Matching for Network Security Applications: Acceleration through Vectorization.
Charalampos Stylianopoulos, Magnus Almgren, Olaf Landsiedel, Marina Papatriantafilou
2017Nearly Balanced Work Partitioning for Heterogeneous Algorithms.
Mallipeddi Hardhik, Dip Sankar Banerjee, Kiran Raj Ramamoorthy, Kishore Kothapalli, Kannan Srinathan
2017Network Aware Multi-User Computation Partitioning in Mobile Edge Clouds.
Lei Yang, Jiannong Cao, Zhenyu Wang, Weigang Wu
2017Non-Sequential Striping for Distributed Storage Systems with Different Redundancy Schemes.
Yanwen Xie, Dan Feng, Fang Wang
2017OptiMatch: Enabling an Optimal Match between Green Power and Various Workloads for Renewable-Energy Powered Storage Systems.
Xiaoyang Qu, Jiguang Wan, Fengguang Song, Xiaozhao Zhuang, Fei Wu, Changsheng Xie
2017Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor.
James Lin, Zhigeng Xu, Akira Nukada, Naoya Maruyama, Satoshi Matsuoka
2017Order/Radix Problem: Towards Low End-to-End Latency Interconnection Networks.
Ryota Yasudo, Michihiro Koibuchi, Koji Nakano, Hiroki Matsutani, Hideharu Amano
2017Overlapping Data Transfers with Computation on GPU with Tiles.
Burak Bastem, Didem Unat, Weiqun Zhang, Ann S. Almgren, John Shalf
2017PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout.
Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian, Youhui Bai
2017Parallel Algorithm for Single-Source Earliest-Arrival Problem in Temporal Graphs.
Peng Ni, Masatoshi Hanai, Wen Jun Tan, Chen Wang, Wentong Cai
2017Parallel Algorithms for the Computation of Cycles in Relative Neighborhood Graphs.
Hari Sundar, Parmeshwar Khurd
2017Parallel Construction of Simultaneous Deterministic Finite Automata on Shared-Memory Multicores.
Minyoung Jung, Jinwoo Park, Johann Blieberger, Bernd Burgstaller
2017Parallel Reconstruction of Three Dimensional Magnetohydrodynamic Equilibria in Plasma Confinement Devices.
Sudip K. Seal, Mark R. Cianciosa, Steven P. Hirshman, Andreas Wingen, Robert S. Wilcox, Ezekial A. Unterberg
2017Parallel Space-Time Kernel Density Estimation.
Erik Saule, Dinesh Panchananam, Alexander Hohl, Wenwu Tang, Eric Delmelle
2017Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors.
Athena Elafrou, Georgios I. Goumas, Nectarios Koziris
2017Practical Experience with Transactional Lock Elision.
Tingzhe Zhou, Pantea Zardoshti, Michael F. Spear
2017Predicting Response Latency Percentiles for Cloud Object Storage Systems.
Yi Su, Dan Feng, Yu Hua, Zhan Shi
2017Preparing HPC Applications for the Exascale Era: A Decoupling Strategy.
Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Erwin Laure, Stefano Markidis
2017Resilience for Stencil Computations with Latent Errors.
Aiman Fang, Aurélien Cavelan, Yves Robert, Andrew A. Chien
2017Runtime Data Layout Scheduling for Machine Learning Dataset.
Yang You, James Demmel
2017Scalable Write Allocation in the WAFL File System.
Matthew Curtis-Maury, Ram Kesavan, Mrinal K. Bhattacharjee
2017Scheduling Independent Tasks in Parallel under Power Constraints.
Ayham Kassab, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo
2017Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations.
Takumi Honda, Shinnosuke Yamamoto, Hiroaki Honda, Koji Nakano, Yasuaki Ito
2017The Cloud as an OpenMP Offloading Device.
Hervé Yviquel, Guido Araujo
2017Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor.
Lijuan Jiang, Chao Yang, Yulong Ao, Wanwang Yin, Wenjing Ma, Qiao Sun, Fangfang Liu, Rongfen Lin, Peng Zhang
2017Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning.
Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí
2017WA-Dataspaces: Exploring the Data Staging Abstractions for Wide-Area Distributed Scientific Workflows.
Mehmet Fatih Aktas, Javier Diaz Montes, Ivan Rodero, Manish Parashar