| 2018 | Cellular automata beyond 100k cores: MPI vs Fortran coarrays. Anton Shterenlikht, Luis Cebamanos |
| 2018 | Efficient Asynchronous Communication Progress for MPI without Dedicated Resources. Amit Ruhela, Hari Subramoni, Sourav Chakraborty, Mohammadreza Bayatpour, Pouya Kousha, Dhabaleswar K. Panda |
| 2018 | Enabling callback-driven runtime introspection via MPI_T. Marc-André Hermanns, Nathan T. Hjelm, Michael Knobloch, Kathryn M. Mohror, Martin Schulz |
| 2018 | Energy-efficient localised rollback via data flow analysis and frequency scaling. Kiril Dichev, Kirk W. Cameron, Dimitrios S. Nikolopoulos |
| 2018 | Full-Duplex Inter-Group All-to-All Broadcast Algorithms with Optimal Bandwidth. Qiao Kang, Jesper Larsson Träff, Reda Al-Bahrani, Ankit Agrawal, Alok N. Choudhary, Wei-keng Liao |
| 2018 | Improving Performance Models for Irregular Point-to-Point Communication. Amanda Bienz, William D. Gropp, Luke N. Olson |
| 2018 | Improving the Interoperability between MPI and Task-Based Programming Models. Kevin Sala, Jorge Bellón, Pau Farré, Xavier Teruel, Josep M. Pérez, Antonio J. Peña, Daniel J. Holmes, Vicenç Beltran, Jesús Labarta |
| 2018 | MC-CChecker: A Clock-Based Approach to Detect Memory Consistency Errors in MPI One-Sided Applications. Thanh-Dang Diep, Karl Fürlinger, Nam Thoai |
| 2018 | MPI Derived Datatypes: Performance and Portability Issues. Qingqing Xiong, Purushotham V. Bangalore, Anthony Skjellum, Martin C. Herbordt |
| 2018 | MPI Stages: Checkpointing MPI State for Bulk Synchronous Applications. Nawrin Sultana, Anthony Skjellum, Ignacio Laguna, Matthew Shane Farmer, Kathryn M. Mohror, Murali Emani |
| 2018 | MPI+OpenMP Tasking Scalability for the Simulation of the Human Brain: Human Brain Project. Pedro Valero-Lara, Raül Sirvent, Antonio J. Peña, Xavier Martorell, Jesús Labarta |
| 2018 | Multi-Threading and Lock-Free MPI RMA Based Graph Processing on KNL and POWER Architectures. Mingzhe Li, Xiaoyi Lu, Hari Subramoni, Dhabaleswar K. Panda |
| 2018 | Optimized Broadcast for Deep Learning Workloads on Dense-GPU InfiniBand Clusters: MPI or NCCL? Ammar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda |
| 2018 | Performance model for mesh optimization on distributed-memory computers. Domingo Benitez, José María Escobar, Rafael Montenegro, Eduardo Rodríguez |
| 2018 | Proceedings of the 25th European MPI Users' Group Meeting, Barcelona, Spain, September 23-26, 2018 |
| 2018 | Supporting MPI-distributed stream parallel patterns in GrPPI. Javier Fernández Muñoz, Manuel F. Dolz, David del Rio Astorga, Javier Prieto Cepeda, José Daniel García |
| 2018 | Transparent High-Speed Network Checkpoint/Restart in MPI. Julien Adam, Jean-Baptiste Besnard, Allen D. Malony, Sameer Shende, Marc Pérache, Patrick Carribault, Julien Jaeger |
| 2018 | Using Node Information to Implement MPI Cartesian Topologies. William D. Gropp |
| 2018 | Using Simulation to Examine the Effect of MPI Message Matching Costs on Application Performance. Scott Levy, Kurt B. Ferreira |