| 2018 | A Case for Granularity Aware Page Migration. Jee Ho Ryoo, Lizy K. John, Arkaprava Basu |
| 2018 | A two-phase recovery mechanism. Zhaoxiang Jin, Soner Önder |
| 2018 | Accurate, Fast and Scalable Kernel Ridge Regression on Parallel and Distributed Systems. Yang You, James Demmel, Cho-Jui Hsieh, Richard W. Vuduc |
| 2018 | Analysis-driven Engineering of Comparison-based Sorting Algorithms on GPUs. Ben Karsin, Volker Weichert, Henri Casanova, John Iacono, Nodari Sitchinava |
| 2018 | Automated Analysis of Time Series Data to Understand Parallel Program Behaviors. Lai Wei, John M. Mellor-Crummey |
| 2018 | Bootstrapping Parameter Space Exploration for Fast Tuning. Jayaraman J. Thiagarajan, Nikhil Jain, Rushil Anirudh, Alfredo Giménez, Rahul Sridhar, Aniruddha Marathe, Tao Wang, Murali Emani, Abhinav Bhatele, Todd Gamblin |
| 2018 | CELIA: A Device and Architecture Co-Design Framework for STT-MRAM-Based Deep Learning Acceleration. Hao Yan, Hebin R. Cherian, Ethan C. Ahn, Lide Duan |
| 2018 | ChplBlamer: A Data-centric and Code-centric Combined Profiler for Multi-locale Chapel Programs. Hui Zhang, Jeffrey K. Hollingsworth |
| 2018 | Classification-Driven Search for Effective SM Partitioning in Multitasking GPUs. Xia Zhao, Zhiying Wang, Lieven Eeckhout |
| 2018 | ComPEND: Computation Pruning through Early Negative Detection for ReLU in a Deep Neural Network Accelerator. Dongwoo Lee, Sungbum Kang, Kiyoung Choi |
| 2018 | Demystifying Cache Policies for Photo Stores at Scale: A Tencent Case Study. Ke Zhou, Si Sun, Hua Wang, Ping Huang, Xubin He, Rui Lan, Wenyan Li, Wenjie Liu, Tianming Yang |
| 2018 | Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs. Jacob Lambert, Seyong Lee, Jungwon Kim, Jeffrey S. Vetter, Allen D. Malony |
| 2018 | Dynamic Load Balancing for Compressible Multiphase Turbulence. Keke Zhai, Tania Banerjee, David Zwick, Jason Hackl, Sanjay Ranka |
| 2018 | GRU: Exploring Computation and Data Redundancy via Partial GPU Computing Result Reuse. Husheng Zhou, Soroush Bateni, Cong Liu |
| 2018 | HALO: A Hierarchical Memory Access Locality Modeling Technique For Memory System Explorations. Reena Panda, Lizy K. John |
| 2018 | High-Performance, Low-Complexity Deadlock Avoidance for Arbitrary Topologies/Routings. Jose Antonio Pascual, Javier Navaridas |
| 2018 | IRIS: I/O Redirection via Integrated Storage. Anthony Kougkas, Hariharan Devarajan, Xian-He Sun |
| 2018 | Isometry: A Path-Based Distributed Data Transfer System. Zhihao Jia, Sean Treichler, Galen M. Shipman, Patrick S. McCormick, Alex Aiken |
| 2018 | On Optimizing Distributed Tucker Decomposition for Sparse Tensors. Venkatesan T. Chakaravarthy, Jee W. Choi, Douglas J. Joseph, Prakash Murali, Shivmaran S. Pandian, Yogish Sabharwal, Dheeraj Sreedhar |
| 2018 | Optimizing Data Aggregation by Leveraging the Deep Memory Hierarchy on Large-scale Systems. François Tessier, Paul Gressier, Venkatram Vishwanath |
| 2018 | Optimizing Tensor Contractions in CCSD(T) for Efficient Execution on GPUs. Jinsung Kim, Aravind Sukumaran-Rajam, Changwan Hong, Ajay Panyala, Rohit Kumar Srivastava, Sriram Krishnamoorthy, P. Sadayappan |
| 2018 | PA-SSD: A Page-Type Aware TLC SSD for Improved Write/Read Performance and Storage Efficiency. Wenhui Zhang, Qiang Cao, Hong Jiang, Jie Yao |
| 2018 | PFault: A General Framework for Analyzing the Reliability of High-Performance Parallel File Systems. Jinrui Cao, Om Rameshwar Gatla, Mai Zheng, Dong Dai, Vidya Eswarappa, Yan Mu, Yong Chen |
| 2018 | Phase-Aware Web Browser Power Management on HMP Platforms. Nadja Peters, Sangyoung Park, Daniel Clifford, S. Kyostila, Ross McIlroy, Benedikt Meurer, Hannes Payer, Samarjit Chakraborty |
| 2018 | Proceedings of the 32nd International Conference on Supercomputing, ICS 2018, Beijing, China, June 12-15, 2018 |
| 2018 | ProfDP: A Lightweight Profiler to Guide Data Placement in Heterogeneous Memory Systems. Shasha Wen, Lucy Cherkasova, Felix Xiaozhu Lin, Xu Liu |
| 2018 | ReGraph: A Graph Processing Framework that Alternately Shrinks and Repartitions the Graph. Xue Li, Mingxing Zhang, Kang Chen, Yongwei Wu |
| 2018 | Reducing Data Movement on Large Shared Memory Systems by Exploiting Computation Dependencies. Isaac Sánchez Barrera, Miquel Moretó, Eduard Ayguadé, Jesús Labarta, Mateo Valero, Marc Casas |
| 2018 | Rethinking Node Allocation Strategy for Data-intensive Applications in Consideration of Spatially Bursty I/O. Jie Yu, Guangming Liu, Xin Liu, Wenrui Dong, Xiaoyong Li, Yusheng Liu |
| 2018 | Revisiting Loop Tiling for Datacenters: Live and Let Live. Jiacheng Zhao, Huimin Cui, Yalin Zhang, Jingling Xue, Xiaobing Feng |
| 2018 | Runtime-Guided Management of Stacked DRAM Memories in Task Parallel Programs. Lluc Alvarez, Marc Casas, Jesús Labarta, Eduard Ayguadé, Mateo Valero, Miquel Moretó |
| 2018 | Sculptor: Flexible Approximation with Selective Dynamic Loop Perforation. Shikai Li, Sunghyun Park, Scott A. Mahlke |
| 2018 | The Broker Queue: A Fast, Linearizable FIFO Queue for Fine-Granular Work Distribution on the GPU. Bernhard Kerbl, Michael Kenzel, Joerg H. Mueller, Dieter Schmalstieg, Markus Steinberger |
| 2018 | Towards Efficient SpMV on Sunway Manycore Architectures. Changxi Liu, Biwei Xie, Xin Liu, Wei Xue, Hailong Yang, Xu Liu |
| 2018 | Warp-Consolidation: A Novel Execution Model for GPUs. Ang Li, Weifeng Liu, Linnan Wang, Kevin J. Barker, Shuaiwen Leon Song |
| 2018 | Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data. Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Wenguang Chen |
| 2018 | cuMBIR: An Efficient Framework for Low-dose X-ray CT Image Reconstruction on GPUs. Xiuhong Li, Yun Liang, Wentai Zhang, Taide Liu, Haochen Li, Guojie Luo, Ming Jiang |