ICS A

37 papers

YearTitle / Authors
2018A Case for Granularity Aware Page Migration.
Jee Ho Ryoo, Lizy K. John, Arkaprava Basu
2018A two-phase recovery mechanism.
Zhaoxiang Jin, Soner Önder
2018Accurate, Fast and Scalable Kernel Ridge Regression on Parallel and Distributed Systems.
Yang You, James Demmel, Cho-Jui Hsieh, Richard W. Vuduc
2018Analysis-driven Engineering of Comparison-based Sorting Algorithms on GPUs.
Ben Karsin, Volker Weichert, Henri Casanova, John Iacono, Nodari Sitchinava
2018Automated Analysis of Time Series Data to Understand Parallel Program Behaviors.
Lai Wei, John M. Mellor-Crummey
2018Bootstrapping Parameter Space Exploration for Fast Tuning.
Jayaraman J. Thiagarajan, Nikhil Jain, Rushil Anirudh, Alfredo Giménez, Rahul Sridhar, Aniruddha Marathe, Tao Wang, Murali Emani, Abhinav Bhatele, Todd Gamblin
2018CELIA: A Device and Architecture Co-Design Framework for STT-MRAM-Based Deep Learning Acceleration.
Hao Yan, Hebin R. Cherian, Ethan C. Ahn, Lide Duan
2018ChplBlamer: A Data-centric and Code-centric Combined Profiler for Multi-locale Chapel Programs.
Hui Zhang, Jeffrey K. Hollingsworth
2018Classification-Driven Search for Effective SM Partitioning in Multitasking GPUs.
Xia Zhao, Zhiying Wang, Lieven Eeckhout
2018ComPEND: Computation Pruning through Early Negative Detection for ReLU in a Deep Neural Network Accelerator.
Dongwoo Lee, Sungbum Kang, Kiyoung Choi
2018Demystifying Cache Policies for Photo Stores at Scale: A Tencent Case Study.
Ke Zhou, Si Sun, Hua Wang, Ping Huang, Xubin He, Rui Lan, Wenyan Li, Wenjie Liu, Tianming Yang
2018Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs.
Jacob Lambert, Seyong Lee, Jungwon Kim, Jeffrey S. Vetter, Allen D. Malony
2018Dynamic Load Balancing for Compressible Multiphase Turbulence.
Keke Zhai, Tania Banerjee, David Zwick, Jason Hackl, Sanjay Ranka
2018GRU: Exploring Computation and Data Redundancy via Partial GPU Computing Result Reuse.
Husheng Zhou, Soroush Bateni, Cong Liu
2018HALO: A Hierarchical Memory Access Locality Modeling Technique For Memory System Explorations.
Reena Panda, Lizy K. John
2018High-Performance, Low-Complexity Deadlock Avoidance for Arbitrary Topologies/Routings.
Jose Antonio Pascual, Javier Navaridas
2018IRIS: I/O Redirection via Integrated Storage.
Anthony Kougkas, Hariharan Devarajan, Xian-He Sun
2018Isometry: A Path-Based Distributed Data Transfer System.
Zhihao Jia, Sean Treichler, Galen M. Shipman, Patrick S. McCormick, Alex Aiken
2018On Optimizing Distributed Tucker Decomposition for Sparse Tensors.
Venkatesan T. Chakaravarthy, Jee W. Choi, Douglas J. Joseph, Prakash Murali, Shivmaran S. Pandian, Yogish Sabharwal, Dheeraj Sreedhar
2018Optimizing Data Aggregation by Leveraging the Deep Memory Hierarchy on Large-scale Systems.
François Tessier, Paul Gressier, Venkatram Vishwanath
2018Optimizing Tensor Contractions in CCSD(T) for Efficient Execution on GPUs.
Jinsung Kim, Aravind Sukumaran-Rajam, Changwan Hong, Ajay Panyala, Rohit Kumar Srivastava, Sriram Krishnamoorthy, P. Sadayappan
2018PA-SSD: A Page-Type Aware TLC SSD for Improved Write/Read Performance and Storage Efficiency.
Wenhui Zhang, Qiang Cao, Hong Jiang, Jie Yao
2018PFault: A General Framework for Analyzing the Reliability of High-Performance Parallel File Systems.
Jinrui Cao, Om Rameshwar Gatla, Mai Zheng, Dong Dai, Vidya Eswarappa, Yan Mu, Yong Chen
2018Phase-Aware Web Browser Power Management on HMP Platforms.
Nadja Peters, Sangyoung Park, Daniel Clifford, S. Kyostila, Ross McIlroy, Benedikt Meurer, Hannes Payer, Samarjit Chakraborty
2018Proceedings of the 32nd International Conference on Supercomputing, ICS 2018, Beijing, China, June 12-15, 2018
2018ProfDP: A Lightweight Profiler to Guide Data Placement in Heterogeneous Memory Systems.
Shasha Wen, Lucy Cherkasova, Felix Xiaozhu Lin, Xu Liu
2018ReGraph: A Graph Processing Framework that Alternately Shrinks and Repartitions the Graph.
Xue Li, Mingxing Zhang, Kang Chen, Yongwei Wu
2018Reducing Data Movement on Large Shared Memory Systems by Exploiting Computation Dependencies.
Isaac Sánchez Barrera, Miquel Moretó, Eduard Ayguadé, Jesús Labarta, Mateo Valero, Marc Casas
2018Rethinking Node Allocation Strategy for Data-intensive Applications in Consideration of Spatially Bursty I/O.
Jie Yu, Guangming Liu, Xin Liu, Wenrui Dong, Xiaoyong Li, Yusheng Liu
2018Revisiting Loop Tiling for Datacenters: Live and Let Live.
Jiacheng Zhao, Huimin Cui, Yalin Zhang, Jingling Xue, Xiaobing Feng
2018Runtime-Guided Management of Stacked DRAM Memories in Task Parallel Programs.
Lluc Alvarez, Marc Casas, Jesús Labarta, Eduard Ayguadé, Mateo Valero, Miquel Moretó
2018Sculptor: Flexible Approximation with Selective Dynamic Loop Perforation.
Shikai Li, Sunghyun Park, Scott A. Mahlke
2018The Broker Queue: A Fast, Linearizable FIFO Queue for Fine-Granular Work Distribution on the GPU.
Bernhard Kerbl, Michael Kenzel, Joerg H. Mueller, Dieter Schmalstieg, Markus Steinberger
2018Towards Efficient SpMV on Sunway Manycore Architectures.
Changxi Liu, Biwei Xie, Xin Liu, Wei Xue, Hailong Yang, Xu Liu
2018Warp-Consolidation: A Novel Execution Model for GPUs.
Ang Li, Weifeng Liu, Linnan Wang, Kevin J. Barker, Shuaiwen Leon Song
2018Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data.
Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Wenguang Chen
2018cuMBIR: An Efficient Framework for Low-dose X-ray CT Image Reconstruction on GPUs.
Xiuhong Li, Yun Liang, Wentai Zhang, Taide Liu, Haochen Li, Guojie Luo, Ming Jiang