ICS A

37 papers

Year	Title / Authors
2018	A Case for Granularity Aware Page Migration. Jee Ho Ryoo, Lizy K. John, Arkaprava Basu
2018	A two-phase recovery mechanism. Zhaoxiang Jin, Soner Önder
2018	Accurate, Fast and Scalable Kernel Ridge Regression on Parallel and Distributed Systems. Yang You, James Demmel, Cho-Jui Hsieh, Richard W. Vuduc
2018	Analysis-driven Engineering of Comparison-based Sorting Algorithms on GPUs. Ben Karsin, Volker Weichert, Henri Casanova, John Iacono, Nodari Sitchinava
2018	Automated Analysis of Time Series Data to Understand Parallel Program Behaviors. Lai Wei, John M. Mellor-Crummey
2018	Bootstrapping Parameter Space Exploration for Fast Tuning. Jayaraman J. Thiagarajan, Nikhil Jain, Rushil Anirudh, Alfredo Giménez, Rahul Sridhar, Aniruddha Marathe, Tao Wang, Murali Emani, Abhinav Bhatele, Todd Gamblin
2018	CELIA: A Device and Architecture Co-Design Framework for STT-MRAM-Based Deep Learning Acceleration. Hao Yan, Hebin R. Cherian, Ethan C. Ahn, Lide Duan
2018	ChplBlamer: A Data-centric and Code-centric Combined Profiler for Multi-locale Chapel Programs. Hui Zhang, Jeffrey K. Hollingsworth
2018	Classification-Driven Search for Effective SM Partitioning in Multitasking GPUs. Xia Zhao, Zhiying Wang, Lieven Eeckhout
2018	ComPEND: Computation Pruning through Early Negative Detection for ReLU in a Deep Neural Network Accelerator. Dongwoo Lee, Sungbum Kang, Kiyoung Choi
2018	Demystifying Cache Policies for Photo Stores at Scale: A Tencent Case Study. Ke Zhou, Si Sun, Hua Wang, Ping Huang, Xubin He, Rui Lan, Wenyan Li, Wenjie Liu, Tianming Yang
2018	Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs. Jacob Lambert, Seyong Lee, Jungwon Kim, Jeffrey S. Vetter, Allen D. Malony
2018	Dynamic Load Balancing for Compressible Multiphase Turbulence. Keke Zhai, Tania Banerjee, David Zwick, Jason Hackl, Sanjay Ranka
2018	GRU: Exploring Computation and Data Redundancy via Partial GPU Computing Result Reuse. Husheng Zhou, Soroush Bateni, Cong Liu
2018	HALO: A Hierarchical Memory Access Locality Modeling Technique For Memory System Explorations. Reena Panda, Lizy K. John
2018	High-Performance, Low-Complexity Deadlock Avoidance for Arbitrary Topologies/Routings. Jose Antonio Pascual, Javier Navaridas
2018	IRIS: I/O Redirection via Integrated Storage. Anthony Kougkas, Hariharan Devarajan, Xian-He Sun
2018	Isometry: A Path-Based Distributed Data Transfer System. Zhihao Jia, Sean Treichler, Galen M. Shipman, Patrick S. McCormick, Alex Aiken
2018	On Optimizing Distributed Tucker Decomposition for Sparse Tensors. Venkatesan T. Chakaravarthy, Jee W. Choi, Douglas J. Joseph, Prakash Murali, Shivmaran S. Pandian, Yogish Sabharwal, Dheeraj Sreedhar
2018	Optimizing Data Aggregation by Leveraging the Deep Memory Hierarchy on Large-scale Systems. François Tessier, Paul Gressier, Venkatram Vishwanath
2018	Optimizing Tensor Contractions in CCSD(T) for Efficient Execution on GPUs. Jinsung Kim, Aravind Sukumaran-Rajam, Changwan Hong, Ajay Panyala, Rohit Kumar Srivastava, Sriram Krishnamoorthy, P. Sadayappan
2018	PA-SSD: A Page-Type Aware TLC SSD for Improved Write/Read Performance and Storage Efficiency. Wenhui Zhang, Qiang Cao, Hong Jiang, Jie Yao
2018	PFault: A General Framework for Analyzing the Reliability of High-Performance Parallel File Systems. Jinrui Cao, Om Rameshwar Gatla, Mai Zheng, Dong Dai, Vidya Eswarappa, Yan Mu, Yong Chen
2018	Phase-Aware Web Browser Power Management on HMP Platforms. Nadja Peters, Sangyoung Park, Daniel Clifford, S. Kyostila, Ross McIlroy, Benedikt Meurer, Hannes Payer, Samarjit Chakraborty
2018	Proceedings of the 32nd International Conference on Supercomputing, ICS 2018, Beijing, China, June 12-15, 2018
2018	ProfDP: A Lightweight Profiler to Guide Data Placement in Heterogeneous Memory Systems. Shasha Wen, Lucy Cherkasova, Felix Xiaozhu Lin, Xu Liu
2018	ReGraph: A Graph Processing Framework that Alternately Shrinks and Repartitions the Graph. Xue Li, Mingxing Zhang, Kang Chen, Yongwei Wu
2018	Reducing Data Movement on Large Shared Memory Systems by Exploiting Computation Dependencies. Isaac Sánchez Barrera, Miquel Moretó, Eduard Ayguadé, Jesús Labarta, Mateo Valero, Marc Casas
2018	Rethinking Node Allocation Strategy for Data-intensive Applications in Consideration of Spatially Bursty I/O. Jie Yu, Guangming Liu, Xin Liu, Wenrui Dong, Xiaoyong Li, Yusheng Liu
2018	Revisiting Loop Tiling for Datacenters: Live and Let Live. Jiacheng Zhao, Huimin Cui, Yalin Zhang, Jingling Xue, Xiaobing Feng
2018	Runtime-Guided Management of Stacked DRAM Memories in Task Parallel Programs. Lluc Alvarez, Marc Casas, Jesús Labarta, Eduard Ayguadé, Mateo Valero, Miquel Moretó
2018	Sculptor: Flexible Approximation with Selective Dynamic Loop Perforation. Shikai Li, Sunghyun Park, Scott A. Mahlke
2018	The Broker Queue: A Fast, Linearizable FIFO Queue for Fine-Granular Work Distribution on the GPU. Bernhard Kerbl, Michael Kenzel, Joerg H. Mueller, Dieter Schmalstieg, Markus Steinberger
2018	Towards Efficient SpMV on Sunway Manycore Architectures. Changxi Liu, Biwei Xie, Xin Liu, Wei Xue, Hailong Yang, Xu Liu
2018	Warp-Consolidation: A Novel Execution Model for GPUs. Ang Li, Weifeng Liu, Linnan Wang, Kevin J. Barker, Shuaiwen Leon Song
2018	Zwift: A Programming Framework for High Performance Text Analytics on Compressed Data. Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, Wenguang Chen
2018	cuMBIR: An Efficient Framework for Low-dose X-ray CT Image Reconstruction on GPUs. Xiuhong Li, Yun Liang, Wentai Zhang, Taide Liu, Haochen Li, Guojie Luo, Ming Jiang