ISPASS - RankMe

35 papers

Year	Title / Authors
2019	A Detailed Model for Contemporary GPU Memory Systems. Mahmoud Khairy, Akshay Jain, Tor M. Aamodt, Timothy G. Rogers
2019	A Model Driven Approach Towards Improving the Performance of Apache Spark Applications. Kewen Wang, Mohammad Maifi Hasan Khan, Nhan Nguyen, Swapna S. Gokhale
2019	An Improved Dynamic Vertical Partitioning Technique for Semi-Structured Data. Sahel Sharify, Alan W. Lu, Jin Chen, Arnamoy Bhattacharyya, Ali B. Hashemi, Nick Koudas, Cristiana Amza
2019	Analyzing Machine Learning Workloads Using a Detailed GPU Simulator. Jonathan S. Lew, Deval A. Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor M. Aamodt
2019	Assessing the Effects of Low Voltage in Branch Prediction Units. Athanasios Chatzidimitriou, George Papadimitriou, Dimitris Gizopoulos, Shrikanth Ganapathy, John Kalamatianos
2019	Characterization of Unnecessary Computations in Web Applications. Hossein Golestani, Scott A. Mahlke, Satish Narayanasamy
2019	Characterizing Sources of Ineffectual Computations in Deep Learning Networks. Milos Nikolic, Mostafa Mahmoud, Andreas Moshovos, Yiren Zhao, Robert Mullins
2019	DSMM: A Dynamic Setting for Memory Management in Apache Spark. Suk-Joo Chae, Tae-Sun Chung
2019	DeLTA: GPU Performance Model for Deep Learning Applications with In-Depth Memory System Traffic Analysis. Sangkug Lym, Donghyuk Lee, Mike O'Connor, Niladrish Chatterjee, Mattan Erez
2019	Demystifying Bayesian Inference Workloads. Yu Emma Wang, Yuhao Zhu, Glenn G. Ko, Brandon Reagen, Gu-Yeon Wei, David Brooks
2019	Demystifying Crypto-Mining: Analysis and Optimizations of Memory-Hard PoW Algorithms. Runchao Han, Nikos Foutris, Christos Kotselidis
2019	Distributed Software Defined Networking Controller Failure Mode and Availability Analysis. Paul Reeser, Guilhem Tesseyre, Marcus Callaway
2019	Empirical Investigation of Stale Value Tolerance on Parallel RNN Training. Joo Hwan Lee, Hyesoon Kim
2019	Emulating and Evaluating Hybrid Memory for Managed Languages on NUMA Hardware. Shoaib Akram, Jennifer B. Sartor, Kathryn S. McKinley, Lieven Eeckhout
2019	Fast Modeling of the L2 Cache Reuse Distance Histograms from Software Traces. Jiancong Ge, Ming Ling
2019	FlexCPU: A Configurable Out-of-Order CPU Abstraction. Bradley Wang, Ayaz Akram, Jason Lowe-Power
2019	Full-System Simulation of Mobile CPU/GPU Platforms. Kuba Kaszyk, Harry Wagstaff, Tom Spink, Björn Franke, Michael F. P. O'Boyle, Bruno Bodin, Henrik Uhrenholt
2019	GeST: An Automatic Framework For Generating CPU Stress-Tests. Zacharias Hadjilambrou, Shidhartha Das, Paul N. Whatmough, David M. Bull, Yiannakis Sazeides
2019	HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators. Masab Ahmad, Halit Dogan, Christopher J. Michael, Omer Khan
2019	Hierarchical Page Eviction Policy for Unified Memory in GPUs. Qi Yu, Bruce R. Childers, Libo Huang, Cheng Qian, Zhiying Wang
2019	IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2019, Madison, WI, USA, March 24-26, 2019
2019	Modeling Deep Learning Accelerator Enabled GPUs. Md Aamir Raihan, Negar Goli, Tor M. Aamodt
2019	On the Impact of Instruction Address Translation Overhead. Yufeng Zhou, Xiaowan Dong, Alan L. Cox, Sandhya Dwarkadas
2019	One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-Off in Machine Learning Cloud Service APIs via Tolerance Tiers. Matthew Halpern, Behzad Boroujerdian, Todd W. Mummert, Evelyn Duesterwald, Vijay Janapa Reddi
2019	PARADISE - Post-Moore Architecture and Accelerator Design Space Exploration Using Device Level Simulation and Experiments. Dilip P. Vasudevan, George Michelogiannakis, David Donofrio, John Shalf
2019	Parallelism Analysis of Prominent Desktop Applications: An 18- Year Perspective. Siying Feng, Subhankar Pal, Yichen Yang, Ronald G. Dreslinski
2019	Quantifying Process Variations and Its Impacts on Smartphones. Guru Prasad Srinivasa, Scott Haseley, Geoffrey Challen, Mark Hempstead
2019	RPPM: Rapid Performance Prediction of Multithreaded Workloads on Multicore Processors. Sander De Pestel, Sam Van den Steen, Shoaib Akram, Lieven Eeckhout
2019	Racing to Hardware-Validated Simulation. Almutaz Adileh, Cecilia González-Alvarez, Juan Miguel De Haro Ruiz, Lieven Eeckhout
2019	Tango: A Deep Neural Network Benchmark Suite for Various Accelerators. Aajna Karki, Chethan Palangotu Keshava, Spoorthi Mysore Shivakumar, Joshua Skow, Goutam Madhukeshwar Hegde, Hyeran Jeon
2019	The POP Detector: A Lightweight Online Program Phase Detection Framework. Karl Taht, James Greensky, Rajeev Balasubramonian
2019	Timeloop: A Systematic Approach to DNN Accelerator Evaluation. Angshuman Parashar, Priyanka Raina, Yakun Sophia Shao, Yu-Hsin Chen, Victor A. Ying, Anurag Mukkara, Rangharajan Venkatesan, Brucek Khailany, Stephen W. Keckler, Joel S. Emer
2019	Workload Characterization of Nondeterministic Programs Parallelized by STATS. Enrico Armenio Deiana, Simone Campanoni
2019	mRNA: Enabling Efficient Mapping Space Exploration for a Reconfiguration Neural Accelerator. Zhongyuan Zhao, Hyoukjun Kwon, Sachit Kuhar, Weiguang Sheng, Zhigang Mao, Tushar Krishna
2019	µqSim: Enabling Accurate and Scalable Simulation for Interactive Microservices. Yanqi Zhang, Yu Gan, Christina Delimitrou