ISPASS B

35 papers

YearTitle / Authors
2019A Detailed Model for Contemporary GPU Memory Systems.
Mahmoud Khairy, Akshay Jain, Tor M. Aamodt, Timothy G. Rogers
2019A Model Driven Approach Towards Improving the Performance of Apache Spark Applications.
Kewen Wang, Mohammad Maifi Hasan Khan, Nhan Nguyen, Swapna S. Gokhale
2019An Improved Dynamic Vertical Partitioning Technique for Semi-Structured Data.
Sahel Sharify, Alan W. Lu, Jin Chen, Arnamoy Bhattacharyya, Ali B. Hashemi, Nick Koudas, Cristiana Amza
2019Analyzing Machine Learning Workloads Using a Detailed GPU Simulator.
Jonathan S. Lew, Deval A. Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor M. Aamodt
2019Assessing the Effects of Low Voltage in Branch Prediction Units.
Athanasios Chatzidimitriou, George Papadimitriou, Dimitris Gizopoulos, Shrikanth Ganapathy, John Kalamatianos
2019Characterization of Unnecessary Computations in Web Applications.
Hossein Golestani, Scott A. Mahlke, Satish Narayanasamy
2019Characterizing Sources of Ineffectual Computations in Deep Learning Networks.
Milos Nikolic, Mostafa Mahmoud, Andreas Moshovos, Yiren Zhao, Robert Mullins
2019DSMM: A Dynamic Setting for Memory Management in Apache Spark.
Suk-Joo Chae, Tae-Sun Chung
2019DeLTA: GPU Performance Model for Deep Learning Applications with In-Depth Memory System Traffic Analysis.
Sangkug Lym, Donghyuk Lee, Mike O'Connor, Niladrish Chatterjee, Mattan Erez
2019Demystifying Bayesian Inference Workloads.
Yu Emma Wang, Yuhao Zhu, Glenn G. Ko, Brandon Reagen, Gu-Yeon Wei, David Brooks
2019Demystifying Crypto-Mining: Analysis and Optimizations of Memory-Hard PoW Algorithms.
Runchao Han, Nikos Foutris, Christos Kotselidis
2019Distributed Software Defined Networking Controller Failure Mode and Availability Analysis.
Paul Reeser, Guilhem Tesseyre, Marcus Callaway
2019Empirical Investigation of Stale Value Tolerance on Parallel RNN Training.
Joo Hwan Lee, Hyesoon Kim
2019Emulating and Evaluating Hybrid Memory for Managed Languages on NUMA Hardware.
Shoaib Akram, Jennifer B. Sartor, Kathryn S. McKinley, Lieven Eeckhout
2019Fast Modeling of the L2 Cache Reuse Distance Histograms from Software Traces.
Jiancong Ge, Ming Ling
2019FlexCPU: A Configurable Out-of-Order CPU Abstraction.
Bradley Wang, Ayaz Akram, Jason Lowe-Power
2019Full-System Simulation of Mobile CPU/GPU Platforms.
Kuba Kaszyk, Harry Wagstaff, Tom Spink, Björn Franke, Michael F. P. O'Boyle, Bruno Bodin, Henrik Uhrenholt
2019GeST: An Automatic Framework For Generating CPU Stress-Tests.
Zacharias Hadjilambrou, Shidhartha Das, Paul N. Whatmough, David M. Bull, Yiannakis Sazeides
2019HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators.
Masab Ahmad, Halit Dogan, Christopher J. Michael, Omer Khan
2019Hierarchical Page Eviction Policy for Unified Memory in GPUs.
Qi Yu, Bruce R. Childers, Libo Huang, Cheng Qian, Zhiying Wang
2019IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2019, Madison, WI, USA, March 24-26, 2019
2019Modeling Deep Learning Accelerator Enabled GPUs.
Md Aamir Raihan, Negar Goli, Tor M. Aamodt
2019On the Impact of Instruction Address Translation Overhead.
Yufeng Zhou, Xiaowan Dong, Alan L. Cox, Sandhya Dwarkadas
2019One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-Off in Machine Learning Cloud Service APIs via Tolerance Tiers.
Matthew Halpern, Behzad Boroujerdian, Todd W. Mummert, Evelyn Duesterwald, Vijay Janapa Reddi
2019PARADISE - Post-Moore Architecture and Accelerator Design Space Exploration Using Device Level Simulation and Experiments.
Dilip P. Vasudevan, George Michelogiannakis, David Donofrio, John Shalf
2019Parallelism Analysis of Prominent Desktop Applications: An 18- Year Perspective.
Siying Feng, Subhankar Pal, Yichen Yang, Ronald G. Dreslinski
2019Quantifying Process Variations and Its Impacts on Smartphones.
Guru Prasad Srinivasa, Scott Haseley, Geoffrey Challen, Mark Hempstead
2019RPPM: Rapid Performance Prediction of Multithreaded Workloads on Multicore Processors.
Sander De Pestel, Sam Van den Steen, Shoaib Akram, Lieven Eeckhout
2019Racing to Hardware-Validated Simulation.
Almutaz Adileh, Cecilia González-Alvarez, Juan Miguel De Haro Ruiz, Lieven Eeckhout
2019Tango: A Deep Neural Network Benchmark Suite for Various Accelerators.
Aajna Karki, Chethan Palangotu Keshava, Spoorthi Mysore Shivakumar, Joshua Skow, Goutam Madhukeshwar Hegde, Hyeran Jeon
2019The POP Detector: A Lightweight Online Program Phase Detection Framework.
Karl Taht, James Greensky, Rajeev Balasubramonian
2019Timeloop: A Systematic Approach to DNN Accelerator Evaluation.
Angshuman Parashar, Priyanka Raina, Yakun Sophia Shao, Yu-Hsin Chen, Victor A. Ying, Anurag Mukkara, Rangharajan Venkatesan, Brucek Khailany, Stephen W. Keckler, Joel S. Emer
2019Workload Characterization of Nondeterministic Programs Parallelized by STATS.
Enrico Armenio Deiana, Simone Campanoni
2019mRNA: Enabling Efficient Mapping Space Exploration for a Reconfiguration Neural Accelerator.
Zhongyuan Zhao, Hyoukjun Kwon, Sachit Kuhar, Weiguang Sheng, Zhigang Mao, Tushar Krishna
2019µqSim: Enabling Accurate and Scalable Simulation for Interactive Microservices.
Yanqi Zhang, Yu Gan, Christina Delimitrou