| 2019 | A Detailed Model for Contemporary GPU Memory Systems. Mahmoud Khairy, Akshay Jain, Tor M. Aamodt, Timothy G. Rogers |
| 2019 | A Model Driven Approach Towards Improving the Performance of Apache Spark Applications. Kewen Wang, Mohammad Maifi Hasan Khan, Nhan Nguyen, Swapna S. Gokhale |
| 2019 | An Improved Dynamic Vertical Partitioning Technique for Semi-Structured Data. Sahel Sharify, Alan W. Lu, Jin Chen, Arnamoy Bhattacharyya, Ali B. Hashemi, Nick Koudas, Cristiana Amza |
| 2019 | Analyzing Machine Learning Workloads Using a Detailed GPU Simulator. Jonathan S. Lew, Deval A. Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor M. Aamodt |
| 2019 | Assessing the Effects of Low Voltage in Branch Prediction Units. Athanasios Chatzidimitriou, George Papadimitriou, Dimitris Gizopoulos, Shrikanth Ganapathy, John Kalamatianos |
| 2019 | Characterization of Unnecessary Computations in Web Applications. Hossein Golestani, Scott A. Mahlke, Satish Narayanasamy |
| 2019 | Characterizing Sources of Ineffectual Computations in Deep Learning Networks. Milos Nikolic, Mostafa Mahmoud, Andreas Moshovos, Yiren Zhao, Robert Mullins |
| 2019 | DSMM: A Dynamic Setting for Memory Management in Apache Spark. Suk-Joo Chae, Tae-Sun Chung |
| 2019 | DeLTA: GPU Performance Model for Deep Learning Applications with In-Depth Memory System Traffic Analysis. Sangkug Lym, Donghyuk Lee, Mike O'Connor, Niladrish Chatterjee, Mattan Erez |
| 2019 | Demystifying Bayesian Inference Workloads. Yu Emma Wang, Yuhao Zhu, Glenn G. Ko, Brandon Reagen, Gu-Yeon Wei, David Brooks |
| 2019 | Demystifying Crypto-Mining: Analysis and Optimizations of Memory-Hard PoW Algorithms. Runchao Han, Nikos Foutris, Christos Kotselidis |
| 2019 | Distributed Software Defined Networking Controller Failure Mode and Availability Analysis. Paul Reeser, Guilhem Tesseyre, Marcus Callaway |
| 2019 | Empirical Investigation of Stale Value Tolerance on Parallel RNN Training. Joo Hwan Lee, Hyesoon Kim |
| 2019 | Emulating and Evaluating Hybrid Memory for Managed Languages on NUMA Hardware. Shoaib Akram, Jennifer B. Sartor, Kathryn S. McKinley, Lieven Eeckhout |
| 2019 | Fast Modeling of the L2 Cache Reuse Distance Histograms from Software Traces. Jiancong Ge, Ming Ling |
| 2019 | FlexCPU: A Configurable Out-of-Order CPU Abstraction. Bradley Wang, Ayaz Akram, Jason Lowe-Power |
| 2019 | Full-System Simulation of Mobile CPU/GPU Platforms. Kuba Kaszyk, Harry Wagstaff, Tom Spink, Björn Franke, Michael F. P. O'Boyle, Bruno Bodin, Henrik Uhrenholt |
| 2019 | GeST: An Automatic Framework For Generating CPU Stress-Tests. Zacharias Hadjilambrou, Shidhartha Das, Paul N. Whatmough, David M. Bull, Yiannakis Sazeides |
| 2019 | HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators. Masab Ahmad, Halit Dogan, Christopher J. Michael, Omer Khan |
| 2019 | Hierarchical Page Eviction Policy for Unified Memory in GPUs. Qi Yu, Bruce R. Childers, Libo Huang, Cheng Qian, Zhiying Wang |
| 2019 | IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2019, Madison, WI, USA, March 24-26, 2019 |
| 2019 | Modeling Deep Learning Accelerator Enabled GPUs. Md Aamir Raihan, Negar Goli, Tor M. Aamodt |
| 2019 | On the Impact of Instruction Address Translation Overhead. Yufeng Zhou, Xiaowan Dong, Alan L. Cox, Sandhya Dwarkadas |
| 2019 | One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-Off in Machine Learning Cloud Service APIs via Tolerance Tiers. Matthew Halpern, Behzad Boroujerdian, Todd W. Mummert, Evelyn Duesterwald, Vijay Janapa Reddi |
| 2019 | PARADISE - Post-Moore Architecture and Accelerator Design Space Exploration Using Device Level Simulation and Experiments. Dilip P. Vasudevan, George Michelogiannakis, David Donofrio, John Shalf |
| 2019 | Parallelism Analysis of Prominent Desktop Applications: An 18- Year Perspective. Siying Feng, Subhankar Pal, Yichen Yang, Ronald G. Dreslinski |
| 2019 | Quantifying Process Variations and Its Impacts on Smartphones. Guru Prasad Srinivasa, Scott Haseley, Geoffrey Challen, Mark Hempstead |
| 2019 | RPPM: Rapid Performance Prediction of Multithreaded Workloads on Multicore Processors. Sander De Pestel, Sam Van den Steen, Shoaib Akram, Lieven Eeckhout |
| 2019 | Racing to Hardware-Validated Simulation. Almutaz Adileh, Cecilia González-Alvarez, Juan Miguel De Haro Ruiz, Lieven Eeckhout |
| 2019 | Tango: A Deep Neural Network Benchmark Suite for Various Accelerators. Aajna Karki, Chethan Palangotu Keshava, Spoorthi Mysore Shivakumar, Joshua Skow, Goutam Madhukeshwar Hegde, Hyeran Jeon |
| 2019 | The POP Detector: A Lightweight Online Program Phase Detection Framework. Karl Taht, James Greensky, Rajeev Balasubramonian |
| 2019 | Timeloop: A Systematic Approach to DNN Accelerator Evaluation. Angshuman Parashar, Priyanka Raina, Yakun Sophia Shao, Yu-Hsin Chen, Victor A. Ying, Anurag Mukkara, Rangharajan Venkatesan, Brucek Khailany, Stephen W. Keckler, Joel S. Emer |
| 2019 | Workload Characterization of Nondeterministic Programs Parallelized by STATS. Enrico Armenio Deiana, Simone Campanoni |
| 2019 | mRNA: Enabling Efficient Mapping Space Exploration for a Reconfiguration Neural Accelerator. Zhongyuan Zhao, Hyoukjun Kwon, Sachit Kuhar, Weiguang Sheng, Zhigang Mao, Tushar Krishna |
| 2019 | µqSim: Enabling Accurate and Scalable Simulation for Interactive Microservices. Yanqi Zhang, Yu Gan, Christina Delimitrou |