| 2017 | 2017 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2017, Santa Rosa, CA, USA, April 24-25, 2017 |
| 2017 | A taxonomy of out-of-order instruction commit. Mehdi Alipour, Trevor E. Carlson, Stefanos Kaxiras |
| 2017 | Accurate address streams for LLC and beyond (SLAB): A methodology to enable system exploration. Reena Panda, Xinnian Zheng, Lizy Kurian John |
| 2017 | Analyzing OpenCL 2.0 workloads using a heterogeneous CPU-GPU simulator. Li Wang, Ren-Wei Tsai, Shao-Chung Wang, Kun-Chih Chen, Po-Han Wang, Hsiang-Yun Cheng, Yi-Chung Lee, Sheng-Jie Shu, Chun-Chieh Yang, Min-Yih Hsu, Li-Chen Kan, Chao-Lin Lee, Tzu-Chieh Yu, Rih-Ding Peng, Chia-Lin Yang, Yuan-Shin Hwang, Jenq Kuen Lee, Shiao-Li Tsao, Ming Ouhyoung |
| 2017 | Analyzing the scalability of managed language applications with speedup stacks. Jennifer B. Sartor, Kristof Du Bois, Stijn Eyerman, Lieven Eeckhout |
| 2017 | Chai: Collaborative heterogeneous applications for integrated-architectures. Juan Gómez-Luna, Izzat El Hajj, Li-Wen Chang, Victor Garcia-Flores, Simon Garcia De Gonzalo, Thomas B. Jablin, Antonio J. Peña, Wen-mei W. Hwu |
| 2017 | Characterization of GPGPU workloads on a multidimensional heterogeneous processor. Matthew A. Watkins, Philip Bedoukian |
| 2017 | Clone morphing: Creating new workload behavior from existing applications. Yipeng Wang, Amro Awad, Yan Solihin |
| 2017 | Crossing the architectural barrier: Evaluating representative regions of parallel HPC applications. Alexandra Ferreron, Radhika Jagtap, Sascha Bischoff, Roxana Rusitoru |
| 2017 | DARTS: Performance-counter driven sampling using binary translators. Rajesh Kumar, Suchita Pati, Kanishka Lahiri |
| 2017 | Docker characterization on high performance SSDs. Qiumin Xu, Manu Awasthi, Krishna T. Malladi, Janki Bhimani, Jingpei Yang, Murali Annavaram |
| 2017 | Evaluating and mitigating bandwidth bottlenecks across the memory hierarchy in GPUs. Saumay Dublish, Vijay Nagarajan, Nigel P. Topham |
| 2017 | Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline Modeling. Andre Lopes, Frederico Pratas, Leonel Sousa, Aleksandar Ilic |
| 2017 | Fast IPC estimation for performance projections using proxy suites and decision trees. Kanishka Lahiri, Subhash Kunnoth |
| 2017 | GaaS workload characterization under NUMA architecture for virtualized GPU. Huixiang Chen, Meng Wang, Yang Hu, Mingcong Song, Tao Li |
| 2017 | HW/SW co-designed processors: Challenges, design choices and a simulation infrastructure for evaluation. Rakesh Kumar, José Cano, Aleksandar Brankovic, Demos Pavlou, Kyriakos Stavrou, Enric Gibert, Alejandro Martínez, Antonio Gonzalez |
| 2017 | Machine learning for performance and power modeling/prediction. Lizy Kurian John |
| 2017 | MaxSim: A simulation platform for managed applications. Andrey Rodchenko, Christos Kotselidis, Andy Nisbet, Antoniu Pop, Mikel Luján |
| 2017 | Microarchitecture level reliability comparison of modern GPU designs: First findings. Alessandro Vallero, Stefano Di Carlo, Sotiris Tselonis, Dimitris Gizopoulos |
| 2017 | Multi2Sim Kepler: A detailed architectural GPU simulator. Xun Gong, Rafael Ubal, David R. Kaeli |
| 2017 | OpenSMART: Single-cycle multi-hop NoC generator in BSV and Chisel. Hyoukjun Kwon, Tushar Krishna |
| 2017 | PMAL: Enabling lightweight adaptation of legacy file systems on persistent memory systems. Hyunsub Song, Young Je Moon, Se Kwon Lee, Sam H. Noh |
| 2017 | PTAT: An efficient and precise tool for collecting detailed TLB miss traces. Jiutian Zhang, Yuhang Liu, Xiaojing Zhu, Yuan Ruan, Mingyu Chen |
| 2017 | Performance analysis of CNN frameworks for GPUs. Heehoon Kim, Hyoungwook Nam, Wookeun Jung, Jaejin Lee |
| 2017 | Performance competitiveness of a statically compiled language for server-side Web applications. Yohei Ueda, Moriyoshi Ohara |
| 2017 | Predicting memory page stability and its application to memory deduplication and live migration. Karim Elghamrawy, Diana Franklin, Frederic T. Chong |
| 2017 | Prefetching for cloud workloads: An analysis based on address patterns. Jiajun Wang, Reena Panda, Lizy Kurian John |
| 2017 | Proxy benchmarks for emerging big-data workloads. Reena Panda, Lizy Kurian John |
| 2017 | SASSIFI: An architecture-level fault injection tool for GPU application resilience evaluation. Siva Kumar Sastry Hari, Timothy Tsai, Mark Stephenson, Stephen W. Keckler, Joel S. Emer |
| 2017 | Service capacity measurement by redlining with live production traffic. Susie Xia, Zhenyun Zhuang, Anant Rao, Haricharan Ramachandra, Yi Feng, Ramya Pasumarti |
| 2017 | Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applications. Ugljesa Milic, Alejandro Rico, Paul M. Carpenter, Alex Ramírez |
| 2017 | SimBench: A portable benchmarking methodology for full-system simulators. Harry Wagstaff, Bruno Bodin, Tom Spink, Björn Franke |
| 2017 | StressRight: Finding the right stress for accurate in-development system evaluation. Jaewon Lee, Hanhwi Jang, Jae-Eon Jo, Gyu-hyeon Lee, Jangwoo Kim |
| 2017 | Toolbox for exploration of energy-efficient event processors for human-computer interaction. Tayyar Rzayev, David H. Albonesi, François Guimbretière, Rajit Manohar, Jaeyeon Kihm |
| 2017 | Treelogy: A benchmark suite for tree traversals. Nikhil Hegde, Jianqiao Liu, Kirshanthan Sundararajah, Milind Kulkarni |
| 2017 | dist-gem5: Distributed simulation of computer clusters. Mohammad Alian, Umur Darbaz, Gábor Dózsa, Stephan Diestelhorst, Daehoon Kim, Nam Sung Kim |