| 2010 | A study of hardware assisted IP over InfiniBand and its impact on enterprise data center performance. Ryan E. Grant, Pavan Balaji, Ahmad Afsahi |
| 2010 | An analysis of hard to predict branches. Celal Öztürk, Resit Sendag |
| 2010 | ArchExplorer.org: A methodology for facilitating a fair Comparison of research ideas. Veerle Desmet, Sylvain Girbal, Olivier Temam |
| 2010 | Cache contention and application performance prediction for multi-core systems. Chi Xu, Xi Chen, Robert P. Dick, Zhuoqing Morley Mao |
| 2010 | Characterizing the design and performance of interactive java applications. Dmitrijs Zaparanuks, Matthias Hauswirth |
| 2010 | Demystifying GPU microarchitecture through microbenchmarking. Henry Wong, Misel-Myrto Papadopoulou, Maryam Sadooghi-Alvandi, Andreas Moshovos |
| 2010 | Dynamic program analysis of Microsoft Windows applications. Alex Skaletsky, Tevi Devor, Nadav Chachmon, Robert S. Cohn, Kim M. Hazelwood, Vladimir Vladimirov, Moshe Bach |
| 2010 | Exploiting FPGAs for technology-aware system-level evaluation of multi-core architectures. Simone Secchi, Paolo Meloni, Luigi Raffo |
| 2010 | Hardware prediction of OS run-length for fine-grained resource customization. David W. Nellans, Kshitij Sudan, Rajeev Balasubramonian, Erik Brunvand |
| 2010 | High-level performance modeling of task-based algorithms. Alexei Alexandrov, Douglas Armstrong, Hrabri Rajic, Michael Voss, Donald Hayes |
| 2010 | IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2010, 28-30 March 2010, White Plains, NY, USA |
| 2010 | Incorporating Instruction-Based Sampling into AMD CodeAnalyst. Paul J. Drongowski, Lei Yu, Frank Swehosky, Suravee Suthikulpanit, Robert Richter |
| 2010 | Influences of SIMD architectures for scattered data interpolation algorithm. Jean-Charles Tournier, Martin Naef |
| 2010 | LagAlyzer: A latency profile analysis and visualization tool. Andrea Adamoli, Milan Jovic, Matthias Hauswirth |
| 2010 | Memphis: Finding and fixing NUMA-related performance problems on multi-core platforms. Collin McCurdy, Jeffrey S. Vetter |
| 2010 | Modeling memory concurrency for multi-socket multi-core systems. Anirban Mandal, Rob Fowler, Allan Porterfield |
| 2010 | PEBIL: Efficient static binary instrumentation for Linux. Michael Laurenzano, Mustafa M. Tikir, Laura Carrington, Allan Snavely |
| 2010 | Performance-effective operation below Vcc-min. Nikolas Ladas, Yiannakis Sazeides, Veerle Desmet |
| 2010 | Program behavior characterization in large memory systems. Parijat Dube, Michael Tsao, Dan E. Poff, Li Zhang, Alan Bivens |
| 2010 | Runahead execution vs. conventional data prefetching in the IBM POWER6 microprocessor. Harold W. Cain, Priya Nagpurkar |
| 2010 | Scalability comparison of commodity operating systems on multi-cores. Yan Cui, Yu Chen, Yuanchun Shi, Qingbo Wu |
| 2010 | Scaling OLTP applications on commodity multi-core platforms. Yan Cui, Yu Chen, Yuanchun Shi |
| 2010 | Simulation environment for studying overlap of communication and computation. Vladimir Subotic, Jesús Labarta, Mateo Valero |
| 2010 | StatStack: Efficient modeling of LRU caches. David Eklov, Erik Hagersten |
| 2010 | Synthesizing memory-level parallelism aware miniature clones for SPEC CPU2006 and ImplantBench workloads. Karthik Ganesan, Jungho Jo, Lizy K. John |
| 2010 | The Hadoop distributed filesystem: Balancing portability and performance. Jeffrey Shafer, Scott Rixner, Alan L. Cox |
| 2010 | The big pileup. Nick Mitchell |
| 2010 | Understanding transactional memory performance. Donald E. Porter, Emmett Witchel |
| 2010 | Using special-purpose hardware to achieve a hundred-fold speedup in molecular dynamics simulations of proteins. David Shaw |
| 2010 | Visualizing complex dynamics in many-core accelerator architectures. Aaron Ariel, Wilson W. L. Fung, Andrew E. Turner, Tor M. Aamodt |
| 2010 | Weak execution ordering - exploiting iterative methods on many-core GPUs. Jianmin Chen, Zhuo Huang, Feiqi Su, Jih-Kwon Peir, Jeff Ho, Lu Peng |