| 2006 | "MAMA!": a memory allocator for multithreaded architectures. Simon Kahan, Petr Konecny |
| 2006 | A case study in top-down performance estimation for a large-scale parallel application. Ilya Sharapov, Robert Kroeger, Guy Delamarter, Razvan Cheveresan, Matthew Ramsay |
| 2006 | Accurate and efficient runtime detection of atomicity errors in concurrent programs. Liqiang Wang, Scott D. Stoller |
| 2006 | Adaptive scheduling with parallelism feedback. Kunal Agrawal, Yuxiong He, Wen-Jing Hsu, Charles E. Leiserson |
| 2006 | Collective communication on architectures that support simultaneous communication over multiple links. Ernie Chan, Robert A. van de Geijn, William Gropp, Rajeev Thakur |
| 2006 | Exploiting distributed version concurrency in a transactional memory cluster. Kaloian Manassiev, Madalin Mihailescu, Cristiana Amza |
| 2006 | Fast and transparent recovery for continuous availability of cluster-based servers. Rosalia Christodoulopoulou, Kaloian Manassiev, Angelos Bilas, Cristiana Amza |
| 2006 | Global-view abstractions for user-defined reductions and scans. Steven J. Deitz, David Callahan, Bradford L. Chamberlain, Lawrence Snyder |
| 2006 | Hardware profile-guided automatic page placement for ccNUMA systems. Jaydeep Marathe, Frank Mueller |
| 2006 | High-performance IPv6 forwarding algorithm for multi-core and multithreaded network processor. Xianghui Hu, Xinan Tang, Bei Hua |
| 2006 | Hybrid transactional memory. Sanjeev Kumar, Michael Chu, Christopher J. Hughes, Partha Kundu, Anthony D. Nguyen |
| 2006 | McRT-STM: a high performance software transactional memory system for a multi-core runtime. Bratin Saha, Ali-Reza Adl-Tabatabai, Richard L. Hudson, Chi Cao Minh, Ben Hertzberg |
| 2006 | Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster. Robert Springer, David K. Lowenthal, Barry Rountree, Vincent W. Freeh |
| 2006 | Mobile MPI programs in computational grids. Rohit Fernandes, Keshav Pingali, Paul Stodghill |
| 2006 | On-line automated performance diagnosis on thousands of processes. Philip C. Roth, Barton P. Miller |
| 2006 | Optimizing irregular shared-memory applications for distributed-memory systems. Ayon Basumallik, Rudolf Eigenmann |
| 2006 | POSH: a TLS compiler that exploits program structure. Wei Liu, James Tuck, Luis Ceze, Wonsun Ahn, Karin Strauss, Jose Renau, Josep Torrellas |
| 2006 | Parallel programming and code selection in fortress. Guy L. Steele Jr. |
| 2006 | Parallel programming in modern web search engines. Raymie Stata |
| 2006 | Performance characterization of molecular dynamics techniques for biomolecular simulations. Sadaf R. Alam, Jeffrey S. Vetter, Pratul K. Agarwal, Al Geist |
| 2006 | Performance evaluation of adaptive MPI. Chao Huang, Gengbin Zheng, Laxmikant V. Kalé, Sameer Kumar |
| 2006 | Predicting bounds on queuing delay for batch-scheduled parallel machines. John Brevik, Daniel Nurmi, Richard Wolski |
| 2006 | Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2006, New York, New York, USA, March 29-31, 2006 Josep Torrellas, Siddhartha Chatterjee |
| 2006 | Programming for parallelism and locality with hierarchically tiled arrays. Ganesh Bikshandi, Jia Guo, Daniel Hoeflinger, Gheorghe Almási, Basilio B. Fraguela, María Jesús Garzarán, David A. Padua, Christoph von Praun |
| 2006 | Proving correctness of highly-concurrent linearisable objects. Viktor Vafeiadis, Maurice Herlihy, Tony Hoare, Marc Shapiro |
| 2006 | RDMA read based rendezvous protocol for MPI over InfiniBand: design alternatives and benefits. Sayantan Sur, Hyun-Wook Jin, Lei Chai, Dhabaleswar K. Panda |
| 2006 | Scalable synchronous queues. William N. Scherer III, Doug Lea, Michael L. Scott |
| 2006 | Teaching parallel computing to science faculty: best practices and common pitfalls. David A. Joiner, Paul Gray, Thomas Murphy, Charles Peck |