| 2011 | A domain-specific approach to heterogeneous parallelism. Hassan Chafi, Arvind K. Sujeeth, Kevin J. Brown, HyoukJoong Lee, Anand R. Atreya, Kunle Olukotun |
| 2011 | A wait-free NCAS library for parallel applications with timing constraints. Philippe Stellwag, Fabian Scheler, Jakob Krainz, Wolfgang Schröder-Preikschat |
| 2011 | Accelerating CUDA graph algorithms at maximum warp. Sungpack Hong, Sang Kyun Kim, Tayo Oguntebi, Kunle Olukotun |
| 2011 | Achieving a single compute device image in OpenCL for multiple GPUs. Jungwon Kim, Honggyu Kim, Joo Hwan Lee, Jaejin Lee |
| 2011 | Active pebbles: a programming model for highly parallel fine-grained data-driven computations. Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine |
| 2011 | Algorithm-based recovery for HPL. Teresa Davies, Zizhong Chen, Christer Karlsson, Hui Liu |
| 2011 | All-window profiling and composable models of cache sharing. Xiaoya Xiang, Bin Bao, Tongxin Bai, Chen Ding, Trishul M. Chilimbi |
| 2011 | Auto-tuning of fast fourier transform on graphics processors. Yuri Dotsenko, Sara S. Baghsorkhi, Brandon Lloyd, Naga K. Govindaraju |
| 2011 | Automatic formal verification of MPI-based parallel programs. Stephen F. Siegel, Timothy K. Zirkel |
| 2011 | Automatic safety proofs for asynchronous memory operations. Matko Botincan, Mike Dodds, Alastair F. Donaldson, Matthew J. Parkinson |
| 2011 | COREMU: a scalable and portable parallel full-system emulator. Zhaoguo Wang, Ran Liu, Yufei Chen, Xi Wu, Haibo Chen, Weihua Zhang, Binyu Zang |
| 2011 | CSX: an extended compression format for spmv on shared memory systems. Kornilios Kourtis, Vasileios Karakasis, Georgios I. Goumas, Nectarios Koziris |
| 2011 | Communicating memory transactions. Mohsen Lesani, Jens Palsberg |
| 2011 | Compact data structure and scalable algorithms for the sparse grid technique. Alin Florindor Murarasu, Josef Weidendorfer, Gerrit Buse, Daniel Butnaru, Dirk Pflüger |
| 2011 | Cooperative reasoning for preemptive execution. Jaeheon Yi, Caitlin Sadowski, Cormac Flanagan |
| 2011 | Copperhead: compiling an embedded data parallel language. Bryan Catanzaro, Michael Garland, Kurt Keutzer |
| 2011 | Enhanced speculative parallelization via incremental recovery. Chen Tian, Changhui Lin, Min Feng, Rajiv Gupta |
| 2011 | Evaluating graph coloring on GPUs. Andre Vincent Pascal Grosset, Peihong Zhu, Shusen Liu, Suresh Venkatasubramanian, Mary W. Hall |
| 2011 | GRace: a low-overhead mechanism for detecting data races in GPU programs. Mai Zheng, Vignesh T. Ravi, Feng Qin, Gagan Agrawal |
| 2011 | How's the parallel computing revolution going? Kathryn S. McKinley |
| 2011 | Inferring ownership transfer for efficient message passing. Stas Negara, Rajesh K. Karmani, Gul A. Agha |
| 2011 | Kremlin: like gprof, but for parallelization. Donghwan Jeon, Saturnino Garcia, Christopher M. Louie, Sravanthi Kota Venkata, Michael Bedford Taylor |
| 2011 | Lifeline-based global load balancing. Vijay A. Saraswat, Prabhanjan Kambadur, Sreedhar B. Kodali, David Grove, Sriram Krishnamoorthy |
| 2011 | Lock-free and scalable multi-version software transactional memory. Sergio Miguel Fernandes, João P. Cachopo |
| 2011 | OoOJava: software out-of-order execution. James Christopher Jenista, Yong Hun Eom, Brian Demsky |
| 2011 | Ordered vs. unordered: a comparison of parallelism and work-efficiency in irregular algorithms. Muhammad Amber Hassaan, Martin Burtscher, Keshav Pingali |
| 2011 | Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2011, San Antonio, TX, USA, February 12-16, 2011 Calin Cascaval, Pen-Chung Yew |
| 2011 | Programming the cloud. James R. Larus |
| 2011 | Programming the memory hierarchy revisited: supporting irregular parallelism in sequoia. Michael Bauer, John Clark, Eric Schkufza, Alex Aiken |
| 2011 | QoS aware storage cache management in multi-server environments. Ramya Prabhakar, Shekhar Srikantaiah, Rajat Garg, Mahmut T. Kandemir |
| 2011 | SCRATCH: a tool for automatic analysis of dma races. Alastair F. Donaldson, Daniel Kroening, Philipp Rümmer |
| 2011 | ScalaExtrap: trace-based communication extrapolation for spmd programs. Xing Wu, Frank Mueller |
| 2011 | SpiceC: scalable parallelism via implicit copying and explicit commit. Min Feng, Rajiv Gupta, Yi Hu |
| 2011 | Symbolically modeling concurrent MCAPI executions. Topher Fischer, Eric Mercer, Neha Rungta |
| 2011 | The STAPL parallel container framework. Gabriel Tanase, Antal A. Buss, Adam Fidel, Harshvardhan, Ioannis Papadopoulos, Olga Pearce, Timmie G. Smith, Nathan L. Thomas, Xiabing Xu, Nedal Mourad, Jeremy Vu, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger |
| 2011 | Thread contracts for safe parallelism. Rajesh K. Karmani, P. Madhusudan, Brandon M. Moore |
| 2011 | Time skewing made simple. Robert Strzodka, Mohammed Shaheen, Dawid Pajak |
| 2011 | Transaction communicators: enabling cooperation among concurrent transactions. Victor Luchangco, Virendra J. Marathe |
| 2011 | Two examples of parallel programming without concurrency constructs (PP-CC). Chen Ding |
| 2011 | ULCC: a user-level facility for optimizing shared cache performance on multicores. Xiaoning Ding, Kaibo Wang, Xiaodong Zhang |
| 2011 | Wait-free queues with multiple enqueuers and dequeuers. Alex Kogan, Erez Petrank |
| 2011 | Weak atomicity under the x86 memory consistency model. Amitabha Roy, Steven Hand, Tim Harris |