| 2013 | A peta-scalable CPU-GPU algorithm for global atmospheric simulations. Chao Yang, Wei Xue, Haohuan Fu, Lin Gan, Linfeng Li, Yangtong Xu, Yutong Lu, Jiachang Sun, Guangwen Yang, Weimin Zheng |
| 2013 | ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '13, Shenzhen, China, February 23-27, 2013 Alex Nicolau, Xiaowei Shen, Saman P. Amarasinghe, Richard W. Vuduc |
| 2013 | Adoption protocols for fanout-optimal fault-tolerant termination detection. Jonathan Lifflander, Phil Miller, Laxmikant V. Kalé |
| 2013 | Array dataflow analysis for polyhedral X10 programs. Tomofumi Yuki, Paul Feautrier, Sanjay V. Rajopadhye, Vijay A. Saraswat |
| 2013 | Automatic problem size sensitive task partitioning on heterogeneous parallel systems. Ivan Grasso, Klaus Kofler, Biagio Cosenza, Thomas Fahringer |
| 2013 | Betweenness centrality: algorithms and implementations. Dimitrios Prountzos, Keshav Pingali |
| 2013 | Compiler aided manual speculation for high performance concurrent data structures. Lingxiang Xiang, Michael Lee Scott |
| 2013 | Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU. Bo Wu, Zhijia Zhao, Eddy Zheng Zhang, Yunlian Jiang, Xipeng Shen |
| 2013 | Correct and efficient work-stealing for weak memory models. Nhat Minh Lê, Antoniu Pop, Albert Cohen, Francesco Zappa Nardelli |
| 2013 | Data layout optimization for GPGPU architectures. Jun Liu, Wei Ding, Ohyoung Jang, Mahmut T. Kandemir |
| 2013 | Data-only flattening for nested data parallelism. Lars Bergstrom, Matthew Fluet, Mike Rainey, John H. Reppy, Stephen Rosen, Adam Shaw |
| 2013 | Decomposition techniques for optimal design-space exploration of streaming applications. Shobana Padmanabhan, Yixin Chen, Roger D. Chamberlain |
| 2013 | Distributed merge trees. Dmitriy Morozov, Gunther H. Weber |
| 2013 | Exploring different automata representations for efficient regular expression matching on GPUs. Xiaodong Yu, Michela Becchi |
| 2013 | Expressing graph algorithms using generalized active messages. Nick Edmonds, Jeremiah Willcock, Andrew Lumsdaine |
| 2013 | Fast concurrent queues for x86 processors. Adam Morrison, Yehuda Afek |
| 2013 | FastLane: improving performance of software transactional memory for low thread counts. Jons-Tobias Wamhoff, Christof Fetzer, Pascal Felber, Etienne Rivière, Gilles Muller |
| 2013 | From relational verification to SIMD loop synthesis. Gilles Barthe, Juan Manuel Crespo, Sumit Gulwani, César Kunz, Mark Marron |
| 2013 | Ligra: a lightweight graph processing framework for shared memory. Julian Shun, Guy E. Blelloch |
| 2013 | Morph algorithms on GPUs. Rupesh Nasre, Martin Burtscher, Keshav Pingali |
| 2013 | Multi-level parallel computing of reverse time migration for seismic imaging on blue Gene/Q. Ligang Lu, Karen A. Magerlein |
| 2013 | NUMA-aware reader-writer locks. Irina Calciu, David Dice, Yossi Lev, Victor Luchangco, Virendra J. Marathe, Nir Shavit |
| 2013 | Online-ABFT: an online algorithm based fault tolerance scheme for soft error detection in iterative methods. Zizhong Chen |
| 2013 | Ownership passing: efficient distributed memory programming on multi-core systems. Andrew Friedley, Torsten Hoefler, Greg Bronevetsky, Andrew Lumsdaine, Ching-Chen Ma |
| 2013 | Parallel programming with big operators. Changhee Park, Guy L. Steele Jr., Jean-Baptiste Tristan |
| 2013 | Parallel schedule synthesis for attribute grammars. Leo A. Meyerovich, Matthew E. Torok, Eric Atkinson, Rastislav Bodík |
| 2013 | Parallel suffix array and least common prefix for the GPU. Mrinal Deo, Sean Keely |
| 2013 | Programming with hardware lock elision. Yehuda Afek, Amir Levy, Adam Morrison |
| 2013 | RaceFree: an efficient multi-threading model for determinism. Kai Lu, Xu Zhou, Xiaoping Wang, Wenzhe Zhang, Gen Li |
| 2013 | Reducing contention through priority updates. Julian Shun, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons |
| 2013 | Relational algorithms for multi-bulk-synchronous processors. Gregory Frederick Diamos, Haicheng Wu, Jin Wang, Ashwin Sanjay Lele, Sudhakar Yalamanchili |
| 2013 | Runtime elision of transactional barriers for captured memory. Fernando Miguel Carvalho, João P. Cachopo |
| 2013 | Scalable data race detection for partitioned global address space programs. Chang-Seo Park, Koushik Sen, Costin Iancu |
| 2013 | Scalable deterministic replay in a parallel full-system emulator. Yufei Chen, Haibo Chen |
| 2013 | Scalable statistics counters. Dave Dice, Yossi Lev, Mark Moir |
| 2013 | Scheduling parallel programs by work stealing with private deques. Umut A. Acar, Arthur Charguéraud, Mike Rainey |
| 2013 | StreamScan: fast scan algorithms for GPUs without global barrier synchronization. Shengen Yan, Guoping Long, Yunquan Zhang |
| 2013 | Swift/T: scalable data flow programming for many-task applications. Justin M. Wozniak, Timothy G. Armstrong, Michael Wilde, Daniel S. Katz, Ewing L. Lusk, Ian T. Foster |
| 2013 | TeamWork: synchronizing threads globally to detect real deadlocks for multithreaded programs. Yan Cai, Ke Zhai, Shangru Wu, Wing Kwong Chan |
| 2013 | The tasks with effects model for safe concurrency. Stephen Heumann, Vikram S. Adve, Shengjie Wang |
| 2013 | TigerQuoll: parallel event-based JavaScript. Daniele Bonetta, Walter Binder, Cesare Pautasso |
| 2013 | Towards an energy estimator for fault tolerance protocols. Mohammed el Mehdi Diouri, Olivier Glück, Laurent Lefèvre, Franck Cappello |
| 2013 | Using hardware transactional memory to correct and simplify and readers-writer lock algorithm. Dave Dice, Yossi Lev, Yujie Liu, Victor Luchangco, Mark Moir |
| 2013 | Work-stealing with configurable scheduling strategies. Martin Wimmer, Daniel Cederman, Jesper Larsson Träff, Philippas Tsigas |
| 2013 | WuKong: effective diagnosis of bugs at large system scales. Bowen Zhou, Milind Kulkarni, Saurabh Bagchi |
| 2013 | ZOOMM: a parallel web browser engine for multicore mobile devices. Calin Cascaval, Seth Fowler, Pablo Montesinos-Ortego, Wayne Piekarski, Mehrdad Reshadi, Behnam Robatmili, Michael Weber, Vrajesh Bhavsar |