PPoPP B

46 papers

YearTitle / Authors
2013A peta-scalable CPU-GPU algorithm for global atmospheric simulations.
Chao Yang, Wei Xue, Haohuan Fu, Lin Gan, Linfeng Li, Yangtong Xu, Yutong Lu, Jiachang Sun, Guangwen Yang, Weimin Zheng
2013ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '13, Shenzhen, China, February 23-27, 2013
Alex Nicolau, Xiaowei Shen, Saman P. Amarasinghe, Richard W. Vuduc
2013Adoption protocols for fanout-optimal fault-tolerant termination detection.
Jonathan Lifflander, Phil Miller, Laxmikant V. Kalé
2013Array dataflow analysis for polyhedral X10 programs.
Tomofumi Yuki, Paul Feautrier, Sanjay V. Rajopadhye, Vijay A. Saraswat
2013Automatic problem size sensitive task partitioning on heterogeneous parallel systems.
Ivan Grasso, Klaus Kofler, Biagio Cosenza, Thomas Fahringer
2013Betweenness centrality: algorithms and implementations.
Dimitrios Prountzos, Keshav Pingali
2013Compiler aided manual speculation for high performance concurrent data structures.
Lingxiang Xiang, Michael Lee Scott
2013Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU.
Bo Wu, Zhijia Zhao, Eddy Zheng Zhang, Yunlian Jiang, Xipeng Shen
2013Correct and efficient work-stealing for weak memory models.
Nhat Minh Lê, Antoniu Pop, Albert Cohen, Francesco Zappa Nardelli
2013Data layout optimization for GPGPU architectures.
Jun Liu, Wei Ding, Ohyoung Jang, Mahmut T. Kandemir
2013Data-only flattening for nested data parallelism.
Lars Bergstrom, Matthew Fluet, Mike Rainey, John H. Reppy, Stephen Rosen, Adam Shaw
2013Decomposition techniques for optimal design-space exploration of streaming applications.
Shobana Padmanabhan, Yixin Chen, Roger D. Chamberlain
2013Distributed merge trees.
Dmitriy Morozov, Gunther H. Weber
2013Exploring different automata representations for efficient regular expression matching on GPUs.
Xiaodong Yu, Michela Becchi
2013Expressing graph algorithms using generalized active messages.
Nick Edmonds, Jeremiah Willcock, Andrew Lumsdaine
2013Fast concurrent queues for x86 processors.
Adam Morrison, Yehuda Afek
2013FastLane: improving performance of software transactional memory for low thread counts.
Jons-Tobias Wamhoff, Christof Fetzer, Pascal Felber, Etienne Rivière, Gilles Muller
2013From relational verification to SIMD loop synthesis.
Gilles Barthe, Juan Manuel Crespo, Sumit Gulwani, César Kunz, Mark Marron
2013Ligra: a lightweight graph processing framework for shared memory.
Julian Shun, Guy E. Blelloch
2013Morph algorithms on GPUs.
Rupesh Nasre, Martin Burtscher, Keshav Pingali
2013Multi-level parallel computing of reverse time migration for seismic imaging on blue Gene/Q.
Ligang Lu, Karen A. Magerlein
2013NUMA-aware reader-writer locks.
Irina Calciu, David Dice, Yossi Lev, Victor Luchangco, Virendra J. Marathe, Nir Shavit
2013Online-ABFT: an online algorithm based fault tolerance scheme for soft error detection in iterative methods.
Zizhong Chen
2013Ownership passing: efficient distributed memory programming on multi-core systems.
Andrew Friedley, Torsten Hoefler, Greg Bronevetsky, Andrew Lumsdaine, Ching-Chen Ma
2013Parallel programming with big operators.
Changhee Park, Guy L. Steele Jr., Jean-Baptiste Tristan
2013Parallel schedule synthesis for attribute grammars.
Leo A. Meyerovich, Matthew E. Torok, Eric Atkinson, Rastislav Bodík
2013Parallel suffix array and least common prefix for the GPU.
Mrinal Deo, Sean Keely
2013Programming with hardware lock elision.
Yehuda Afek, Amir Levy, Adam Morrison
2013RaceFree: an efficient multi-threading model for determinism.
Kai Lu, Xu Zhou, Xiaoping Wang, Wenzhe Zhang, Gen Li
2013Reducing contention through priority updates.
Julian Shun, Guy E. Blelloch, Jeremy T. Fineman, Phillip B. Gibbons
2013Relational algorithms for multi-bulk-synchronous processors.
Gregory Frederick Diamos, Haicheng Wu, Jin Wang, Ashwin Sanjay Lele, Sudhakar Yalamanchili
2013Runtime elision of transactional barriers for captured memory.
Fernando Miguel Carvalho, João P. Cachopo
2013Scalable data race detection for partitioned global address space programs.
Chang-Seo Park, Koushik Sen, Costin Iancu
2013Scalable deterministic replay in a parallel full-system emulator.
Yufei Chen, Haibo Chen
2013Scalable statistics counters.
Dave Dice, Yossi Lev, Mark Moir
2013Scheduling parallel programs by work stealing with private deques.
Umut A. Acar, Arthur Charguéraud, Mike Rainey
2013StreamScan: fast scan algorithms for GPUs without global barrier synchronization.
Shengen Yan, Guoping Long, Yunquan Zhang
2013Swift/T: scalable data flow programming for many-task applications.
Justin M. Wozniak, Timothy G. Armstrong, Michael Wilde, Daniel S. Katz, Ewing L. Lusk, Ian T. Foster
2013TeamWork: synchronizing threads globally to detect real deadlocks for multithreaded programs.
Yan Cai, Ke Zhai, Shangru Wu, Wing Kwong Chan
2013The tasks with effects model for safe concurrency.
Stephen Heumann, Vikram S. Adve, Shengjie Wang
2013TigerQuoll: parallel event-based JavaScript.
Daniele Bonetta, Walter Binder, Cesare Pautasso
2013Towards an energy estimator for fault tolerance protocols.
Mohammed el Mehdi Diouri, Olivier Glück, Laurent Lefèvre, Franck Cappello
2013Using hardware transactional memory to correct and simplify and readers-writer lock algorithm.
Dave Dice, Yossi Lev, Yujie Liu, Victor Luchangco, Mark Moir
2013Work-stealing with configurable scheduling strategies.
Martin Wimmer, Daniel Cederman, Jesper Larsson Träff, Philippas Tsigas
2013WuKong: effective diagnosis of bugs at large system scales.
Bowen Zhou, Milind Kulkarni, Saurabh Bagchi
2013ZOOMM: a parallel web browser engine for multicore mobile devices.
Calin Cascaval, Seth Fowler, Pablo Montesinos-Ortego, Wayne Piekarski, Mehrdad Reshadi, Behnam Robatmili, Michael Weber, Vrajesh Bhavsar