PPoPP - RankMe

48 papers

Year	Title / Authors
2017	A Multicore Path to Connectomics-on-Demand. Alexander Matveev, Yaron Meirovitch, Hayk Saribekyan, Wiktor Jakubiuk, Tim Kaler, Gergely Ódor, David M. Budden, Aleksandar Zlateski, Nir Shavit
2017	An Efficient Abortable-locking Protocol for Multi-level NUMA Systems. Milind Chabbi, Abdelhalim Amer, Shasha Wen, Xu Liu
2017	Checking Concurrent Data Structures Under the C/C++11 Memory Model. Peizhao Ou, Brian Demsky
2017	Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation. Peng Jiang, Gagan Agrawal
2017	Contention in Structured Concurrency: Provably Efficient Dynamic Non-Zero Indicators for Nested Parallelism. Umut A. Acar, Naama Ben-David, Mike Rainey
2017	EffiSha: A Software Framework for Enabling Effficient Preemptive Scheduling of GPU. Guoyang Chen, Yue Zhao, Xipeng Shen, Huiyang Zhou
2017	Eunomia: Scaling Concurrent Search Trees under Contention Using HTM. Xin Wang, Weihua Zhang, Zhaoguo Wang, Ziyun Wei, Haibo Chen, Wenyun Zhao
2017	Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs. Bin Ren, Sriram Krishnamoorthy, Kunal Agrawal, Milind Kulkarni
2017	Function Call Re-Vectorization. Rubens E. A. Moreira, Caroline Collange, Fernando Magno Quintão Pereira
2017	Grammar-aware Parallelization for Scalable XPath Querying. Lin Jiang, Zhijia Zhao
2017	Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations. Tal Ben-Nun, Michael Sutton, Sreepathi Pai, Keshav Pingali
2017	Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications. Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Felix Wolf
2017	It's Time for a New Old Language. Guy L. Steele Jr.
2017	KiWi: A Key-Value Map for Scalable Real-Time Analytics. Dmitry Basin, Edward Bortnikov, Anastasia Braginsky, Guy Golan-Gueta, Eshcar Hillel, Idit Keidar, Moshe Sulamy
2017	Layout Lock: A Scalable Locking Paradigm for Concurrent Data Layout Modifications. Nachshon Cohen, Arie Tal, Erez Petrank
2017	Model-based Iterative CT Image Reconstruction on GPUs. Amit Sabne, Xiao Wang, Sherman J. Kisner, Charles A. Bouman, Anand Raghunathan, Samuel P. Midkiff
2017	Noise Injection Techniques to Expose Subtle and Unintended Message Races. Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz, Christopher M. Chambreau
2017	Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis. Samyam Rajbhandari, Fabrice Rastello, Karol Kowalski, Sriram Krishnamoorthy, P. Sadayappan
2017	POSTER: A GPU-Friendly Skiplist Algorithm. Nurit Moscovici, Nachshon Cohen, Erez Petrank
2017	POSTER: A Wait-Free Queue with Wait-Free Memory Reclamation. Pedro Ramalhete, Andreia Correia
2017	POSTER: An Architecture and Programming Model for Accelerating Parallel Commutative Computations via Privatization. Vignesh Balaji, Dhruva Tirumala, Brandon Lucia
2017	POSTER: An Infrastructure for HPC Knowledge Sharing and Reuse. Yue Zhao, Chunhua Liao, Xipeng Shen
2017	POSTER: Automated Load Balancer Selection Based on Application Characteristics. Harshitha Menon, Kavitha Chandrasekar, Laxmikant V. Kalé
2017	POSTER: Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures. Shigang Li, Yunquan Zhang, Torsten Hoefler
2017	POSTER: Distributed Control: The Benefits of Eliminating Global Synchronization via Effective Scheduling. Jesun Sahariar Firoz, Thejaka Amila Kanewala, Marcin Zalewski, Martina Barnas, Andrew Lumsdaine
2017	POSTER: HythTM: Extending the Applicability of Intel TSX Hardware Transactional Support. Arnamoy Bhattacharyya, Mike Dai Wang, Mihai Burcea, Yi Ding, Allen Deng, Sai Varikooty, Shafaaf Hossain, Cristiana Amza
2017	POSTER: IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases. Dong Dai, Wei Zhang, Yong Chen
2017	POSTER: MAPA: An Automatic Memory Access Pattern Analyzer for GPU Applications. Gangwon Jo, Jaehoon Jung, Jiyoung Park, Jaejin Lee
2017	POSTER: On the Problem of Consistency Exceptions in the Context of Strong Memory Models. Minjia Zhang, Swarnendu Biswas, Michael D. Bond
2017	POSTER: Poor Man's URCU. Pedro Ramalhete, Andreia Correia
2017	POSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms. Rezaul Chowdhury, Pramod Ganapathi, Yuan Tang, Jesmin Jahan Tithi
2017	POSTER: Recovering Performance for Vector-based Machine Learning on Managed Runtime. Mingyu Wu, Haibing Guan, Binyu Zang, Haibo Chen
2017	POSTER: Reuse, don't Recycle: Transforming Algorithms that Throw Away Descriptors. Maya Arbel-Raviv, Trevor Brown
2017	POSTER: STAR (Space-Time Adaptive and Reductive) Algorithms for Real-World Space-Time Optimality. Yuan Tang, Ronghui You
2017	POSTER: State Teleportation via Hardware Transactional Memory. Nachshon Cohen, Maurice Herlihy, Erez Petrank, Elias Wald
2017	Pagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks. Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, Timothy G. Rogers
2017	Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Austin, TX, USA, February 4-8, 2017 Vivek Sarkar, Lawrence Rauchwerger
2017	Processor-Oblivious Record and Replay. Robert Utterback, Kunal Agrawal, I-Ting Angelina Lee, Milind Kulkarni
2017	S-Caffe: Co-designing MPI Runtimes and Caffe for Scalable Deep Learning on Modern GPU Clusters. Ammar Ahmad Awan, Khaled Hamidouche, Jahanzeb Maqbool Hashmi, Dhabaleswar K. Panda
2017	SC-Haskell: Sequential Consistency in Languages That Minimize Mutable Shared Heap. Michael Vollmer, Ryan G. Scott, Madanlal Musuvathi, Ryan R. Newton
2017	Self-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL. Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng
2017	Silent Data Corruption Resilient Two-sided Matrix Factorizations. Panruo Wu, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Jieyang Chen, Dingwen Tao, Xin Liang, Kaiming Ouyang, Zizhong Chen
2017	Simple, Accurate, Analytical Time Modeling and Optimal Tile Size Selection for GPGPU Stencils. Nirmal Prajapati, Waruna Ranasinghe, Sanjay V. Rajopadhye, Rumen Andonov, Hristo N. Djidjev, Tobias Grosser
2017	Synchronized-by-Default Concurrency for Shared-Memory Systems. Martin Bättig, Thomas R. Gross
2017	Tapir: Embedding Fork-Join Parallelism into LLVM's Intermediate Representation. Tao B. Schardl, William S. Moses, Charles E. Leiserson
2017	Thread Data Sharing in Cache: Theory and Measurement. Hao Luo, Pengcheng Li, Chen Ding
2017	Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning. Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li, Keren Zhou, Mingyu Chen
2017	Using Butterfly-Patterned Partial Sums to Draw from Discrete Distributions. Guy L. Steele Jr., Jean-Baptiste Tristan