| 2013 | 27th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2013, Cambridge, MA, USA, May 20-24, 2013 |
| 2013 | A Case for Handshake in Nanophotonic Interconnects. Lei Wang, Jagadish Jayabalan, Minseon Ahn, Haiyin Gu, Ki Hwan Yum, Eun Jung Kim |
| 2013 | A Communication-Optimal N-Body Algorithm for Direct Interactions. Michael B. Driscoll, Evangelos Georganas, Penporn Koanantakool, Edgar Solomonik, Katherine A. Yelick |
| 2013 | A Multi-partitioning Approach to Building Fast and Accurate Counting Bloom Filters. Kun Huang, Jie Zhang, Dafang Zhang, Gaogang Xie, Kavé Salamatian, Alex X. Liu, Wei Li |
| 2013 | A Network Configuration Algorithm Based on Optimization of Kirchhoff Index. Adam Hackett, Deepak Ajwani, Shoukat Ali, Steve Kirkland, John P. Morrison |
| 2013 | A Roofline Model of Energy. Jeewhan Choi, Daniel Bedard, Robert J. Fowler, Richard W. Vuduc |
| 2013 | A Scalable Heterogeneous Parallelization Framework for Iterative Local Searches. Martin Burtscher, Hassan Rabeti |
| 2013 | A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU Architectures. Shuaiwen Song, Chun-Yi Su, Barry Rountree, Kirk W. Cameron |
| 2013 | A Study of the Behavior of Synchronization Methods in Commonly Used Languages and Systems. Daniel Cederman, Bapi Chatterjee, Nhan Nguyen Dang, Yiannis Nikolakopoulos, Marina Papatriantafilou, Philippas Tsigas |
| 2013 | A Theoretical Framework for Algorithm-Architecture Co-design. Kenneth Czechowski, Richard W. Vuduc |
| 2013 | A Transparent Collective I/O Implementation. Yongen Yu, Jingjin Wu, Zhiling Lan, Douglas H. Rudd, Nickolay Y. Gnedin, Andrey V. Kravtsov |
| 2013 | A Visual Network Analysis Method for Large-Scale Parallel I/O Systems. Carmen Sigovan, Chris Muelder, Kwan-Liu Ma, Jason Cope, Kamil Iskra, Robert B. Ross |
| 2013 | Acceleration of an Asynchronous Message Driven Programming Paradigm on IBM Blue Gene/Q. Sameer Kumar, Yanhua Sun, Laximant V. Kalé |
| 2013 | Adapting Particle Filter Algorithms to Many-Core Architectures. Mehdi Chitchian, Alexander S. van Amesfoort, Andrea Simonetto, Tamás Keviczky, Henk J. Sips |
| 2013 | Adaptive Cache Bypassing for Inclusive Last Level Caches. Saurabh Gupta, Hongliang Gao, Huiyang Zhou |
| 2013 | Adaptive Incremental Checkpointing via Delta Compression for Networked Multicore Systems. Itthichok Jangjaimon, Nian-Feng Tzeng |
| 2013 | Agreement via Symmetry Breaking: On the Structure of Weak Subconsensus Tasks. Armando Castañeda, Sergio Rajsbaum, Michel Raynal |
| 2013 | Algorithms for the Thermal Scheduling Problem. Koyel Mukherjee, Samir Khuller, Amol Deshpande |
| 2013 | An Analytical Performance Model for Partitioning Off-Chip Memory Bandwidth. Ruisheng Wang, Lizhong Chen, Timothy Mark Pinkston |
| 2013 | Analysis of Randomized Work Stealing with False Sharing. Richard Cole, Vijaya Ramachandran |
| 2013 | Automated Rapid Prototyping of Regular Grid-Based Numerical Applications Using Generalized Elemental Subroutines. Yingchong Situ, Ye Wang, Zhiyuan Li |
| 2013 | Big Data in 10 Years. Raghu Ramakrishnan |
| 2013 | Burstiness-aware Server Consolidation via Queuing Theory Approach in a Computing Cloud. Zhaoyi Luo, Zhuzhong Qian |
| 2013 | CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled Cores. Konstantina Mitropoulou, Vasileios Porpodas, Marcelo Cintra |
| 2013 | Communication-Avoiding Algorithms for Linear Algebra and Beyond. James Demmel |
| 2013 | Communication-Based Mapping Using Shared Pages. Matthias Diener, Eduardo Henrique Molina da Cruz, Philippe Olivier Alexandre Navaux |
| 2013 | Communication-Optimal Parallel Recursive Rectangular Matrix Multiplication. James Demmel, David Eliahu, Armando Fox, Shoaib Kamil, Benjamin Lipshitz, Oded Schwartz, Omer Spillinger |
| 2013 | Composing Relaxed Transactions. Vincent Gramoli, Rachid Guerraoui, Mihai Letia |
| 2013 | Contention Resolution in a Non-synchronized Multiple Access Channel. Gianluca De Marco, Dariusz R. Kowalski |
| 2013 | Crowdsourcing under Real-Time Constraints. Ioannis Boutsis, Vana Kalogeraki |
| 2013 | Cura: A Cost-Optimized Model for MapReduce in a Cloud. Balaji Palanisamy, Aameek Singh, Ling Liu, Bryan Langston |
| 2013 | Cyclops Tensor Framework: Reducing Communication and Eliminating Load Imbalance in Massively Parallel Contractions. Edgar Solomonik, Devin Matthews, Jeff R. Hammond, James Demmel |
| 2013 | DLOOP: A Flash Translation Layer Exploiting Plane-Level Parallelism. Abdul Rahman Abdurrab, Tao Xie, Wei Wang |
| 2013 | DTN-FLOW: Inter-Landmark Data Flow for High-Throughput Routing in DTNs. Kang Chen, Haiying Shen |
| 2013 | Data-Driven Versus Topology-driven Irregular Computations on GPUs. Rupesh Nasre, Martin Burtscher, Keshav Pingali |
| 2013 | Deploying Graph Algorithms on GPUs: An Adaptive Solution. Da Li, Michela Becchi |
| 2013 | Design and Implementation of the Linpack Benchmark for Single and Multi-node Systems Based on Intel® Xeon Phi Coprocessor. Alexander Heinecke, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Alexander Kobotov, Roman Dubtsov, Greg Henry, Aniruddha G. Shet, George Chrysos, Pradeep Dubey |
| 2013 | Disk-Cache and Parallelism Aware I/O Scheduling to Improve Storage System Performance. Ramya Prabhakar, Mahmut T. Kandemir, Myoungsoo Jung |
| 2013 | Distributed Algorithms for Joint Routing and Frame Aggregation in 802.11n Wireless Mesh Networks. Dawei Gong, Yuanyuan Yang |
| 2013 | Distributed Algorithms for Scheduling on Line and Tree Networks with Non-uniform Bandwidths. Venkatesan T. Chakaravarthy, Anamitra R. Choudhury, Sambuddha Roy, Yogish Sabharwal |
| 2013 | Distributed Low-Latency Out-of-Order Event Processing for High Data Rate Sensor Streams. Christopher Mutschler, Michael Philippsen |
| 2013 | Early Experience on the Blue Gene/Q Supercomputing System. Vitali A. Morozov, Kalyan Kumaran, Venkatram Vishwanath, Jiayuan Meng, Michael E. Papka |
| 2013 | Efficient and Scalable Retrieval Techniques for Global File Properties. Dong H. Ahn, Michael J. Brim, Bronis R. de Supinski, Todd Gamblin, Gregory L. Lee, Matthew P. LeGendre, Barton P. Miller, Adam Moody, Martin Schulz |
| 2013 | Energy-Efficient Scheduling for Best-Effort Interactive Services to Achieve High Response Quality. Zhihui Du, Hongyang Sun, Yuxiong He, Yu He, David A. Bader, Huazhe Zhang |
| 2013 | Exascale Computing - A Fact or a Fiction? Shekhar Borkar |
| 2013 | Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi Coprocessors. Simon J. Pennycook, Christopher J. Hughes, Mikhail Smelyanskiy, Stephen A. Jarvis |
| 2013 | Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application. Ian Karlin, Abhinav Bhatele, Jeff Keasler, Bradford L. Chamberlain, Jonathan D. Cohen, Zachary DeVito, Riyaz Haque, Dan Laney, Edward Luke, Felix Wang, David F. Richards, Martin Schulz, Charles H. Still |
| 2013 | Extending OpenSHMEM for GPU Computing. Sreeram Potluri, Devendar Bureddy, Hao Wang, Hari Subramoni, Dhabaleswar K. Panda |
| 2013 | Extending the Generality of Molecular Dynamics Simulations on a Special-Purpose Machine. Daniele Paolo Scarpazza, Douglas J. Ierardi, Adam K. Lerer, Kenneth M. Mackenzie, Albert C. Pan, Joseph A. Bank, Edmond Chow, Ron O. Dror, J. P. Grossman, Daniel Killebrew, Mark A. Moraes, Cristian Predescu, John K. Salmon, David E. Shaw |
| 2013 | FlexIO: I/O Middleware for Location-Flexible Scientific Data Analytics. Fang Zheng, Hongbo Zou, Greg Eisenhauer, Karsten Schwan, Matthew Wolf, Jai Dayal, Tuan-Anh Nguyen, Jianting Cao, Hasan Abbasi, Scott Klasky, Norbert Podhorszki, Hongfeng Yu |
| 2013 | GPU-based Runtime Verification. Shay Berkovich, Borzoo Bonakdarpour, Sebastian Fischmeister |
| 2013 | Generalized Hierarchical All-to-All Exchange Patterns. Bogdan Prisacari, Germán Rodríguez, Cyriel Minkenberg |
| 2013 | Guided Region-Based GPU Scheduling: Utilizing Multi-thread Parallelism to Hide Memory Latency. Jianmin Chen, Xi Tao, Zhen Yang, Jih-Kwon Peir, Xiaoyuan Li, Shih-Lien Lu |
| 2013 | HPC Cloud Bad; HPC in the Cloud Good. Josh Simons |
| 2013 | HQL: A Scalable Synchronization Mechanism for GPUs. Ayse Yilmazer, David R. Kaeli |
| 2013 | Hardware-Accelerated Regular Expression Matching with Overlap Handling on IBM PowerEN Processor. Kubilay Atasu, Florian Dörfler, Jan van Lunteren, Christoph Hagleitner |
| 2013 | High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform. Jing Wu, Joseph F. JáJá |
| 2013 | High-Productivity and High-Performance Analysis of Filtered Semantic Graphs. Aydin Buluç, Erika Duriakova, Armando Fox, John R. Gilbert, Shoaib Kamil, Adam Lugowski, Leonid Oliker, Samuel Williams |
| 2013 | High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms. George Teodoro, Tony Pan, Tahsin M. Kurç, Jun Kong, Lee A. D. Cooper, Norbert Podhorszki, Scott Klasky, Joel H. Saltz |
| 2013 | Implementing a Blocked Aasen's Algorithm with a Dynamic Scheduler on Multicore Architectures. Grey Ballard, Dulceneia Becker, James Demmel, Jack J. Dongarra, Alex Druinsky, Inon Peled, Oded Schwartz, Sivan Toledo, Ichitaro Yamazaki |
| 2013 | Improving the Computing Efficiency of HPC Systems Using a Combination of Proactive and Preventive Checkpointing. Mohamed-Slim Bouguerra, Ana Gainaru, Leonardo Arturo Bautista-Gomez, Franck Cappello, Satoshi Matsuoka, Naoya Maruyama |
| 2013 | Improving the Performance of the Symmetric Sparse Matrix-Vector Multiplication in Multicore. Theodoros Gkountouvas, Vasileios Karakasis, Kornilios Kourtis, Georgios I. Goumas, Nectarios Koziris |
| 2013 | Integrating Asynchronous Task Parallelism with MPI. Sanjay Chatterjee, Sagnak Tasirlar, Zoran Budimlic, Vincent Cavé, Milind Chabbi, Max Grossman, Vivek Sarkar, Yonghong Yan |
| 2013 | Integrating Online Compression to Accelerate Large-Scale Data Analytics Applications. Tekin Bicer, Jian Yin, David Chiu, Gagan Agrawal, Karen Schuchardt |
| 2013 | JVM-Bypass for Efficient Hadoop Shuffling. Yandong Wang, Cong Xu, Xiaobing Li, Weikuan Yu |
| 2013 | Joint Host-Network Optimization for Energy-Efficient Data Center Networking. Hao Jin, Tosmate Cheocherngngarn, Dmita Levy, Alex Smith, Deng Pan, Jason Liu, Niki Pissinou |
| 2013 | Kernel Specialization for Improved Adaptability and Performance on Graphics Processing Units (GPUs). Nicholas Moore, Miriam Leeser, Laurie A. Smith King |
| 2013 | Locally Self-Adjusting Tree Networks. Chen Avin, Bernhard Haeupler, Zvi Lotker, Christian Scheideler, Stefan Schmid |
| 2013 | Lock-Free and Wait-Free Slot Scheduling Algorithms. Pooja Aggarwal, Smruti R. Sarangi |
| 2013 | Malleable Sorting. Patrick Flick, Peter Sanders, Jochen Speck |
| 2013 | Managing Asynchronous Operations in Coarray Fortran 2.0. Chaoran Yang, Karthik Murthy, John M. Mellor-Crummey |
| 2013 | Massively Parallel Model of Extended Memory Use in Evolutionary Game Dynamics. Amanda Peters Randles, David G. Rand, Christopher Lee, Greg Morrisett, Jayanta Sircar, Martin A. Nowak, Hanspeter Pfister |
| 2013 | Minimizing Communication in All-Pairs Shortest Paths. Edgar Solomonik, Aydin Buluç, James Demmel |
| 2013 | Multi-threaded Graph Partitioning. Dominique LaSalle, George Karypis |
| 2013 | Multi-vehicle Coordination for Wireless Energy Replenishment in Sensor Networks. Cong Wang, Ji Li, Fan Ye, Yuanyuan Yang |
| 2013 | Non Linear Divisible Loads: There is No Free Lunch. Olivier Beaumont, Hubert Larchevêque, Loris Marchal |
| 2013 | Novel Parallelization Schemes for Large-Scale Likelihood-based Phylogenetic Inference. Alexandros Stamatakis, Andre J. Aberer |
| 2013 | On Closed Nesting and Checkpointing in Fault-Tolerant Distributed Transactional Memory. Aditya Dhoke, Binoy Ravindran, Bo Zhang |
| 2013 | On Feasibility of Fingerprinting Wireless Sensor Nodes Using Physical Properties. Xiaowei Mei, Donggang Liu, Kun Sun, Dingbang Xu |
| 2013 | On Graphs, GPUs, and Blind Dating: A Workload to Processor Matchmaking Quest. Abdullah Gharaibeh, Lauro Beltrão Costa, Elizeu Santos-Neto, Matei Ripeanu |
| 2013 | Optimizations and Analysis of BSP Graph Processing Models on Public Clouds. Mark Redekopp, Yogesh Simmhan, Viktor K. Prasanna |
| 2013 | Optimizing Checkpoints Using NVM as Virtual Memory. Sudarsun Kannan, Ada Gavrilovska, Karsten Schwan, Dejan S. Milojicic |
| 2013 | Optimizing Resource allocation while handling SLA violations in Cloud Computing platforms. Lionel Eyraud-Dubois, Hubert Larchevêque |
| 2013 | Optimizing and Auto-Tuning Iterative Stencil Loops for GPUs with the In-Plane Method. Wai Teng Tang, Wen Jun Tan, Ratna Krishnamoorthy, Yi Wen Wong, Shyh-Hao Kuo, Rick Siow Mong Goh, Stephen John Turner, Weng-Fai Wong |
| 2013 | Oversubscription Bounded Multicast Scheduling in Fat-Tree Data Center Networks. Zhiyang Guo, Jun Duan, Yuanyuan Yang |
| 2013 | P-sync: A Photonically Enabled Architecture for Efficient Non-local Data Access. David Whelihan, Jeffrey J. Hughes, Scott M. Sawyer, Eric Robinson, Michael M. Wolf, Sanjeev Mohindra, Julie Mullen, Anna Klein, Michelle S. Beard, Nadya T. Bliss, Johnnie Chan, Robert Hendry, Keren Bergman, Luca P. Carloni |
| 2013 | Parallel Label-Setting Multi-objective Shortest Path Search. Peter Sanders, Lawrence Mandow |
| 2013 | Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O Systems. Yanlong Yin, Jibing Li, Jun He, Xian-He Sun, Rajeev Thakur |
| 2013 | Perfect Strong Scaling Using No Additional Energy. James Demmel, Andrew Gearhart, Benjamin Lipshitz, Oded Schwartz |
| 2013 | Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes. Amanda Peters Randles, Vivek Kale, Jeff R. Hammond, William Gropp, Efthimios Kaxiras |
| 2013 | Pluggable Watchdog: Transparent Failure Detection for MPI Programs. Keun Soo Yim, Zbigniew Kalbarczyk, Ravishankar K. Iyer |
| 2013 | Profit Aware Load Balancing for Distributed Cloud Data Centers. Shuo Liu, Shaolei Ren, Gang Quan, Ming Zhao, Shangping Ren |
| 2013 | Programmable and Scalable Reductions on Clusters. Jan Ciesko, Javier Bueno, Nikola Puzovic, Alex Ramírez, Rosa M. Badia, Jesús Labarta |
| 2013 | RAIR: Interference Reduction in Regionalized Networks-on-Chip. Lizhong Chen, Kai Hwang, Timothy Mark Pinkston |
| 2013 | Reliable Service Allocation in Clouds. Olivier Beaumont, Lionel Eyraud-Dubois, Hubert Larchevêque |
| 2013 | Replicate and Bundle (RnB) - A Mechanism for Relieving Bottlenecks in Data Centers. Shachar Raindel, Yitzhak Birk |
| 2013 | Replication-Based Load Balancing in Distributed Content-Based Publish/Subscribe. Weixiong Rao, Chao Chen, Pan Hui, Sasu Tarkoma |
| 2013 | Resource Management in VMware Powered Cloud: Concepts and Techniques. Pradeep Padala |
| 2013 | SIPMaP: A Tool for Modeling Irregular Parallel Computations in the Super Instruction Architecture. Nakul Jindal, Victor Lotrich, Erik Deumens, Beverly A. Sanders |
| 2013 | Scaling Techniques for Massive Scale-Free Graphs in Distributed (External) Memory. Roger A. Pearce, Maya B. Gokhale, Nancy M. Amato |
| 2013 | Scaling and Scheduling to Maximize Application Performance within Budget Constraints in Cloud Workflows. Ming Mao, Marty Humphrey |
| 2013 | Scheduling Tree-Shaped Task Graphs to Minimize Memory and Makespan. Loris Marchal, Oliver Sinnen, Frédéric Vivien |
| 2013 | Self-Adaptive OmpSs Tasks in Heterogeneous Environments. Judit Planas, Rosa M. Badia, Eduard Ayguadé, Jesús Labarta |
| 2013 | TM-dietlibc: A TM-aware Real-World System Library. Vesna Smiljkovic, Martin Nowack, Neboja Miletic, Tim Harris, Osman S. Ünsal, Adrián Cristal, Mateo Valero |
| 2013 | The Bounded Data Reuse Problem in Scientific Workflows. Mohsen Zohrevandi, Rida A. Bazzi |
| 2013 | Throughput Enhancement through Selective Time Sharing and Dynamic Grouping. Junliang Chen, Bing Bing Zhou, Chen Wang, Peng Lu, Penghao Wang, Albert Y. Zomaya |
| 2013 | Towards Scalable Checkpoint Restart: A Collective Inline Memory Contents Deduplication Proposal. Bogdan Nicolae |
| 2013 | V-Cache: Towards Flexible Resource Provisioning for Multi-tier Applications in IaaS Clouds. Yanfei Guo, Palden Lama, Jia Rao, Xiaobo Zhou |
| 2013 | Virtual Systolic Array for QR Decomposition. Jakub Kurzak, Piotr Luszczek, Mark Gates, Ichitaro Yamazaki, Jack J. Dongarra |
| 2013 | WHATSUP: A Decentralized Instant News Recommender. Antoine Boutet, Davide Frey, Rachid Guerraoui, Arnaud Jégou, Anne-Marie Kermarrec |
| 2013 | Wait-free Hyperobjects for Task-Parallel Programming Systems. Martin Wimmer |
| 2013 | XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures. Thierry Gautier, João V. F. Lima, Nicolas Maillard, Bruno Raffin |
| 2013 | ZHT: A Light-Weight Reliable Persistent Dynamic Scalable Zero-Hop Distributed Hash Table. Tonglin Li, Xiaobing Zhou, Kevin Brandstatter, Dongfang Zhao, Ke Wang, Anupam Rajendran, Zhao Zhang, Ioan Raicu |
| 2013 | iBridge: Improving Unaligned Parallel File Access with Solid-State Drives. Xuechen Zhang, Ke Liu, Kei Davis, Song Jiang |