| 2012 | 26th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2012, Shanghai, China, May 21-25, 2012 |
| 2012 | A Case Study of Designing Efficient Algorithm-based Fault Tolerant Application for Exascale Parallelism. Erlin Yao, Rui Wang, Mingyu Chen, Guangming Tan, Ninghui Sun |
| 2012 | A Comprehensive Study of Task Coalescing for Selecting Parallelism Granularity in a Two-Stage Bidiagonal Reduction. Azzam Haidar, Hatem Ltaief, Piotr Luszczek, Jack J. Dongarra |
| 2012 | A Highly Parallel Reuse Distance Analysis Algorithm on GPUs. Huimin Cui, Qing Yi, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng |
| 2012 | A Lower Bound on Proximity Preservation by Space Filling Curves. Pan Xu, Srikanta Tirthapura |
| 2012 | A Novel Sorting Algorithm for Many-core Architectures Based on Adaptive Bitonic Sort. Hagen Peters, Ole Schulz-Hildebrandt, Norbert Luttenberger |
| 2012 | A Parallel Algorithm for Spectrum-based Short Read Error Correction. Ankit Shah, Sriram P. Chockalingam, Srinivas Aluru |
| 2012 | A Parallel Tiled Solver for Dense Symmetric Indefinite Systems on Multicore Architectures. Marc Baboulin, Dulceneia Becker, Jack J. Dongarra |
| 2012 | A Predictive Model for Solving Small Linear Algebra Problems in GPU Registers. Michael J. Anderson, David Sheffield, Kurt Keutzer |
| 2012 | A Self-Stabilization Process for Small-World Networks. Sebastian Kniesburges, Andreas Koutsopoulos, Christian Scheideler |
| 2012 | A Self-tuning Failure Detection Scheme for Cloud Computing Service. Naixue Xiong, Athanasios V. Vasilakos, Jie Wu, Yang Richard Yang, Andy J. Rindos, Yuezhi Zhou, Wen-Zhan Song, Yi Pan |
| 2012 | A Source-aware Interrupt Scheduling for Modern Parallel I/O Systems. Hongbo Zou, Xian-He Sun, Siyuan Ma, Xi Duan |
| 2012 | A uGNI-based Asynchronous Message-driven Runtime System for Cray Supercomputers with Gemini Interconnect. Yanhua Sun, Gengbin Zheng, Laximant V. Kalé, Terry R. Jones, Ryan Olson |
| 2012 | Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems. George Teodoro, Tahsin M. Kurç, Tony Pan, Lee A. D. Cooper, Jun Kong, Patrick M. Widener, Joel H. Saltz |
| 2012 | Accelerating Nearest Neighbor Search on Manycore Systems. Lawrence Cayton |
| 2012 | Advancing Large Scale Many-Body QMC Simulations on GPU Accelerated Multicore Systems. Andrés Tomás, Chia-Chen Chang, Richard Scalettar, Zhaojun Bai |
| 2012 | Algebraic Block Multi-Color Ordering Method for Parallel Multi-Threaded Sparse Triangular Solver in ICCG Method. Takeshi Iwashita, Hiroshi Nakashima, Yasuhito Takahashi |
| 2012 | An Accurate GPU Performance Model for Effective Control Flow Divergence Optimization. Zheng Cui, Yun Liang, Kyle Rupnow, Deming Chen |
| 2012 | An Efficient Framework for Multi-dimensional Tuning of High Performance Computing Applications. Guojing Cong, Hui-Fang Wen, I-Hsin Chung, David J. Klepacki, Hiroki Murata, Yasushi Negishi |
| 2012 | An SMT-Selection Metric to Improve Multithreaded Applications' Performance. Justin R. Funston, Kaoutar El Maghraoui, Joefon Jann, Pratap Pattnaik, Alexandra Fedorova |
| 2012 | Automated and Agile Server Parameter Tuning with Learning and Control. Yanfei Guo, Palden Lama, Xiaobo Zhou |
| 2012 | Automatic Resource Scheduling with Latency Hiding for Parallel Stencil Applications on GPGPU Clusters. Kumiko Maeda, Masana Murase, Munehiro Doi, Hideaki Komatsu, Shigeho Noda, Ryutaro Himeno |
| 2012 | BRISA: Combining Efficiency and Reliability in Epidemic Data Dissemination. Miguel Matos, Valerio Schiavoni, Pascal Felber, Rui Oliveira, Etienne Rivière |
| 2012 | Building billion-threads computer and elastic processor. Guo-Jie Li |
| 2012 | Competitive Cache Replacement Strategies for Shared Cache Environments. Anil Kumar Katti, Vijaya Ramachandran |
| 2012 | Consistency-aware Partitioning Algorithm in Multi-server Distributed Virtual Environments. Yusen Li, Wentong Cai |
| 2012 | Cross-layer Energy and Performance Evaluation of a Nanophotonic Manycore Processor System Using Real Application Workloads. George Kurian, Chen Sun, Chia-Hsin Owen Chen, Jason E. Miller, Jürgen Michel, Lan Wei, Dimitri A. Antoniadis, Li-Shiuan Peh, Lionel C. Kimerling, Vladimir Stojanovic, Anant Agarwal |
| 2012 | DCAF - A Directly Connected Arbitration-Free Photonic Crossbar for Energy-Efficient High Performance Computing. Christopher Nitta, Matthew K. Farrens, Venkatesh Akella |
| 2012 | Designing Non-blocking Allreduce with Collective Offload on InfiniBand Clusters: A Case Study with Conjugate Gradient Solvers. Krishna Chaitanya Kandalla, Ulrike Meier Yang, Jeff Keasler, Tzanio V. Kolev, Adam Moody, Hari Subramoni, Karen Tomko, Jérôme Vienne, Bronis R. de Supinski, Dhabaleswar K. Panda |
| 2012 | Distributed Demand and Response Algorithm for Optimizing Social-Welfare in Smart Grid. Qifen Dong, Li Yu, Wen-Zhan Song, Lang Tong, Shaojie Tang |
| 2012 | Distributed Transactional Memory for General Networks. Gokarna Sharma, Costas Busch, Srinivasagopalan Srivathsan |
| 2012 | Dynamic Message Ordering for Topic-Based Publish/Subscribe Systems. Roberto Baldoni, Silvia Bonomi, Marco Platania, Leonardo Querzoni |
| 2012 | Dynamic Operands Insertion for VLIW Architecture with a Reduced Bit-width Instruction Set. Jongwon Lee, Jonghee M. Youn, Jihoon Lee, Minwook Ahn, Yunheung Paek |
| 2012 | Efficient Quality Threshold Clustering for Parallel Architectures. Anthony Danalis, Collin McCurdy, Jeffrey S. Vetter |
| 2012 | Efficient Resource Oblivious Algorithms for Multicores with False Sharing. Richard Cole, Vijaya Ramachandran |
| 2012 | Enabling In-situ Execution of Coupled Scientific Workflow on Multi-core Platform. Fan Zhang, Ciprian Docan, Manish Parashar, Scott Klasky, Norbert Podhorszki, Hasan Abbasi |
| 2012 | Enhancing the Scalability of Consistency-based Progressive Multiple Sequences Alignment Applications. Miquel Orobitg, Fernando Cores, Fernando Guirado, Carsten Kemena, Cédric Notredame, Ana Ripoll |
| 2012 | Evaluating Mesh-based P2P Video-on-Demand Systems. Yingwu Zhu |
| 2012 | Evaluating the Impact of TLB Misses on Future HPC Systems. Alessandro Morari, Roberto Gioiosa, Robert W. Wisniewski, Bryan S. Rosenburg, Todd Inglett, Mateo Valero |
| 2012 | ExPERT: Pareto-Efficient Task Replication on Grids and a Cloud. Orna Agmon Ben-Yehuda, Assaf Schuster, Artyom Sharov, Mark Silberstein, Alexandru Iosup |
| 2012 | Exascale System Software for the Year of the Dragon. Pete Beckman |
| 2012 | Exploring the Scope of the InfiniBand Congestion Control Mechanism. Ernst Gunnar Gran, Sven-Arne Reinemo, Olav Lysne, Tor Skeie, Eitan Zahavi, Gilad Shainer |
| 2012 | Fast and Efficient Graph Traversal Algorithm for CPUs: Maximizing Single-Node Efficiency. Jatin Chhugani, Nadathur Satish, Changkyu Kim, Jason Sewall, Pradeep Dubey |
| 2012 | FractalMRC: Online Cache Miss Rate Curve Prediction on Commodity Systems. Lulu He, Zhibin Yu, Hai Jin |
| 2012 | GTI: A Generic Tools Infrastructure for Event-Based Tools in Parallel Systems. Tobias Hilbrich, Matthias S. Müller, Bronis R. de Supinski, Martin Schulz, Wolfgang E. Nagel |
| 2012 | Generating Device-specific GPU Code for Local Operators in Medical Imaging. Richard Membarth, Frank Hannig, Jürgen Teich, Mario Körner, Wieland Eckert |
| 2012 | Graph Partitioning for Reconfigurable Topology. Deepak Ajwani, Shoukat Ali, John P. Morrison |
| 2012 | Heterogeneous Task Scheduling for Accelerated OpenMP. Thomas Scogland, Barry Rountree, Wu-chun Feng, Bronis R. de Supinski |
| 2012 | HierKNEM: An Adaptive Framework for Kernel-Assisted and Topology-Aware Collective Communications on Many-core Clusters. Teng Ma, George Bosilca, Aurélien Bouteiller, Jack J. Dongarra |
| 2012 | Hierarchical Local Storage: Exploiting Flexible User-Data Sharing Between MPI Tasks. Marc Tchiboukdjian, Patrick Carribault, Marc Pérache |
| 2012 | Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems. Jack J. Dongarra, Mathieu Faverge, Thomas Hérault, Julien Langou, Yves Robert |
| 2012 | High Performance Non-uniform FFT on Modern X86-based Multi-core Systems. Dhiraj D. Kalamkar, Joshua D. Trzasko, Srinivas Sridharan, Mikhail Smelyanskiy, Daehyun Kim, Armando Manduca, Yunhong Shu, Matt A. Bernstein, Bharat Kaul, Pradeep Dubey |
| 2012 | High-Performance Design of HBase with RDMA over InfiniBand. Jian Huang, Xiangyong Ouyang, Jithin Jose, Md. Wasi-ur-Rahman, Hao Wang, Miao Luo, Hari Subramoni, Chet Murthy, Dhabaleswar K. Panda |
| 2012 | High-Performance Interaction-Based Simulation of Gut Immunopathologies with ENteric Immunity Simulator (ENISI). Keith R. Bisset, Md. Maksudul Alam, Josep Bassaganya-Riera, Adria Carbo, Stephen G. Eubank, Raquel Hontecillas, Stefan Hoops, Yongguo Mei, Katherine V. Wendelsdorf, Dawen Xie, Jae-Seung Yeom, Madhav V. Marathe |
| 2012 | Highly Efficient Performance Portable Tracking of Evolving Surfaces. Wei Yu, Franz Franchetti, James C. Hoe, Tsuhan Chen |
| 2012 | Holistic Debugging of MPI Derived Datatypes. Joachim Protze, Tobias Hilbrich, Andreas Knüpfer, Bronis R. de Supinski, Matthias S. Müller |
| 2012 | Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization. Simplice Donfack, Laura Grigori, William D. Gropp, Vivek Kale |
| 2012 | Hybrid Transactions: Lock Allocation and Assignment for Irrevocability. Jaswanth Sreeram, Santosh Pande |
| 2012 | HydEE: Failure Containment without Event Logging for Large Scale Send-Deterministic MPI Applications. Amina Guermouche, Thomas Ropars, Marc Snir, Franck Cappello |
| 2012 | Identifying Opportunities for Byte-Addressable Non-Volatile Memory in Extreme-Scale Scientific Applications. Dong Li, Jeffrey S. Vetter, Gabriel Marin, Collin McCurdy, Cristian Cira, Zhuo Liu, Weikuan Yu |
| 2012 | Improved Bounds for Discrete Diffusive Load Balancing. Clemens P. J. Adolphs, Petra Berenbrink |
| 2012 | Improving Parallel IO Performance of Cell-based AMR Cosmology Applications. Yongen Yu, Douglas H. Rudd, Zhiling Lan, Nickolay Y. Gnedin, Andrey V. Kravtsov, Jingjin Wu |
| 2012 | Improving the Performance of Dynamical Simulations Via Multiple Right-Hand Sides. Xing Liu, Edmond Chow, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy |
| 2012 | Large-scale visual data analysis. Chris R. Johnson |
| 2012 | Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers. Humayun Arafat, P. Sadayappan, James Dinan, Sriram Krishnamoorthy, Theresa L. Windus |
| 2012 | Locality Principle Revisited: A Probability-Based Quantitative Approach. Saurabh Gupta, Ping Xiang, Yi Yang, Huiyang Zhou |
| 2012 | Low-Cost Parallel Algorithms for 2: 1 Octree Balance. Tobin Isaac, Carsten Burstedde, Omar Ghattas |
| 2012 | MATE-CG: A Map Reduce-Like Framework for Accelerating Data-Intensive Computations on Heterogeneous Clusters. Wei Jiang, Gagan Agrawal |
| 2012 | Mapping Dense LU Factorization on Multicore Supercomputer Nodes. Jonathan Lifflander, Phil Miller, Ramprasad Venkataraman, Anshu Arya, Laxmikant V. Kalé, Terry R. Jones |
| 2012 | Meteor Shower: A Reliable Stream Processing System for Commodity Data Centers. Huayong Wang, Li-Shiuan Peh, Emmanouil Koukoumidis, Shao Tao, Mun Choon Chan |
| 2012 | Minimizing Weighted Mean Completion Time for Malleable Tasks Scheduling. Olivier Beaumont, Nicolas Bonichon, Lionel Eyraud-Dubois, Loris Marchal |
| 2012 | Miss-Correlation Folding: Encoding Per-Block Miss Correlations in Compressed DRAM for Data Prefetching. Gang Liu, Jih-Kwon Peir, Victor W. Lee |
| 2012 | Modeling and Analyzing Key Performance Factors of Shared Memory MapReduce. Devesh Tiwari, Yan Solihin |
| 2012 | Multi-core Spanning Forest Algorithms using the Disjoint-set Data Structure. Md. Mostofa Ali Patwary, Peder Refsnes, Fredrik Manne |
| 2012 | Multi-level Layout Optimization for Efficient Spatio-temporal Queries on ISABELA-compressed Data. Zhenhuan Gong, Sriram Lakshminarasimhan, John Jenkins, Hemanth Kolla, Stéphane Ethier, Jackie Chen, Robert B. Ross, Scott Klasky, Nagiza F. Samatova |
| 2012 | Multithreaded Algorithms for Maxmum Matching in Bipartite Graphs. Ariful Azad, Mahantesh Halappanavar, Sivasankaran Rajamanickam, Erik G. Boman, Arif M. Khan, Alex Pothen |
| 2012 | Multithreaded Clustering for Multi-level Hypergraph Partitioning. Ümit V. Çatalyürek, Mehmet Deveci, Kamer Kaya, Bora Uçar |
| 2012 | NUMA Aware Iterative Stencil Computations on Many-Core Systems. Mohammed Shaheen, Robert Strzodka |
| 2012 | NVMalloc: Exposing an Aggregate SSD Store as a Memory Partition in Extreme-Scale Machines. Chao Wang, Sudharshan S. Vazhkudai, Xiaosong Ma, Fei Meng, Youngjae Kim, Christian Engelmann |
| 2012 | New Scheduling Strategies and Hybrid Programming for a Parallel Right-looking Sparse LU Factorization Algorithm on Multicore Cluster Systems. Ichitaro Yamazaki, Xiaoye S. Li |
| 2012 | On Nonblocking Multirate Multicast Fat-tree Data Center Networks with Server Redundancy. Zhiyang Guo, Yuanyuan Yang |
| 2012 | On the Role of NVRAM in Data-intensive Architectures: An Evaluation. Brian Van Essen, Roger A. Pearce, Sasha Ames, Maya B. Gokhale |
| 2012 | On λ-Alert Problem. Marek Klonowski, Dominik Pajak |
| 2012 | Opportunistic Data-driven Execution of Parallel Programs for Efficient I/O Services. Xuechen Zhang, Kei Davis, Song Jiang |
| 2012 | Optimal Algorithms and Approximation Algorithms for Replica Placement with Distance Constraints in Tree Networks. Anne Benoit, Hubert Larchevêque, Paul Renaud-Goud |
| 2012 | Optimal Resource Rental Planning for Elastic Applications in Cloud Market. Han Zhao, Miao Pan, Xinxin Liu, Xiaolin Li, Yuguang Fang |
| 2012 | Optimization of Parallel Discrete Event Simulator for Multi-core Systems. Deepak Jagtap, Nael B. Abu-Ghazaleh, Dmitry Ponomarev |
| 2012 | Optimizing Busy Time on Parallel Machines. George B. Mertzios, Mordechai Shalom, Ariella Voloshin, Prudence W. H. Wong, Shmuel Zaks |
| 2012 | Optimizing Large-scale Graph Analysis on Multithreaded, Multicore Platforms. Guojing Cong, Konstantin Makarychev |
| 2012 | PAMI: A Parallel Active Message Interface for the Blue Gene/Q Supercomputer. Sameer Kumar, Amith R. Mamidala, Daniel Faraj, Brian E. Smith, Michael Blocksome, Bob Cernohous, Douglas Miller, Jeff Parker, Joseph Ratterman, Philip Heidelberger, Dong Chen, Burkhard D. Steinmacher-Burow |
| 2012 | PARDA: A Fast Parallel Reuse Distance Analysis Algorithm. Qingpeng Niu, James Dinan, Qingda Lu, P. Sadayappan |
| 2012 | PGAS for Distributed Numerical Python Targeting Multi-core Clusters. Mads Ruben Burgdorff Kristensen, Yili Zheng, Brian Vinter |
| 2012 | Parametric Utilization Bounds for Fixed-Priority Multiprocessor Scheduling. Nan Guan, Martin Stigge, Wang Yi, Ge Yu |
| 2012 | Performance Portability with the Chapel Language. Albert Sidelnik, Saeed Maleki, Bradford L. Chamberlain, María Jesús Garzarán, David A. Padua |
| 2012 | Power-aware Manhattan Routing on Chip Multiprocessors. Anne Benoit, Rami G. Melhem, Paul Renaud-Goud, Yves Robert |
| 2012 | Predicting Potential Speedup of Serial Code via Lightweight Profiling and Emulations with Memory Performance Model. Minjang Kim, Pranith Kumar, Hyesoon Kim, Bevin Brett |
| 2012 | Productive Programming of GPU Clusters with OmpSs. Javier Bueno, Judit Planas, Alejandro Duran, Rosa M. Badia, Xavier Martorell, Eduard Ayguadé, Jesús Labarta |
| 2012 | Profiling-based Adaptive Contention Management for Software Transactional Memory. Zhengyu He, Xiao Yu, Bo Hong |
| 2012 | Query Optimization and Execution in a Parallel Analytics DBMS. Todd Eavis, Ahmad Taleb |
| 2012 | Radio Astronomy Beam Forming on Many-Core Architectures. Alessio Sclocco, Ana Lucia Varbanescu, Jan David Mol, Rob van Nieuwpoort |
| 2012 | Reducing Data Movement Costs: Scalable Seismic Imaging on Blue Gene. Michael Perrone, Lurng-Kuo Liu, Ligang Lu, Karen A. Magerlein, Changhoan Kim, Irina Fedulova, Artyom Semenikhin |
| 2012 | Robust SIMD: Dynamically Adapted SIMD Width and Multi-Threading Depth. Jiayuan Meng, Jeremy W. Sheaffer, Kevin Skadron |
| 2012 | SAHAD: Subgraph Analysis in Massive Networks Using Hadoop. Zhao Zhao, Guanying Wang, Ali Raza Butt, Maleq Khan, V. S. Anil Kumar, Madhav V. Marathe |
| 2012 | SEL-TM: Selective Eager-Lazy Management for Improved Concurrency in Transactional Memory. Lihang Zhao, Woojin Choi, Jeff Draper |
| 2012 | SUV: A Novel Single-Update Version-Management Scheme for Hardware Transactional Memory Systems. Zhichao Yan, Hong Jiang, Dan Feng, Lei Tian, Yujuan Tan |
| 2012 | ScalaBenchGen: Auto-Generation of Communication Benchmarks Traces. Xing Wu, Vivek Deshpande, Frank Mueller |
| 2012 | Scalable Critical-Path Based Performance Analysis. David Böhme, Felix Wolf, Bronis R. de Supinski, Martin Schulz, Markus Geimer |
| 2012 | Scalable Distributed Consensus to Support MPI Fault Tolerance. Darius Buntinas |
| 2012 | Scheduling Closed-Nested Transactions in Distributed Transactional Memory. Junwhan Kim, Binoy Ravindran |
| 2012 | Self-organizing Particle Systems. Maximilian Drees, Martina Hüllmann, Andreas Koutsopoulos, Christian Scheideler |
| 2012 | ShyLU: A Hybrid-Hybrid Solver for Multicore Platforms. Sivasankaran Rajamanickam, Erik G. Boman, Michael A. Heroux |
| 2012 | Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication. James Dinan, Pavan Balaji, Jeff R. Hammond, Sriram Krishnamoorthy, Vinod Tipparaju |
| 2012 | Switching Optically-Connected Memories in a Large-Scale System. Abhirup Chakraborty, Eugen Schenfeld, Dilma Da Silva |
| 2012 | SyncChecker: Detecting Synchronization Errors between MPI Applications and Libraries. Zhezhe Chen, Xinyu Li, Jau-Yuan Chen, Hua Zhong, Feng Qin |
| 2012 | Taming of the Shrew: Modeling the Normal and Faulty Behaviour of Large-scale HPC Systems. Ana Gainaru, Franck Cappello, William Kramer |
| 2012 | The Parallel Computation of Morse-Smale Complexes. Attila Gyulassy, Valerio Pascucci, Tom Peterka, Robert B. Ross |
| 2012 | Understanding Cache Hierarchy Contention in CMPs to Improve Job Scheduling. Josué Feliu, Julio Sahuquillo, Salvador Petit, José Duato |
| 2012 | Using the Translation Lookaside Buffer to Map Threads in Parallel Applications Based on Shared Memory. Eduardo Henrique Molina da Cruz, Matthias Diener, Philippe Olivier Alexandre Navaux |
| 2012 | Virtual Machine Resource Allocation for Service Hosting on Heterogeneous Distributed Platforms. Mark Stillwell, Frédéric Vivien, Henri Casanova |
| 2012 | WATS: Workload-Aware Task Scheduling in Asymmetric Multi-core Architectures. Quan Chen, Yawen Chen, Zhiyi Huang, Minyi Guo |
| 2012 | iHarmonizer: Improving the Disk Efficiency of I/O-intensive Multithreaded Codes. Yizhe Wang, Kei Davis, Yuehai Xu, Song Jiang |
| 2012 | iTransformer: Using SSD to Improve Disk Scheduling for High-performance I/O. Xuechen Zhang, Kei Davis, Song Jiang |