IPDPS A

122 papers

YearTitle / Authors
201226th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2012, Shanghai, China, May 21-25, 2012
2012A Case Study of Designing Efficient Algorithm-based Fault Tolerant Application for Exascale Parallelism.
Erlin Yao, Rui Wang, Mingyu Chen, Guangming Tan, Ninghui Sun
2012A Comprehensive Study of Task Coalescing for Selecting Parallelism Granularity in a Two-Stage Bidiagonal Reduction.
Azzam Haidar, Hatem Ltaief, Piotr Luszczek, Jack J. Dongarra
2012A Highly Parallel Reuse Distance Analysis Algorithm on GPUs.
Huimin Cui, Qing Yi, Jingling Xue, Lei Wang, Yang Yang, Xiaobing Feng
2012A Lower Bound on Proximity Preservation by Space Filling Curves.
Pan Xu, Srikanta Tirthapura
2012A Novel Sorting Algorithm for Many-core Architectures Based on Adaptive Bitonic Sort.
Hagen Peters, Ole Schulz-Hildebrandt, Norbert Luttenberger
2012A Parallel Algorithm for Spectrum-based Short Read Error Correction.
Ankit Shah, Sriram P. Chockalingam, Srinivas Aluru
2012A Parallel Tiled Solver for Dense Symmetric Indefinite Systems on Multicore Architectures.
Marc Baboulin, Dulceneia Becker, Jack J. Dongarra
2012A Predictive Model for Solving Small Linear Algebra Problems in GPU Registers.
Michael J. Anderson, David Sheffield, Kurt Keutzer
2012A Self-Stabilization Process for Small-World Networks.
Sebastian Kniesburges, Andreas Koutsopoulos, Christian Scheideler
2012A Self-tuning Failure Detection Scheme for Cloud Computing Service.
Naixue Xiong, Athanasios V. Vasilakos, Jie Wu, Yang Richard Yang, Andy J. Rindos, Yuezhi Zhou, Wen-Zhan Song, Yi Pan
2012A Source-aware Interrupt Scheduling for Modern Parallel I/O Systems.
Hongbo Zou, Xian-He Sun, Siyuan Ma, Xi Duan
2012A uGNI-based Asynchronous Message-driven Runtime System for Cray Supercomputers with Gemini Interconnect.
Yanhua Sun, Gengbin Zheng, Laximant V. Kalé, Terry R. Jones, Ryan Olson
2012Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems.
George Teodoro, Tahsin M. Kurç, Tony Pan, Lee A. D. Cooper, Jun Kong, Patrick M. Widener, Joel H. Saltz
2012Accelerating Nearest Neighbor Search on Manycore Systems.
Lawrence Cayton
2012Advancing Large Scale Many-Body QMC Simulations on GPU Accelerated Multicore Systems.
Andrés Tomás, Chia-Chen Chang, Richard Scalettar, Zhaojun Bai
2012Algebraic Block Multi-Color Ordering Method for Parallel Multi-Threaded Sparse Triangular Solver in ICCG Method.
Takeshi Iwashita, Hiroshi Nakashima, Yasuhito Takahashi
2012An Accurate GPU Performance Model for Effective Control Flow Divergence Optimization.
Zheng Cui, Yun Liang, Kyle Rupnow, Deming Chen
2012An Efficient Framework for Multi-dimensional Tuning of High Performance Computing Applications.
Guojing Cong, Hui-Fang Wen, I-Hsin Chung, David J. Klepacki, Hiroki Murata, Yasushi Negishi
2012An SMT-Selection Metric to Improve Multithreaded Applications' Performance.
Justin R. Funston, Kaoutar El Maghraoui, Joefon Jann, Pratap Pattnaik, Alexandra Fedorova
2012Automated and Agile Server Parameter Tuning with Learning and Control.
Yanfei Guo, Palden Lama, Xiaobo Zhou
2012Automatic Resource Scheduling with Latency Hiding for Parallel Stencil Applications on GPGPU Clusters.
Kumiko Maeda, Masana Murase, Munehiro Doi, Hideaki Komatsu, Shigeho Noda, Ryutaro Himeno
2012BRISA: Combining Efficiency and Reliability in Epidemic Data Dissemination.
Miguel Matos, Valerio Schiavoni, Pascal Felber, Rui Oliveira, Etienne Rivière
2012Building billion-threads computer and elastic processor.
Guo-Jie Li
2012Competitive Cache Replacement Strategies for Shared Cache Environments.
Anil Kumar Katti, Vijaya Ramachandran
2012Consistency-aware Partitioning Algorithm in Multi-server Distributed Virtual Environments.
Yusen Li, Wentong Cai
2012Cross-layer Energy and Performance Evaluation of a Nanophotonic Manycore Processor System Using Real Application Workloads.
George Kurian, Chen Sun, Chia-Hsin Owen Chen, Jason E. Miller, Jürgen Michel, Lan Wei, Dimitri A. Antoniadis, Li-Shiuan Peh, Lionel C. Kimerling, Vladimir Stojanovic, Anant Agarwal
2012DCAF - A Directly Connected Arbitration-Free Photonic Crossbar for Energy-Efficient High Performance Computing.
Christopher Nitta, Matthew K. Farrens, Venkatesh Akella
2012Designing Non-blocking Allreduce with Collective Offload on InfiniBand Clusters: A Case Study with Conjugate Gradient Solvers.
Krishna Chaitanya Kandalla, Ulrike Meier Yang, Jeff Keasler, Tzanio V. Kolev, Adam Moody, Hari Subramoni, Karen Tomko, Jérôme Vienne, Bronis R. de Supinski, Dhabaleswar K. Panda
2012Distributed Demand and Response Algorithm for Optimizing Social-Welfare in Smart Grid.
Qifen Dong, Li Yu, Wen-Zhan Song, Lang Tong, Shaojie Tang
2012Distributed Transactional Memory for General Networks.
Gokarna Sharma, Costas Busch, Srinivasagopalan Srivathsan
2012Dynamic Message Ordering for Topic-Based Publish/Subscribe Systems.
Roberto Baldoni, Silvia Bonomi, Marco Platania, Leonardo Querzoni
2012Dynamic Operands Insertion for VLIW Architecture with a Reduced Bit-width Instruction Set.
Jongwon Lee, Jonghee M. Youn, Jihoon Lee, Minwook Ahn, Yunheung Paek
2012Efficient Quality Threshold Clustering for Parallel Architectures.
Anthony Danalis, Collin McCurdy, Jeffrey S. Vetter
2012Efficient Resource Oblivious Algorithms for Multicores with False Sharing.
Richard Cole, Vijaya Ramachandran
2012Enabling In-situ Execution of Coupled Scientific Workflow on Multi-core Platform.
Fan Zhang, Ciprian Docan, Manish Parashar, Scott Klasky, Norbert Podhorszki, Hasan Abbasi
2012Enhancing the Scalability of Consistency-based Progressive Multiple Sequences Alignment Applications.
Miquel Orobitg, Fernando Cores, Fernando Guirado, Carsten Kemena, Cédric Notredame, Ana Ripoll
2012Evaluating Mesh-based P2P Video-on-Demand Systems.
Yingwu Zhu
2012Evaluating the Impact of TLB Misses on Future HPC Systems.
Alessandro Morari, Roberto Gioiosa, Robert W. Wisniewski, Bryan S. Rosenburg, Todd Inglett, Mateo Valero
2012ExPERT: Pareto-Efficient Task Replication on Grids and a Cloud.
Orna Agmon Ben-Yehuda, Assaf Schuster, Artyom Sharov, Mark Silberstein, Alexandru Iosup
2012Exascale System Software for the Year of the Dragon.
Pete Beckman
2012Exploring the Scope of the InfiniBand Congestion Control Mechanism.
Ernst Gunnar Gran, Sven-Arne Reinemo, Olav Lysne, Tor Skeie, Eitan Zahavi, Gilad Shainer
2012Fast and Efficient Graph Traversal Algorithm for CPUs: Maximizing Single-Node Efficiency.
Jatin Chhugani, Nadathur Satish, Changkyu Kim, Jason Sewall, Pradeep Dubey
2012FractalMRC: Online Cache Miss Rate Curve Prediction on Commodity Systems.
Lulu He, Zhibin Yu, Hai Jin
2012GTI: A Generic Tools Infrastructure for Event-Based Tools in Parallel Systems.
Tobias Hilbrich, Matthias S. Müller, Bronis R. de Supinski, Martin Schulz, Wolfgang E. Nagel
2012Generating Device-specific GPU Code for Local Operators in Medical Imaging.
Richard Membarth, Frank Hannig, Jürgen Teich, Mario Körner, Wieland Eckert
2012Graph Partitioning for Reconfigurable Topology.
Deepak Ajwani, Shoukat Ali, John P. Morrison
2012Heterogeneous Task Scheduling for Accelerated OpenMP.
Thomas Scogland, Barry Rountree, Wu-chun Feng, Bronis R. de Supinski
2012HierKNEM: An Adaptive Framework for Kernel-Assisted and Topology-Aware Collective Communications on Many-core Clusters.
Teng Ma, George Bosilca, Aurélien Bouteiller, Jack J. Dongarra
2012Hierarchical Local Storage: Exploiting Flexible User-Data Sharing Between MPI Tasks.
Marc Tchiboukdjian, Patrick Carribault, Marc Pérache
2012Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems.
Jack J. Dongarra, Mathieu Faverge, Thomas Hérault, Julien Langou, Yves Robert
2012High Performance Non-uniform FFT on Modern X86-based Multi-core Systems.
Dhiraj D. Kalamkar, Joshua D. Trzasko, Srinivas Sridharan, Mikhail Smelyanskiy, Daehyun Kim, Armando Manduca, Yunhong Shu, Matt A. Bernstein, Bharat Kaul, Pradeep Dubey
2012High-Performance Design of HBase with RDMA over InfiniBand.
Jian Huang, Xiangyong Ouyang, Jithin Jose, Md. Wasi-ur-Rahman, Hao Wang, Miao Luo, Hari Subramoni, Chet Murthy, Dhabaleswar K. Panda
2012High-Performance Interaction-Based Simulation of Gut Immunopathologies with ENteric Immunity Simulator (ENISI).
Keith R. Bisset, Md. Maksudul Alam, Josep Bassaganya-Riera, Adria Carbo, Stephen G. Eubank, Raquel Hontecillas, Stefan Hoops, Yongguo Mei, Katherine V. Wendelsdorf, Dawen Xie, Jae-Seung Yeom, Madhav V. Marathe
2012Highly Efficient Performance Portable Tracking of Evolving Surfaces.
Wei Yu, Franz Franchetti, James C. Hoe, Tsuhan Chen
2012Holistic Debugging of MPI Derived Datatypes.
Joachim Protze, Tobias Hilbrich, Andreas Knüpfer, Bronis R. de Supinski, Matthias S. Müller
2012Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization.
Simplice Donfack, Laura Grigori, William D. Gropp, Vivek Kale
2012Hybrid Transactions: Lock Allocation and Assignment for Irrevocability.
Jaswanth Sreeram, Santosh Pande
2012HydEE: Failure Containment without Event Logging for Large Scale Send-Deterministic MPI Applications.
Amina Guermouche, Thomas Ropars, Marc Snir, Franck Cappello
2012Identifying Opportunities for Byte-Addressable Non-Volatile Memory in Extreme-Scale Scientific Applications.
Dong Li, Jeffrey S. Vetter, Gabriel Marin, Collin McCurdy, Cristian Cira, Zhuo Liu, Weikuan Yu
2012Improved Bounds for Discrete Diffusive Load Balancing.
Clemens P. J. Adolphs, Petra Berenbrink
2012Improving Parallel IO Performance of Cell-based AMR Cosmology Applications.
Yongen Yu, Douglas H. Rudd, Zhiling Lan, Nickolay Y. Gnedin, Andrey V. Kravtsov, Jingjin Wu
2012Improving the Performance of Dynamical Simulations Via Multiple Right-Hand Sides.
Xing Liu, Edmond Chow, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy
2012Large-scale visual data analysis.
Chris R. Johnson
2012Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers.
Humayun Arafat, P. Sadayappan, James Dinan, Sriram Krishnamoorthy, Theresa L. Windus
2012Locality Principle Revisited: A Probability-Based Quantitative Approach.
Saurabh Gupta, Ping Xiang, Yi Yang, Huiyang Zhou
2012Low-Cost Parallel Algorithms for 2: 1 Octree Balance.
Tobin Isaac, Carsten Burstedde, Omar Ghattas
2012MATE-CG: A Map Reduce-Like Framework for Accelerating Data-Intensive Computations on Heterogeneous Clusters.
Wei Jiang, Gagan Agrawal
2012Mapping Dense LU Factorization on Multicore Supercomputer Nodes.
Jonathan Lifflander, Phil Miller, Ramprasad Venkataraman, Anshu Arya, Laxmikant V. Kalé, Terry R. Jones
2012Meteor Shower: A Reliable Stream Processing System for Commodity Data Centers.
Huayong Wang, Li-Shiuan Peh, Emmanouil Koukoumidis, Shao Tao, Mun Choon Chan
2012Minimizing Weighted Mean Completion Time for Malleable Tasks Scheduling.
Olivier Beaumont, Nicolas Bonichon, Lionel Eyraud-Dubois, Loris Marchal
2012Miss-Correlation Folding: Encoding Per-Block Miss Correlations in Compressed DRAM for Data Prefetching.
Gang Liu, Jih-Kwon Peir, Victor W. Lee
2012Modeling and Analyzing Key Performance Factors of Shared Memory MapReduce.
Devesh Tiwari, Yan Solihin
2012Multi-core Spanning Forest Algorithms using the Disjoint-set Data Structure.
Md. Mostofa Ali Patwary, Peder Refsnes, Fredrik Manne
2012Multi-level Layout Optimization for Efficient Spatio-temporal Queries on ISABELA-compressed Data.
Zhenhuan Gong, Sriram Lakshminarasimhan, John Jenkins, Hemanth Kolla, Stéphane Ethier, Jackie Chen, Robert B. Ross, Scott Klasky, Nagiza F. Samatova
2012Multithreaded Algorithms for Maxmum Matching in Bipartite Graphs.
Ariful Azad, Mahantesh Halappanavar, Sivasankaran Rajamanickam, Erik G. Boman, Arif M. Khan, Alex Pothen
2012Multithreaded Clustering for Multi-level Hypergraph Partitioning.
Ümit V. Çatalyürek, Mehmet Deveci, Kamer Kaya, Bora Uçar
2012NUMA Aware Iterative Stencil Computations on Many-Core Systems.
Mohammed Shaheen, Robert Strzodka
2012NVMalloc: Exposing an Aggregate SSD Store as a Memory Partition in Extreme-Scale Machines.
Chao Wang, Sudharshan S. Vazhkudai, Xiaosong Ma, Fei Meng, Youngjae Kim, Christian Engelmann
2012New Scheduling Strategies and Hybrid Programming for a Parallel Right-looking Sparse LU Factorization Algorithm on Multicore Cluster Systems.
Ichitaro Yamazaki, Xiaoye S. Li
2012On Nonblocking Multirate Multicast Fat-tree Data Center Networks with Server Redundancy.
Zhiyang Guo, Yuanyuan Yang
2012On the Role of NVRAM in Data-intensive Architectures: An Evaluation.
Brian Van Essen, Roger A. Pearce, Sasha Ames, Maya B. Gokhale
2012On λ-Alert Problem.
Marek Klonowski, Dominik Pajak
2012Opportunistic Data-driven Execution of Parallel Programs for Efficient I/O Services.
Xuechen Zhang, Kei Davis, Song Jiang
2012Optimal Algorithms and Approximation Algorithms for Replica Placement with Distance Constraints in Tree Networks.
Anne Benoit, Hubert Larchevêque, Paul Renaud-Goud
2012Optimal Resource Rental Planning for Elastic Applications in Cloud Market.
Han Zhao, Miao Pan, Xinxin Liu, Xiaolin Li, Yuguang Fang
2012Optimization of Parallel Discrete Event Simulator for Multi-core Systems.
Deepak Jagtap, Nael B. Abu-Ghazaleh, Dmitry Ponomarev
2012Optimizing Busy Time on Parallel Machines.
George B. Mertzios, Mordechai Shalom, Ariella Voloshin, Prudence W. H. Wong, Shmuel Zaks
2012Optimizing Large-scale Graph Analysis on Multithreaded, Multicore Platforms.
Guojing Cong, Konstantin Makarychev
2012PAMI: A Parallel Active Message Interface for the Blue Gene/Q Supercomputer.
Sameer Kumar, Amith R. Mamidala, Daniel Faraj, Brian E. Smith, Michael Blocksome, Bob Cernohous, Douglas Miller, Jeff Parker, Joseph Ratterman, Philip Heidelberger, Dong Chen, Burkhard D. Steinmacher-Burow
2012PARDA: A Fast Parallel Reuse Distance Analysis Algorithm.
Qingpeng Niu, James Dinan, Qingda Lu, P. Sadayappan
2012PGAS for Distributed Numerical Python Targeting Multi-core Clusters.
Mads Ruben Burgdorff Kristensen, Yili Zheng, Brian Vinter
2012Parametric Utilization Bounds for Fixed-Priority Multiprocessor Scheduling.
Nan Guan, Martin Stigge, Wang Yi, Ge Yu
2012Performance Portability with the Chapel Language.
Albert Sidelnik, Saeed Maleki, Bradford L. Chamberlain, María Jesús Garzarán, David A. Padua
2012Power-aware Manhattan Routing on Chip Multiprocessors.
Anne Benoit, Rami G. Melhem, Paul Renaud-Goud, Yves Robert
2012Predicting Potential Speedup of Serial Code via Lightweight Profiling and Emulations with Memory Performance Model.
Minjang Kim, Pranith Kumar, Hyesoon Kim, Bevin Brett
2012Productive Programming of GPU Clusters with OmpSs.
Javier Bueno, Judit Planas, Alejandro Duran, Rosa M. Badia, Xavier Martorell, Eduard Ayguadé, Jesús Labarta
2012Profiling-based Adaptive Contention Management for Software Transactional Memory.
Zhengyu He, Xiao Yu, Bo Hong
2012Query Optimization and Execution in a Parallel Analytics DBMS.
Todd Eavis, Ahmad Taleb
2012Radio Astronomy Beam Forming on Many-Core Architectures.
Alessio Sclocco, Ana Lucia Varbanescu, Jan David Mol, Rob van Nieuwpoort
2012Reducing Data Movement Costs: Scalable Seismic Imaging on Blue Gene.
Michael Perrone, Lurng-Kuo Liu, Ligang Lu, Karen A. Magerlein, Changhoan Kim, Irina Fedulova, Artyom Semenikhin
2012Robust SIMD: Dynamically Adapted SIMD Width and Multi-Threading Depth.
Jiayuan Meng, Jeremy W. Sheaffer, Kevin Skadron
2012SAHAD: Subgraph Analysis in Massive Networks Using Hadoop.
Zhao Zhao, Guanying Wang, Ali Raza Butt, Maleq Khan, V. S. Anil Kumar, Madhav V. Marathe
2012SEL-TM: Selective Eager-Lazy Management for Improved Concurrency in Transactional Memory.
Lihang Zhao, Woojin Choi, Jeff Draper
2012SUV: A Novel Single-Update Version-Management Scheme for Hardware Transactional Memory Systems.
Zhichao Yan, Hong Jiang, Dan Feng, Lei Tian, Yujuan Tan
2012ScalaBenchGen: Auto-Generation of Communication Benchmarks Traces.
Xing Wu, Vivek Deshpande, Frank Mueller
2012Scalable Critical-Path Based Performance Analysis.
David Böhme, Felix Wolf, Bronis R. de Supinski, Martin Schulz, Markus Geimer
2012Scalable Distributed Consensus to Support MPI Fault Tolerance.
Darius Buntinas
2012Scheduling Closed-Nested Transactions in Distributed Transactional Memory.
Junwhan Kim, Binoy Ravindran
2012Self-organizing Particle Systems.
Maximilian Drees, Martina Hüllmann, Andreas Koutsopoulos, Christian Scheideler
2012ShyLU: A Hybrid-Hybrid Solver for Multicore Platforms.
Sivasankaran Rajamanickam, Erik G. Boman, Michael A. Heroux
2012Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication.
James Dinan, Pavan Balaji, Jeff R. Hammond, Sriram Krishnamoorthy, Vinod Tipparaju
2012Switching Optically-Connected Memories in a Large-Scale System.
Abhirup Chakraborty, Eugen Schenfeld, Dilma Da Silva
2012SyncChecker: Detecting Synchronization Errors between MPI Applications and Libraries.
Zhezhe Chen, Xinyu Li, Jau-Yuan Chen, Hua Zhong, Feng Qin
2012Taming of the Shrew: Modeling the Normal and Faulty Behaviour of Large-scale HPC Systems.
Ana Gainaru, Franck Cappello, William Kramer
2012The Parallel Computation of Morse-Smale Complexes.
Attila Gyulassy, Valerio Pascucci, Tom Peterka, Robert B. Ross
2012Understanding Cache Hierarchy Contention in CMPs to Improve Job Scheduling.
Josué Feliu, Julio Sahuquillo, Salvador Petit, José Duato
2012Using the Translation Lookaside Buffer to Map Threads in Parallel Applications Based on Shared Memory.
Eduardo Henrique Molina da Cruz, Matthias Diener, Philippe Olivier Alexandre Navaux
2012Virtual Machine Resource Allocation for Service Hosting on Heterogeneous Distributed Platforms.
Mark Stillwell, Frédéric Vivien, Henri Casanova
2012WATS: Workload-Aware Task Scheduling in Asymmetric Multi-core Architectures.
Quan Chen, Yawen Chen, Zhiyi Huang, Minyi Guo
2012iHarmonizer: Improving the Disk Efficiency of I/O-intensive Multithreaded Codes.
Yizhe Wang, Kei Davis, Yuehai Xu, Song Jiang
2012iTransformer: Using SSD to Improve Disk Scheduling for High-performance I/O.
Xuechen Zhang, Kei Davis, Song Jiang