SC A

87 papers

YearTitle / Authors
201424.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs.
Jeroen Bédorf, Evghenii Gaburov, Michiko S. Fujii, Keigo Nitadori, Tomoaki Ishiyama, Simon Portegies Zwart
2014A Communication-Optimal Framework for Contracting Distributed Tensors.
Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy, P. Sadayappan
2014A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm.
Penporn Koanantakool, Katherine A. Yelick
2014A Study on Balancing Parallelism, Data Locality, and Recomputation in Existing PDE Solvers.
Catherine Mills Olschanowsky, Michelle Mills Strout, Stephen M. Guzik, John Loffeld, Jeffrey Hittinger
2014A System Software Approach to Proactive Memory-Error Avoidance.
Carlos H. A. Costa, Yoonho Park, Bryan S. Rosenburg, Chen-Yong Cher, Kyung Dong Ryu
2014A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters.
Matthias Noack, Florian Wende, Thomas Steinke, Frank Cordes
2014A User-Friendly Approach for Tuning Parallel File Operations.
Robert T. McLay, Doug James, Si Liu, John Cazes, William L. Barth
2014A Volume Integral Equation Stokes Solver for Problems with Variable Coefficients.
Dhairya Malhotra, Amir Gholami, George Biros
2014An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis.
James P. Ahrens, Sébastien Jourdain, Patrick O'Leary, John Patchett, David H. Rogers, Mark R. Petersen
2014Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer.
David E. Shaw, J. P. Grossman, Joseph A. Bank, Brannon Batson, J. Adam Butts, Jack C. Chao, Martin M. Deneroff, Ron O. Dror, Amos Even, Christopher H. Fenton, Anthony Forte, Joseph Gagliardo, Gennette Gill, Brian Greskamp, C. Richard Ho, Douglas J. Ierardi, Lev Iserovich, Jeffrey Kuskin, Richard H. Larson, Timothy Layman, Li-Siang Lee, Adam K. Lerer, Chester Li, Daniel Killebrew, Kenneth M. Mackenzie, Shark Yeuk-Hai Mok, Mark A. Moraes, Rolf Mueller, Lawrence J. Nociolo, Jon L. Peticolas, Terry Quan, Daniel Ramot, John K. Salmon, Daniele Paolo Scarpazza, U. Ben Schafer, Naseer Siddique, Christopher W. Snyder, Jochen Spengler, Ping Tak Peter Tang, Michael Theobald, Horia Toma, Brian Towles, Benjamin Vitale, Stanley C. Wang, Cliff Young
2014Application Centric Energy-Efficiency Study of Distributed Multi-Core and Hybrid CPU-GPU Systems.
Ben Cumming, Gilles Fourestey, Oliver Fuhrer, Tobias Gysi, Massimiliano Fatica, Thomas C. Schulthess
2014Best Practices and Lessons Learned from Deploying and Operating Large-Scale Data-Centric Parallel File Systems.
Sarp Oral, James Simmons, Jason Hill, Dustin Leverman, Feiyi Wang, Matthew A. Ezell, Ross G. Miller, Douglas Fuller, Raghul Gunasekaran, Youngjae Kim, Saurabh Gupta, Devesh Tiwari, Sudharshan S. Vazhkudai, James H. Rogers, David Dillow, Galen M. Shipman, Arthur S. Bland
2014CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression.
Jidong Zhai, Jianfei Hu, Xiongchao Tang, Xiaosong Ma, Wenguang Chen
2014Compiler Techniques for Massively Scalable Implicit Task Parallelism.
Timothy G. Armstrong, Justin M. Wozniak, Michael Wilde, Ian T. Foster
2014Correctness Field Testing of Production and Decommissioned High Performance Computing Platforms at Los Alamos National Laboratory.
Sarah Ellen Michalak, William N. Rust, John T. Daly, Rew J. Dubois, David H. DuBois
2014DISC: A Domain-Interaction Based Programming Model with Support for Heterogeneous Execution.
Mehmet Can Kurt, Gagan Agrawal
2014Dissecting On-Node Memory Access Performance: A Semantic Approach.
Alfredo Giménez, Todd Gamblin, Barry Rountree, Abhinav Bhatele, Ilir Jusufi, Peer-Timo Bremer, Bernd Hamann
2014Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster.
Ichitaro Yamazaki, Sivasankaran Rajamanickam, Erik G. Boman, Mark Hoemmen, Michael A. Heroux, Stanimire Tomov
2014ECC Parity: A Technique for Efficient Memory Error Resilience for Multi-Channel Memory Systems.
Xun Jian, Rakesh Kumar
2014Efficient I/O and Storage of Adaptive-Resolution Data.
Sidharth Kumar, John Edwards, Peer-Timo Bremer, Aaron Knoll, Cameron Christensen, Venkatram Vishwanath, Philip H. Carns, John A. Schmidt, Valerio Pascucci
2014Efficient Implementation of Many-Body Quantum Chemical Methods on the Intel® Xeon Phi Coprocessor.
Edoardo Aprà, Michael Klemm, Karol Kowalski
2014Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices.
Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Xing Liu, Md. Mostofa Ali Patwary, Yutong Lu, Pradeep Dubey
2014Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format.
Joseph L. Greathouse, Mayank Daga
2014Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints.
Srinivas Sridharan, James Dinan, Dhiraj D. Kalamkar
2014Exploring Automatic, Online Failure Recovery for Scientific Applications at Extreme Scales.
Marc Gamell, Daniel S. Katz, Hemanth Kolla, Jacqueline Chen, Scott Klasky, Manish Parashar
2014FAST: Near Real-Time Searchable Data Analytics for the Cloud.
Yu Hua, Hong Jiang, Dan Feng
2014Fail-in-Place Network Design: Interaction Between Topology, Routing Algorithm and Failures.
Jens Domke, Torsten Hoefler, Satoshi Matsuoka
2014Fast Iterative Graph Computation: A Path Centric Approach.
Pingpeng Yuan, Wenya Zhang, Changfeng Xie, Hai Jin, Ling Liu, Kisung Lee
2014Fast Parallel Computation of Longest Common Prefixes.
Julian Shun
2014Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications.
Arash Ashari, Naser Sedaghati, John Eisenlohr, Srinivasan Parthasarathy, P. Sadayappan
2014Faster Parallel Traversal of Scale Free Graphs at Extreme Scale with Vertex Delegates.
Roger A. Pearce, Maya B. Gokhale, Nancy M. Amato
2014Fault-Tolerant Dynamic Task Graph Scheduling.
Mehmet Can Kurt, Sriram Krishnamoorthy, Kunal Agrawal, Gagan Agrawal
2014Fence Scoping.
Changhui Lin, Vijay Nagarajan, Rajiv Gupta
2014Finding Constant from Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds.
Yifan Gong, Bingsheng He, Dan Li
2014FlexSlot: Moving Hadoop Into the Cloud with Flexible Slot Management.
Yanfei Guo, Jia Rao, Changjun Jiang, Xiaobo Zhou
2014High-Performance Computation of Distributed-Memory Parallel 3D Voronoi and Delaunay Tessellation.
Tom Peterka, Dmitriy Morozov, Carolyn L. Phillips
2014High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA.
Takashi Shimokawabe, Takayuki Aoki, Naoyuki Onodera
2014In-Situ Feature Extraction of Large Scale Combustion Simulations Using Segmented Merge Trees.
Aaditya G. Landge, Valerio Pascucci, Attila Gyulassy, Janine Bennett, Hemanth Kolla, Jacqueline Chen, Peer-Timo Bremer
2014IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion.
Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson
2014International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014, New Orleans, LA, USA, November 16-21, 2014
Trish Damkroger, Jack J. Dongarra
2014Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors.
Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, Pradeep Dubey
2014MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications.
Zhezhe Chen, James Dinan, Zhen Tang, Pavan Balaji, Hua Zhong, Jun Wei, Tao Huang, Feng Qin
2014MSL: A Synthesis Enabled Language for Distributed Implementations.
Zhilei Xu, Shoaib Kamil, Armando Solar-Lezama
2014Managing DRAM Latency Divergence in Irregular GPGPU Applications.
Niladrish Chatterjee, Mike O'Connor, Gabriel H. Loh, Nuwan Jayasena, Rajeev Balasubramonian
2014Mapping to Irregular Torus Topologies and Other Techniques for Petascale Biomolecular Simulation.
James C. Phillips, Yanhua Sun, Nikhil Jain, Eric J. Bohm, Laxmikant V. Kalé
2014Maximizing Throughput of Overprovisioned HPC Data Centers Under a Strict Power Budget.
Osman Sarood, Akhil Langer, Abhishek Gupta, Laxmikant V. Kalé
2014Maximizing Throughput on a Dragonfly Network.
Nikhil Jain, Abhinav Bhatele, Xiang Ni, Nicholas J. Wright, Laxmikant V. Kalé
2014Metascalable Quantum Molecular Dynamics Simulations of Hydrogen-on-Demand.
Ken-ichi Nomura, Rajiv K. Kalia, Aiichiro Nakano, Priya Vashishta, Kohei Shimamura, Fuyuki Shimojo, Manaschai Kunaseth, Paul C. Messina, Nichols A. Romero
2014Microbank: Architecting Through-Silicon Interposer-Based Main Memory Systems.
Young Hoon Son, Seongil O, Hyunggyun Yang, Daejin Jung, Jung Ho Ahn, John Kim, Jangwoo Kim, Jae W. Lee
2014NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing.
Zhengzhang Chen, Seung Woo Son, William Hendrix, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary
2014Nonblocking Epochs in MPI One-Sided Communication.
Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William Gropp, Ahmad Afsahi
2014Oil and Water Can Mix: An Integration of Polyhedral and AST-Based Transformations.
Jun Shirako, Louis-Noël Pouchet, Vivek Sarkar
2014Omnisc'IO: A Grammar-Based Approach to Spatial and Temporal I/O Patterns Prediction.
Matthieu Dorier, Shadi Ibrahim, Gabriel Antoniu, Robert B. Ross
2014Optimization of a Multilevel Checkpoint Model with Uncertain Execution Scales.
Sheng Di, Leonardo Arturo Bautista-Gomez, Franck Cappello
2014Optimized Scheduling Strategies for Hybrid Density Functional theory Electronic Structure Calculations.
William Dawson, François Gygi
2014Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing.
Jonathan Lifflander, Sriram Krishnamoorthy, Laxmikant V. Kalé
2014Orion: Scaling Genomic Sequence Matching with Fine-Grained Parallelization.
Kanak Mahadik, Somali Chaterji, Bowen Zhou, Milind Kulkarni, Saurabh Bagchi
2014Parallel Bayesian Network Structure Learning for Genome-Scale Gene Networks.
Sanchit Misra, Md. Vasimuddin, Kiran Pamnany, Sriram P. Chockalingam, Yong Dong, Min Xie, Maneesha R. Aluru, Srinivas Aluru
2014Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly.
Evangelos Georganas, Aydin Buluç, Jarrod Chapman, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick
2014Parallel Deep Neural Network Training for Big Data on Blue Gene/Q.
I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury
2014Parallel Programming with Migratable Objects: Charm++ in Practice.
Bilge Acun, Abhishek Gupta, Nikhil Jain, Akhil Langer, Harshitha Menon, Eric Mikida, Xiang Ni, Michael P. Robson, Yanhua Sun, Ehsan Totoni, Lukasz Wesolowski, Laxmikant V. Kalé
2014Parallelization of Reordering Algorithms for Bandwidth and Wavefront Reduction.
Konstantinos I. Karantasis, Andrew Lenharth, Donald Nguyen, María Jesús Garzarán, Keshav Pingali
2014Pardicle: Parallel Approximate Density-Based Clustering.
Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Fredrik Manne, Salman Habib, Pradeep Dubey
2014Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers.
Alexander Heinecke, Alexander Breuer, Sebastian Rettenberger, Michael Bader, Alice-Agnes Gabriel, Christian Pelties, Arndt Bode, William Barth, Xiangke Liao, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Pradeep Dubey
2014Physics-Based Urban Earthquake Simulation Enhanced by 10.7 BlnDOF × 30 K Time-Step Unstructured FE Non-Linear Seismic Wave Simulation.
Tsuyoshi Ichimura, Kohei Fujita, Seizo Tanaka, Muneo Hori, Wijerathne Maddegedara Lalith Lakshman, Yoshihisa Shizawa, Hiroshi Kobayashi
2014Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System.
Ali Charara, Hatem Ltaief, Damien Gratadour, David E. Keyes, Arnaud Sevin, Ahmad Abdelfattah, Eric Gendron, Carine Morel, Fabrice Vidal
2014Practical Symbolic Race Checking of GPU Programs.
Peng Li, Guodong Li, Ganesh Gopalakrishnan
2014Quantitatively Modeling Application Resilience with the Data Vulnerability Factor.
Li Yu, Dong Li, Sparsh Mittal, Jeffrey S. Vetter
2014RAHTM: Routing Algorithm Aware Hierarchical Task Mapping.
Ahmed H. Abdel-Gawad, Mithuna Thottethodi, Abhinav Bhatele
2014Real-Time Scalable Cortical Computing at 46 Giga-Synaptic OPS/Watt with ~100× Speedup in Time-to-Solution and ~100, 000× Reduction in Energy-to-Solution.
Andrew S. Cassidy, Rodrigo Alvarez-Icaza, Filipp Akopyan, Jun Sawada, John V. Arthur, Paul Merolla, Pallab Datta, Marc González Tallada, Brian Taba, Alexander Andreopoulos, Arnon Amir, Steven K. Esser, Jeff Kusnitz, Rathinakumar Appuswamy, Chuck Haymes, Bernard Brezzo, Roger Moussalli, Ralph Bellofatto, Christian W. Baks, Michael Mastro, Kai Schleupen, Charles E. Cox, Ken Inoue, Steven E. Millman, Nabil Imam, Emmett McQuinn, Yutaka Y. Nakamura, Ivan Vo, Chen Guok, Don Nguyen, Scott Lekuch, Sameh W. Asaad, Daniel J. Friedman, Bryan L. Jackson, Myron Flickner, William P. Risk, Rajit Manohar, Dharmendra S. Modha
2014Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds.
Haikun Liu, Bingsheng He
2014Recycled Error Bits: Energy-Efficient Architectural Support for Floating Point Accuracy.
Ralph Nathan, Bryan Anthonio, Shih-Lien Lu, Helia Naeimi, Daniel J. Sorin, Xiaobai Sun
2014Scalable Computation of Stream Surfaces on Large Scale Vector Fields.
Kewei Lu, Han-Wei Shen, Tom Peterka
2014Scalable Kernel Fusion for Memory-Bound GPU Applications.
Mohamed Wahib, Naoya Maruyama
2014Scalable and High Performance Betweenness Centrality on the GPU.
Adam McLaughlin, David A. Bader
2014Scaling MapReduce Vertically and Horizontally.
Ismail El-Helw, Rutger F. H. Hofman, Henri E. Bal
2014Scaling the Power Wall: A Path to Exascale.
Oreste Villa, Daniel R. Johnson, Mike O'Connor, Evgeny Bolotin, David W. Nellans, Justin Luitjens, Nikolai Sakharnykh, Peng Wang, Paulius Micikevicius, Anthony Scudiero, Stephen W. Keckler, William J. Dally
2014Scheduling Multi-tenant Cloud Workloads on Accelerator-Based Systems.
Dipanjan Sengupta, Anshuman Goswami, Karsten Schwan, Krishna Pallavi
2014Slim Fly: A Cost Effective Low-Diameter Network Topology.
Maciej Besta, Torsten Hoefler
2014Structure Slicing: Extending Logical Regions with Fields.
Michael Bauer, Sean Treichler, Elliott Slaughter, Alex Aiken
2014The DRIHM Project: A Flexible Approach to Integrate HPC, Grid and Cloud Resources for Hydro-Meteorological Research.
Daniele D'Agostino, Andrea Clematis, Antonella Galizia, Alfonso Quarati, Emanuele Danovaro, Luca Roverelli, Gabriele Zereik, Dieter Kranzlmüller, Michael Schiffers, Nils gentschen Felde, Christian Straube, Olivier Caumont, Evelyne Richard, Luis Garrote, Quillon K. Harpham, H. R. A. Jagers, Vladimir Dimitrijevic, Ljiljana Dekic, Elisabetta Fiori, Fabio Delogu, Antonio Parodi
2014The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications.
Anthony M. Agelastos, Benjamin A. Allan, Jim M. Brandt, Paul Cassella, Jeremy Enos, Joshi Fullop, Ann C. Gentile, Steve Monk, Nichamon Naksinehaboon, Jeff Ogden, Mahesh Rajan, Michael T. Showerman, Joel Stevenson, Narate Taerat, Thomas W. Tucker
2014Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems.
Dong Dai, Yong Chen, Dries Kimpe, Robert B. Ross
2014Understanding Soft Error Resiliency of Blue Gene/Q Compute Chip through Hardware Proton Irradiation and Software Fault Injection.
Chen-Yong Cher, Meeta Sharma Gupta, Pradip Bose, K. Paul Muller
2014Understanding the Effects of Communication and Coordination on Checkpointing at Scale.
Kurt B. Ferreira, Patrick M. Widener, Scott Levy, Dorian C. Arnold, Torsten Hoefler
2014Using an Adaptive HPC Runtime System to Reconfigure the Cache Hierarchy.
Ehsan Totoni, Josep Torrellas, Laxmikant V. Kalé
2014pTatin3D: High-Performance Methods for Long-Term Lithospheric Dynamics.
Dave A. May, Jed Brown, Laetitia Le Pourhiet