| 2014 | 24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs. Jeroen Bédorf, Evghenii Gaburov, Michiko S. Fujii, Keigo Nitadori, Tomoaki Ishiyama, Simon Portegies Zwart |
| 2014 | A Communication-Optimal Framework for Contracting Distributed Tensors. Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy, P. Sadayappan |
| 2014 | A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm. Penporn Koanantakool, Katherine A. Yelick |
| 2014 | A Study on Balancing Parallelism, Data Locality, and Recomputation in Existing PDE Solvers. Catherine Mills Olschanowsky, Michelle Mills Strout, Stephen M. Guzik, John Loffeld, Jeffrey Hittinger |
| 2014 | A System Software Approach to Proactive Memory-Error Avoidance. Carlos H. A. Costa, Yoonho Park, Bryan S. Rosenburg, Chen-Yong Cher, Kyung Dong Ryu |
| 2014 | A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters. Matthias Noack, Florian Wende, Thomas Steinke, Frank Cordes |
| 2014 | A User-Friendly Approach for Tuning Parallel File Operations. Robert T. McLay, Doug James, Si Liu, John Cazes, William L. Barth |
| 2014 | A Volume Integral Equation Stokes Solver for Problems with Variable Coefficients. Dhairya Malhotra, Amir Gholami, George Biros |
| 2014 | An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis. James P. Ahrens, Sébastien Jourdain, Patrick O'Leary, John Patchett, David H. Rogers, Mark R. Petersen |
| 2014 | Anton 2: Raising the Bar for Performance and Programmability in a Special-Purpose Molecular Dynamics Supercomputer. David E. Shaw, J. P. Grossman, Joseph A. Bank, Brannon Batson, J. Adam Butts, Jack C. Chao, Martin M. Deneroff, Ron O. Dror, Amos Even, Christopher H. Fenton, Anthony Forte, Joseph Gagliardo, Gennette Gill, Brian Greskamp, C. Richard Ho, Douglas J. Ierardi, Lev Iserovich, Jeffrey Kuskin, Richard H. Larson, Timothy Layman, Li-Siang Lee, Adam K. Lerer, Chester Li, Daniel Killebrew, Kenneth M. Mackenzie, Shark Yeuk-Hai Mok, Mark A. Moraes, Rolf Mueller, Lawrence J. Nociolo, Jon L. Peticolas, Terry Quan, Daniel Ramot, John K. Salmon, Daniele Paolo Scarpazza, U. Ben Schafer, Naseer Siddique, Christopher W. Snyder, Jochen Spengler, Ping Tak Peter Tang, Michael Theobald, Horia Toma, Brian Towles, Benjamin Vitale, Stanley C. Wang, Cliff Young |
| 2014 | Application Centric Energy-Efficiency Study of Distributed Multi-Core and Hybrid CPU-GPU Systems. Ben Cumming, Gilles Fourestey, Oliver Fuhrer, Tobias Gysi, Massimiliano Fatica, Thomas C. Schulthess |
| 2014 | Best Practices and Lessons Learned from Deploying and Operating Large-Scale Data-Centric Parallel File Systems. Sarp Oral, James Simmons, Jason Hill, Dustin Leverman, Feiyi Wang, Matthew A. Ezell, Ross G. Miller, Douglas Fuller, Raghul Gunasekaran, Youngjae Kim, Saurabh Gupta, Devesh Tiwari, Sudharshan S. Vazhkudai, James H. Rogers, David Dillow, Galen M. Shipman, Arthur S. Bland |
| 2014 | CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression. Jidong Zhai, Jianfei Hu, Xiongchao Tang, Xiaosong Ma, Wenguang Chen |
| 2014 | Compiler Techniques for Massively Scalable Implicit Task Parallelism. Timothy G. Armstrong, Justin M. Wozniak, Michael Wilde, Ian T. Foster |
| 2014 | Correctness Field Testing of Production and Decommissioned High Performance Computing Platforms at Los Alamos National Laboratory. Sarah Ellen Michalak, William N. Rust, John T. Daly, Rew J. Dubois, David H. DuBois |
| 2014 | DISC: A Domain-Interaction Based Programming Model with Support for Heterogeneous Execution. Mehmet Can Kurt, Gagan Agrawal |
| 2014 | Dissecting On-Node Memory Access Performance: A Semantic Approach. Alfredo Giménez, Todd Gamblin, Barry Rountree, Abhinav Bhatele, Ilir Jusufi, Peer-Timo Bremer, Bernd Hamann |
| 2014 | Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster. Ichitaro Yamazaki, Sivasankaran Rajamanickam, Erik G. Boman, Mark Hoemmen, Michael A. Heroux, Stanimire Tomov |
| 2014 | ECC Parity: A Technique for Efficient Memory Error Resilience for Multi-Channel Memory Systems. Xun Jian, Rakesh Kumar |
| 2014 | Efficient I/O and Storage of Adaptive-Resolution Data. Sidharth Kumar, John Edwards, Peer-Timo Bremer, Aaron Knoll, Cameron Christensen, Venkatram Vishwanath, Philip H. Carns, John A. Schmidt, Valerio Pascucci |
| 2014 | Efficient Implementation of Many-Body Quantum Chemical Methods on the Intel® Xeon Phi Coprocessor. Edoardo Aprà, Michael Klemm, Karol Kowalski |
| 2014 | Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices. Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Xing Liu, Md. Mostofa Ali Patwary, Yutong Lu, Pradeep Dubey |
| 2014 | Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format. Joseph L. Greathouse, Mayank Daga |
| 2014 | Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints. Srinivas Sridharan, James Dinan, Dhiraj D. Kalamkar |
| 2014 | Exploring Automatic, Online Failure Recovery for Scientific Applications at Extreme Scales. Marc Gamell, Daniel S. Katz, Hemanth Kolla, Jacqueline Chen, Scott Klasky, Manish Parashar |
| 2014 | FAST: Near Real-Time Searchable Data Analytics for the Cloud. Yu Hua, Hong Jiang, Dan Feng |
| 2014 | Fail-in-Place Network Design: Interaction Between Topology, Routing Algorithm and Failures. Jens Domke, Torsten Hoefler, Satoshi Matsuoka |
| 2014 | Fast Iterative Graph Computation: A Path Centric Approach. Pingpeng Yuan, Wenya Zhang, Changfeng Xie, Hai Jin, Ling Liu, Kisung Lee |
| 2014 | Fast Parallel Computation of Longest Common Prefixes. Julian Shun |
| 2014 | Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications. Arash Ashari, Naser Sedaghati, John Eisenlohr, Srinivasan Parthasarathy, P. Sadayappan |
| 2014 | Faster Parallel Traversal of Scale Free Graphs at Extreme Scale with Vertex Delegates. Roger A. Pearce, Maya B. Gokhale, Nancy M. Amato |
| 2014 | Fault-Tolerant Dynamic Task Graph Scheduling. Mehmet Can Kurt, Sriram Krishnamoorthy, Kunal Agrawal, Gagan Agrawal |
| 2014 | Fence Scoping. Changhui Lin, Vijay Nagarajan, Rajiv Gupta |
| 2014 | Finding Constant from Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds. Yifan Gong, Bingsheng He, Dan Li |
| 2014 | FlexSlot: Moving Hadoop Into the Cloud with Flexible Slot Management. Yanfei Guo, Jia Rao, Changjun Jiang, Xiaobo Zhou |
| 2014 | High-Performance Computation of Distributed-Memory Parallel 3D Voronoi and Delaunay Tessellation. Tom Peterka, Dmitriy Morozov, Carolyn L. Phillips |
| 2014 | High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA. Takashi Shimokawabe, Takayuki Aoki, Naoyuki Onodera |
| 2014 | In-Situ Feature Extraction of Large Scale Combustion Simulations Using Segmented Merge Trees. Aaditya G. Landge, Valerio Pascucci, Attila Gyulassy, Janine Bennett, Hemanth Kolla, Jacqueline Chen, Peer-Timo Bremer |
| 2014 | IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion. Kai Ren, Qing Zheng, Swapnil Patil, Garth A. Gibson |
| 2014 | International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2014, New Orleans, LA, USA, November 16-21, 2014 Trish Damkroger, Jack J. Dongarra |
| 2014 | Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors. Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, Pradeep Dubey |
| 2014 | MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications. Zhezhe Chen, James Dinan, Zhen Tang, Pavan Balaji, Hua Zhong, Jun Wei, Tao Huang, Feng Qin |
| 2014 | MSL: A Synthesis Enabled Language for Distributed Implementations. Zhilei Xu, Shoaib Kamil, Armando Solar-Lezama |
| 2014 | Managing DRAM Latency Divergence in Irregular GPGPU Applications. Niladrish Chatterjee, Mike O'Connor, Gabriel H. Loh, Nuwan Jayasena, Rajeev Balasubramonian |
| 2014 | Mapping to Irregular Torus Topologies and Other Techniques for Petascale Biomolecular Simulation. James C. Phillips, Yanhua Sun, Nikhil Jain, Eric J. Bohm, Laxmikant V. Kalé |
| 2014 | Maximizing Throughput of Overprovisioned HPC Data Centers Under a Strict Power Budget. Osman Sarood, Akhil Langer, Abhishek Gupta, Laxmikant V. Kalé |
| 2014 | Maximizing Throughput on a Dragonfly Network. Nikhil Jain, Abhinav Bhatele, Xiang Ni, Nicholas J. Wright, Laxmikant V. Kalé |
| 2014 | Metascalable Quantum Molecular Dynamics Simulations of Hydrogen-on-Demand. Ken-ichi Nomura, Rajiv K. Kalia, Aiichiro Nakano, Priya Vashishta, Kohei Shimamura, Fuyuki Shimojo, Manaschai Kunaseth, Paul C. Messina, Nichols A. Romero |
| 2014 | Microbank: Architecting Through-Silicon Interposer-Based Main Memory Systems. Young Hoon Son, Seongil O, Hyunggyun Yang, Daejin Jung, Jung Ho Ahn, John Kim, Jangwoo Kim, Jae W. Lee |
| 2014 | NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing. Zhengzhang Chen, Seung Woo Son, William Hendrix, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary |
| 2014 | Nonblocking Epochs in MPI One-Sided Communication. Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William Gropp, Ahmad Afsahi |
| 2014 | Oil and Water Can Mix: An Integration of Polyhedral and AST-Based Transformations. Jun Shirako, Louis-Noël Pouchet, Vivek Sarkar |
| 2014 | Omnisc'IO: A Grammar-Based Approach to Spatial and Temporal I/O Patterns Prediction. Matthieu Dorier, Shadi Ibrahim, Gabriel Antoniu, Robert B. Ross |
| 2014 | Optimization of a Multilevel Checkpoint Model with Uncertain Execution Scales. Sheng Di, Leonardo Arturo Bautista-Gomez, Franck Cappello |
| 2014 | Optimized Scheduling Strategies for Hybrid Density Functional theory Electronic Structure Calculations. William Dawson, François Gygi |
| 2014 | Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing. Jonathan Lifflander, Sriram Krishnamoorthy, Laxmikant V. Kalé |
| 2014 | Orion: Scaling Genomic Sequence Matching with Fine-Grained Parallelization. Kanak Mahadik, Somali Chaterji, Bowen Zhou, Milind Kulkarni, Saurabh Bagchi |
| 2014 | Parallel Bayesian Network Structure Learning for Genome-Scale Gene Networks. Sanchit Misra, Md. Vasimuddin, Kiran Pamnany, Sriram P. Chockalingam, Yong Dong, Min Xie, Maneesha R. Aluru, Srinivas Aluru |
| 2014 | Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly. Evangelos Georganas, Aydin Buluç, Jarrod Chapman, Leonid Oliker, Daniel Rokhsar, Katherine A. Yelick |
| 2014 | Parallel Deep Neural Network Training for Big Data on Blue Gene/Q. I-Hsin Chung, Tara N. Sainath, Bhuvana Ramabhadran, Michael Picheny, John A. Gunnels, Vernon Austel, Upendra V. Chaudhari, Brian Kingsbury |
| 2014 | Parallel Programming with Migratable Objects: Charm++ in Practice. Bilge Acun, Abhishek Gupta, Nikhil Jain, Akhil Langer, Harshitha Menon, Eric Mikida, Xiang Ni, Michael P. Robson, Yanhua Sun, Ehsan Totoni, Lukasz Wesolowski, Laxmikant V. Kalé |
| 2014 | Parallelization of Reordering Algorithms for Bandwidth and Wavefront Reduction. Konstantinos I. Karantasis, Andrew Lenharth, Donald Nguyen, María Jesús Garzarán, Keshav Pingali |
| 2014 | Pardicle: Parallel Approximate Density-Based Clustering. Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Fredrik Manne, Salman Habib, Pradeep Dubey |
| 2014 | Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers. Alexander Heinecke, Alexander Breuer, Sebastian Rettenberger, Michael Bader, Alice-Agnes Gabriel, Christian Pelties, Arndt Bode, William Barth, Xiangke Liao, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Pradeep Dubey |
| 2014 | Physics-Based Urban Earthquake Simulation Enhanced by 10.7 BlnDOF × 30 K Time-Step Unstructured FE Non-Linear Seismic Wave Simulation. Tsuyoshi Ichimura, Kohei Fujita, Seizo Tanaka, Muneo Hori, Wijerathne Maddegedara Lalith Lakshman, Yoshihisa Shizawa, Hiroshi Kobayashi |
| 2014 | Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System. Ali Charara, Hatem Ltaief, Damien Gratadour, David E. Keyes, Arnaud Sevin, Ahmad Abdelfattah, Eric Gendron, Carine Morel, Fabrice Vidal |
| 2014 | Practical Symbolic Race Checking of GPU Programs. Peng Li, Guodong Li, Ganesh Gopalakrishnan |
| 2014 | Quantitatively Modeling Application Resilience with the Data Vulnerability Factor. Li Yu, Dong Li, Sparsh Mittal, Jeffrey S. Vetter |
| 2014 | RAHTM: Routing Algorithm Aware Hierarchical Task Mapping. Ahmed H. Abdel-Gawad, Mithuna Thottethodi, Abhinav Bhatele |
| 2014 | Real-Time Scalable Cortical Computing at 46 Giga-Synaptic OPS/Watt with ~100× Speedup in Time-to-Solution and ~100, 000× Reduction in Energy-to-Solution. Andrew S. Cassidy, Rodrigo Alvarez-Icaza, Filipp Akopyan, Jun Sawada, John V. Arthur, Paul Merolla, Pallab Datta, Marc González Tallada, Brian Taba, Alexander Andreopoulos, Arnon Amir, Steven K. Esser, Jeff Kusnitz, Rathinakumar Appuswamy, Chuck Haymes, Bernard Brezzo, Roger Moussalli, Ralph Bellofatto, Christian W. Baks, Michael Mastro, Kai Schleupen, Charles E. Cox, Ken Inoue, Steven E. Millman, Nabil Imam, Emmett McQuinn, Yutaka Y. Nakamura, Ivan Vo, Chen Guok, Don Nguyen, Scott Lekuch, Sameh W. Asaad, Daniel J. Friedman, Bryan L. Jackson, Myron Flickner, William P. Risk, Rajit Manohar, Dharmendra S. Modha |
| 2014 | Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds. Haikun Liu, Bingsheng He |
| 2014 | Recycled Error Bits: Energy-Efficient Architectural Support for Floating Point Accuracy. Ralph Nathan, Bryan Anthonio, Shih-Lien Lu, Helia Naeimi, Daniel J. Sorin, Xiaobai Sun |
| 2014 | Scalable Computation of Stream Surfaces on Large Scale Vector Fields. Kewei Lu, Han-Wei Shen, Tom Peterka |
| 2014 | Scalable Kernel Fusion for Memory-Bound GPU Applications. Mohamed Wahib, Naoya Maruyama |
| 2014 | Scalable and High Performance Betweenness Centrality on the GPU. Adam McLaughlin, David A. Bader |
| 2014 | Scaling MapReduce Vertically and Horizontally. Ismail El-Helw, Rutger F. H. Hofman, Henri E. Bal |
| 2014 | Scaling the Power Wall: A Path to Exascale. Oreste Villa, Daniel R. Johnson, Mike O'Connor, Evgeny Bolotin, David W. Nellans, Justin Luitjens, Nikolai Sakharnykh, Peng Wang, Paulius Micikevicius, Anthony Scudiero, Stephen W. Keckler, William J. Dally |
| 2014 | Scheduling Multi-tenant Cloud Workloads on Accelerator-Based Systems. Dipanjan Sengupta, Anshuman Goswami, Karsten Schwan, Krishna Pallavi |
| 2014 | Slim Fly: A Cost Effective Low-Diameter Network Topology. Maciej Besta, Torsten Hoefler |
| 2014 | Structure Slicing: Extending Logical Regions with Fields. Michael Bauer, Sean Treichler, Elliott Slaughter, Alex Aiken |
| 2014 | The DRIHM Project: A Flexible Approach to Integrate HPC, Grid and Cloud Resources for Hydro-Meteorological Research. Daniele D'Agostino, Andrea Clematis, Antonella Galizia, Alfonso Quarati, Emanuele Danovaro, Luca Roverelli, Gabriele Zereik, Dieter Kranzlmüller, Michael Schiffers, Nils gentschen Felde, Christian Straube, Olivier Caumont, Evelyne Richard, Luis Garrote, Quillon K. Harpham, H. R. A. Jagers, Vladimir Dimitrijevic, Ljiljana Dekic, Elisabetta Fiori, Fabio Delogu, Antonio Parodi |
| 2014 | The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications. Anthony M. Agelastos, Benjamin A. Allan, Jim M. Brandt, Paul Cassella, Jeremy Enos, Joshi Fullop, Ann C. Gentile, Steve Monk, Nichamon Naksinehaboon, Jeff Ogden, Mahesh Rajan, Michael T. Showerman, Joel Stevenson, Narate Taerat, Thomas W. Tucker |
| 2014 | Two-Choice Randomized Dynamic I/O Scheduler for Object Storage Systems. Dong Dai, Yong Chen, Dries Kimpe, Robert B. Ross |
| 2014 | Understanding Soft Error Resiliency of Blue Gene/Q Compute Chip through Hardware Proton Irradiation and Software Fault Injection. Chen-Yong Cher, Meeta Sharma Gupta, Pradip Bose, K. Paul Muller |
| 2014 | Understanding the Effects of Communication and Coordination on Checkpointing at Scale. Kurt B. Ferreira, Patrick M. Widener, Scott Levy, Dorian C. Arnold, Torsten Hoefler |
| 2014 | Using an Adaptive HPC Runtime System to Reconfigure the Cache Hierarchy. Ehsan Totoni, Josep Torrellas, Laxmikant V. Kalé |
| 2014 | pTatin3D: High-Performance Methods for Long-Term Lithospheric Dynamics. Dave A. May, Jed Brown, Laetitia Le Pourhiet |