| 2013 | 11 PFLOP/s simulations of cloud cavitation collapse. Diego Rossinelli, Babak Hejazialhosseini, Panagiotis E. Hadjidoukas, Costas Bekas, Alessandro Curioni, Adam Bertsch, Scott Futral, Steffen J. Schmidt, Nikolaus A. Adams, Petros Koumoutsakos |
| 2013 | 20 petaflops simulation of proteins suspensions in crowding conditions. Massimo Bernaschi, Mauro Bisson, Massimiliano Fatica, Simone Melchionna |
| 2013 | 2HOT: an improved parallel hashed oct-tree n-body algorithm for cosmological simulation. Michael S. Warren |
| 2013 | A 'cool' way of improving the reliability of HPC machines. Osman Sarood, Esteban Meneses, Laxmikant V. Kalé |
| 2013 | A computationally efficient algorithm for the 2D covariance method. Oded Green, Yitzhak Birk |
| 2013 | A data-centric profiler for parallel programs. Xu Liu, John M. Mellor-Crummey |
| 2013 | A distributed dynamic load balancer for iterative applications. Harshitha Menon, Laxmikant V. Kalé |
| 2013 | A framework for hybrid parallel flow simulations with a trillion cells in complex geometries. Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler, Ulrich Rüde |
| 2013 | A framework for load balancing of tensor contraction expressions via dynamic task partitioning. Pai-Wei Lai, Kevin Stock, Samyam Rajbhandari, Sriram Krishnamoorthy, P. Sadayappan |
| 2013 | A large-scale cross-architecture evaluation of thread-coarsening. Alberto Magni, Christophe Dubach, Michael F. P. O'Boyle |
| 2013 | A new routing scheme for Jellyfish and its performance with HPC workloads. Xin Yuan, Santosh Mahapatra, Wickus Nienaber, Scott Pakin, Michael Lang |
| 2013 | A scalable parallel algorithm for dynamic range-limited Manaschai Kunaseth, Rajiv K. Kalia, Aiichiro Nakano, Ken-ichi Nomura, Priya Vashishta |
| 2013 | A scalable, efficient scheme for evaluation of stencil computations over unstructured meshes. James King, Robert M. Kirby |
| 2013 | ACIC: automatic cloud I/O configurator for HPC applications. Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zhai, Qianqian Shi, Xiaosong Ma, Wenguang Chen |
| 2013 | ACR: automatic checkpoint/restart for soft and hard error protection. Xiang Ni, Esteban Meneses, Nikhil Jain, Laxmikant V. Kalé |
| 2013 | AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs. Qian Wang, Xianyi Zhang, Yunquan Zhang, Qing Yi |
| 2013 | Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes. Wai Teng Tang, Wen Jun Tan, Rajarshi Ray, Yi Wen Wong, Weiguang Chen, Shyh-Hao Kuo, Rick Siow Mong Goh, Stephen John Turner, Weng-Fai Wong |
| 2013 | Algorithms for high-throughput disk-to-disk sorting. Hari Sundar, Dhairya Malhotra, Karl W. Schulz |
| 2013 | An early performance evaluation of many integrated core architecture based SGI rackable computing system. Subhash Saini, Haoqiang Jin, Dennis C. Jespersen, Huiyu Feng, M. Jahed Djomehri, William Arasin, Robert Hood, Piyush Mehrotra, Rupak Biswas |
| 2013 | An improved parallel singular value algorithm and its implementation for multicore hardware. Azzam Haidar, Jakub Kurzak, Piotr Luszczek |
| 2013 | Assessing the effects of data compression in simulations using physically motivated metrics. Daniel E. Laney, Steven Langer, Christopher Weber, Peter Lindstrom, Al Wegener |
| 2013 | COCA: online distributed resource management for cost minimization and carbon neutrality in data centers. Shaolei Ren, Yuxiong He |
| 2013 | Channel reservation protocol for over-subscribed channels and destinations. George Michelogiannakis, Nan Jiang, Daniel Becker, William J. Dally |
| 2013 | Characterization and modeling of PIDX parallel I/O for performance optimization. Sidharth Kumar, Avishek Saha, Venkatram Vishwanath, Philip H. Carns, John A. Schmidt, Giorgio Scorzelli, Hemanth Kolla, Ray W. Grout, Robert Latham, Robert B. Ross, Michael E. Papka, Jacqueline Chen, Valerio Pascucci |
| 2013 | Compiling affine loop nests for distributed-memory parallel architectures. Uday Bondhugula |
| 2013 | CooMR: cross-task coordination for efficient data management in MapReduce programs. Xiaobing Li, Yandong Wang, Yizheng Jiao, Cong Xu, Weikuan Yu |
| 2013 | Coordinated energy management in heterogeneous processors. Indrani Paul, Vignesh T. Ravi, Srilatha Manne, Manish Arora, Sudhakar Yalamanchili |
| 2013 | Cost-effective cloud HPC resource provisioning by building semi-elastic virtual clusters. Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, Wenguang Chen |
| 2013 | Design and performance evaluation of NUMA-aware RDMA-based end-to-end data transfer systems. Yufei Ren, Tan Li, Dantong Yu, Shudong Jin, Thomas G. Robertazzi |
| 2013 | Detection of false sharing using machine learning. Sanath Jayasena, Saman P. Amarasinghe, Asanka Abeyweera, Gayashan Amarasinghe, Himeshi De Silva, Sunimal Rathnayake, Xiaoqiao Meng, Yanbin Liu |
| 2013 | Deterministic scale-free pipeline parallelism with hyperqueues. Hans Vandierendonck, Kallia Chronaki, Dimitrios S. Nikolopoulos |
| 2013 | Distributed wait state tracking for runtime MPI deadlock detection. Tobias Hilbrich, Bronis R. de Supinski, Wolfgang E. Nagel, Joachim Protze, Christel Baier, Matthias S. Müller |
| 2013 | Distributed-memory parallel algorithms for generating massive scale-free networks using preferential attachment model. Md. Maksudul Alam, Maleq Khan, Madhav V. Marathe |
| 2013 | Effective sampling-driven performance tools for GPU-accelerated supercomputers. Milind Chabbi, Karthik Murthy, Michael W. Fagan, John M. Mellor-Crummey |
| 2013 | Efficient data partitioning model for heterogeneous graphs in the cloud. Kisung Lee, Ling Liu |
| 2013 | Enabling comprehensive data-driven system management for large computational facilities. James C. Browne, Robert L. DeLeon, Charng-Da Lu, Matthew D. Jones, Steven M. Gallo, Amin Ghadersohi, Abani K. Patra, William L. Barth, John L. Hammond, Thomas R. Furlani, Robert T. McLay |
| 2013 | Enabling fair pricing on HPC systems with node sharing. Alex D. Breslow, Ananta Tiwari, Martin Schulz, Laura Carrington, Lingjia Tang, Jason Mars |
| 2013 | Enabling highly-scalable remote memory access programming with MPI-3 one sided. Robert Gerstenberger, Maciej Besta, Torsten Hoefler |
| 2013 | Exploiting application dynamism and cloud elasticity for continuous dataflows. Alok Gautam Kumbhare, Yogesh Simmhan, Viktor K. Prasanna |
| 2013 | Exploring DRAM organizations for energy-efficient and resilient exascale memories. Bharan Giridhar, Michael Cieslak, Deepankar Duggal, Ronald G. Dreslinski, Hsing Min Chen, Robert Patti, Betina Hold, Chaitali Chakrabarti, Trevor N. Mudge, David T. Blaauw |
| 2013 | Exploring portfolio scheduling for long-term execution of scientific workloads in IaaS clouds. Kefeng Deng, Junqiang Song, Kaijun Ren, Alexandru Iosup |
| 2013 | Exploring power behaviors and trade-offs of in-situ data analytics. Marc Gamell, Ivan Rodero, Manish Parashar, Janine Bennett, Hemanth Kolla, Jacqueline Chen, Peer-Timo Bremer, Aaditya G. Landge, Attila Gyulassy, Patrick S. McCormick, Scott Pakin, Valerio Pascucci, Scott Klasky |
| 2013 | Exploring the future of out-of-core computing with compute-local non-volatile memory. Myoungsoo Jung, Ellis Herbert Wilson, Wonil Choi, John Shalf, Hasan Metin Aktulga, Chao Yang, Erik Saule, Ümit V. Çatalyürek, Mahmut T. Kandemir |
| 2013 | Feng shui of supercomputer memory: positional effects in DRAM and SRAM faults. Vilas Sridharan, Jon Stearley, Nathan DeBardeleben, Sean Blanchard, Sudhanva Gurumurthi |
| 2013 | General transformations for GPU execution of tree traversals. Michael Goldfarb, Youngjoon Jo, Milind Kulkarni |
| 2013 | Globalizing selectively: shared-memory efficiency with address-space separation. Nilesh Mahajan, Uday Pitambare, Arun Chauhan |
| 2013 | GoldRush: resource efficient in situ scientific data analytics using fine-grained interference aware execution. Fang Zheng, Hongfeng Yu, Can Hantas, Matthew Wolf, Greg Eisenhauer, Karsten Schwan, Hasan Abbasi, Scott Klasky |
| 2013 | Guide-copy: fast and silent migration of virtual machine for datacenters. Jihun Kim, Dongju Chae, Jangwoo Kim, Jong Kim |
| 2013 | HACC: extreme scaling and performance across diverse architectures. Salman Habib, Vitali A. Morozov, Nicholas Frontiere, Hal Finkel, Adrian Pope, Katrin Heitmann |
| 2013 | Hybrid MPI: efficient message passing for multi-core systems. Andrew Friedley, Greg Bronevetsky, Torsten Hoefler, Andrew Lumsdaine |
| 2013 | Insights for exascale IO APIs from building a petascale IO API. Jay F. Lofstead, Robert Ross |
| 2013 | Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems. Xu Yang, Zhou Zhou, Sean Wallace, Zhiling Lan, Wei Tang, Susan Coghlan, Michael E. Papka |
| 2013 | International Conference for High Performance Computing, Networking, Storage and Analysis, SC'13, Denver, CO, USA - November 17 - 21, 2013 William Gropp, Satoshi Matsuoka |
| 2013 | Investigating applications portability with the Uintah DAG-based runtime system on PetaScale supercomputers. Qingyu Meng, Alan Humphrey, John A. Schmidt, Martin Berzins |
| 2013 | Kinetic turbulence simulations at extreme scale on leadership-class systems. Bei Wang, Stéphane Ethier, William M. Tang, Timothy J. Williams, Khaled Z. Ibrahim, Kamesh Madduri, Samuel Williams, Leonid Oliker |
| 2013 | Load-balanced pipeline parallelism. Md. Kamruzzaman, Steven Swanson, Dean M. Tullsen |
| 2013 | Location-aware cache management for many-core processors with deep cache hierarchy. Jongsoo Park, Richard M. Yoo, Daya Shanker Khudia, Christopher J. Hughes, Daehyun Kim |
| 2013 | Low-power, low-storage-overhead chipkill correct via multi-line error correction. Xun Jian, Henry Duwe, John Sartori, Vilas Sridharan, Rakesh Kumar |
| 2013 | MVAPICH-PRISM: a proxy-based communication framework using InfiniBand and SCIF for intel MIC clusters. Sreeram Potluri, Devendar Bureddy, Khaled Hamidouche, Akshay Venkatesh, Krishna Chaitanya Kandalla, Hari Subramoni, Dhabaleswar K. Panda |
| 2013 | Mr. Scan: extreme scale density-based clustering using a tree-based network of GPGPU nodes. Benjamin Welton, Evan Samanas, Barton P. Miller |
| 2013 | On fast parallel detection of strongly connected components (SCC) in small-world graphs. Sungpack Hong, Nicole C. Rodia, Kunle Olukotun |
| 2013 | On the usefulness of object tracking techniques in performance analysis. Germán Llort, Harald Servat, Juan Gonzalez, Judit Giménez, Jesús Labarta |
| 2013 | Optimization of cloud task processing with checkpoint-restart mechanism. Sheng Di, Yves Robert, Frédéric Vivien, Derrick Kondo, Cho-Li Wang, Franck Cappello |
| 2013 | Parallel design and performance of nested filtering factorization preconditioner. Long Qu, Laura Grigori, Frédéric Nataf |
| 2013 | Parallel reduction to hessenberg form with algorithm-based fault tolerance. Yulu Jia, George Bosilca, Piotr Luszczek, Jack J. Dongarra |
| 2013 | Parallelizing the execution of sequential scripts. Zhao Zhang, Daniel S. Katz, Timothy G. Armstrong, Justin M. Wozniak, Ian T. Foster |
| 2013 | Performance evaluation of Intel® transactional synchronization extensions for high-performance computing. Richard M. Yoo, Christopher J. Hughes, Konrad Lai, Ravi Rajwar |
| 2013 | Petascale WRF simulation of hurricane Sandy deployment of NCSA's cray XE6 blue waters. Peter Johnsen, Mark Straka, Melvyn Shapiro, Alan Norton, Thomas Galarneau |
| 2013 | Petascale direct numerical simulation of turbulent channel flow on up to 786K cores. Myoungkyu Lee, Nicholas Malaya, Robert D. Moser |
| 2013 | Physics-based seismic hazard analysis on petascale heterogeneous supercomputers. Yifeng Cui, Efecan Poyraz, Kim B. Olsen, Jun Zhou, Kyle Withers, Scott Callaghan, Jeff Larkin, Clark C. Guest, Dong Ju Choi, Amit Chourasia, Zheqiang Shi, Steven M. Day, Philip Maechling, Thomas H. Jordan |
| 2013 | Practical nonvolatile multilevel-cell phase change memory. Doe Hyun Yoon, Jichuan Chang, Robert S. Schreiber, Norman P. Jouppi |
| 2013 | Precimonious: tuning assistant for floating-point precision. Cindy Rubio-González, Cuong Nguyen, Hong Diep Nguyen, James Demmel, William Kahan, Koushik Sen, David H. Bailey, Costin Iancu, David Hough |
| 2013 | Predicting application performance using supervised learning on communication features. Nikhil Jain, Abhinav Bhatele, Michael P. Robson, Todd Gamblin, Laxmikant V. Kalé |
| 2013 | Radiative signatures of the relativistic Kelvin-Helmholtz instability. Michael Bussmann, Heiko Burau, Thomas E. Cowan, Alexander Debus, Axel Huebl, Guido Juckeland, Thomas Kluge, Wolfgang E. Nagel, Richard Pausch, Felix Schmitt, Ulrich Schramm, Joseph Schuchart, René Widera |
| 2013 | Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach. Dong Li, Zizhong Chen, Panruo Wu, Jeffrey S. Vetter |
| 2013 | SDQuery DSI: integrating data management support with a wide area data transfer protocol. Yu Su, Yi Wang, Gagan Agrawal, Rajkumar Kettimuthu |
| 2013 | SIDR: structure-aware intelligent data routing in Hadoop. Joe B. Buck, Noah Watkins, Greg Levin, Adam Crume, Kleoni Ioannidou, Scott A. Brandt, Carlos Maltzahn, Neoklis Polyzotis, Aaron Torres |
| 2013 | SPBC: leveraging the characteristics of MPI HPC applications for scalable checkpointing. Thomas Ropars, Tatiana V. Martsinkevich, Amina Guermouche, André Schiper, Franck Cappello |
| 2013 | Scalable domain decomposition preconditioners for heterogeneous elliptic problems. Pierre Jolivet, Frédéric Hecht, Frédéric Nataf, Christophe Prud'Homme |
| 2013 | Scalable matrix computations on large scale-free graphs using 2D graph partitioning. Erik G. Boman, Karen D. Devine, Sivasankaran Rajamanickam |
| 2013 | Scalable parallel OPTICS data clustering using graph algorithmic techniques. Md. Mostofa Ali Patwary, Diana Palsetia, Ankit Agrawal, Wei-keng Liao, Fredrik Manne, Alok N. Choudhary |
| 2013 | Scalable parallel graph partitioning. Shad Kirmani, Padma Raghavan |
| 2013 | Scalable virtual machine deployment using VM image caches. Kaveh Razavi, Thilo Kielmann |
| 2013 | Semi-automatic restructuring of offloadable tasks for many-core accelerators. Nishkam Ravi, Yi Yang, Tao Bao, Srimat T. Chakradhar |
| 2013 | Solving the compressible navier-stokes equations on up to 1.97 million cores and 4.1 trillion grid points. Iván Bermejo-Moreno, Julien Bodart, Johan Larsson, Blaise M. Barney, Joseph W. Nichols, Steve Jones |
| 2013 | Supercomputing with commodity CPUs: are mobile SoCs ready for HPC? Nikola Rajovic, Paul M. Carpenter, Isaac Gelado, Nikola Puzovic, Alex Ramírez, Mateo Valero |
| 2013 | Swendsen-Wang multi-cluster algorithm for the 2D/3D Ising model on Xeon Phi and GPU. Florian Wende, Thomas Steinke |
| 2013 | Taking a quantum leap in time to solution for simulations of high-Tc superconductors. Peter W. J. Staar, Thomas A. Maier, Michael S. Summers, Gilles Fourestey, Raffaele Solcà, Thomas C. Schulthess |
| 2013 | Taming parallel I/O complexity with auto-tuning. Babak Behzad, Huong Vu Thanh Luu, Joseph Huchette, Surendra Byna, Prabhat, Ruth A. Aydt, Quincey Koziol, Marc Snir |
| 2013 | Tera-scale 1D FFT with low-communication algorithm and Intel® Xeon Phi™ coprocessors. Jongsoo Park, Ganesh Bikshandi, Karthikeyan Vaidyanathan, Ping Tak Peter Tang, Pradeep Dubey, Daehyun Kim |
| 2013 | The Science DMZ: a network design pattern for data-intensive science. Eli Dart, Lauren Rotman, Brian Tierney, Mary Hester, Jason Zurawski |
| 2013 | The origin of mass. Peter A. Boyle, Michael I. Buchoff, Norman H. Christ, Taku Izubuchi, Chulwoo Jung, Thomas C. Luu, Robert D. Mawhinney, Chris Schroeder, Ron Soltz, Pavlos Vranas, Joseph Wasem |
| 2013 | There goes the neighborhood: performance degradation due to nearby jobs. Abhinav Bhatele, Kathryn M. Mohror, Steve H. Langer, Katherine E. Isaacs |
| 2013 | Toward millions of file system IOPS on low-cost, commodity hardware. Da Zheng, Randal C. Burns, Alexander S. Szalay |
| 2013 | Using automated performance modeling to find scalability bugs in complex codes. Alexandru Calotoiu, Torsten Hoefler, Marius Poke, Felix Wolf |
| 2013 | Using cross-layer adaptations for dynamic data management in large scale coupled scientific workflows. Tong Jin, Fan Zhang, Qian Sun, Hoang Bui, Manish Parashar, Hongfeng Yu, Scott Klasky, Norbert Podhorszki, Hasan Abbasi |
| 2013 | Using simulation to explore distributed key-value stores for extreme-scale system services. Ke Wang, Abhishek Kulkarni, Michael Lang, Dorian C. Arnold, Ioan Raicu |