SC A

97 papers

YearTitle / Authors
201311 PFLOP/s simulations of cloud cavitation collapse.
Diego Rossinelli, Babak Hejazialhosseini, Panagiotis E. Hadjidoukas, Costas Bekas, Alessandro Curioni, Adam Bertsch, Scott Futral, Steffen J. Schmidt, Nikolaus A. Adams, Petros Koumoutsakos
201320 petaflops simulation of proteins suspensions in crowding conditions.
Massimo Bernaschi, Mauro Bisson, Massimiliano Fatica, Simone Melchionna
20132HOT: an improved parallel hashed oct-tree n-body algorithm for cosmological simulation.
Michael S. Warren
2013A 'cool' way of improving the reliability of HPC machines.
Osman Sarood, Esteban Meneses, Laxmikant V. Kalé
2013A computationally efficient algorithm for the 2D covariance method.
Oded Green, Yitzhak Birk
2013A data-centric profiler for parallel programs.
Xu Liu, John M. Mellor-Crummey
2013A distributed dynamic load balancer for iterative applications.
Harshitha Menon, Laxmikant V. Kalé
2013A framework for hybrid parallel flow simulations with a trillion cells in complex geometries.
Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler, Ulrich Rüde
2013A framework for load balancing of tensor contraction expressions via dynamic task partitioning.
Pai-Wei Lai, Kevin Stock, Samyam Rajbhandari, Sriram Krishnamoorthy, P. Sadayappan
2013A large-scale cross-architecture evaluation of thread-coarsening.
Alberto Magni, Christophe Dubach, Michael F. P. O'Boyle
2013A new routing scheme for Jellyfish and its performance with HPC workloads.
Xin Yuan, Santosh Mahapatra, Wickus Nienaber, Scott Pakin, Michael Lang
2013A scalable parallel algorithm for dynamic range-limited
Manaschai Kunaseth, Rajiv K. Kalia, Aiichiro Nakano, Ken-ichi Nomura, Priya Vashishta
2013A scalable, efficient scheme for evaluation of stencil computations over unstructured meshes.
James King, Robert M. Kirby
2013ACIC: automatic cloud I/O configurator for HPC applications.
Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zhai, Qianqian Shi, Xiaosong Ma, Wenguang Chen
2013ACR: automatic checkpoint/restart for soft and hard error protection.
Xiang Ni, Esteban Meneses, Nikhil Jain, Laxmikant V. Kalé
2013AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs.
Qian Wang, Xianyi Zhang, Yunquan Zhang, Qing Yi
2013Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes.
Wai Teng Tang, Wen Jun Tan, Rajarshi Ray, Yi Wen Wong, Weiguang Chen, Shyh-Hao Kuo, Rick Siow Mong Goh, Stephen John Turner, Weng-Fai Wong
2013Algorithms for high-throughput disk-to-disk sorting.
Hari Sundar, Dhairya Malhotra, Karl W. Schulz
2013An early performance evaluation of many integrated core architecture based SGI rackable computing system.
Subhash Saini, Haoqiang Jin, Dennis C. Jespersen, Huiyu Feng, M. Jahed Djomehri, William Arasin, Robert Hood, Piyush Mehrotra, Rupak Biswas
2013An improved parallel singular value algorithm and its implementation for multicore hardware.
Azzam Haidar, Jakub Kurzak, Piotr Luszczek
2013Assessing the effects of data compression in simulations using physically motivated metrics.
Daniel E. Laney, Steven Langer, Christopher Weber, Peter Lindstrom, Al Wegener
2013COCA: online distributed resource management for cost minimization and carbon neutrality in data centers.
Shaolei Ren, Yuxiong He
2013Channel reservation protocol for over-subscribed channels and destinations.
George Michelogiannakis, Nan Jiang, Daniel Becker, William J. Dally
2013Characterization and modeling of PIDX parallel I/O for performance optimization.
Sidharth Kumar, Avishek Saha, Venkatram Vishwanath, Philip H. Carns, John A. Schmidt, Giorgio Scorzelli, Hemanth Kolla, Ray W. Grout, Robert Latham, Robert B. Ross, Michael E. Papka, Jacqueline Chen, Valerio Pascucci
2013Compiling affine loop nests for distributed-memory parallel architectures.
Uday Bondhugula
2013CooMR: cross-task coordination for efficient data management in MapReduce programs.
Xiaobing Li, Yandong Wang, Yizheng Jiao, Cong Xu, Weikuan Yu
2013Coordinated energy management in heterogeneous processors.
Indrani Paul, Vignesh T. Ravi, Srilatha Manne, Manish Arora, Sudhakar Yalamanchili
2013Cost-effective cloud HPC resource provisioning by building semi-elastic virtual clusters.
Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, Wenguang Chen
2013Design and performance evaluation of NUMA-aware RDMA-based end-to-end data transfer systems.
Yufei Ren, Tan Li, Dantong Yu, Shudong Jin, Thomas G. Robertazzi
2013Detection of false sharing using machine learning.
Sanath Jayasena, Saman P. Amarasinghe, Asanka Abeyweera, Gayashan Amarasinghe, Himeshi De Silva, Sunimal Rathnayake, Xiaoqiao Meng, Yanbin Liu
2013Deterministic scale-free pipeline parallelism with hyperqueues.
Hans Vandierendonck, Kallia Chronaki, Dimitrios S. Nikolopoulos
2013Distributed wait state tracking for runtime MPI deadlock detection.
Tobias Hilbrich, Bronis R. de Supinski, Wolfgang E. Nagel, Joachim Protze, Christel Baier, Matthias S. Müller
2013Distributed-memory parallel algorithms for generating massive scale-free networks using preferential attachment model.
Md. Maksudul Alam, Maleq Khan, Madhav V. Marathe
2013Effective sampling-driven performance tools for GPU-accelerated supercomputers.
Milind Chabbi, Karthik Murthy, Michael W. Fagan, John M. Mellor-Crummey
2013Efficient data partitioning model for heterogeneous graphs in the cloud.
Kisung Lee, Ling Liu
2013Enabling comprehensive data-driven system management for large computational facilities.
James C. Browne, Robert L. DeLeon, Charng-Da Lu, Matthew D. Jones, Steven M. Gallo, Amin Ghadersohi, Abani K. Patra, William L. Barth, John L. Hammond, Thomas R. Furlani, Robert T. McLay
2013Enabling fair pricing on HPC systems with node sharing.
Alex D. Breslow, Ananta Tiwari, Martin Schulz, Laura Carrington, Lingjia Tang, Jason Mars
2013Enabling highly-scalable remote memory access programming with MPI-3 one sided.
Robert Gerstenberger, Maciej Besta, Torsten Hoefler
2013Exploiting application dynamism and cloud elasticity for continuous dataflows.
Alok Gautam Kumbhare, Yogesh Simmhan, Viktor K. Prasanna
2013Exploring DRAM organizations for energy-efficient and resilient exascale memories.
Bharan Giridhar, Michael Cieslak, Deepankar Duggal, Ronald G. Dreslinski, Hsing Min Chen, Robert Patti, Betina Hold, Chaitali Chakrabarti, Trevor N. Mudge, David T. Blaauw
2013Exploring portfolio scheduling for long-term execution of scientific workloads in IaaS clouds.
Kefeng Deng, Junqiang Song, Kaijun Ren, Alexandru Iosup
2013Exploring power behaviors and trade-offs of in-situ data analytics.
Marc Gamell, Ivan Rodero, Manish Parashar, Janine Bennett, Hemanth Kolla, Jacqueline Chen, Peer-Timo Bremer, Aaditya G. Landge, Attila Gyulassy, Patrick S. McCormick, Scott Pakin, Valerio Pascucci, Scott Klasky
2013Exploring the future of out-of-core computing with compute-local non-volatile memory.
Myoungsoo Jung, Ellis Herbert Wilson, Wonil Choi, John Shalf, Hasan Metin Aktulga, Chao Yang, Erik Saule, Ümit V. Çatalyürek, Mahmut T. Kandemir
2013Feng shui of supercomputer memory: positional effects in DRAM and SRAM faults.
Vilas Sridharan, Jon Stearley, Nathan DeBardeleben, Sean Blanchard, Sudhanva Gurumurthi
2013General transformations for GPU execution of tree traversals.
Michael Goldfarb, Youngjoon Jo, Milind Kulkarni
2013Globalizing selectively: shared-memory efficiency with address-space separation.
Nilesh Mahajan, Uday Pitambare, Arun Chauhan
2013GoldRush: resource efficient in situ scientific data analytics using fine-grained interference aware execution.
Fang Zheng, Hongfeng Yu, Can Hantas, Matthew Wolf, Greg Eisenhauer, Karsten Schwan, Hasan Abbasi, Scott Klasky
2013Guide-copy: fast and silent migration of virtual machine for datacenters.
Jihun Kim, Dongju Chae, Jangwoo Kim, Jong Kim
2013HACC: extreme scaling and performance across diverse architectures.
Salman Habib, Vitali A. Morozov, Nicholas Frontiere, Hal Finkel, Adrian Pope, Katrin Heitmann
2013Hybrid MPI: efficient message passing for multi-core systems.
Andrew Friedley, Greg Bronevetsky, Torsten Hoefler, Andrew Lumsdaine
2013Insights for exascale IO APIs from building a petascale IO API.
Jay F. Lofstead, Robert Ross
2013Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems.
Xu Yang, Zhou Zhou, Sean Wallace, Zhiling Lan, Wei Tang, Susan Coghlan, Michael E. Papka
2013International Conference for High Performance Computing, Networking, Storage and Analysis, SC'13, Denver, CO, USA - November 17 - 21, 2013
William Gropp, Satoshi Matsuoka
2013Investigating applications portability with the Uintah DAG-based runtime system on PetaScale supercomputers.
Qingyu Meng, Alan Humphrey, John A. Schmidt, Martin Berzins
2013Kinetic turbulence simulations at extreme scale on leadership-class systems.
Bei Wang, Stéphane Ethier, William M. Tang, Timothy J. Williams, Khaled Z. Ibrahim, Kamesh Madduri, Samuel Williams, Leonid Oliker
2013Load-balanced pipeline parallelism.
Md. Kamruzzaman, Steven Swanson, Dean M. Tullsen
2013Location-aware cache management for many-core processors with deep cache hierarchy.
Jongsoo Park, Richard M. Yoo, Daya Shanker Khudia, Christopher J. Hughes, Daehyun Kim
2013Low-power, low-storage-overhead chipkill correct via multi-line error correction.
Xun Jian, Henry Duwe, John Sartori, Vilas Sridharan, Rakesh Kumar
2013MVAPICH-PRISM: a proxy-based communication framework using InfiniBand and SCIF for intel MIC clusters.
Sreeram Potluri, Devendar Bureddy, Khaled Hamidouche, Akshay Venkatesh, Krishna Chaitanya Kandalla, Hari Subramoni, Dhabaleswar K. Panda
2013Mr. Scan: extreme scale density-based clustering using a tree-based network of GPGPU nodes.
Benjamin Welton, Evan Samanas, Barton P. Miller
2013On fast parallel detection of strongly connected components (SCC) in small-world graphs.
Sungpack Hong, Nicole C. Rodia, Kunle Olukotun
2013On the usefulness of object tracking techniques in performance analysis.
Germán Llort, Harald Servat, Juan Gonzalez, Judit Giménez, Jesús Labarta
2013Optimization of cloud task processing with checkpoint-restart mechanism.
Sheng Di, Yves Robert, Frédéric Vivien, Derrick Kondo, Cho-Li Wang, Franck Cappello
2013Parallel design and performance of nested filtering factorization preconditioner.
Long Qu, Laura Grigori, Frédéric Nataf
2013Parallel reduction to hessenberg form with algorithm-based fault tolerance.
Yulu Jia, George Bosilca, Piotr Luszczek, Jack J. Dongarra
2013Parallelizing the execution of sequential scripts.
Zhao Zhang, Daniel S. Katz, Timothy G. Armstrong, Justin M. Wozniak, Ian T. Foster
2013Performance evaluation of Intel® transactional synchronization extensions for high-performance computing.
Richard M. Yoo, Christopher J. Hughes, Konrad Lai, Ravi Rajwar
2013Petascale WRF simulation of hurricane Sandy deployment of NCSA's cray XE6 blue waters.
Peter Johnsen, Mark Straka, Melvyn Shapiro, Alan Norton, Thomas Galarneau
2013Petascale direct numerical simulation of turbulent channel flow on up to 786K cores.
Myoungkyu Lee, Nicholas Malaya, Robert D. Moser
2013Physics-based seismic hazard analysis on petascale heterogeneous supercomputers.
Yifeng Cui, Efecan Poyraz, Kim B. Olsen, Jun Zhou, Kyle Withers, Scott Callaghan, Jeff Larkin, Clark C. Guest, Dong Ju Choi, Amit Chourasia, Zheqiang Shi, Steven M. Day, Philip Maechling, Thomas H. Jordan
2013Practical nonvolatile multilevel-cell phase change memory.
Doe Hyun Yoon, Jichuan Chang, Robert S. Schreiber, Norman P. Jouppi
2013Precimonious: tuning assistant for floating-point precision.
Cindy Rubio-González, Cuong Nguyen, Hong Diep Nguyen, James Demmel, William Kahan, Koushik Sen, David H. Bailey, Costin Iancu, David Hough
2013Predicting application performance using supervised learning on communication features.
Nikhil Jain, Abhinav Bhatele, Michael P. Robson, Todd Gamblin, Laxmikant V. Kalé
2013Radiative signatures of the relativistic Kelvin-Helmholtz instability.
Michael Bussmann, Heiko Burau, Thomas E. Cowan, Alexander Debus, Axel Huebl, Guido Juckeland, Thomas Kluge, Wolfgang E. Nagel, Richard Pausch, Felix Schmitt, Ulrich Schramm, Joseph Schuchart, René Widera
2013Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach.
Dong Li, Zizhong Chen, Panruo Wu, Jeffrey S. Vetter
2013SDQuery DSI: integrating data management support with a wide area data transfer protocol.
Yu Su, Yi Wang, Gagan Agrawal, Rajkumar Kettimuthu
2013SIDR: structure-aware intelligent data routing in Hadoop.
Joe B. Buck, Noah Watkins, Greg Levin, Adam Crume, Kleoni Ioannidou, Scott A. Brandt, Carlos Maltzahn, Neoklis Polyzotis, Aaron Torres
2013SPBC: leveraging the characteristics of MPI HPC applications for scalable checkpointing.
Thomas Ropars, Tatiana V. Martsinkevich, Amina Guermouche, André Schiper, Franck Cappello
2013Scalable domain decomposition preconditioners for heterogeneous elliptic problems.
Pierre Jolivet, Frédéric Hecht, Frédéric Nataf, Christophe Prud'Homme
2013Scalable matrix computations on large scale-free graphs using 2D graph partitioning.
Erik G. Boman, Karen D. Devine, Sivasankaran Rajamanickam
2013Scalable parallel OPTICS data clustering using graph algorithmic techniques.
Md. Mostofa Ali Patwary, Diana Palsetia, Ankit Agrawal, Wei-keng Liao, Fredrik Manne, Alok N. Choudhary
2013Scalable parallel graph partitioning.
Shad Kirmani, Padma Raghavan
2013Scalable virtual machine deployment using VM image caches.
Kaveh Razavi, Thilo Kielmann
2013Semi-automatic restructuring of offloadable tasks for many-core accelerators.
Nishkam Ravi, Yi Yang, Tao Bao, Srimat T. Chakradhar
2013Solving the compressible navier-stokes equations on up to 1.97 million cores and 4.1 trillion grid points.
Iván Bermejo-Moreno, Julien Bodart, Johan Larsson, Blaise M. Barney, Joseph W. Nichols, Steve Jones
2013Supercomputing with commodity CPUs: are mobile SoCs ready for HPC?
Nikola Rajovic, Paul M. Carpenter, Isaac Gelado, Nikola Puzovic, Alex Ramírez, Mateo Valero
2013Swendsen-Wang multi-cluster algorithm for the 2D/3D Ising model on Xeon Phi and GPU.
Florian Wende, Thomas Steinke
2013Taking a quantum leap in time to solution for simulations of high-Tc superconductors.
Peter W. J. Staar, Thomas A. Maier, Michael S. Summers, Gilles Fourestey, Raffaele Solcà, Thomas C. Schulthess
2013Taming parallel I/O complexity with auto-tuning.
Babak Behzad, Huong Vu Thanh Luu, Joseph Huchette, Surendra Byna, Prabhat, Ruth A. Aydt, Quincey Koziol, Marc Snir
2013Tera-scale 1D FFT with low-communication algorithm and Intel® Xeon Phi™ coprocessors.
Jongsoo Park, Ganesh Bikshandi, Karthikeyan Vaidyanathan, Ping Tak Peter Tang, Pradeep Dubey, Daehyun Kim
2013The Science DMZ: a network design pattern for data-intensive science.
Eli Dart, Lauren Rotman, Brian Tierney, Mary Hester, Jason Zurawski
2013The origin of mass.
Peter A. Boyle, Michael I. Buchoff, Norman H. Christ, Taku Izubuchi, Chulwoo Jung, Thomas C. Luu, Robert D. Mawhinney, Chris Schroeder, Ron Soltz, Pavlos Vranas, Joseph Wasem
2013There goes the neighborhood: performance degradation due to nearby jobs.
Abhinav Bhatele, Kathryn M. Mohror, Steve H. Langer, Katherine E. Isaacs
2013Toward millions of file system IOPS on low-cost, commodity hardware.
Da Zheng, Randal C. Burns, Alexander S. Szalay
2013Using automated performance modeling to find scalability bugs in complex codes.
Alexandru Calotoiu, Torsten Hoefler, Marius Poke, Felix Wolf
2013Using cross-layer adaptations for dynamic data management in large scale coupled scientific workflows.
Tong Jin, Fan Zhang, Qian Sun, Hoang Bui, Manish Parashar, Hongfeng Yu, Scott Klasky, Norbert Podhorszki, Hasan Abbasi
2013Using simulation to explore distributed key-value stores for extreme-scale system services.
Ke Wang, Abhishek Kulkarni, Michael Lang, Dorian C. Arnold, Ioan Raicu