SC - RankMe – RankMe

97 papers

Year	Title / Authors
2013	11 PFLOP/s simulations of cloud cavitation collapse. Diego Rossinelli, Babak Hejazialhosseini, Panagiotis E. Hadjidoukas, Costas Bekas, Alessandro Curioni, Adam Bertsch, Scott Futral, Steffen J. Schmidt, Nikolaus A. Adams, Petros Koumoutsakos
2013	20 petaflops simulation of proteins suspensions in crowding conditions. Massimo Bernaschi, Mauro Bisson, Massimiliano Fatica, Simone Melchionna
2013	2HOT: an improved parallel hashed oct-tree n-body algorithm for cosmological simulation. Michael S. Warren
2013	A 'cool' way of improving the reliability of HPC machines. Osman Sarood, Esteban Meneses, Laxmikant V. Kalé
2013	A computationally efficient algorithm for the 2D covariance method. Oded Green, Yitzhak Birk
2013	A data-centric profiler for parallel programs. Xu Liu, John M. Mellor-Crummey
2013	A distributed dynamic load balancer for iterative applications. Harshitha Menon, Laxmikant V. Kalé
2013	A framework for hybrid parallel flow simulations with a trillion cells in complex geometries. Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler, Ulrich Rüde
2013	A framework for load balancing of tensor contraction expressions via dynamic task partitioning. Pai-Wei Lai, Kevin Stock, Samyam Rajbhandari, Sriram Krishnamoorthy, P. Sadayappan
2013	A large-scale cross-architecture evaluation of thread-coarsening. Alberto Magni, Christophe Dubach, Michael F. P. O'Boyle
2013	A new routing scheme for Jellyfish and its performance with HPC workloads. Xin Yuan, Santosh Mahapatra, Wickus Nienaber, Scott Pakin, Michael Lang
2013	A scalable parallel algorithm for dynamic range-limited Manaschai Kunaseth, Rajiv K. Kalia, Aiichiro Nakano, Ken-ichi Nomura, Priya Vashishta
2013	A scalable, efficient scheme for evaluation of stencil computations over unstructured meshes. James King, Robert M. Kirby
2013	ACIC: automatic cloud I/O configurator for HPC applications. Mingliang Liu, Ye Jin, Jidong Zhai, Yan Zhai, Qianqian Shi, Xiaosong Ma, Wenguang Chen
2013	ACR: automatic checkpoint/restart for soft and hard error protection. Xiang Ni, Esteban Meneses, Nikhil Jain, Laxmikant V. Kalé
2013	AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs. Qian Wang, Xianyi Zhang, Yunquan Zhang, Qing Yi
2013	Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes. Wai Teng Tang, Wen Jun Tan, Rajarshi Ray, Yi Wen Wong, Weiguang Chen, Shyh-Hao Kuo, Rick Siow Mong Goh, Stephen John Turner, Weng-Fai Wong
2013	Algorithms for high-throughput disk-to-disk sorting. Hari Sundar, Dhairya Malhotra, Karl W. Schulz
2013	An early performance evaluation of many integrated core architecture based SGI rackable computing system. Subhash Saini, Haoqiang Jin, Dennis C. Jespersen, Huiyu Feng, M. Jahed Djomehri, William Arasin, Robert Hood, Piyush Mehrotra, Rupak Biswas
2013	An improved parallel singular value algorithm and its implementation for multicore hardware. Azzam Haidar, Jakub Kurzak, Piotr Luszczek
2013	Assessing the effects of data compression in simulations using physically motivated metrics. Daniel E. Laney, Steven Langer, Christopher Weber, Peter Lindstrom, Al Wegener
2013	COCA: online distributed resource management for cost minimization and carbon neutrality in data centers. Shaolei Ren, Yuxiong He
2013	Channel reservation protocol for over-subscribed channels and destinations. George Michelogiannakis, Nan Jiang, Daniel Becker, William J. Dally
2013	Characterization and modeling of PIDX parallel I/O for performance optimization. Sidharth Kumar, Avishek Saha, Venkatram Vishwanath, Philip H. Carns, John A. Schmidt, Giorgio Scorzelli, Hemanth Kolla, Ray W. Grout, Robert Latham, Robert B. Ross, Michael E. Papka, Jacqueline Chen, Valerio Pascucci
2013	Compiling affine loop nests for distributed-memory parallel architectures. Uday Bondhugula
2013	CooMR: cross-task coordination for efficient data management in MapReduce programs. Xiaobing Li, Yandong Wang, Yizheng Jiao, Cong Xu, Weikuan Yu
2013	Coordinated energy management in heterogeneous processors. Indrani Paul, Vignesh T. Ravi, Srilatha Manne, Manish Arora, Sudhakar Yalamanchili
2013	Cost-effective cloud HPC resource provisioning by building semi-elastic virtual clusters. Shuangcheng Niu, Jidong Zhai, Xiaosong Ma, Xiongchao Tang, Wenguang Chen
2013	Design and performance evaluation of NUMA-aware RDMA-based end-to-end data transfer systems. Yufei Ren, Tan Li, Dantong Yu, Shudong Jin, Thomas G. Robertazzi
2013	Detection of false sharing using machine learning. Sanath Jayasena, Saman P. Amarasinghe, Asanka Abeyweera, Gayashan Amarasinghe, Himeshi De Silva, Sunimal Rathnayake, Xiaoqiao Meng, Yanbin Liu
2013	Deterministic scale-free pipeline parallelism with hyperqueues. Hans Vandierendonck, Kallia Chronaki, Dimitrios S. Nikolopoulos
2013	Distributed wait state tracking for runtime MPI deadlock detection. Tobias Hilbrich, Bronis R. de Supinski, Wolfgang E. Nagel, Joachim Protze, Christel Baier, Matthias S. Müller
2013	Distributed-memory parallel algorithms for generating massive scale-free networks using preferential attachment model. Md. Maksudul Alam, Maleq Khan, Madhav V. Marathe
2013	Effective sampling-driven performance tools for GPU-accelerated supercomputers. Milind Chabbi, Karthik Murthy, Michael W. Fagan, John M. Mellor-Crummey
2013	Efficient data partitioning model for heterogeneous graphs in the cloud. Kisung Lee, Ling Liu
2013	Enabling comprehensive data-driven system management for large computational facilities. James C. Browne, Robert L. DeLeon, Charng-Da Lu, Matthew D. Jones, Steven M. Gallo, Amin Ghadersohi, Abani K. Patra, William L. Barth, John L. Hammond, Thomas R. Furlani, Robert T. McLay
2013	Enabling fair pricing on HPC systems with node sharing. Alex D. Breslow, Ananta Tiwari, Martin Schulz, Laura Carrington, Lingjia Tang, Jason Mars
2013	Enabling highly-scalable remote memory access programming with MPI-3 one sided. Robert Gerstenberger, Maciej Besta, Torsten Hoefler
2013	Exploiting application dynamism and cloud elasticity for continuous dataflows. Alok Gautam Kumbhare, Yogesh Simmhan, Viktor K. Prasanna
2013	Exploring DRAM organizations for energy-efficient and resilient exascale memories. Bharan Giridhar, Michael Cieslak, Deepankar Duggal, Ronald G. Dreslinski, Hsing Min Chen, Robert Patti, Betina Hold, Chaitali Chakrabarti, Trevor N. Mudge, David T. Blaauw
2013	Exploring portfolio scheduling for long-term execution of scientific workloads in IaaS clouds. Kefeng Deng, Junqiang Song, Kaijun Ren, Alexandru Iosup
2013	Exploring power behaviors and trade-offs of in-situ data analytics. Marc Gamell, Ivan Rodero, Manish Parashar, Janine Bennett, Hemanth Kolla, Jacqueline Chen, Peer-Timo Bremer, Aaditya G. Landge, Attila Gyulassy, Patrick S. McCormick, Scott Pakin, Valerio Pascucci, Scott Klasky
2013	Exploring the future of out-of-core computing with compute-local non-volatile memory. Myoungsoo Jung, Ellis Herbert Wilson, Wonil Choi, John Shalf, Hasan Metin Aktulga, Chao Yang, Erik Saule, Ümit V. Çatalyürek, Mahmut T. Kandemir
2013	Feng shui of supercomputer memory: positional effects in DRAM and SRAM faults. Vilas Sridharan, Jon Stearley, Nathan DeBardeleben, Sean Blanchard, Sudhanva Gurumurthi
2013	General transformations for GPU execution of tree traversals. Michael Goldfarb, Youngjoon Jo, Milind Kulkarni
2013	Globalizing selectively: shared-memory efficiency with address-space separation. Nilesh Mahajan, Uday Pitambare, Arun Chauhan
2013	GoldRush: resource efficient in situ scientific data analytics using fine-grained interference aware execution. Fang Zheng, Hongfeng Yu, Can Hantas, Matthew Wolf, Greg Eisenhauer, Karsten Schwan, Hasan Abbasi, Scott Klasky
2013	Guide-copy: fast and silent migration of virtual machine for datacenters. Jihun Kim, Dongju Chae, Jangwoo Kim, Jong Kim
2013	HACC: extreme scaling and performance across diverse architectures. Salman Habib, Vitali A. Morozov, Nicholas Frontiere, Hal Finkel, Adrian Pope, Katrin Heitmann
2013	Hybrid MPI: efficient message passing for multi-core systems. Andrew Friedley, Greg Bronevetsky, Torsten Hoefler, Andrew Lumsdaine
2013	Insights for exascale IO APIs from building a petascale IO API. Jay F. Lofstead, Robert Ross
2013	Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems. Xu Yang, Zhou Zhou, Sean Wallace, Zhiling Lan, Wei Tang, Susan Coghlan, Michael E. Papka
2013	International Conference for High Performance Computing, Networking, Storage and Analysis, SC'13, Denver, CO, USA - November 17 - 21, 2013 William Gropp, Satoshi Matsuoka
2013	Investigating applications portability with the Uintah DAG-based runtime system on PetaScale supercomputers. Qingyu Meng, Alan Humphrey, John A. Schmidt, Martin Berzins
2013	Kinetic turbulence simulations at extreme scale on leadership-class systems. Bei Wang, Stéphane Ethier, William M. Tang, Timothy J. Williams, Khaled Z. Ibrahim, Kamesh Madduri, Samuel Williams, Leonid Oliker
2013	Load-balanced pipeline parallelism. Md. Kamruzzaman, Steven Swanson, Dean M. Tullsen
2013	Location-aware cache management for many-core processors with deep cache hierarchy. Jongsoo Park, Richard M. Yoo, Daya Shanker Khudia, Christopher J. Hughes, Daehyun Kim
2013	Low-power, low-storage-overhead chipkill correct via multi-line error correction. Xun Jian, Henry Duwe, John Sartori, Vilas Sridharan, Rakesh Kumar
2013	MVAPICH-PRISM: a proxy-based communication framework using InfiniBand and SCIF for intel MIC clusters. Sreeram Potluri, Devendar Bureddy, Khaled Hamidouche, Akshay Venkatesh, Krishna Chaitanya Kandalla, Hari Subramoni, Dhabaleswar K. Panda
2013	Mr. Scan: extreme scale density-based clustering using a tree-based network of GPGPU nodes. Benjamin Welton, Evan Samanas, Barton P. Miller
2013	On fast parallel detection of strongly connected components (SCC) in small-world graphs. Sungpack Hong, Nicole C. Rodia, Kunle Olukotun
2013	On the usefulness of object tracking techniques in performance analysis. Germán Llort, Harald Servat, Juan Gonzalez, Judit Giménez, Jesús Labarta
2013	Optimization of cloud task processing with checkpoint-restart mechanism. Sheng Di, Yves Robert, Frédéric Vivien, Derrick Kondo, Cho-Li Wang, Franck Cappello
2013	Parallel design and performance of nested filtering factorization preconditioner. Long Qu, Laura Grigori, Frédéric Nataf
2013	Parallel reduction to hessenberg form with algorithm-based fault tolerance. Yulu Jia, George Bosilca, Piotr Luszczek, Jack J. Dongarra
2013	Parallelizing the execution of sequential scripts. Zhao Zhang, Daniel S. Katz, Timothy G. Armstrong, Justin M. Wozniak, Ian T. Foster
2013	Performance evaluation of Intel® transactional synchronization extensions for high-performance computing. Richard M. Yoo, Christopher J. Hughes, Konrad Lai, Ravi Rajwar
2013	Petascale WRF simulation of hurricane Sandy deployment of NCSA's cray XE6 blue waters. Peter Johnsen, Mark Straka, Melvyn Shapiro, Alan Norton, Thomas Galarneau
2013	Petascale direct numerical simulation of turbulent channel flow on up to 786K cores. Myoungkyu Lee, Nicholas Malaya, Robert D. Moser
2013	Physics-based seismic hazard analysis on petascale heterogeneous supercomputers. Yifeng Cui, Efecan Poyraz, Kim B. Olsen, Jun Zhou, Kyle Withers, Scott Callaghan, Jeff Larkin, Clark C. Guest, Dong Ju Choi, Amit Chourasia, Zheqiang Shi, Steven M. Day, Philip Maechling, Thomas H. Jordan
2013	Practical nonvolatile multilevel-cell phase change memory. Doe Hyun Yoon, Jichuan Chang, Robert S. Schreiber, Norman P. Jouppi
2013	Precimonious: tuning assistant for floating-point precision. Cindy Rubio-González, Cuong Nguyen, Hong Diep Nguyen, James Demmel, William Kahan, Koushik Sen, David H. Bailey, Costin Iancu, David Hough
2013	Predicting application performance using supervised learning on communication features. Nikhil Jain, Abhinav Bhatele, Michael P. Robson, Todd Gamblin, Laxmikant V. Kalé
2013	Radiative signatures of the relativistic Kelvin-Helmholtz instability. Michael Bussmann, Heiko Burau, Thomas E. Cowan, Alexander Debus, Axel Huebl, Guido Juckeland, Thomas Kluge, Wolfgang E. Nagel, Richard Pausch, Felix Schmitt, Ulrich Schramm, Joseph Schuchart, René Widera
2013	Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach. Dong Li, Zizhong Chen, Panruo Wu, Jeffrey S. Vetter
2013	SDQuery DSI: integrating data management support with a wide area data transfer protocol. Yu Su, Yi Wang, Gagan Agrawal, Rajkumar Kettimuthu
2013	SIDR: structure-aware intelligent data routing in Hadoop. Joe B. Buck, Noah Watkins, Greg Levin, Adam Crume, Kleoni Ioannidou, Scott A. Brandt, Carlos Maltzahn, Neoklis Polyzotis, Aaron Torres
2013	SPBC: leveraging the characteristics of MPI HPC applications for scalable checkpointing. Thomas Ropars, Tatiana V. Martsinkevich, Amina Guermouche, André Schiper, Franck Cappello
2013	Scalable domain decomposition preconditioners for heterogeneous elliptic problems. Pierre Jolivet, Frédéric Hecht, Frédéric Nataf, Christophe Prud'Homme
2013	Scalable matrix computations on large scale-free graphs using 2D graph partitioning. Erik G. Boman, Karen D. Devine, Sivasankaran Rajamanickam
2013	Scalable parallel OPTICS data clustering using graph algorithmic techniques. Md. Mostofa Ali Patwary, Diana Palsetia, Ankit Agrawal, Wei-keng Liao, Fredrik Manne, Alok N. Choudhary
2013	Scalable parallel graph partitioning. Shad Kirmani, Padma Raghavan
2013	Scalable virtual machine deployment using VM image caches. Kaveh Razavi, Thilo Kielmann
2013	Semi-automatic restructuring of offloadable tasks for many-core accelerators. Nishkam Ravi, Yi Yang, Tao Bao, Srimat T. Chakradhar
2013	Solving the compressible navier-stokes equations on up to 1.97 million cores and 4.1 trillion grid points. Iván Bermejo-Moreno, Julien Bodart, Johan Larsson, Blaise M. Barney, Joseph W. Nichols, Steve Jones
2013	Supercomputing with commodity CPUs: are mobile SoCs ready for HPC? Nikola Rajovic, Paul M. Carpenter, Isaac Gelado, Nikola Puzovic, Alex Ramírez, Mateo Valero
2013	Swendsen-Wang multi-cluster algorithm for the 2D/3D Ising model on Xeon Phi and GPU. Florian Wende, Thomas Steinke
2013	Taking a quantum leap in time to solution for simulations of high-Tc superconductors. Peter W. J. Staar, Thomas A. Maier, Michael S. Summers, Gilles Fourestey, Raffaele Solcà, Thomas C. Schulthess
2013	Taming parallel I/O complexity with auto-tuning. Babak Behzad, Huong Vu Thanh Luu, Joseph Huchette, Surendra Byna, Prabhat, Ruth A. Aydt, Quincey Koziol, Marc Snir
2013	Tera-scale 1D FFT with low-communication algorithm and Intel® Xeon Phi™ coprocessors. Jongsoo Park, Ganesh Bikshandi, Karthikeyan Vaidyanathan, Ping Tak Peter Tang, Pradeep Dubey, Daehyun Kim
2013	The Science DMZ: a network design pattern for data-intensive science. Eli Dart, Lauren Rotman, Brian Tierney, Mary Hester, Jason Zurawski
2013	The origin of mass. Peter A. Boyle, Michael I. Buchoff, Norman H. Christ, Taku Izubuchi, Chulwoo Jung, Thomas C. Luu, Robert D. Mawhinney, Chris Schroeder, Ron Soltz, Pavlos Vranas, Joseph Wasem
2013	There goes the neighborhood: performance degradation due to nearby jobs. Abhinav Bhatele, Kathryn M. Mohror, Steve H. Langer, Katherine E. Isaacs
2013	Toward millions of file system IOPS on low-cost, commodity hardware. Da Zheng, Randal C. Burns, Alexander S. Szalay
2013	Using automated performance modeling to find scalability bugs in complex codes. Alexandru Calotoiu, Torsten Hoefler, Marius Poke, Felix Wolf
2013	Using cross-layer adaptations for dynamic data management in large scale coupled scientific workflows. Tong Jin, Fan Zhang, Qian Sun, Hoang Bui, Manish Parashar, Hongfeng Yu, Scott Klasky, Norbert Podhorszki, Hasan Abbasi
2013	Using simulation to explore distributed key-value stores for extreme-scale system services. Ke Wang, Abhishek Kulkarni, Michael Lang, Dorian C. Arnold, Ioan Raicu