SC A

78 papers

YearTitle / Authors
2011A 'cool' load balancer for parallel applications.
Osman Sarood, Laxmikant V. Kalé
2011A distributed look-up architecture for text mining applications using MapReduce.
Atilla Soner Balkir, Ian T. Foster, Andrey Rzhetsky
2011A fast solver for modeling the evolution of virus populations.
Gerhard Niederbrucker, Wilfried N. Gansterer
2011A new computational paradigm in multiscale simulations: application to brain blood flow.
Leopold Grinberg, Joseph A. Insley, Vitali A. Morozov, Michael E. Papka, George E. Karniadakis, Dmitry A. Fedosov, Kalyan Kumaran
2011A scalable eigensolver for large scale-free graphs using 2D graph partitioning.
Andy Yoo, Allison H. Baker, Roger A. Pearce, Van Emden Henson
2011A similarity measure for time, frequency, and dependencies in large-scale workloads.
Mario Lassnig, Thomas Fahringer, Vincent Garonne, Angelos Molfetas, Martin Barisits
2011An early performance analysis of POWER7-IH HPC systems.
Kevin J. Barker, Adolfy Hoisie, Darren J. Kerbyson
2011An image compositing solution at scale.
Kenneth Moreland, Wesley Kendall, Tom Peterka, Jian Huang
2011Atomistic nanoelectronic device engineering with sustained performances up to 1.44 PFlop/s.
Mathieu Luisier, Timothy B. Boykin, Gerhard Klimeck, Wolfgang Fichtner
2011Auto-scaling to minimize cost and meet application deadlines in cloud workflows.
Ming Mao, Marty Humphrey
2011Avoiding hot-spots on two-level direct networks.
Abhinav Bhatele, Nikhil Jain, William D. Gropp, Laxmikant V. Kalé
2011BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots.
Bogdan Nicolae, Franck Cappello
2011Checkpointing strategies for parallel jobs.
Marin Bougeret, Henri Casanova, Mikaël Rabie, Yves Robert, Frédéric Vivien
2011Conference on High Performance Computing Networking, Storage and Analysis, SC 2011, Seattle, WA, USA, November 12-18, 2011
Scott Lathrop, Jim Costa, William Kramer
2011Copernicus: a new paradigm for parallel adaptive molecular dynamics.
Sander Pronk, Per Larsson, Iman Pouya, Gregory R. Bowman, Imran S. Haque, Kyle Beauchamp, Berk Hess, Vijay S. Pande, Peter M. Kasson, Erik Lindahl
2011CudaDMA: optimizing GPU memory bandwidth via warp specialization.
Michael Bauer, Henry Cook, Brucek Khailany
2011Dymaxion: optimizing memory access patterns for heterogeneous systems.
Shuai Che, Jeremy W. Sheaffer, Kevin Skadron
2011Efficient data race detection for distributed memory parallel programs.
Chang-Seo Park, Koushik Sen, Paul Hargrove, Costin Iancu
2011Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicore-optimized message-driven runtime.
Chao Mei, Yanhua Sun, Gengbin Zheng, Eric J. Bohm, Laxmikant V. Kalé, James C. Phillips, Chris Harrison
2011End-to-end network QoS via scheduling of flexible resource reservation requests.
Sushant Sharma, Dimitrios Katramatos, Dantong Yu
2011Evaluating the viability of process replication reliability for exascale systems.
Kurt B. Ferreira, Jon Stearley, James H. Laros III, Ron A. Oldfield, Kevin T. Pedretti, Ron Brightwell, Rolf Riesen, Patrick G. Bridges, Dorian C. Arnold
2011Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning.
Samuel Williams, Leonid Oliker, Jonathan Carter, John Shalf
2011FTI: high performance fault tolerance interface for hybrid systems.
Leonardo Arturo Bautista-Gomez, Seiji Tsuboi, Dimitri Komatitsch, Franck Cappello, Naoya Maruyama, Satoshi Matsuoka
2011Fast implementation of DGEMM on Fermi GPU.
Guangming Tan, Linchuan Li, Sean Triechle, Everett H. Phillips, Yungang Bao, Ninghui Sun
2011First-principles calculations of electron states of a silicon nanowire with 100, 000 atoms on the K computer.
Yukihiro Hasegawa, Jun-ichi Iwata, Miwako Tsuji, Daisuke Takahashi, Atsushi Oshiyama, Kazuo Minami, Taisuke Boku, Fumiyoshi Shoji, Atsuya Uno, Motoyoshi Kurokawa, Hikaru Inoue, Ikuo Miyoshi, Mitsuo Yokokawa
2011Flexible resource allocation for reliable virtual cluster computing systems.
Thomas J. Hacker, Kanak Mahadik
2011GROPHECY: GPU performance projection from CPU code skeletons.
Jiayuan Meng, Vitali A. Morozov, Kalyan Kumaran, Venkatram Vishwanath, Thomas D. Uram
2011GreenSlot: scheduling energy consumption in green datacenters.
Iñigo Goiri, Ryan Beauchea, Kien Le, Thu D. Nguyen, Md. Enamul Haque, Jordi Guitart, Jordi Torres, Ricardo Bianchini
2011Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems.
Kamesh Madduri, Khaled Z. Ibrahim, Samuel Williams, Eun-Jin Im, Stéphane Ethier, John Shalf, Leonid Oliker
2011Hadoop acceleration through network levitated merge.
Yandong Wang, Xinyu Que, Weikuan Yu, Dror Goldenberg, Dhiraj Sehgal
2011Hardware/software co-design for energy-efficient seismic modeling.
Jens Krueger, David Donofrio, John Shalf, Marghoob Mohiyuddin, Samuel Williams, Leonid Oliker, Franz-Josef Pfreundt
2011High-efficiency server design.
Eitan Frachtenberg, Ali Heydari, Harry Li, Amir Michael, Jacob Na, Avery Nisbet, Pierluigi Sarti
2011High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach.
Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Jee W. Choi, Bálint Joó, Jatin Chhugani, Michael A. Clark, Pradeep Dubey
2011Highly scalable
Benoit Marchand, Vladimir B. Bajic, Dinesh K. Kaushik
2011I/O streaming evaluation of batch queries for data-intensive computational turbulence.
Kalin Kanov, Eric A. Perlman, Randal C. Burns, Yanif Ahmad, Alexander S. Szalay
2011ISABELA-QA: query-driven analytics with ISABELA-compressed extreme-scale scientific data.
Sriram Lakshminarasimhan, John Jenkins, Isha Arkatkar, Zhenhuan Gong, Hemanth Kolla, Seung-Hoe Ku, Stéphane Ethier, Jackie Chen, Choong-Seock Chang, Scott Klasky, Robert Latham, Robert B. Ross, Nagiza F. Samatova
2011Improving communication performance in dense linear algebra via topology aware collectives.
Edgar Solomonik, Abhinav Bhatele, James Demmel
2011Large scale debugging of parallel tasks with AutomaDeD.
Ignacio Laguna, Todd Gamblin, Bronis R. de Supinski, Saurabh Bagchi, Greg Bronevetsky, Dong H. Ahn, Martin Schulz, Barry Rountree
2011Large scale plane wave pseudopotential density functional theory calculations on GPU clusters.
Long Wang, Yue Wu, Weile Jia, Weiguo Gao, Xuebin Chi, Lin-Wang Wang
2011Liszt: a domain specific language for building portable mesh-based PDE solvers.
Zach DeVito, Niels Joubert, Francisco Palacios, Stephen Oakley, Montserrat Medina, Mike Barrientos, Erich Elsen, Frank Ham, Alex Aiken, Karthik Duraisamy, Eric Darve, Juan J. Alonso, Pat Hanrahan
2011MAximum Multicore POwer (MAMPO): an automatic multithreaded synthetic power virus generation framework for multicore systems.
Karthik Ganesan, Lizy K. John
2011Modeling and tolerating heterogeneous failures in large parallel systems.
Eric Martin Heien, Derrick Kondo, Ana Gainaru, Dan Lapine, Bill Kramer, Franck Cappello
2011Multi-science applications with single codebase - GAMER - for massively parallel architectures.
Hemant Shukla, Hsi-Yu Schive, Tak-Pong Woo, Tzihong Chiueh
2011Multithreaded global address space communication techniques for gyrokinetic fusion applications on ultra-scale platforms.
Robert Preissl, Nathan Wichmann, Bill Long, John Shalf, Stéphane Ethier, Alice E. Koniges
2011On the duality of data-intensive file system design: reconciling HDFS and PVFS.
Wittawat Tantisiriroj, Seung Woo Son, Swapnil Patil, Samuel Lang, Garth Gibson, Robert B. Ross
2011Optimized pre-copy live migration for memory intensive applications.
Khaled Z. Ibrahim, Steven A. Hofmeyr, Costin Iancu, Eric Roman
2011Optimizing symmetric dense matrix-vector multiplication on GPUs.
Rajib Nath, Stanimire Tomov, Tingxing Dong, Jack J. Dongarra
2011Optimizing the Barnes-Hut algorithm in UPC.
Junchao Zhang, Babak Behzad, Marc Snir
2011Parallel breadth-first search on distributed memory systems.
Aydin Buluç, Kamesh Madduri
2011Parallel index and query for large scale data analysis.
Jerry Chi-Yuan Chou, Mark Howison, Brian Austin, Kesheng Wu, Ji Qiang, E. Wes Bethel, Arie Shoshani, Oliver Rübel, Prabhat, Robert D. Ryne
2011Parallel random numbers: as easy as 1, 2, 3.
John K. Salmon, Mark A. Moraes, Ron O. Dror, David E. Shaw
2011Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels.
Azzam Haidar, Hatem Ltaief, Jack J. Dongarra
2011Parallelization design on multi-core platforms in density matrix renormalization group toward 2-D quantum strongly-correlated systems.
Susumu Yamada, Toshiyuki Imamura, Masahiko Machida
2011Performance of the community earth system model.
Patrick H. Worley, Arthur A. Mirin, Anthony P. Craig, Mark A. Taylor, John M. Dennis, Mariana Vertenstein
2011Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer.
Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Toshio Endo, Akinori Yamanaka, Naoya Maruyama, Akira Nukada, Satoshi Matsuoka
2011Petaflop biofluidics simulations on a two million-core system.
Massimo Bernaschi, Mauro Bisson, Toshio Endo, Satoshi Matsuoka, Massimiliano Fatica, Simone Melchionna
2011Physis: an implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers.
Naoya Maruyama, Tatsuo Nomura, Kento Sato, Satoshi Matsuoka
2011Purlieus: locality-aware resource allocation for MapReduce in a cloud.
Balaji Palanisamy, Aameek Singh, Ling Liu, Bhushan Jain
2011QoS support for end users of I/O-intensive applications using shared storage systems.
Xuechen Zhang, Kei Davis, Song Jiang
2011Reducing electricity cost through virtual machine placement in high performance computing clouds.
Kien Le, Ricardo Bianchini, Jingru Zhang, Yogesh Jaluria, Jiandong Meng, Thu D. Nguyen
2011SCMFS: a file system for storage class memory.
Xiaojian Wu, A. L. Narasimha Reddy
2011Scalable fast multipole methods on distributed heterogeneous architectures.
Qi Hu, Nail A. Gumerov, Ramani Duraiswami
2011Scalable hashing for shared memory supercomputers.
Eric L. Goodman, M. Nicole Lemaster, Edward Jimenez
2011Scalable implementations of accurate excited-state coupled cluster theories: application of high-level methods to porphyrin-based systems.
Karol Kowalski, Sriram Krishnamoorthy, Ryan M. Olson, Vinod Tipparaju, Edoardo Aprà
2011Scalable stochastic optimization of complex energy systems.
Miles Lubin, Cosmin G. Petra, Mihai Anitescu, Victor M. Zavala
2011Scaling lattice QCD beyond 100 GPUs.
Ronald Babich, Michael A. Clark, Bálint Joó, Guochun Shi, Richard C. Brower, Steven A. Gottlieb
2011SciHadoop: array-based query processing in Hadoop.
Joe B. Buck, Noah Watkins, Jeff LeFevre, Kleoni Ioannidou, Carlos Maltzahn, Neoklis Polyzotis, Scott A. Brandt
2011Server-side I/O coordination for parallel file systems.
Huaiming Song, Yanlong Yin, Xian-He Sun, Rajeev Thakur, Samuel Lang
2011Simplified parallel domain traversal.
Wesley Kendall, Jingyuan Wang, Melissa R. Allen, Tom Peterka, Jian Huang, David Erickson
2011Sniper: exploring the level of abstraction for scalable and accurate parallel multi-core simulation.
Trevor E. Carlson, Wim Heirman, Lieven Eeckhout
2011System implications of memory reliability in exascale computing.
Sheng Li, Ke Chen, Ming-yu Hsieh, Naveen Muralimanohar, Chad D. Kersey, Jay B. Brockman, Arun F. Rodrigues, Norman P. Jouppi
2011TRACON: interference-aware scheduling for data-intensive applications in virtualized environments.
Ron Chi-Lung Chiang, H. Howie Huang
2011The IBM Blue Gene/Q interconnection network and message unit.
Dong Chen, Noel Eisley, Philip Heidelberger, Robert M. Senger, Yutaka Sugawara, Sameer Kumar, Valentina Salapura, David L. Satterfield, Burkhard D. Steinmacher-Burow, Jeffrey J. Parker
2011Tiled QR factorization algorithms.
Henricus Bouwmeester, Mathias Jacquelin, Julien Langou, Yves Robert
2011Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems.
Venkatram Vishwanath, Mark Hereld, Vitali A. Morozov, Michael E. Papka
2011Unitary qubit lattice simulations of multiscale phenomena in quantum turbulence.
George Vahala, Min Soe, Bo Zhang, Jeffrey Yepez, Linda Vahala, Jonathan Carter, Sean Ziegeler
2011Using the TOP500 to trace and project technology and architecture trends.
Peter M. Kogge, Timothy J. Dysart
2011Virtual I/O caching: dynamic storage cache management for concurrent workloads.
Michael R. Frasca, Ramya Prabhakar, Padma Raghavan, Mahmut T. Kandemir