| 2011 | A 'cool' load balancer for parallel applications. Osman Sarood, Laxmikant V. Kalé |
| 2011 | A distributed look-up architecture for text mining applications using MapReduce. Atilla Soner Balkir, Ian T. Foster, Andrey Rzhetsky |
| 2011 | A fast solver for modeling the evolution of virus populations. Gerhard Niederbrucker, Wilfried N. Gansterer |
| 2011 | A new computational paradigm in multiscale simulations: application to brain blood flow. Leopold Grinberg, Joseph A. Insley, Vitali A. Morozov, Michael E. Papka, George E. Karniadakis, Dmitry A. Fedosov, Kalyan Kumaran |
| 2011 | A scalable eigensolver for large scale-free graphs using 2D graph partitioning. Andy Yoo, Allison H. Baker, Roger A. Pearce, Van Emden Henson |
| 2011 | A similarity measure for time, frequency, and dependencies in large-scale workloads. Mario Lassnig, Thomas Fahringer, Vincent Garonne, Angelos Molfetas, Martin Barisits |
| 2011 | An early performance analysis of POWER7-IH HPC systems. Kevin J. Barker, Adolfy Hoisie, Darren J. Kerbyson |
| 2011 | An image compositing solution at scale. Kenneth Moreland, Wesley Kendall, Tom Peterka, Jian Huang |
| 2011 | Atomistic nanoelectronic device engineering with sustained performances up to 1.44 PFlop/s. Mathieu Luisier, Timothy B. Boykin, Gerhard Klimeck, Wolfgang Fichtner |
| 2011 | Auto-scaling to minimize cost and meet application deadlines in cloud workflows. Ming Mao, Marty Humphrey |
| 2011 | Avoiding hot-spots on two-level direct networks. Abhinav Bhatele, Nikhil Jain, William D. Gropp, Laxmikant V. Kalé |
| 2011 | BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots. Bogdan Nicolae, Franck Cappello |
| 2011 | Checkpointing strategies for parallel jobs. Marin Bougeret, Henri Casanova, Mikaël Rabie, Yves Robert, Frédéric Vivien |
| 2011 | Conference on High Performance Computing Networking, Storage and Analysis, SC 2011, Seattle, WA, USA, November 12-18, 2011 Scott Lathrop, Jim Costa, William Kramer |
| 2011 | Copernicus: a new paradigm for parallel adaptive molecular dynamics. Sander Pronk, Per Larsson, Iman Pouya, Gregory R. Bowman, Imran S. Haque, Kyle Beauchamp, Berk Hess, Vijay S. Pande, Peter M. Kasson, Erik Lindahl |
| 2011 | CudaDMA: optimizing GPU memory bandwidth via warp specialization. Michael Bauer, Henry Cook, Brucek Khailany |
| 2011 | Dymaxion: optimizing memory access patterns for heterogeneous systems. Shuai Che, Jeremy W. Sheaffer, Kevin Skadron |
| 2011 | Efficient data race detection for distributed memory parallel programs. Chang-Seo Park, Koushik Sen, Paul Hargrove, Costin Iancu |
| 2011 | Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicore-optimized message-driven runtime. Chao Mei, Yanhua Sun, Gengbin Zheng, Eric J. Bohm, Laxmikant V. Kalé, James C. Phillips, Chris Harrison |
| 2011 | End-to-end network QoS via scheduling of flexible resource reservation requests. Sushant Sharma, Dimitrios Katramatos, Dantong Yu |
| 2011 | Evaluating the viability of process replication reliability for exascale systems. Kurt B. Ferreira, Jon Stearley, James H. Laros III, Ron A. Oldfield, Kevin T. Pedretti, Ron Brightwell, Rolf Riesen, Patrick G. Bridges, Dorian C. Arnold |
| 2011 | Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning. Samuel Williams, Leonid Oliker, Jonathan Carter, John Shalf |
| 2011 | FTI: high performance fault tolerance interface for hybrid systems. Leonardo Arturo Bautista-Gomez, Seiji Tsuboi, Dimitri Komatitsch, Franck Cappello, Naoya Maruyama, Satoshi Matsuoka |
| 2011 | Fast implementation of DGEMM on Fermi GPU. Guangming Tan, Linchuan Li, Sean Triechle, Everett H. Phillips, Yungang Bao, Ninghui Sun |
| 2011 | First-principles calculations of electron states of a silicon nanowire with 100, 000 atoms on the K computer. Yukihiro Hasegawa, Jun-ichi Iwata, Miwako Tsuji, Daisuke Takahashi, Atsushi Oshiyama, Kazuo Minami, Taisuke Boku, Fumiyoshi Shoji, Atsuya Uno, Motoyoshi Kurokawa, Hikaru Inoue, Ikuo Miyoshi, Mitsuo Yokokawa |
| 2011 | Flexible resource allocation for reliable virtual cluster computing systems. Thomas J. Hacker, Kanak Mahadik |
| 2011 | GROPHECY: GPU performance projection from CPU code skeletons. Jiayuan Meng, Vitali A. Morozov, Kalyan Kumaran, Venkatram Vishwanath, Thomas D. Uram |
| 2011 | GreenSlot: scheduling energy consumption in green datacenters. Iñigo Goiri, Ryan Beauchea, Kien Le, Thu D. Nguyen, Md. Enamul Haque, Jordi Guitart, Jordi Torres, Ricardo Bianchini |
| 2011 | Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems. Kamesh Madduri, Khaled Z. Ibrahim, Samuel Williams, Eun-Jin Im, Stéphane Ethier, John Shalf, Leonid Oliker |
| 2011 | Hadoop acceleration through network levitated merge. Yandong Wang, Xinyu Que, Weikuan Yu, Dror Goldenberg, Dhiraj Sehgal |
| 2011 | Hardware/software co-design for energy-efficient seismic modeling. Jens Krueger, David Donofrio, John Shalf, Marghoob Mohiyuddin, Samuel Williams, Leonid Oliker, Franz-Josef Pfreundt |
| 2011 | High-efficiency server design. Eitan Frachtenberg, Ali Heydari, Harry Li, Amir Michael, Jacob Na, Avery Nisbet, Pierluigi Sarti |
| 2011 | High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach. Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Jee W. Choi, Bálint Joó, Jatin Chhugani, Michael A. Clark, Pradeep Dubey |
| 2011 | Highly scalable Benoit Marchand, Vladimir B. Bajic, Dinesh K. Kaushik |
| 2011 | I/O streaming evaluation of batch queries for data-intensive computational turbulence. Kalin Kanov, Eric A. Perlman, Randal C. Burns, Yanif Ahmad, Alexander S. Szalay |
| 2011 | ISABELA-QA: query-driven analytics with ISABELA-compressed extreme-scale scientific data. Sriram Lakshminarasimhan, John Jenkins, Isha Arkatkar, Zhenhuan Gong, Hemanth Kolla, Seung-Hoe Ku, Stéphane Ethier, Jackie Chen, Choong-Seock Chang, Scott Klasky, Robert Latham, Robert B. Ross, Nagiza F. Samatova |
| 2011 | Improving communication performance in dense linear algebra via topology aware collectives. Edgar Solomonik, Abhinav Bhatele, James Demmel |
| 2011 | Large scale debugging of parallel tasks with AutomaDeD. Ignacio Laguna, Todd Gamblin, Bronis R. de Supinski, Saurabh Bagchi, Greg Bronevetsky, Dong H. Ahn, Martin Schulz, Barry Rountree |
| 2011 | Large scale plane wave pseudopotential density functional theory calculations on GPU clusters. Long Wang, Yue Wu, Weile Jia, Weiguo Gao, Xuebin Chi, Lin-Wang Wang |
| 2011 | Liszt: a domain specific language for building portable mesh-based PDE solvers. Zach DeVito, Niels Joubert, Francisco Palacios, Stephen Oakley, Montserrat Medina, Mike Barrientos, Erich Elsen, Frank Ham, Alex Aiken, Karthik Duraisamy, Eric Darve, Juan J. Alonso, Pat Hanrahan |
| 2011 | MAximum Multicore POwer (MAMPO): an automatic multithreaded synthetic power virus generation framework for multicore systems. Karthik Ganesan, Lizy K. John |
| 2011 | Modeling and tolerating heterogeneous failures in large parallel systems. Eric Martin Heien, Derrick Kondo, Ana Gainaru, Dan Lapine, Bill Kramer, Franck Cappello |
| 2011 | Multi-science applications with single codebase - GAMER - for massively parallel architectures. Hemant Shukla, Hsi-Yu Schive, Tak-Pong Woo, Tzihong Chiueh |
| 2011 | Multithreaded global address space communication techniques for gyrokinetic fusion applications on ultra-scale platforms. Robert Preissl, Nathan Wichmann, Bill Long, John Shalf, Stéphane Ethier, Alice E. Koniges |
| 2011 | On the duality of data-intensive file system design: reconciling HDFS and PVFS. Wittawat Tantisiriroj, Seung Woo Son, Swapnil Patil, Samuel Lang, Garth Gibson, Robert B. Ross |
| 2011 | Optimized pre-copy live migration for memory intensive applications. Khaled Z. Ibrahim, Steven A. Hofmeyr, Costin Iancu, Eric Roman |
| 2011 | Optimizing symmetric dense matrix-vector multiplication on GPUs. Rajib Nath, Stanimire Tomov, Tingxing Dong, Jack J. Dongarra |
| 2011 | Optimizing the Barnes-Hut algorithm in UPC. Junchao Zhang, Babak Behzad, Marc Snir |
| 2011 | Parallel breadth-first search on distributed memory systems. Aydin Buluç, Kamesh Madduri |
| 2011 | Parallel index and query for large scale data analysis. Jerry Chi-Yuan Chou, Mark Howison, Brian Austin, Kesheng Wu, Ji Qiang, E. Wes Bethel, Arie Shoshani, Oliver Rübel, Prabhat, Robert D. Ryne |
| 2011 | Parallel random numbers: as easy as 1, 2, 3. John K. Salmon, Mark A. Moraes, Ron O. Dror, David E. Shaw |
| 2011 | Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels. Azzam Haidar, Hatem Ltaief, Jack J. Dongarra |
| 2011 | Parallelization design on multi-core platforms in density matrix renormalization group toward 2-D quantum strongly-correlated systems. Susumu Yamada, Toshiyuki Imamura, Masahiko Machida |
| 2011 | Performance of the community earth system model. Patrick H. Worley, Arthur A. Mirin, Anthony P. Craig, Mark A. Taylor, John M. Dennis, Mariana Vertenstein |
| 2011 | Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer. Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Toshio Endo, Akinori Yamanaka, Naoya Maruyama, Akira Nukada, Satoshi Matsuoka |
| 2011 | Petaflop biofluidics simulations on a two million-core system. Massimo Bernaschi, Mauro Bisson, Toshio Endo, Satoshi Matsuoka, Massimiliano Fatica, Simone Melchionna |
| 2011 | Physis: an implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers. Naoya Maruyama, Tatsuo Nomura, Kento Sato, Satoshi Matsuoka |
| 2011 | Purlieus: locality-aware resource allocation for MapReduce in a cloud. Balaji Palanisamy, Aameek Singh, Ling Liu, Bhushan Jain |
| 2011 | QoS support for end users of I/O-intensive applications using shared storage systems. Xuechen Zhang, Kei Davis, Song Jiang |
| 2011 | Reducing electricity cost through virtual machine placement in high performance computing clouds. Kien Le, Ricardo Bianchini, Jingru Zhang, Yogesh Jaluria, Jiandong Meng, Thu D. Nguyen |
| 2011 | SCMFS: a file system for storage class memory. Xiaojian Wu, A. L. Narasimha Reddy |
| 2011 | Scalable fast multipole methods on distributed heterogeneous architectures. Qi Hu, Nail A. Gumerov, Ramani Duraiswami |
| 2011 | Scalable hashing for shared memory supercomputers. Eric L. Goodman, M. Nicole Lemaster, Edward Jimenez |
| 2011 | Scalable implementations of accurate excited-state coupled cluster theories: application of high-level methods to porphyrin-based systems. Karol Kowalski, Sriram Krishnamoorthy, Ryan M. Olson, Vinod Tipparaju, Edoardo Aprà |
| 2011 | Scalable stochastic optimization of complex energy systems. Miles Lubin, Cosmin G. Petra, Mihai Anitescu, Victor M. Zavala |
| 2011 | Scaling lattice QCD beyond 100 GPUs. Ronald Babich, Michael A. Clark, Bálint Joó, Guochun Shi, Richard C. Brower, Steven A. Gottlieb |
| 2011 | SciHadoop: array-based query processing in Hadoop. Joe B. Buck, Noah Watkins, Jeff LeFevre, Kleoni Ioannidou, Carlos Maltzahn, Neoklis Polyzotis, Scott A. Brandt |
| 2011 | Server-side I/O coordination for parallel file systems. Huaiming Song, Yanlong Yin, Xian-He Sun, Rajeev Thakur, Samuel Lang |
| 2011 | Simplified parallel domain traversal. Wesley Kendall, Jingyuan Wang, Melissa R. Allen, Tom Peterka, Jian Huang, David Erickson |
| 2011 | Sniper: exploring the level of abstraction for scalable and accurate parallel multi-core simulation. Trevor E. Carlson, Wim Heirman, Lieven Eeckhout |
| 2011 | System implications of memory reliability in exascale computing. Sheng Li, Ke Chen, Ming-yu Hsieh, Naveen Muralimanohar, Chad D. Kersey, Jay B. Brockman, Arun F. Rodrigues, Norman P. Jouppi |
| 2011 | TRACON: interference-aware scheduling for data-intensive applications in virtualized environments. Ron Chi-Lung Chiang, H. Howie Huang |
| 2011 | The IBM Blue Gene/Q interconnection network and message unit. Dong Chen, Noel Eisley, Philip Heidelberger, Robert M. Senger, Yutaka Sugawara, Sameer Kumar, Valentina Salapura, David L. Satterfield, Burkhard D. Steinmacher-Burow, Jeffrey J. Parker |
| 2011 | Tiled QR factorization algorithms. Henricus Bouwmeester, Mathias Jacquelin, Julien Langou, Yves Robert |
| 2011 | Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems. Venkatram Vishwanath, Mark Hereld, Vitali A. Morozov, Michael E. Papka |
| 2011 | Unitary qubit lattice simulations of multiscale phenomena in quantum turbulence. George Vahala, Min Soe, Bo Zhang, Jeffrey Yepez, Linda Vahala, Jonathan Carter, Sean Ziegeler |
| 2011 | Using the TOP500 to trace and project technology and architecture trends. Peter M. Kogge, Timothy J. Dysart |
| 2011 | Virtual I/O caching: dynamic storage cache management for concurrent workloads. Michael R. Frasca, Ramya Prabhakar, Padma Raghavan, Mahmut T. Kandemir |