| 2011 | A QHD-capable parallel H.264 decoder. Chi Ching Chi, Ben H. H. Juurlink |
| 2011 | A composite and scalable cache coherence protocol for large scale CMPs. Yi Xu, Yu Du, Youtao Zhang, Jun Yang |
| 2011 | Active pebbles: parallel programming for data-driven applications. Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine |
| 2011 | An execution strategy and optimized runtime support for parallelizing irregular reductions on modern GPUs. Xin Huo, Vignesh T. Ravi, Wenjing Ma, Gagan Agrawal |
| 2011 | An idiom-finding tool for increasing productivity of accelerators. Laura Carrington, Mustafa M. Tikir, Catherine Olschanowsky, Michael Laurenzano, Joshua Peraza, Allan Snavely, Stephen Poole |
| 2011 | Automatic SIMD vectorization of fast fourier transforms for the larrabee and AVX instruction sets. Daniel S. McFarlin, Volodymyr Arbatov, Franz Franchetti, Markus Püschel |
| 2011 | Automatic generation of executable communication specifications from parallel applications. Xing Wu, Frank Mueller, Scott Pakin |
| 2011 | Automating GPU computing in MATLAB. Chun-Yu Shei, Pushkar Ratnalikar, Arun Chauhan |
| 2011 | Challenges and opportunities in renewable energy and energy efficiency. Steven W. Hammond |
| 2011 | Characterizing the impact of soft errors on iterative methods in scientific computing. Manu Shantharam, Sowmyalatha Srinivasmurthy, Padma Raghavan |
| 2011 | Controlling cache utilization of HPC applications. Swann Perarnau, Marc Tchiboukdjian, Guillaume Huard |
| 2011 | Coordinating processor and main memory for efficientserver power control. Ming Chen, Xiaorui Wang, Xue Li |
| 2011 | Cosmic microwave background map-making at the petascale and beyond. Rajesh Sudarsan, Julian Borrill, Christopher Cantalupo, Theodore Kisner, Kamesh Madduri, Leonid Oliker, Yili Zheng, Horst D. Simon |
| 2011 | Cost-effectively offering private buffers in SoCs and CMPs. Zhen Fang, Li Zhao, Ravishankar R. Iyer, Carlos Flores Fajardo, German Fabila Garcia, Seung Eun Lee, Bin Li, Steve R. King, Xiaowei Jiang, Srihari Makineni |
| 2011 | F Jin Ouyang, Chuan Yang, Dimin Niu, Yuan Xie, Zhiwen Liu |
| 2011 | Generic topology mapping strategies for large-scale parallel architectures. Torsten Hoefler, Marc Snir |
| 2011 | High performance linpack benchmark: a fault tolerant implementation without checkpointing. Teresa Davies, Christer Karlsson, Hui Liu, Chong Ding, Zizhong Chen |
| 2011 | Hystor: making the best use of solid state drives in high performance storage systems. Feng Chen, David A. Koufaty, Xiaodong Zhang |
| 2011 | Karma: scalable deterministic record-replay. Arkaprava Basu, Jayaram Bobba, Mark D. Hill |
| 2011 | MDR: performance model driven runtime for heterogeneous parallel platforms. Jacques A. Pienaar, Anand Raghunathan, Srimat T. Chakradhar |
| 2011 | MP-PIPE: a massively parallel protein-protein interaction prediction engine. Andrew Schoenrock, Frank K. H. A. Dehne, James R. Green, Ashkan Golshani, Sylvain Pitre |
| 2011 | Mint: realizing CUDA performance in 3D stencil methods with annotated C. Didem Unat, Xing Cai, Scott B. Baden |
| 2011 | Modeling the performance of an algebraic multigrid cycle on HPC platforms. Hormozd Gahvari, Allison H. Baker, Martin Schulz, Ulrike Meier Yang, Kirk E. Jordan, William Gropp |
| 2011 | Multiset signatures for transactional memory. Ricardo Quislant, Eladio Gutiérrez, Oscar G. Plata, Emilio L. Zapata |
| 2011 | Optimizing the datacenter for data-centric workloads. Stijn Polfliet, Frederick Ryckbosch, Lieven Eeckhout |
| 2011 | Optimizing throughput/power trade-offs in hardware transactional memory using DVFS and intelligent scheduling. Clay Hughes, Tao Li |
| 2011 | Page placement in hybrid memory systems. Luiz E. Ramos, Eugene Gorbatov, Ricardo Bianchini |
| 2011 | Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularity. Yang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, Shu Ping Zhang |
| 2011 | Performance modeling as the key to extreme scale computing. William D. Gropp |
| 2011 | Poster: DVFS management in real-processors. Vasileios Spiliopoulos, Georgios Keramidas, Stefanos Kaxiras, Konstantinos Efstathiou |
| 2011 | Poster: implications of merging phases on scalability of multi-core architectures. Madhavan Manivannan, Ben H. H. Juurlink, Per Stenström |
| 2011 | Poster: programming clusters of GPUs with OMPSs. Javier Bueno, Alejandro Duran, Xavier Martorell, Eduard Ayguadé, Rosa M. Badia, Jesús Labarta |
| 2011 | Poster: revisiting virtual channel memory for performance and fairness on multi-core architecture. Licheng Chen, Yongbing Huang, Yungang Bao, Onur Mutlu, Guangming Tan, Mingyu Chen |
| 2011 | Predictive coordination of multiple on-chip resources for chip multiprocessors. Jian Chen, Lizy Kurian John |
| 2011 | Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31 - June 04, 2011 David K. Lowenthal, Bronis R. de Supinski, Sally A. McKee |
| 2011 | Processing data streams with hard real-time constraints on heterogeneous systems. Uri Verner, Assaf Schuster, Mark Silberstein |
| 2011 | Rethinking shared-memory languages and hardware. Sarita V. Adve |
| 2011 | SRC: Damaris - using dedicated i/o cores for scalable post-petascale HPC simulations. Matthieu Dorier |
| 2011 | SRC: FenixOS - a research operating system focused on high scalability and reliability. Stavros Passas, Sven Karlsson |
| 2011 | SRC: OpenSHMEM library development. Swaroop Suhas Pophale |
| 2011 | SRC: an automatic code overlaying technique for multicores with explicitly-managed memory hierarchies. Choonki Jang |
| 2011 | SRC: automatic extraction of SST/macro skeleton models. Amruth Rudraiah Dakshinamurthy |
| 2011 | SRC: enabling petascale data analysis for scientific applications through data reorganization. Yuan Tian |
| 2011 | SRC: facilitating efficient parallelization of information storage and retrieval on large data sets. Steven Feldman |
| 2011 | SRC: information retrieval as a persistent parallel service on supercomputer infrastructure. Tobias Berka, Marián Vajtersic |
| 2011 | SRC: soft error detection and recovery for high performance linpack. Teresa Davies, Zizhong Chen |
| 2011 | SRC: virtual i/o caching: dynamic storage cache management for concurrent workloads. Michael R. Frasca, Ramya Prabhakar |
| 2011 | Scalable fine-grained call path tracing. Nathan R. Tallent, John M. Mellor-Crummey, Michael Franco, Reed Landrum, Laksono Adhianto |
| 2011 | SecureME: a hardware-software approach to full system security. Siddhartha Chhabra, Brian Rogers, Yan Solihin, Milos Prvulovic |
| 2011 | The elephant and the mice: the role of non-strict fine-grain synchronization for modern many-core architectures. Juergen Ributzka, Yuhei Hayashi, Joseph B. Manzano, Guang R. Gao |
| 2011 | Transactional conflict decoupling and value prediction. Fuad Tabba, Andrew W. Hay, James R. Goodman |
| 2011 | Using GPUs to compute large out-of-card FFTs. Liang Gu, Jakob Siegel, Xiaoming Li |
| 2011 | ZEBRA: a data-centric, hybrid-policy hardware transactional memory design. J. Rubén Titos Gil, Anurag Negi, Manuel E. Acacio, José M. García, Per Stenström |