| 2005 | A NUCA substrate for flexible CMP cache sharing. Jaehyuk Huh, Changkyu Kim, Hazim Shafi, Lixin Zhang, Doug Burger, Stephen W. Keckler |
| 2005 | A heterogeneously segmented cache architecture for a packet forwarding engine. Kaushik Rajan, Ramaswamy Govindarajan |
| 2005 | A hybrid hardware/software approach to efficiently determine cache coherence Bottlenecks. Jaydeep Marathe, Frank Mueller, Bronis R. de Supinski |
| 2005 | A performance-conserving approach for reducing peak power consumption in server systems. Wesley M. Felter, Karthick Rajamani, Tom W. Keller, Cosmin Rusu |
| 2005 | An asymmetric clustered processor based on value content. Rubén González, Adrián Cristal, Miquel Pericàs, Mateo Valero, Alexander V. Veidenbaum |
| 2005 | An integrated simdization framework using virtual vectors. Peng Wu, Alexandre E. Eichenberger, Amy Wang, Peng Zhao |
| 2005 | Another approach to backfilled jobs: applying virtual malleability to expired windows. Gladys Utrera, Julita Corbalán, Jesús Labarta |
| 2005 | Automatic generation and tuning of MPI collective communication routines. Ahmad Faraj, Xin Yuan |
| 2005 | Automatic thread distribution for nested parallelism in OpenMP. Alejandro Duran, Marc González, Julita Corbalán |
| 2005 | Cache oblivious stencil computations. Matteo Frigo, Volker Strumpen |
| 2005 | Characterization of L3 cache behavior of SPECjAppServer2002 and TPC-C. Eriko Nurvitadhi, Nirut Chalainanont, Shih-Lien Lu |
| 2005 | Continuous Replica Placement schemes in distributed systems. Thanasis Loukopoulos, Petros Lampsas, Ishfaq Ahmad |
| 2005 | Design of a next generation sampling service for large scale data analysis applications. Huai Wang, Srinivasan Parthasarathy, Amol Ghoting, Shirish Tatikonda, Gregory Buehrer, Tahsin M. Kurç, Joel H. Saltz |
| 2005 | Disk layout optimization for reducing energy consumption. Seung Woo Son, Guangyu Chen, Mahmut T. Kandemir |
| 2005 | Facilitating the search for compositions of program transformations. Albert Cohen, Marc Sigler, Sylvain Girbal, Olivier Temam, David Parello, Nicolas Vasilache |
| 2005 | Fast branch misprediction recovery in out-of-order superscalar processors. Peng Zhou, Soner Önder, Steve Carr |
| 2005 | Generating new general compiler optimization settings. Masayo Haneda, Peter M. W. Knijnenburg, Harry A. G. Wijshoff |
| 2005 | High performance support of parallel virtual file system (PVFS2) over Quadrics. Weikuan Yu, Shuang Liang, Dhabaleswar K. Panda |
| 2005 | Improved automatic testcase synthesis for performance model validation. Robert H. Bell Jr., Lizy Kurian John |
| 2005 | Improving the computational intensity of unstructured mesh applications. Brian S. White, Sally A. McKee, Bronis R. de Supinski, Brian Miller, Daniel J. Quinlan, Martin Schulz |
| 2005 | Lightweight reference affinity analysis. Xipeng Shen, Yaoqing Gao, Chen Ding, Roch Archambault |
| 2005 | Low-overhead call path profiling of unmodified, optimized code. Nathan Froyd, John M. Mellor-Crummey, Robert J. Fowler |
| 2005 | Low-power, low-complexity instruction issue using compiler assistance. Madhavi Gopal Valluri, Lizy Kurian John, Kathryn S. McKinley |
| 2005 | Multigrain parallel Delaunay Mesh generation: challenges and opportunities for multithreaded architectures. Christos D. Antonopoulos, Xiaoning Ding, Andrey N. Chernikov, Filip Blagojevic, Dimitrios S. Nikolopoulos, Nikos Chrisochoides |
| 2005 | Online performance analysis by statistical sampling of microprocessor performance counters. Reza Azimi, Michael Stumm, Robert W. Wisniewski |
| 2005 | Optimization of MPI collective communication on BlueGene/L systems. George Almási, Philip Heidelberger, Charles Archer, Xavier Martorell, C. Christopher Erway, José E. Moreira, Burkhard D. Steinmacher-Burow, Yili Zheng |
| 2005 | Parallel sparse LU factorization on second-class message passing platforms. Kai Shen |
| 2005 | Power-aware resource allocation in high-end systems via online simulation. Barry Lawson, Evgenia Smirni |
| 2005 | Proceedings of the 19th Annual International Conference on Supercomputing, ICS 2005, Cambridge, Massachusetts, USA, June 20-22, 2005 Arvind, Larry Rudolph |
| 2005 | Reducing latencies of pipelined cache accesses through set prediction. Aneesh Aggarwal |
| 2005 | Scaling physics and material science applications on a massively parallel Blue Gene/L system. George Almási, Gyan Bhanot, Alan Gara, Manish Gupta, James C. Sexton, Robert Walkup, Vasily V. Bulatov, Andrew W. Cook, Bronis R. de Supinski, James N. Glosli, Jeffrey A. Greenough, François Gygi, Alison Kubota, Steve Louis, Thomas E. Spelce, Frederick H. Streitz, Peter L. Williams, Robert K. Yates, Charles Archer, José E. Moreira, Charles A. Rendleman |
| 2005 | System noise, OS clock ticks, and fine-grained parallel applications. Dan Tsafrir, Yoav Etsion, Dror G. Feitelson, Scott Kirkpatrick |
| 2005 | TAPE: a transactional application profiling environment. Hassan Chafi, Chi Cao Minh, Austen McDonald, Brian D. Carlstrom, JaeWoong Chung, Lance Hammond, Christos Kozyrakis, Kunle Olukotun |
| 2005 | Tasking with out-of-order spawn in TLS chip multiprocessors: microarchitecture and compilation. Jose Renau, James Tuck, Wei Liu, Luis Ceze, Karin Strauss, Josep Torrellas |
| 2005 | The architecture of the HP Superdome shared-memory multiprocessor. Gary Gostin, Jean-Francois Collard, Kirby Collins |
| 2005 | The implications of working set analysis on supercomputing memory hierarchy design. Richard C. Murphy, Arun Rodrigues, Peter M. Kogge, Keith D. Underwood |
| 2005 | Think globally, search locally. Kamen Yotov, Keshav Pingali, Paul Stodghill |
| 2005 | Thread-Level Speculation on a CMP can be energy efficient. Jose Renau, Karin Strauss, Luis Ceze, Wei Liu, Smruti R. Sarangi, James Tuck, Josep Torrellas |
| 2005 | Tornado warning: the perils of selective replay in multithreaded processors. Yongxiang Liu, Anahita Shayesteh, Gokhan Memik, Glenn Reinman |
| 2005 | Towards automatic translation of OpenMP to MPI. Ayon Basumallik, Rudolf Eigenmann |
| 2005 | Transparent caching with strong consistency in dynamic content web sites. Cristiana Amza, Gokul Soundararajan, Emmanuel Cecchet |
| 2005 | What is worth learning from parallel workloads?: a user and session based analysis. Julia Zilber, Ofer Amit, David Talby |
| 2005 | affinity-on-next-touch: increasing the performance of an industrial PDE solver on a cc-NUMA system. Henrik Löf, Sverker Holmgren |