| 2007 | A low-cost mixed-mode parallel processor architecture for embedded systems. Shorin Kyo, Takuya Koga, Hanno Lieske, Shouhei Nomoto, Shin'ichiro Okazaki |
| 2007 | A study of process arrival patterns for MPI collective operations. Ahmad Faraj, Pitch Patarasuk, Xin Yuan |
| 2007 | A symmetric transformation for 3-body potential molecular dynamics using force-decomposition in a heterogeneous distributed environment. J. V. Sumanth, David R. Swanson, Hong Jiang |
| 2007 | Active memory operations. Zhen Fang, Lixin Zhang, John B. Carter, Ali Ibrahim, Michael A. Parker |
| 2007 | Adaptive Strassen's matrix multiplication. Paolo D'Alberto, Alexandru Nicolau |
| 2007 | Adaptive performance control for distributed scientific coupled models. Mohamed Khamiss Hussein, Kenneth R. Mayes, Mikel Luján, John R. Gurd |
| 2007 | An L2-miss-driven early register deallocation for SMT processors. Joseph J. Sharkey, Dmitry V. Ponomarev |
| 2007 | An operation stacking framework for large ensemble computations. Mehmet Belgin, Calvin J. Ribbens, Godmar Back |
| 2007 | Automatic nonblocking communication for partitioned global address space programs. Wei-Yu Chen, Dan Bonachea, Costin Iancu, Katherine A. Yelick |
| 2007 | Characteristics of workloads used in high performance and technical computing. Razvan Cheveresan, Matthew Ramsay, Chris Feucht, Ilya Sharapov |
| 2007 | Compression in cache design. Ali-Reza Adl-Tabatabai, Anwar M. Ghuloum, Shobhit O. Kanaujia |
| 2007 | Cooperative cache partitioning for chip multiprocessors. Jichuan Chang, Gurindar S. Sohi |
| 2007 | Current trends in computer architectures: multi-cores, many-cores and special-cores. Avi Mendelson |
| 2007 | Executing irregular scientific applications on stream architectures. Mattan Erez, Jung Ho Ahn, Jayanth Gummaraju, Mendel Rosenblum, William J. Dally |
| 2007 | GridRod: a dynamic runtime scheduler for grid workflows. Shahaan Ayyub, David Abramson |
| 2007 | Harnessing massive parallelism in the era of parallelism for the masses. Craig B. Stunkel |
| 2007 | High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters. Matthew J. Koop, Sayantan Sur, Qi Gao, Dhabaleswar K. Panda |
| 2007 | Increasing cache capacity through word filtering. Prateek Pujara, Aneesh Aggarwal |
| 2007 | Locality of sampling and diversity in parallel system workloads. Dror G. Feitelson |
| 2007 | Modeling correlated workloads by combining model based clustering and a localized sampling algorithm. Hui Li, Michael Muskulus, Lex Wolters |
| 2007 | Optimization and bottleneck analysis of network block I/O in commodity storage systems. Manolis Marazakis, Vassilis Papaefstathiou, Angelos Bilas |
| 2007 | Optimization of data prefetch helper threads with path-expression based statistical modeling. Tor M. Aamodt, Paul Chow |
| 2007 | Performance driven data cache prefetching in a dynamic software optimization system. Jean Christophe Beyler, Philippe Clauss |
| 2007 | Proactive fault tolerance for HPC with Xen virtualization. Arun Babu Nagarajan, Frank Mueller, Christian Engelmann, Stephen L. Scott |
| 2007 | Proceedings of the 21th Annual International Conference on Supercomputing, ICS 2007, Seattle, Washington, USA, June 17-21, 2007 Burton J. Smith |
| 2007 | Representation-transparent matrix algorithms with scalable performance. Peter Gottschling, David S. Wise, Michael D. Adams |
| 2007 | Scalability analysis of SPMD codes using expectations. Cristian Coarfa, John M. Mellor-Crummey, Nathan Froyd, Yuri Dotsenko |
| 2007 | Scalability of the Nutch search engine. José E. Moreira, Maged M. Michael, Dilma Da Silva, Doron Shiloach, Parijat Dube, Li Zhang |
| 2007 | Scheduling FFT computation on SMP and multicore systems. Ayaz Ali, S. Lennart Johnsson, Jaspal Subhlok |
| 2007 | Sensitivity analysis for automatic parallelization on multi-cores. Silvius Vasile Rus, Maikel Pennings, Lawrence Rauchwerger |
| 2007 | Sequencer virtualization. Perry H. Wang, Jamison D. Collins, Gautham N. Chinya, Bernard Lint, Asit Mallick, Koichi Yamada, Hong Wang |
| 2007 | Tradeoff between data-, instruction-, and thread-level parallelism in stream processors. Jung Ho Ahn, Mattan Erez, William J. Dally |