| 1996 | A Quantitative Analysis of Loop Nest Locality. Kathryn S. McKinley, Olivier Temam |
| 1996 | ASPLOS-VII Proceedings - Seventh International Conference on Architectural Support for Programming Languages and Operating Systems, Cambridge, Massachusetts, USA, October 1-5, 1996. Bill Dally, Susan J. Eggers |
| 1996 | Adapting to Network and Client Variability via On-Demand Dynamic Distillation. Armando Fox, Steven D. Gribble, Eric A. Brewer, Elan Amir |
| 1996 | An Evaluation of Memory Consistency Models for Shared-Memory Systems with ILP Processors. Vijay S. Pai, Parthasarathy Ranganathan, Sarita V. Adve, Tracy Harton |
| 1996 | An Integrated Compile-Time/Run-Time Software Distributed Shared Memory System. Sandhya Dwarkadas, Alan L. Cox, Willy Zwaenepoel |
| 1996 | Analysis of Branch Prediction Via Data Compression. I-Cheng K. Chen, John T. Coffey, Trevor N. Mudge |
| 1996 | Compiler-Based Prefetching for Recursive Data Structures. Chi-Keung Luk, Todd C. Mowry |
| 1996 | Compiler-Directed Page Coloring for Multiprocessors. Edouard Bugnion, Jennifer-Ann M. Anderson, Todd C. Mowry, Mendel Rosenblum, Monica S. Lam |
| 1996 | Evaluation of Architectural Support for Global Address-Based Communication in Large-Scale Parallel Machines. Arvind Krishnamurthy, Klaus E. Schauser, Chris J. Scheiman, Randolph Y. Wang, David E. Culler, Katherine A. Yelick |
| 1996 | Exploiting Dual Data-Memory Banks in Digital Signal Processors. Mazen A. R. Saghir, Paul Chow, Corinna G. Lee |
| 1996 | Hiding Communication Latency and Coherence Overhead in Software DSMs. Ricardo Bianchini, Leonidas I. Kontothanassis, Raquel Pinto, M. De Maria, M. Abud, Claudio Luis de Amorim |
| 1996 | Improving Cache Performance with Balanced Tag and Data Paths. Jih-Kwon Peir, Windsor W. Hsu, Honesty C. Young, Shauchi Ong |
| 1996 | Multiple-Block Ahead Branch Predictors. André Seznec, Stéphan Jourdan, Pascal Sainrat, Pierre Michaud |
| 1996 | Operating System Support for Improving Data Locality on CC-NUMA Compute Servers. Ben Verghese, Scott Devine, Anoop Gupta, Mendel Rosenblum |
| 1996 | Petal: Distributed Virtual Disks. Edward K. Lee, Chandramohan A. Thekkath |
| 1996 | Reducing Network Latency Using Subpages in a Global Memory Environment. Hervé A. Jamrozik, Michael J. Feeley, Geoffrey M. Voelker, James Evans II, Anna R. Karlin, Henry M. Levy, Mary K. Vernon |
| 1996 | Shasta: A Low Overhead, Software-Only Approach for Supporting Fine-Grain Shared Memory. Daniel J. Scales, Kourosh Gharachorloo, Chandramohan A. Thekkath |
| 1996 | SoftFLASH: Analyzing the Performance of Clustered Distributed Virtual Shared Memory. Andrew Erlichson, Neal Nuckolls, Greg Chesson, John L. Hennessy |
| 1996 | Synchronization and Communication in the T3E Multiprocessor. Steven L. Scott |
| 1996 | The Case for a Single-Chip Multiprocessor. Kunle Olukotun, Basem A. Nayfeh, Lance Hammond, Kenneth G. Wilson, Kunyung Chang |
| 1996 | The Intrinsic Bandwidth Requirements of Ordinary Programs. Andrew S. Huang, John Paul Shen |
| 1996 | The Rio File Cache: Surviving Operating System Crashes. Peter M. Chen, Wee Teck Ng, Subhachandra Chandra, Christopher M. Aycock, Gurushankar Rajamani, David E. Lowell |
| 1996 | The Structure and Performance of Interpreters. Theodore H. Romer, Dennis Lee, Geoffrey M. Voelker, Alec Wolman, Wayne A. Wong, Jean-Loup Baer, Brian N. Bershad, Henry M. Levy |
| 1996 | Thread Scheduling for Cache Locality. James Philbin, Jan Edler, Otto J. Anshus, Craig C. Douglas, Kai Li |
| 1996 | Value Locality and Load Value Prediction. Mikko H. Lipasti, Christopher B. Wilkerson, John Paul Shen |
| 1996 | Whole-Program Optimization for Time and Space Efficient Threads. Dirk Grunwald, Richard Neves |