| 1996 | A Cost-Comparison Approach for Adaptive Distributed Shared Memory. Jai-Hoon Kim, Nitin H. Vaidya |
| 1996 | A MATLAB to Fortran 90 Translator and Its Effectiveness. Luiz De Rose, David A. Padua |
| 1996 | A New Guaranteed Heuristic for the Software Pipelining Problem. Pierre-Yves Calland, Alain Darte, Yves Robert |
| 1996 | A Performance Study of Cosmological Simulations on Message-Passing and Shared-Memory Multiprocessors. Marios D. Dikaiakos, Joachim Stadel |
| 1996 | A Register Allocation Technique Using Guarded PDG. Akira Koseki, Hideaki Komatsu, Yoshiaki Fukazawa |
| 1996 | A Template for Non-Uniform Parallel Loops Based on Dynamic Scheduling and Prefetching Techniques. Salvatore Orlando, Raffaele Perego |
| 1996 | Amon2: A Parallel Wire Routing Algorithm on a Torus Network Parallel Computer. Hesham Keshk, Shin-ichiro Mori, Hiroshi Nakashima, Shinji Tomita |
| 1996 | An Efficient Steepest-Edge Simplex Algorithm for SIMD Computers. Michael E. Thomadakis, Jyh-Charn Liu |
| 1996 | An Interprocedural Framework for Placement of Asynchronous I/O Operations. Gagan Agrawal, Anurag Acharya, Joel H. Saltz |
| 1996 | Analysis of Local Enumeration and Storage Schemes in HPF. Henk J. Sips, Kees van Reeuwijk, Will Denissen |
| 1996 | Are There Advantages to High-Dimension Architectures? Analysis of Shantanu Dutt, Nam Trinh |
| 1996 | Automatic Optimization of Communication in Compiling Out-of-Core Stencil Codes. Rajesh Bordawekar, Alok N. Choudhary, J. Ramanujam |
| 1996 | Automatic Partitioning Techniques for Solving Partial Differential Equations on Irregular Adaptive Meshes. Jérôme Galtier |
| 1996 | Automating Parallel Runtime Optimizations Using Post-Mortem Analysis. Sanjeev Krishnan, Laxmikant V. Kalé |
| 1996 | Benchmark Tests on the Digital Equipment Corporation Alpha AXP 21164-based AlphaServer 8400, Including a Comparison of Optimized Vector and Superscalar Processing. Harvey J. Wasserman |
| 1996 | Block Algorithms for Sparse Matrix Computations on High Performance Workstations. Juan J. Navarro, Elena García-Diego, Josep Lluís Larriba-Pey, Toni Juan |
| 1996 | CTADEL: A Generator of Multi-Platform High Performance Codes for PDE-Based Scientific Applications. Robert van Engelen, Lex Wolters, Gerard Cats |
| 1996 | Compiler Support for Hybrid Irregular Accesses on Multicomputers. Antonio Lain, Prithviraj Banerjee |
| 1996 | Counting Solutions to Linear and Nonlinear Constraints Through Ehrhart Polynomials: Applications to Analyze and Transform Scientific Programs. Philippe Clauss |
| 1996 | Data Prefetching and Multilevel Blocking for Linear Algebra Operations. Juan J. Navarro, Elena García-Diego, José R. Herrero |
| 1996 | Data-Localization for Fortran Macro-Dataflow Computation Using Partial Static Task Assignment. Akimasa Yoshida, Kenichi Koshizuka, Hironori Kasahara |
| 1996 | Design and Evaluation of Dynamic Access Ordering Hardware. Sally A. McKee, Assaji Aluwihare, Benjamin H. Clark, Robert H. Klenke, Trevor C. Landon, Christopher W. Oliver, Maximo H. Salinas, Adam E. Szymkowiak, Kenneth L. Wright, William A. Wulf, James H. Aylor |
| 1996 | Detection and Global Optimization of Reduction Operations for Distributed Parallel Machines. Toshio Suganuma, Hideaki Komatsu, Toshio Nakatani |
| 1996 | Eliminating Redundant Barrier Synchronizations in Rule-Based Programs. Anurag Acharya |
| 1996 | Evaluating Virtual Channels for Cache-Coherent Shared-Memory Multiprocessors. Akhilesh Kumar, Laxmi N. Bhuyan |
| 1996 | Evaluating the Limits of Message Passing via the Shared Attraction Memory on CC-COMA Machines: Experiences with TCGMSG and PVM. Kaushik Ghosh, Stephen R. Breit |
| 1996 | Examination of a Memory Access Classification Scheme for Pointer-Intensive and Numeric Programs. Luddy Harrison |
| 1996 | Experimental Evaluation of Efficient Sparse Matrix Distributions. Manuel Ujaldon, Shamik D. Sharma, Emilio L. Zapata, Joel H. Saltz |
| 1996 | Fine Grain Parallel Communication on General Purpose LANs. Todd W. Mummert, Corey Kosak, Peter Steenkiste, Allan Fisher |
| 1996 | Hybrid Algorithms for Complete Exchange in 2D Meshes. N. S. Sundar, Doddaballapur Narasimha-Murthy Jayasimha, Dhabaleswar K. Panda, P. Sadayappan |
| 1996 | Improving Single-Process Performance with Multithreaded Processors. Alexandre Farcy, Olivier Temam |
| 1996 | Integrating Task and Data Parallelism Using Shared Objects. Saniya Ben Hassen, Henri E. Bal |
| 1996 | Mapping Performance Data for High-Level and Data Views of Parallel Program Performance. R. Bruce Irvin, Barton P. Miller |
| 1996 | Memory Organization in Multi-Channel Optical Networks: NUMA and COMA Revisited. Yan Yang Xiao, John K. Bennett |
| 1996 | Minimizing Communication While Preserving Parallelism. Wayne Kelly, William W. Pugh |
| 1996 | Optimizing Primary Data Caches for Parallel Scientific Applications: The Pool Buffer Approach. Liuxi Yang, Josep Torrellas |
| 1996 | ParInt: A Software Package for Parallel Integration. Elise de Doncker, Ajay Gupta, Jay Ball, Patricia Ealy, Alan Genz |
| 1996 | Parallel Additive Lagged Fibonacci Random Number Generators. Srinivas Aluru |
| 1996 | Parallel Construction of Multidimensional Binary Search Trees. Ibraheem Al-Furaih, Srinivas Aluru, Sanjay Goil, Sanjay Ranka |
| 1996 | Parallel Implementation of the Lanczos Method for Sparse Matrices: Analysis of Data Distributions. Ester M. Garzón, Inmaculada García |
| 1996 | Performance of the Vectorial Processor VEC-SM2 Using Serial Multiport Memory. Jacques Jorda, Abdelaziz Mzoughi, O. Lafontaine, Daniel Litaize |
| 1996 | Proceedings of the 10th international conference on Supercomputing, ICS 1996, Philadelphia, PA, USA, May 25-28, 1996 Pen-Chung Yew |
| 1996 | Profile Driven Weighted Decomposition. Karen A. Tomko, Edward S. Davidson |
| 1996 | Reducing Inter-Vector-Conflicts in Complex Memory Systems. Anna M. del Corral, José M. Llabería |
| 1996 | Run-Time Compilation for Parallel Sparse Matrix Computations. Cong Fu, Tao Yang |
| 1996 | Runtime Coupling of Data-Parallel Programs. M. Ranganathan, Anurag Acharya, Guy Edjlali, Alan Sussman, Joel H. Saltz |
| 1996 | Satisfiability Test with Synchronous Simulated Annealing on the Fujitsu AP1000 Massively-Parallel Multiprocessor. Andrew Sohn, Rupak Biswas |
| 1996 | Synchronization Hardware for Networks of Workstations: Performance vs. Cost. Rahmat S. Hyder, David A. Wood |
| 1996 | The Effect of Interrupts on Software Pipeline Execution on Message-Passing Architectures. Rob F. Van der Wijngaart, Sekhar R. Sarukkai, Pankaj Mehra |
| 1996 | The GLOW Cache Coherence Protocol Extensions for Widely Shared Data. Stefanos Kaxiras, James R. Goodman |
| 1996 | The Galley Parallel File System. Nils Nieuwejaar, David Kotz |