| 1995 | A compiler algorithm that reduces read latency in ownership-based cache coherence protocols. Jonas Skeppstedt, Per Stenström |
| 1995 | A design study of the EARTH multiprocessor. Herbert H. J. Hum, Olivier Maquelin, Kevin B. Theobald, Xinmin Tian, Xinan Tang, Guang R. Gao, Phil Cupryk, Nasser Elmasri, Laurie J. Hendren, Alberto Jimenez, Shoba Krishnan, Andres Marquez, Shamir Merali, Shashank S. Nemawarkar, Prakash Panangaden, Xun Xue, Yingchun Zhu |
| 1995 | A loop parallelization technique for linear dependence vector. Teruaki Kitasuka, Kazuki Joe, Dale Schouten, Akira Fukuda, Keijiro Araki |
| 1995 | A partitioning-independent paradigm for nested data parallelism. Dean Engelhardt, Andrew L. Wendelborn |
| 1995 | A proposal of self-cleanup cache. Shin-ichiro Mori, Masahiro Goshima, Hiroshi Nakashima, Shinji Tomita |
| 1995 | A simple algorithm for the generation of efficient loop structures. Michel Cosnard, Michel Loi |
| 1995 | Allocating registers in multiple instruction-issuing processors. Christine Eisenbeis, Franco Gasperoni, Uwe Schwiegelshohn |
| 1995 | An analytical model of high performance superscalar-based multiprocessors. David H. Albonesi, Israel Koren |
| 1995 | An empirical evaluation of the Convex SPP-1000 hierarchical shared memory system. Thomas L. Sterling, Daniel Savarese, Phillip Merkey, Kevin Olson |
| 1995 | Analysis of communications and overhead reduction in multithreaded execution. Lucas Roh, Walid A. Najjar |
| 1995 | Automatic generation of loop scheduling for VLIW. Cristina Barrado, Jesús Labarta, Eduard Ayguadé, Mateo Valero |
| 1995 | CRAIG: a practical framework for combining instruction scheduling and register assignment. Thomas S. Brasier, Philip H. Sweany, Steven J. Beaty, Steve Carr |
| 1995 | Compiler techniques for data prefetching on the PowerPC. David Bernstein, Doron Cohen, Ari Freund |
| 1995 | Control of loop parallelism in multithreaded code. Bhanu Shankar, Lucas Roh, A. P. Wim Böhm, Walid A. Najjar |
| 1995 | Data flow analysis of parallel programs. Jürgen Vollmer |
| 1995 | Decomposed software pipelining with reduced register requirement. Jian Wang, Andreas Krall, M. Anton Ertl |
| 1995 | Direct-mapped versus set-associative pipelined caches. Nathalie Drach, André Seznec, Daniel Windheiser |
| 1995 | Effects of data bundling in non-strict data structures. Eunha Rho, Sang Yong Han, Heunghwan Kim, Daejoon Hwang |
| 1995 | Evaluating the impact of advanced memory systems on compiler-parallelized codes. Evan Torrie, Chau-Wen Tseng, Margaret Martonosi, Mary W. Hall |
| 1995 | From functional equations to Occam programs: systolizing compilation. Elena Trichina |
| 1995 | Handling block-cyclic distributed arrays in Vienna Fortran 90. Siegfried Benkner |
| 1995 | IPF for real-time image processing on massively parallel architectures. Y. Robin |
| 1995 | Increasing cache bandwidth using multi-port caches for exploiting ILP in non-numerical code. Soo-Mook Moon |
| 1995 | Increasing superscalar performance through multistreaming. Wayne Yamamoto, Mario Nemirovsky |
| 1995 | Mappings for communication minimization using distribution and alignment. Catherine Mongenet |
| 1995 | Multithreading with the EM-4 distributed-memory multiprocessor. Andrew Sohn, Chinhyun Kim, Mitsuhisa Sato |
| 1995 | Ordered multithreading: a novel technique for exploiting thread-level parallelism. Masato Motomura, Toshiaki Inoue, Sunao Torii, Akihiko Konagaya |
| 1995 | Performance impact of architectural features during binary to binary translation. Bryce Cogswell, Zary Segall |
| 1995 | Practical approach to single assignment code. Patricia Prather Pineo, Mary Lou Soffa |
| 1995 | Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques, PACT '95, Limassol, Cyprus, June 27-29, 1995 Lubomir Bic, Paraskevas Evripidou, A. P. Wim Böhm, Jean-Luc Gaudiot |
| 1995 | Register allocation sensitive region scheduling. Cindy Norris, Lori L. Pollock |
| 1995 | Scheduling optimization through iterative refinement. Mayez Al-Mouhamed, Adel Al-Maasarani |
| 1995 | Self-parallelization of sequential object codes. Rudolph N. Rechtschaffen, Kattamuri Ekanadham |
| 1995 | Single-program speculative multithreading (SPSM) architecture: compiler-assisted fine-grained multithreading. Pradeep K. Dubey, Kevin O'Brien, Kathryn M. O'Brien, Charles Barton |
| 1995 | The influence of branch prediction table interference on branch prediction scheme performance. Adam R. Talcott, Mario Nemirovsky, Roger C. Wood |
| 1995 | The meeting graph: a new model for loop cyclic register allocation. Christine Eisenbeis, Sylvain Lelait, Bruno Marmol |
| 1995 | Transformation of functional specifications of finite difference methods to parallel distributed codes. Kanad Roy, Carl McCrosky |
| 1995 | Translation of serial recursive codes to parallel SIMD codes. Abdou Youssef |
| 1995 | Using compilers for heterogeneous system design. Rainer Leupers, Peter Marwedel |
| 1995 | Using predicated execution to improve the performance of a dynamically scheduled machine with speculative execution. Po-Yung Chang, Eric Hao, Yale N. Patt, Pohua P. Chang |