| 1999 | A comparative analysis of four parallelisation schemes. Nandini Mukherjee, John R. Gurd |
| 1999 | A comparison of MPI, SHMEM and cache-coherent shared address space programming models on the SGI Origin2000. Hongzhang Shan, Jaswinder Pal Singh |
| 1999 | A comparison of two approaches for independent scaling up of processing and communication capacities in multicomputer networks. A. Ferre-Vilaplana, José M. Bernabéu-Aubán |
| 1999 | A design analysis of a hybrid technology multithreaded architecture for petaflops scale computation3. Thomas L. Sterling, Larry A. Bergman |
| 1999 | A graphic parallelizing environment for user-compiler interaction. Claudia Roberta Calidonna, Maurizio Giordano, Mario Mango Furnari |
| 1999 | A locality sensitive multi-module cache with explicit management. F. Jesús Sánchez, Antonio González |
| 1999 | A new "quad-tree-based" sub-system allocation technique for mesh-connected parallel machines. Jeeraporn Srisawat, Nikitas A. Alexandridis |
| 1999 | A new method to make communication latency uniform: distributed routing balancing. Daniel Franco, Indhira Garcés, Emilio Luque |
| 1999 | A quantitative architectural evaluation of synchronization algorithms and disciplines on ccNUMA systems: the case of the SGI Origin2000. Dimitrios S. Nikolopoulos, Theodore S. Papatheodorou |
| 1999 | A tile selection algorithm for data locality and cache interference. Jacqueline Chame, Sungdo Moon |
| 1999 | Adapting cache line size to application behavior. Alexander V. Veidenbaum, Weiyu Tang, Rajesh K. Gupta, Alexandru Nicolau, Xiaomei Ji |
| 1999 | Adding a vector unit to a superscalar processor. Francisca Quintana, Jesús Corbal, Roger Espasa, Mateo Valero |
| 1999 | An affine partitioning algorithm to maximize parallelism and minimize communication. Amy W. Lim, Gerald I. Cheong, Monica S. Lam |
| 1999 | An experimental evaluation of tiling and shackling for memory hierarchy management. Induprakas Kodukula, Keshav Pingali, Robert Cox, Dror E. Maydan |
| 1999 | An integer linear programming approach for optimizing cache locality. Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, Eduard Ayguadé |
| 1999 | Application scaling under shared virtual memory on a cluster of SMPs. Dongming Jiang, Brian O'Kelley, Xiang Yu, Sanjeev Kumar, Angelos Bilas, Jaswinder Pal Singh |
| 1999 | CACHET: an adaptive cache coherence protocol for distributed shared-memory systems. Xiaowei Shen, Arvind, Larry Rudolph |
| 1999 | Classifying load and store instructions for memory renaming. Glenn Reinman, Brad Calder, Dean M. Tullsen, Gary S. Tyson, Todd M. Austin |
| 1999 | Clustered speculative multithreaded processors. Pedro Marcuello, Antonio González |
| 1999 | Communication conscious radix sort. Daniel Jiménez-González, Josep Lluís Larriba-Pey, Juan J. Navarro |
| 1999 | Comparing the memory system performance of the HP V-class and SGI Origin 2000 multiprocessors using microbenchmarks and scientific applications. Ravi R. Iyer, Nancy M. Amato, Lawrence Rauchwerger, Laxmi N. Bhuyan |
| 1999 | Cyclic dependence based data reference prediction. Chi-Hung Chi, Jun-Li Yuan, Chin-Ming Cheung |
| 1999 | Dynamic remote memory acquisition for parallel data mining on ATM-connected PC cluster. Masato Oguchi, Masaru Kitsuregawa |
| 1999 | Dynamic removal of redundant computations. Carlos Molina, Antonio González, Jordi Tubella |
| 1999 | Efficient management of memory hierarchies in embedded DRAM systems. Ashley Saulsbury, Su-Jaen Huang, Fredrik Dahlgren |
| 1999 | Eliminating synchronization bottlenecks in object-based programs using adaptive replication. Martin C. Rinard, Pedro C. Diniz |
| 1999 | Exploiting SIMD parallelism in DSP and multimedia algorithms using the AltiVec technology. Huy Nguyen, Lizy Kurian John |
| 1999 | Fast cluster failover using virtual memory-mapped communication. Yuanyuan Zhou, Peter M. Chen, Kai Li |
| 1999 | High-level semantic optimization of numerical codes. Vijay Menon, Keshav Pingali |
| 1999 | Improving memory hierarchy performance for irregular applications. John M. Mellor-Crummey, David B. Whalley, Ken Kennedy |
| 1999 | Improving the performance of bristled CC-NUMA systems using virtual channels and adaptivity. José F. Martínez, Josep Torrellas, José Duato |
| 1999 | Improving the performance of speculatively parallel applications on the Hydra CMP. Kunle Olukotun, Lance Hammond, Mark Willey |
| 1999 | Improving virtual function call target prediction via dependence-based pre-computation. Amir Roth, Andreas Moshovos, Gurindar S. Sohi |
| 1999 | Increasing effective IPC by exploiting distant parallelism. Ivan Martel, Daniel Ortega, Eduard Ayguadé, Mateo Valero |
| 1999 | Low-level router design and its impact on supercomputer system performance. Valentin Puente, José A. Gregorio, Cruz Izu, Ramón Beivide, Fernando Vallejo |
| 1999 | Mechanisms and policies for supporting fine-grained cycle stealing. Kyung Dong Ryu, Jeffrey K. Hollingsworth, Peter J. Keleher |
| 1999 | Microservers: a new memory semantics for massively parallel computing. Jay B. Brockman, Peter M. Kogge, Thomas L. Sterling, Vincent W. Freeh, Shannon K. Kuntz |
| 1999 | New shape analysis techniques for automatic parallelization of C codes. Francisco Corbera, Rafael Asenjo, Emilio L. Zapata |
| 1999 | Nonlinear array layouts for hierarchical memory systems. Siddhartha Chatterjee, Vibhor V. Jain, Alvin R. Lebeck, Shyam Mundhra, Mithuna Thottethodi |
| 1999 | On the complexity of list scheduling algorithms for distributed-memory systems. Andrei Radulescu, Arjan J. C. van Gemund |
| 1999 | Parallel I/O for scientific applications on heterogeneous clusters: a resource-utilization approach. Yong E. Cho, Marianne Winslett, Szu-Wen Kuo, Jonghyun Lee, Ying Chen |
| 1999 | Performance impact of proxies in data intensive client-server applications. Michael D. Beynon, Alan Sussman, Joel H. Saltz |
| 1999 | Problem space promotion and its evaluation as a technique for efficient parallel computation. Bradford L. Chamberlain, E. Christopher Lewis, Lawrence Snyder |
| 1999 | Proceedings of the 13th international conference on Supercomputing, ICS 1999, Rhodes, Greece, June 20-25, 1999 Theodore S. Papatheodorou, Mateo Valero, Constantine D. Polychronopoulos, Yoichi Muraoka, Jesús Labarta |
| 1999 | Realizing the performance potential of the virtual interface architecture. Evan Speight, Hazim Abdel-Shafi, John K. Bennett |
| 1999 | Reducing branch misprediction penalties via dynamic control independence detection. Yuan C. Chou, Jason Fung, John Paul Shen |
| 1999 | Reducing cache misses using hardware and software page placement. Timothy Sherwood, Brad Calder, Joel S. Emer |
| 1999 | Reorganizing global schedules for register allocation. Gang Chen, Michael D. Smith |
| 1999 | Resource usage models for instruction scheduling: two new models and a classification. V. Janaki Ramanan, Ramaswamy Govindarajan |
| 1999 | Responsiveness without interrupts. Dejan Perkovic, Peter J. Keleher |
| 1999 | SMARTS: exploiting temporal locality and parallelism through vertical execution. Suvas Vajracharya, Steve Karmesin, Peter H. Beckman, James Crotinger, Allen D. Malony, Sameer Shende, R. R. Oldehoeft, Stephen Smith |
| 1999 | Shared virtual memory with automatic update support. Liviu Iftode, Matthias A. Blumrich, Cezary Dubnicki, David L. Oppenheimer, Jaswinder Pal Singh, Kai Li |
| 1999 | Software trace cache. Alex Ramírez, Josep Lluís Larriba-Pey, Carlos Navarro, Josep Torrellas, Mateo Valero |
| 1999 | Symmetry and performance in consistency protocols. Peter J. Keleher |
| 1999 | The design and evaluation of high performance communication using a Gigabit Ethernet. Shinji Sumimoto, Hiroshi Tezuka, Atsushi Hori, Hiroshi Harada, Toshiyuki Takahashi, Yutaka Ishikawa |
| 1999 | The pool of subsectors cache design. Jeffrey B. Rothman, Alan Jay Smith |
| 1999 | The scalability of multigrain systems. Donald Yeung |
| 1999 | Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors. Xavier Martorell, Eduard Ayguadé, Nacho Navarro, Julita Corbalán, Marc González, Jesús Labarta |