MICRO A*

38 papers

YearTitle / Authors
1995A limit study of local memory requirements using value reuse profiles.
Andrew S. Huang, John Paul Shen
1995A modified approach to data cache management.
Gary S. Tyson, Matthew K. Farrens, John Matthews, Andrew R. Pleszkun
1995A system level perspective on branch architecture performance.
Brad Calder, Dirk Grunwald, Joel S. Emer
1995Alternative implementations of hybrid branch predictors.
Po-Yung Chang, Eric Hao, Yale N. Patt
1995An effective programmable prefetch engine for on-chip caches.
Tien-Fu Chen
1995An experimental study of several cooperative register allocation and instruction scheduling strategies.
Cindy Norris, Lori L. Pollock
1995An investigation of the performance of various instruction-issue buffer topologies.
Stéphan Jourdan, Pascal Sainrat, Daniel Litaize
1995Cache miss heuristics and preloading techniques for general-purpose programs.
Toshihiro Ozawa, Yasunori Kimura, Shin'ichiro Nishizaki
1995Control flow prediction with tree-like subgraphs for superscalar processors.
Simonjit Dutta, Manoj Franklin
1995Critical path reduction for scalar programs.
Michael S. Schlansker, Vinod Kathail
1995Decoupling integer execution in superscalar processors.
Subbarao Palacharla, James E. Smith
1995Design of storage hierarchy in multithreaded architectures.
Lucas Roh, Walid A. Najjar
1995Disjoint eager execution: an optimal form of speculative execution.
Augustus K. Uht, Vijay Sindagi, Kelley Hall
1995Dynamic path-based branch correlation.
Ravi Nair
1995Dynamic rescheduling: a technique for object code compatibility in VLIW architectures.
Thomas M. Conte, Sumedh W. Sathaye
1995Efficient instruction scheduling using finite state automata.
Vasanth Bala, Norman Rubin
1995Exploiting short-lived variables in superscalar processors.
Luis A. Lozano, Guang R. Gao
1995Hypernode reduction modulo scheduling.
Josep Llosa, Mateo Valero, Eduard Ayguadé, Antonio González
1995Improving CISC instruction decoding performance using a fill unit.
Mark Smotherman, Manoj Franklin
1995Improving instruction-level parallelism by loop unrolling and dynamic memory disambiguation.
Jack W. Davidson, Sanjay Jinturkar
1995Modulo scheduling with multiple initiation intervals.
Nancy J. Warter-Perez, Noubar Partamian
1995Partial resolution in branch target buffers.
Barry S. Fagin, Kathryn Russell
1995Partitioned register file for TTAs.
Johan Janssen, Henk Corporaal
1995Performance issues in correlated branch prediction schemes.
Nicholas C. Gloy, Michael D. Smith, Cliff Young
1995Petri net versus modulo scheduling for software pipelining.
Vicki H. Allan, U. R. Shah, K. M. Reddy
1995Proceedings of the 28th Annual International Symposium on Microarchitecture, Ann Arbor, Michigan, USA, November 29 - December 1, 1995
Trevor N. Mudge, Kemal Ebcioglu
1995Region-based compilation: an introduction and motivation.
Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau
1995Register allocation for predicated code.
Alexandre E. Eichenberger, Edward S. Davidson
1995SPAID: software prefetching in pointer- and call-intensive environments.
Mikko H. Lipasti, William J. Schmidt, Steven R. Kunkel, Robert R. Roediger
1995Self-regulation of workload in the Manchester Data-Flow computer.
John R. Gurd, David F. Snelling
1995Spill-free parallel scheduling of basic blocks.
B. Natarajan, Michael S. Schlansker
1995Stage scheduling: a technique to reduce the register requirements of a modulo schedule.
Alexandre E. Eichenberger, Edward S. Davidson
1995The M-Machine multicomputer.
Marco Fillo, Stephen W. Keckler, William J. Dally, Nicholas P. Carter, Andrew Chang, Yevgeny Gurevich, Whay Sing Lee
1995The performance impact of incomplete bypassing in processor pipelines.
Pritpal S. Ahuja, Douglas W. Clark, Anne Rogers
1995The predictability of branches in libraries.
Brad Calder, Dirk Grunwald, Amitabh Srivastava
1995The role of adaptivity in two-level adaptive branch prediction.
Stuart Sechrest, Chih-Chieh Lee, Trevor N. Mudge
1995Unrolling-based optimizations for modulo scheduling.
Daniel M. Lavery, Wen-mei W. Hwu
1995Zero-cycle loads: microarchitecture support for reducing load latency.
Todd M. Austin, Gurindar S. Sohi