ICS A

32 papers

YearTitle / Authors
2007A low-cost mixed-mode parallel processor architecture for embedded systems.
Shorin Kyo, Takuya Koga, Hanno Lieske, Shouhei Nomoto, Shin'ichiro Okazaki
2007A study of process arrival patterns for MPI collective operations.
Ahmad Faraj, Pitch Patarasuk, Xin Yuan
2007A symmetric transformation for 3-body potential molecular dynamics using force-decomposition in a heterogeneous distributed environment.
J. V. Sumanth, David R. Swanson, Hong Jiang
2007Active memory operations.
Zhen Fang, Lixin Zhang, John B. Carter, Ali Ibrahim, Michael A. Parker
2007Adaptive Strassen's matrix multiplication.
Paolo D'Alberto, Alexandru Nicolau
2007Adaptive performance control for distributed scientific coupled models.
Mohamed Khamiss Hussein, Kenneth R. Mayes, Mikel Luján, John R. Gurd
2007An L2-miss-driven early register deallocation for SMT processors.
Joseph J. Sharkey, Dmitry V. Ponomarev
2007An operation stacking framework for large ensemble computations.
Mehmet Belgin, Calvin J. Ribbens, Godmar Back
2007Automatic nonblocking communication for partitioned global address space programs.
Wei-Yu Chen, Dan Bonachea, Costin Iancu, Katherine A. Yelick
2007Characteristics of workloads used in high performance and technical computing.
Razvan Cheveresan, Matthew Ramsay, Chris Feucht, Ilya Sharapov
2007Compression in cache design.
Ali-Reza Adl-Tabatabai, Anwar M. Ghuloum, Shobhit O. Kanaujia
2007Cooperative cache partitioning for chip multiprocessors.
Jichuan Chang, Gurindar S. Sohi
2007Current trends in computer architectures: multi-cores, many-cores and special-cores.
Avi Mendelson
2007Executing irregular scientific applications on stream architectures.
Mattan Erez, Jung Ho Ahn, Jayanth Gummaraju, Mendel Rosenblum, William J. Dally
2007GridRod: a dynamic runtime scheduler for grid workflows.
Shahaan Ayyub, David Abramson
2007Harnessing massive parallelism in the era of parallelism for the masses.
Craig B. Stunkel
2007High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters.
Matthew J. Koop, Sayantan Sur, Qi Gao, Dhabaleswar K. Panda
2007Increasing cache capacity through word filtering.
Prateek Pujara, Aneesh Aggarwal
2007Locality of sampling and diversity in parallel system workloads.
Dror G. Feitelson
2007Modeling correlated workloads by combining model based clustering and a localized sampling algorithm.
Hui Li, Michael Muskulus, Lex Wolters
2007Optimization and bottleneck analysis of network block I/O in commodity storage systems.
Manolis Marazakis, Vassilis Papaefstathiou, Angelos Bilas
2007Optimization of data prefetch helper threads with path-expression based statistical modeling.
Tor M. Aamodt, Paul Chow
2007Performance driven data cache prefetching in a dynamic software optimization system.
Jean Christophe Beyler, Philippe Clauss
2007Proactive fault tolerance for HPC with Xen virtualization.
Arun Babu Nagarajan, Frank Mueller, Christian Engelmann, Stephen L. Scott
2007Proceedings of the 21th Annual International Conference on Supercomputing, ICS 2007, Seattle, Washington, USA, June 17-21, 2007
Burton J. Smith
2007Representation-transparent matrix algorithms with scalable performance.
Peter Gottschling, David S. Wise, Michael D. Adams
2007Scalability analysis of SPMD codes using expectations.
Cristian Coarfa, John M. Mellor-Crummey, Nathan Froyd, Yuri Dotsenko
2007Scalability of the Nutch search engine.
José E. Moreira, Maged M. Michael, Dilma Da Silva, Doron Shiloach, Parijat Dube, Li Zhang
2007Scheduling FFT computation on SMP and multicore systems.
Ayaz Ali, S. Lennart Johnsson, Jaspal Subhlok
2007Sensitivity analysis for automatic parallelization on multi-cores.
Silvius Vasile Rus, Maikel Pennings, Lawrence Rauchwerger
2007Sequencer virtualization.
Perry H. Wang, Jamison D. Collins, Gautham N. Chinya, Bernard Lint, Asit Mallick, Koichi Yamada, Hong Wang
2007Tradeoff between data-, instruction-, and thread-level parallelism in stream processors.
Jung Ho Ahn, Mattan Erez, William J. Dally