ICS A

53 papers

YearTitle / Authors
2011A QHD-capable parallel H.264 decoder.
Chi Ching Chi, Ben H. H. Juurlink
2011A composite and scalable cache coherence protocol for large scale CMPs.
Yi Xu, Yu Du, Youtao Zhang, Jun Yang
2011Active pebbles: parallel programming for data-driven applications.
Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine
2011An execution strategy and optimized runtime support for parallelizing irregular reductions on modern GPUs.
Xin Huo, Vignesh T. Ravi, Wenjing Ma, Gagan Agrawal
2011An idiom-finding tool for increasing productivity of accelerators.
Laura Carrington, Mustafa M. Tikir, Catherine Olschanowsky, Michael Laurenzano, Joshua Peraza, Allan Snavely, Stephen Poole
2011Automatic SIMD vectorization of fast fourier transforms for the larrabee and AVX instruction sets.
Daniel S. McFarlin, Volodymyr Arbatov, Franz Franchetti, Markus Püschel
2011Automatic generation of executable communication specifications from parallel applications.
Xing Wu, Frank Mueller, Scott Pakin
2011Automating GPU computing in MATLAB.
Chun-Yu Shei, Pushkar Ratnalikar, Arun Chauhan
2011Challenges and opportunities in renewable energy and energy efficiency.
Steven W. Hammond
2011Characterizing the impact of soft errors on iterative methods in scientific computing.
Manu Shantharam, Sowmyalatha Srinivasmurthy, Padma Raghavan
2011Controlling cache utilization of HPC applications.
Swann Perarnau, Marc Tchiboukdjian, Guillaume Huard
2011Coordinating processor and main memory for efficientserver power control.
Ming Chen, Xiaorui Wang, Xue Li
2011Cosmic microwave background map-making at the petascale and beyond.
Rajesh Sudarsan, Julian Borrill, Christopher Cantalupo, Theodore Kisner, Kamesh Madduri, Leonid Oliker, Yili Zheng, Horst D. Simon
2011Cost-effectively offering private buffers in SoCs and CMPs.
Zhen Fang, Li Zhao, Ravishankar R. Iyer, Carlos Flores Fajardo, German Fabila Garcia, Seung Eun Lee, Bin Li, Steve R. King, Xiaowei Jiang, Srihari Makineni
2011F
Jin Ouyang, Chuan Yang, Dimin Niu, Yuan Xie, Zhiwen Liu
2011Generic topology mapping strategies for large-scale parallel architectures.
Torsten Hoefler, Marc Snir
2011High performance linpack benchmark: a fault tolerant implementation without checkpointing.
Teresa Davies, Christer Karlsson, Hui Liu, Chong Ding, Zizhong Chen
2011Hystor: making the best use of solid state drives in high performance storage systems.
Feng Chen, David A. Koufaty, Xiaodong Zhang
2011Karma: scalable deterministic record-replay.
Arkaprava Basu, Jayaram Bobba, Mark D. Hill
2011MDR: performance model driven runtime for heterogeneous parallel platforms.
Jacques A. Pienaar, Anand Raghunathan, Srimat T. Chakradhar
2011MP-PIPE: a massively parallel protein-protein interaction prediction engine.
Andrew Schoenrock, Frank K. H. A. Dehne, James R. Green, Ashkan Golshani, Sylvain Pitre
2011Mint: realizing CUDA performance in 3D stencil methods with annotated C.
Didem Unat, Xing Cai, Scott B. Baden
2011Modeling the performance of an algebraic multigrid cycle on HPC platforms.
Hormozd Gahvari, Allison H. Baker, Martin Schulz, Ulrike Meier Yang, Kirk E. Jordan, William Gropp
2011Multiset signatures for transactional memory.
Ricardo Quislant, Eladio Gutiérrez, Oscar G. Plata, Emilio L. Zapata
2011Optimizing the datacenter for data-centric workloads.
Stijn Polfliet, Frederick Ryckbosch, Lieven Eeckhout
2011Optimizing throughput/power trade-offs in hardware transactional memory using DVFS and intelligent scheduling.
Clay Hughes, Tao Li
2011Page placement in hybrid memory systems.
Luiz E. Ramos, Eugene Gorbatov, Ricardo Bianchini
2011Performance impact and interplay of SSD parallelism through advanced commands, allocation strategy and data granularity.
Yang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, Shu Ping Zhang
2011Performance modeling as the key to extreme scale computing.
William D. Gropp
2011Poster: DVFS management in real-processors.
Vasileios Spiliopoulos, Georgios Keramidas, Stefanos Kaxiras, Konstantinos Efstathiou
2011Poster: implications of merging phases on scalability of multi-core architectures.
Madhavan Manivannan, Ben H. H. Juurlink, Per Stenström
2011Poster: programming clusters of GPUs with OMPSs.
Javier Bueno, Alejandro Duran, Xavier Martorell, Eduard Ayguadé, Rosa M. Badia, Jesús Labarta
2011Poster: revisiting virtual channel memory for performance and fairness on multi-core architecture.
Licheng Chen, Yongbing Huang, Yungang Bao, Onur Mutlu, Guangming Tan, Mingyu Chen
2011Predictive coordination of multiple on-chip resources for chip multiprocessors.
Jian Chen, Lizy Kurian John
2011Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31 - June 04, 2011
David K. Lowenthal, Bronis R. de Supinski, Sally A. McKee
2011Processing data streams with hard real-time constraints on heterogeneous systems.
Uri Verner, Assaf Schuster, Mark Silberstein
2011Rethinking shared-memory languages and hardware.
Sarita V. Adve
2011SRC: Damaris - using dedicated i/o cores for scalable post-petascale HPC simulations.
Matthieu Dorier
2011SRC: FenixOS - a research operating system focused on high scalability and reliability.
Stavros Passas, Sven Karlsson
2011SRC: OpenSHMEM library development.
Swaroop Suhas Pophale
2011SRC: an automatic code overlaying technique for multicores with explicitly-managed memory hierarchies.
Choonki Jang
2011SRC: automatic extraction of SST/macro skeleton models.
Amruth Rudraiah Dakshinamurthy
2011SRC: enabling petascale data analysis for scientific applications through data reorganization.
Yuan Tian
2011SRC: facilitating efficient parallelization of information storage and retrieval on large data sets.
Steven Feldman
2011SRC: information retrieval as a persistent parallel service on supercomputer infrastructure.
Tobias Berka, Marián Vajtersic
2011SRC: soft error detection and recovery for high performance linpack.
Teresa Davies, Zizhong Chen
2011SRC: virtual i/o caching: dynamic storage cache management for concurrent workloads.
Michael R. Frasca, Ramya Prabhakar
2011Scalable fine-grained call path tracing.
Nathan R. Tallent, John M. Mellor-Crummey, Michael Franco, Reed Landrum, Laksono Adhianto
2011SecureME: a hardware-software approach to full system security.
Siddhartha Chhabra, Brian Rogers, Yan Solihin, Milos Prvulovic
2011The elephant and the mice: the role of non-strict fine-grain synchronization for modern many-core architectures.
Juergen Ributzka, Yuhei Hayashi, Joseph B. Manzano, Guang R. Gao
2011Transactional conflict decoupling and value prediction.
Fuad Tabba, Andrew W. Hay, James R. Goodman
2011Using GPUs to compute large out-of-card FFTs.
Liang Gu, Jakob Siegel, Xiaoming Li
2011ZEBRA: a data-centric, hybrid-policy hardware transactional memory design.
J. Rubén Titos Gil, Anurag Negi, Manuel E. Acacio, José M. García, Per Stenström