PACT B

49 papers

YearTitle / Authors
2013A unified view of non-monotonic core selection and application steering in heterogeneous chip multiprocessors.
Sandeep Navada, Niket K. Choudhary, Salil V. Wadhavkar, Eric Rotenberg
2013APOGEE: Adaptive prefetching on GPUs for energy efficiency.
Ankit Sethia, Ganesh S. Dasika, Mehrzad Samadi, Scott A. Mahlke
2013An empirical model for predicting cross-core performance interference on multicore processors.
Jiacheng Zhao, Xiaobing Feng, Huimin Cui, Youliang Yan, Jingling Xue, Wensen Yang
2013An opportunistic prediction-based thread scheduling to maximize throughput/watt in AMPs.
Arunachalam Annamalai, Rance Rodrigues, Israel Koren, Sandip Kundu
2013Automatic OpenCL work-group size selection for multicore CPUs.
Sangmin Seo, Jun Lee, Gangwon Jo, Jaejin Lee
2013Automatic vectorization of tree traversals.
Youngjoon Jo, Michael Goldfarb, Milind Kulkarni
2013Breaking SIMD shackles with an exposed flexible microarchitecture and the access execute PDG.
Venkatraman Govindaraju, Tony Nowatzki, Karthikeyan Sankaralingam
2013Building expressive, area-efficient coherence directories.
Lei Fang, Peng Liu, Qi Hu, Michael C. Huang, Guofan Jiang
2013Can lock-free and combining techniques co-exist? A novel approach on concurrent queue.
Changwoo Min, Young Ik Eom
2013Concurrent predicates: A debugging technique for every parallel programmer.
Justin Emile Gottschlich, Gilles Pokam, Cristiano Pereira, Youfeng Wu
2013Coordinated power-performance optimization in manycores.
Hiroshi Sasaki, Satoshi Imamura, Koji Inoue
2013DANBI: Dynamic scheduling of irregular stream programs for many-core systems.
Changwoo Min, Young Ik Eom
2013Do inputs matter? using data-dependence profiling to evaluate thread level speculation in BG/Q.
Arnamoy Bhattacharyya
2013Dynamic memory access monitoring based on tagged memory.
Mikhail A. Gorelov, Lev Mukhanov
2013Exploring hybrid memory for GPU energy efficiency through software-hardware co-design.
Bin Wang, Bo Wu, Dong Li, Xipeng Shen, Weikuan Yu, Yizheng Jiao, Jeffrey S. Vetter
2013Exposing ILP in custom hardware with a dataflow compiler IR.
Ali Mustafa Zaidi
2013Fairness-aware scheduling on single-ISA heterogeneous multi-cores.
Kenzo Van Craeynest, Shoaib Akram, Wim Heirman, Aamer Jaleel, Lieven Eeckhout
2013General chairs' welcome message.
Michael F. P. O'Boyle, Christian Fensch
2013Generating efficient data movement code for heterogeneous architectures with distributed-memory.
Roshan Dathathri, Chandan Reddy, Thejas Ramashekar, Uday Bondhugula
2013INSPIRE: The insieme parallel intermediate representation.
Herbert Jordan, Simone Pellegrini, Peter Thoman, Klaus Kofler, Thomas Fahringer
2013Interprocedural strength reduction of critical sections in explicitly-parallel programs.
Rajkishore Barik, Jisheng Zhao, Vivek Sarkar
2013Jigsaw: Scalable software-defined caches.
Nathan Beckmann, Daniel Sánchez
2013Keynote talk: A comprehensive approach to HW/SW codesign.
David J. Kuck
2013Keynote talk: Parallel programming for mobile computing.
Calin Cascaval
2013Keynote talk: Towards automatic resource management in parallel architectures.
Per Stenström
2013L1-bandwidth aware thread allocation in multicore SMT processors.
Josué Feliu, Julio Sahuquillo, Salvador Petit, José Duato
2013Managing shared last-level cache in a heterogeneous multicore processor.
Vineeth Mekkat, Anup Holey, Pen-Chung Yew, Antonia Zhai
2013McRouter: Multicast within a router for high performance network-on-chips.
Yuan He, Hiroshi Sasaki, Shinobu Miwa, Hiroshi Nakamura
2013Meeting midway: Improving CMP performance with memory-side prefetching.
Praveen Yedlapalli, Jagadish Kotra, Emre Kultursay, Mahmut T. Kandemir, Chita R. Das, Anand Sivasubramaniam
2013Memory-centric system interconnect design with Hybrid Memory Cubes.
Gwangsun Kim, John Kim, Jung Ho Ahn, Jaeha Kim
2013Message from the program chairs.
André Seznec, François Bodin
2013Neither more nor less: Optimizing thread-level parallelism for GPGPUs.
Onur Kayiran, Adwait Jog, Mahmut T. Kandemir, Chita R. Das
2013PS-cache: An energy-efficient cache design for chip multiprocessors.
Joan J. Valls, Alberto Ros, Julio Sahuquillo, María Engracia Gómez
2013Parallel flow-sensitive pointer analysis by graph-rewriting.
Vaivaswatha Nagaraj, R. Govindarajan
2013Parallel frame rendering: Trading responsiveness for energy on a mobile GPU.
José-María Arnau, Joan-Manuel Parcerisa, Polychronis Xekalakis
2013Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, Edinburgh, United Kingdom, September 7-11, 2013
Christian Fensch, Michael F. P. O'Boyle, André Seznec, François Bodin
2013RSVM: A Region-based Software Virtual Memory for GPU.
Feng Ji, Heshan Lin, Xiaosong Ma
2013Reshaping cache misses to improve row-buffer locality in multicore systems.
Wei Ding, Jun Liu, Mahmut T. Kandemir, Mary Jane Irwin
2013S-CAVE: Effective SSD caching to improve virtual machine storage performance.
Tian Luo, Siyuan Ma, Rubao Lee, Xiaodong Zhang, Deng Liu, Li Zhou
2013SMT-centric power-aware thread placement in chip multiprocessors.
Augusto Vega, Alper Buyuktosunoglu, Pradip Bose
2013Starchart: Hardware and software optimization using recursive partitioning regression trees.
Wenhao Jia, Kelly A. Shaw, Margaret Martonosi
2013TCPT - Thread criticality-driven prefetcher throttling.
Biswabandan Panda, Shankar Balachandran
2013Task sampling: Computer architecture simulation in the many-core era.
Thomas Grass
2013The case for a scalable coherence protocol for complex on-chip cache hierarchies in many-core systems.
Lucia G. Menezo, Valentin Puente, José-Ángel Gregorio
2013ThermOS: System support for dynamic thermal management of chip multi-processors.
Filippo Sironi, Martina Maggio, Riccardo Cattaneo, Giovanni F. Del Nero, Donatella Sciuto, Marco D. Santambrogio
2013Traffic steering between a low-latency unswitched TL ring and a high-throughput switched on-chip interconnect.
Jungju Oh, Alenka G. Zajic, Milos Prvulovic
2013Transparent CPU-GPU collaboration for data-parallel kernels on heterogeneous systems.
Janghaeng Lee, Mehrzad Samadi, Yongjun Park, Scott A. Mahlke
2013Vectorization past dependent branches through speculation.
Majedul Haque Sujon, R. Clint Whaley, Qing Yi
2013Writeback-aware bandwidth partitioning for multi-core systems with PCM.
Miao Zhou, Yu Du, Bruce R. Childers, Rami G. Melhem, Daniel Mossé