| 2012 | A Parallel Implementation of Gomory-Hu's Cut Tree Algorithm. Jaime Cohen, Luiz A. Rodrigues, Elias P. Duarte Jr. |
| 2012 | ACCGen: An Automatic ArchC Compiler Generator. Rafael Auler, Paulo Centoducatte, Edson Borin |
| 2012 | An OS-Hypervisor Infrastructure for Automated OS Crash Diagnosis and Recovery in a Virtualized Environment. Joefon Jann, R. Sarma Burugula, Ching-Farn Eric Wu, Kaoutar El Maghraoui |
| 2012 | Assessing Energy Efficiency of Fault Tolerance Protocols for HPC Systems. Esteban Meneses, Osman Sarood, Laxmikant V. Kalé |
| 2012 | BTL: A Framework for Measuring and Modeling Energy in Memory Hierarchies. Ioannis Manousakis, Dimitrios S. Nikolopoulos |
| 2012 | Beyond CPU Frequency Scaling for a Fine-grained Energy Control of HPC Systems. Ghislain Landry Tsafack Chetsa, Laurent Lefèvre, Jean-Marc Pierson, Patricia Stolf, Georges Da Costa |
| 2012 | CSHARP: Coherence and SHaring Aware Cache Replacement Policies for Parallel Applications. Biswabandan Panda, Shankar Balachandran |
| 2012 | Cloud Workload Analysis with SWAT. Maurício Breternitz, Keith Lowery, Anton Charnoff, Patryk Kaminski, Leonardo Piga |
| 2012 | Compression Speed Enhancements to LZO for Multi-core Systems. Jason Kane, Qing Yang |
| 2012 | Data and Instruction Uniformity in Minimal Multi-threading. Teo Milanez, Caroline Collange, Fernando Magno Quintão Pereira, Wagner Meira Jr., Renato Ferreira |
| 2012 | Divergence Analysis with Affine Constraints. Diogo Sampaio, Rafael Martins de Souza, Caroline Collange, Fernando Magno Quintão Pereira |
| 2012 | Efficient Sorting on the Tilera Manycore Architecture. Alessandro Morari, Antonino Tumeo, Oreste Villa, Simone Secchi, Mateo Valero |
| 2012 | Efficiently Handling Memory Accesses to Improve QoS in Multicore Systems under Real-Time Constraints. José Luis March, Salvador Petit, Julio Sahuquillo, Houcine Hassan, José Duato |
| 2012 | Energy Savings via Dead Sub-Block Prediction. Marco A. Z. Alves, Khubaib, Eiman Ebrahimi, Veynu Narasiman, Carlos Villavieja, Philippe Olivier Alexandre Navaux, Yale N. Patt |
| 2012 | Energy-Performance Tradeoffs in Software Transactional Memory. Alexandro Baldassin, Joao P. L. de Carvalho, Leonardo A. G. Garcia, Rodolfo Azevedo |
| 2012 | Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs. João V. F. Lima, Thierry Gautier, Nicolas Maillard, Vincent Danjean |
| 2012 | Exploiting Phase-Change Memory in Cooperative Caches. Luiz E. Ramos, Ricardo Bianchini |
| 2012 | FusedOS: Fusing LWK Performance with FWK Functionality in a Heterogeneous Environment. Yoonho Park, Eric Van Hensbergen, Marius Hillenbrand, Todd Inglett, Bryan S. Rosenburg, Kyung Dong Ryu, Robert W. Wisniewski |
| 2012 | Global Data Re-allocation via Communication Aggregation in Chapel. Alberto Sanz, Rafael Asenjo, Juan López, Rafael Larrosa, Angeles G. Navarro, Vassily Litvinov, Sung-Eun Choi, Bradford L. Chamberlain |
| 2012 | HAT: Heterogeneous Adaptive Throttling for On-Chip Networks. Kevin Kai-Wei Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu |
| 2012 | IEEE 24th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2012, New York, NY, USA, October 24-26, 2012 Jairo Panetta, José E. Moreira, David A. Padua, Philippe O. A. Navaux |
| 2012 | Integrating Dataflow Abstractions into the Shared Memory Model. Vladimir Gajinov, Srdjan Stipic, Osman S. Unsal, Tim Harris, Eduard Ayguadé, Adrián Cristal |
| 2012 | Level-3 BLAS on the TI C6678 Multi-core DSP. Murtaza Ali, Eric Stotzer, Francisco D. Igual, Robert A. van de Geijn |
| 2012 | Low Overhead Instruction-Cache Modeling Using Instruction Reuse Profiles. Muneeb Khan, Andreas Sembrant, Erik Hagersten |
| 2012 | Network Endpoints for Clusters of SMPs. Gabriel Ilie Tanase, Gheorghe Almási, Hanhong Xue, Charles Archer |
| 2012 | On the Efficiency of Register File versus Broadcast Interconnect for Collective Communications in Data-Parallel Hardware Accelerators. Ardavan Pedram, Andreas Gerstlauer, Robert A. van de Geijn |
| 2012 | Parallel Exact Inference on Multicore Using MapReduce. Nam Ma, Yinglong Xia, Viktor K. Prasanna |
| 2012 | Parallelizing Information Set Generation for Game Tree Search Applications. Mark Richards, Abhishek Gupta, Osman Sarood, Laxmikant V. Kalé |
| 2012 | Runtime Procedure for Energy Savings in Applications with Point-to-Point Communications. Vaibhav Sundriyal, Masha Sosonkina, Alexander Gaenko |
| 2012 | Scalable Algorithms for Distributed-Memory Adaptive Mesh Refinement. Akhil Langer, Jonathan Lifflander, Phil Miller, Kuo-Chuan Pan, Laxmikant V. Kalé, Paul M. Ricker |
| 2012 | Scalable Thread Scheduling in Asymmetric Multicores for Power Efficiency. Rance Rodrigues, Arunachalam Annamalai, Israel Koren, Sandip Kundu |
| 2012 | Scalable Triadic Analysis of Large-Scale Graphs: Multi-core vs. Multi-processor vs. Multi-threaded Shared Memory Architectures. George Chin Jr., Andrés Márquez, Sutanay Choudhury, John Feo |
| 2012 | Sparse Fast Fourier Transform on GPUs and Multi-core CPUs. Jiaxi Hu, Zhaosen Wang, Qiyuan Qiu, Weijun Xiao, David J. Lilja |
| 2012 | The Network Adapter: The Missing Link between MPI Applications and Network Performance. Germán Rodríguez, Cyriel Minkenberg, Ronald P. Luijten, Ramón Beivide, Patrick Geoffray, Jesús Labarta, Mateo Valero, Steve Poole |
| 2012 | Transactional Forwarding: Supporting Highly-Concurrent STM in Asynchronous Distributed Systems. Mohamed M. Saad, Binoy Ravindran |
| 2012 | Using Heterogeneous Networks to Improve Energy Efficiency in Direct Coherence Protocols for Many-Core CMPs. Alberto Ros, Ricardo Fernández-Pascual, Manuel E. Acacio |
| 2012 | VPC: Scalable, Low Downtime Checkpointing for Virtual Clusters. Peng Lu, Binoy Ravindran, Changsoo Kim |