| 2015 | A fast and accurate analytical technique to compute the AVF of sequential bits in a processor. Steven Raasch, Arijit Biswas, Jon Stephan, Paul Racunas, Joel S. Emer |
| 2015 | A scalable architecture for ordered parallelism. Mark C. Jeffrey, Suvinay Subramanian, Cong Yan, Joel S. Emer, Daniel Sánchez |
| 2015 | Adaptive guardband scheduling to improve system-level efficiency of the POWER7+. Yazhou Zu, Charles R. Lefurgy, Jingwen Leng, Matthew Halpern, Michael S. Floyd, Vijay Janapa Reddi |
| 2015 | An integrated concurrency and core-ISA architectural envelope definition, and test oracle, for IBM POWER multiprocessors. Kathryn E. Gray, Gabriel Kerneis, Dominic P. Mulligan, Christopher Pulte, Susmit Sarkar, Peter Sewell |
| 2015 | Architecture-aware automatic computation offload for native applications. Gwangmu Lee, Hyunjoon Park, Seonyeong Heo, Kyung-Ah Chang, Hyogun Lee, Hanjun Kim |
| 2015 | Authenticache: harnessing cache ECC for system authentication. Anys Bacha, Radu Teodorescu |
| 2015 | Avoiding information leakage in the memory controller with fixed service policies. Ali Shafiee, Akhila Gundu, Manjunath Shevgoor, Rajeev Balasubramonian, Mohit Tiwari |
| 2015 | Border control: sandboxing accelerators. Lena E. Olson, Jason Power, Mark D. Hill, David A. Wood |
| 2015 | Bungee jumps: accelerating indirect branches through HW/SW co-design. Daniel S. McFarlin, Craig B. Zilles |
| 2015 | CCICheck: using µhb graphs to verify the coherence-consistency interface. Yatin A. Manerkar, Daniel Lustig, Michael Pellauer, Margaret Martonosi |
| 2015 | CLEAN-ECC: high reliability ECC for adaptive granularity memory system. Seong-Lyong Gong, Minsoo Rhu, Jungrae Kim, Jinsuk Chung, Mattan Erez |
| 2015 | Characterizing, modeling, and improving the QoE of mobile devices with low battery level. Kaige Yan, Xingyao Zhang, Xin Fu |
| 2015 | Coherence domain restriction on large scale systems. Yaosheng Fu, Tri Minh Nguyen, David Wentzlaff |
| 2015 | Confluence: unified instruction supply for scale-out servers. Cansu Kaynak, Boris Grot, Babak Falsafi |
| 2015 | Control flow coalescing on a hybrid dataflow/von Neumann GPGPU. Dani Voitsechov, Yoav Etsion |
| 2015 | Cross-architecture performance prediction (XAPP) using CPU code to predict GPU performance. Newsha Ardalani, Clint Lestourgeon, Karthikeyan Sankaralingam, Xiaojin Zhu |
| 2015 | DCS: a fast and scalable device-centric server architecture. Jaehyung Ahn, Dongup Kwon, Youngsok Kim, Mohammadamin Ajdari, Jaewon Lee, Jangwoo Kim |
| 2015 | DeSC: decoupled supply-compute communication management for heterogeneous architectures. Tae Jun Ham, Juan L. Aragón, Margaret Martonosi |
| 2015 | Doppelgänger: a cache for approximate computing. Joshua San Miguel, Jorge Albericio, Andreas Moshovos, Natalie D. Enright Jerger |
| 2015 | DynaMOS: dynamic schedule migration for heterogeneous cores. Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott A. Mahlke |
| 2015 | Efficient GPU synchronization without scopes: saying no to complex consistency models. Matthew D. Sinclair, Johnathan Alsop, Sarita V. Adve |
| 2015 | Efficient persist barriers for multicores. Arpit Joshi, Vijay Nagarajan, Marcelo Cintra, Stratis Viglas |
| 2015 | Efficient warp execution in presence of divergence with collaborative context collection. Farzad Khorasani, Rajiv Gupta, Laxmi N. Bhuyan |
| 2015 | Efficiently enforcing strong memory ordering in GPUs. Abhayendra Singh, Shaizeen Aga, Satish Narayanasamy |
| 2015 | Efficiently prefetching complex address patterns. Manjunath Shevgoor, Sahil Koladiya, Rajeev Balasubramonian, Chris Wilkerson, Seth H. Pugsley, Zeshan Chishti |
| 2015 | Enabling coordinated register allocation and thread-level parallelism optimization for GPUs. Xiaolong Xie, Yun Liang, Xiuhong Li, Yudong Wu, Guangyu Sun, Tao Wang, Dongrui Fan |
| 2015 | Enabling interposer-based disintegration of multi-core processors. Ajaykumar Kannan, Natalie D. Enright Jerger, Gabriel H. Loh |
| 2015 | Enabling portable energy efficiency with memory accelerated library. Qi Guo, Tze Meng Low, Nikolaos Alachiotis, Berkin Akin, Larry T. Pileggi, James C. Hoe, Franz Franchetti |
| 2015 | Execution time prediction for energy-efficient hardware accelerators. Tao Chen, Alexander Rucker, G. Edward Suh |
| 2015 | Exploiting commutativity to reduce the cost of updates to shared data in cache-coherent systems. Guowei Zhang, Webb Horn, Daniel Sánchez |
| 2015 | Fast support for unstructured data processing: the unified automata processor. Yuanwei Fang, Tung Thanh Hoang, Michela Becchi, Andrew A. Chien |
| 2015 | Filtered runahead execution with a runahead buffer. Milad Hashemi, Yale N. Patt |
| 2015 | Fork path: improving efficiency of ORAM by removing redundant memory accesses. Xian Zhang, Guangyu Sun, Chao Zhang, Weiqi Zhang, Yun Liang, Tao Wang, Yiran Chen, Jia Di |
| 2015 | Free launch: optimizing GPU dynamic kernel launches through thread reuse. Guoyang Chen, Xipeng Shen |
| 2015 | GPU register file virtualization. Hyeran Jeon, Gokul Subramanian Ravi, Nam Sung Kim, Murali Annavaram |
| 2015 | Gather-scatter DRAM: in-DRAM address translation to improve the spatial locality of non-unit strided accesses. Vivek Seshadri, Thomas Mullins, Amirali Boroumand, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry |
| 2015 | HyComp: a hybrid cache compression method for selection of data-type-specific compression methods. Angelos Arelakis, Fredrik Dahlgren, Per Stenström |
| 2015 | IMP: indirect memory prefetcher. Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Srinivas Devadas |
| 2015 | Improving DRAM latency with dynamic asymmetric subarray. Shih-Lien Lu, Ying-Chen Lin, Chia-Lin Yang |
| 2015 | Large pages and lightweight memory management in virtualized environments: can you have it both ways? Binh Pham, Ján Veselý, Gabriel H. Loh, Abhishek Bhattacharjee |
| 2015 | Locking down insecure indirection with hardware-based control-data isolation. William Arthur, Sahil Madeka, Reetuparna Das, Todd M. Austin |
| 2015 | Long term parking (LTP): criticality-aware resource allocation in OOO processors. Andreas Sembrant, Trevor E. Carlson, Erik Hagersten, David Black-Schaffer, Arthur Perais, André Seznec, Pierre Michaud |
| 2015 | MORC: a manycore-oriented compressed cache. Tri Minh Nguyen, David Wentzlaff |
| 2015 | Microarchitectural implications of event-driven server-side web applications. Yuhao Zhu, Daniel Richins, Matthew Halpern, Vijay Janapa Reddi |
| 2015 | Modeling the implications of DRAM failures and protection techniques on datacenter TCO. Panagiota Nikolaou, Yiannakis Sazeides, Lorena Ndreu, Marios Kleanthous |
| 2015 | More is less: improving the energy efficiency of data movement via opportunistic use of sparse codes. Yanwei Song, Engin Ipek |
| 2015 | Neural acceleration for GPU throughput processors. Amir Yazdanbakhsh, Jongse Park, Hardik Sharma, Pejman Lotfi-Kamran, Hadi Esmaeilzadeh |
| 2015 | Neuromorphic accelerators: a comparison between neuroscience and machine-learning approaches. Zidong Du, Daniel D. Ben-Dayan Rubin, Yunji Chen, Liqiang He, Tianshi Chen, Lei Zhang, Chengyong Wu, Olivier Temam |
| 2015 | Prediction-guided performance-energy trade-off for interactive applications. Daniel Lo, Taejoon Song, G. Edward Suh |
| 2015 | Proceedings of the 48th International Symposium on Microarchitecture, MICRO 2015, Waikiki, HI, USA, December 5-9, 2015 Milos Prvulovic |
| 2015 | Rubik: fast analytical power management for latency-critical systems. Harshad Kasture, Davide B. Bartolini, Nathan Beckmann, Daniel Sánchez |
| 2015 | SAWS: synchronization aware GPGPU warp scheduling for multiple independent warp schedulers. Jiwei Liu, Jun Yang, Rami G. Melhem |
| 2015 | Safe limits on voltage reduction efficiency in GPUs: a direct measurement approach. Jingwen Leng, Alper Buyuktosunoglu, Ramon Bertran, Pradip Bose, Vijay Janapa Reddi |
| 2015 | Self-contained, accurate precomputation prefetching. Islam Atta, Xin Tong, Vijayalakshmi Srinivasan, Ioana Baldini, Andreas Moshovos |
| 2015 | The CRISP performance model for dynamic voltage and frequency scaling in a GPGPU. Rajib Nath, Dean M. Tullsen |
| 2015 | The application slowdown model: quantifying and controlling the impact of inter-application interference at shared caches and main memory. Lavanya Subramanian, Vivek Seshadri, Arnab Ghosh, Samira Manabi Khan, Onur Mutlu |
| 2015 | The inner most loop iteration counter: a new dimension in branch history. André Seznec, Joshua San Miguel, Jorge Albericio |
| 2015 | ThyNVM: enabling software-transparent crash consistency in persistent memory systems. Jinglei Ren, Jishen Zhao, Samira Manabi Khan, Jongmoo Choi, Yongwei Wu, Onur Mutlu |
| 2015 | TimeTrader: exploiting latency tail to save datacenter energy for online search. Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, T. N. Vijaykumar |
| 2015 | Ultra-low power render-based collision detection for CPU/GPU systems. Enrique de Lucas, Pedro Marcuello, Joan-Manuel Parcerisa, Antonio González |
| 2015 | WarpPool: sharing requests with inter-warp coalescing for throughput processors. John Kloosterman, Jonathan Beaumont, Mick Wollman, Ankit Sethia, Ronald G. Dreslinski, Trevor N. Mudge, Scott A. Mahlke |
| 2015 | vCache: architectural support for transparent and isolated virtual LLCs in virtualized environments. Daehoon Kim, Hwanju Kim, Nam Sung Kim, Jaehyuk Huh |