MICRO A*

62 papers

YearTitle / Authors
2015A fast and accurate analytical technique to compute the AVF of sequential bits in a processor.
Steven Raasch, Arijit Biswas, Jon Stephan, Paul Racunas, Joel S. Emer
2015A scalable architecture for ordered parallelism.
Mark C. Jeffrey, Suvinay Subramanian, Cong Yan, Joel S. Emer, Daniel Sánchez
2015Adaptive guardband scheduling to improve system-level efficiency of the POWER7+.
Yazhou Zu, Charles R. Lefurgy, Jingwen Leng, Matthew Halpern, Michael S. Floyd, Vijay Janapa Reddi
2015An integrated concurrency and core-ISA architectural envelope definition, and test oracle, for IBM POWER multiprocessors.
Kathryn E. Gray, Gabriel Kerneis, Dominic P. Mulligan, Christopher Pulte, Susmit Sarkar, Peter Sewell
2015Architecture-aware automatic computation offload for native applications.
Gwangmu Lee, Hyunjoon Park, Seonyeong Heo, Kyung-Ah Chang, Hyogun Lee, Hanjun Kim
2015Authenticache: harnessing cache ECC for system authentication.
Anys Bacha, Radu Teodorescu
2015Avoiding information leakage in the memory controller with fixed service policies.
Ali Shafiee, Akhila Gundu, Manjunath Shevgoor, Rajeev Balasubramonian, Mohit Tiwari
2015Border control: sandboxing accelerators.
Lena E. Olson, Jason Power, Mark D. Hill, David A. Wood
2015Bungee jumps: accelerating indirect branches through HW/SW co-design.
Daniel S. McFarlin, Craig B. Zilles
2015CCICheck: using µhb graphs to verify the coherence-consistency interface.
Yatin A. Manerkar, Daniel Lustig, Michael Pellauer, Margaret Martonosi
2015CLEAN-ECC: high reliability ECC for adaptive granularity memory system.
Seong-Lyong Gong, Minsoo Rhu, Jungrae Kim, Jinsuk Chung, Mattan Erez
2015Characterizing, modeling, and improving the QoE of mobile devices with low battery level.
Kaige Yan, Xingyao Zhang, Xin Fu
2015Coherence domain restriction on large scale systems.
Yaosheng Fu, Tri Minh Nguyen, David Wentzlaff
2015Confluence: unified instruction supply for scale-out servers.
Cansu Kaynak, Boris Grot, Babak Falsafi
2015Control flow coalescing on a hybrid dataflow/von Neumann GPGPU.
Dani Voitsechov, Yoav Etsion
2015Cross-architecture performance prediction (XAPP) using CPU code to predict GPU performance.
Newsha Ardalani, Clint Lestourgeon, Karthikeyan Sankaralingam, Xiaojin Zhu
2015DCS: a fast and scalable device-centric server architecture.
Jaehyung Ahn, Dongup Kwon, Youngsok Kim, Mohammadamin Ajdari, Jaewon Lee, Jangwoo Kim
2015DeSC: decoupled supply-compute communication management for heterogeneous architectures.
Tae Jun Ham, Juan L. Aragón, Margaret Martonosi
2015Doppelgänger: a cache for approximate computing.
Joshua San Miguel, Jorge Albericio, Andreas Moshovos, Natalie D. Enright Jerger
2015DynaMOS: dynamic schedule migration for heterogeneous cores.
Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott A. Mahlke
2015Efficient GPU synchronization without scopes: saying no to complex consistency models.
Matthew D. Sinclair, Johnathan Alsop, Sarita V. Adve
2015Efficient persist barriers for multicores.
Arpit Joshi, Vijay Nagarajan, Marcelo Cintra, Stratis Viglas
2015Efficient warp execution in presence of divergence with collaborative context collection.
Farzad Khorasani, Rajiv Gupta, Laxmi N. Bhuyan
2015Efficiently enforcing strong memory ordering in GPUs.
Abhayendra Singh, Shaizeen Aga, Satish Narayanasamy
2015Efficiently prefetching complex address patterns.
Manjunath Shevgoor, Sahil Koladiya, Rajeev Balasubramonian, Chris Wilkerson, Seth H. Pugsley, Zeshan Chishti
2015Enabling coordinated register allocation and thread-level parallelism optimization for GPUs.
Xiaolong Xie, Yun Liang, Xiuhong Li, Yudong Wu, Guangyu Sun, Tao Wang, Dongrui Fan
2015Enabling interposer-based disintegration of multi-core processors.
Ajaykumar Kannan, Natalie D. Enright Jerger, Gabriel H. Loh
2015Enabling portable energy efficiency with memory accelerated library.
Qi Guo, Tze Meng Low, Nikolaos Alachiotis, Berkin Akin, Larry T. Pileggi, James C. Hoe, Franz Franchetti
2015Execution time prediction for energy-efficient hardware accelerators.
Tao Chen, Alexander Rucker, G. Edward Suh
2015Exploiting commutativity to reduce the cost of updates to shared data in cache-coherent systems.
Guowei Zhang, Webb Horn, Daniel Sánchez
2015Fast support for unstructured data processing: the unified automata processor.
Yuanwei Fang, Tung Thanh Hoang, Michela Becchi, Andrew A. Chien
2015Filtered runahead execution with a runahead buffer.
Milad Hashemi, Yale N. Patt
2015Fork path: improving efficiency of ORAM by removing redundant memory accesses.
Xian Zhang, Guangyu Sun, Chao Zhang, Weiqi Zhang, Yun Liang, Tao Wang, Yiran Chen, Jia Di
2015Free launch: optimizing GPU dynamic kernel launches through thread reuse.
Guoyang Chen, Xipeng Shen
2015GPU register file virtualization.
Hyeran Jeon, Gokul Subramanian Ravi, Nam Sung Kim, Murali Annavaram
2015Gather-scatter DRAM: in-DRAM address translation to improve the spatial locality of non-unit strided accesses.
Vivek Seshadri, Thomas Mullins, Amirali Boroumand, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry
2015HyComp: a hybrid cache compression method for selection of data-type-specific compression methods.
Angelos Arelakis, Fredrik Dahlgren, Per Stenström
2015IMP: indirect memory prefetcher.
Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Srinivas Devadas
2015Improving DRAM latency with dynamic asymmetric subarray.
Shih-Lien Lu, Ying-Chen Lin, Chia-Lin Yang
2015Large pages and lightweight memory management in virtualized environments: can you have it both ways?
Binh Pham, Ján Veselý, Gabriel H. Loh, Abhishek Bhattacharjee
2015Locking down insecure indirection with hardware-based control-data isolation.
William Arthur, Sahil Madeka, Reetuparna Das, Todd M. Austin
2015Long term parking (LTP): criticality-aware resource allocation in OOO processors.
Andreas Sembrant, Trevor E. Carlson, Erik Hagersten, David Black-Schaffer, Arthur Perais, André Seznec, Pierre Michaud
2015MORC: a manycore-oriented compressed cache.
Tri Minh Nguyen, David Wentzlaff
2015Microarchitectural implications of event-driven server-side web applications.
Yuhao Zhu, Daniel Richins, Matthew Halpern, Vijay Janapa Reddi
2015Modeling the implications of DRAM failures and protection techniques on datacenter TCO.
Panagiota Nikolaou, Yiannakis Sazeides, Lorena Ndreu, Marios Kleanthous
2015More is less: improving the energy efficiency of data movement via opportunistic use of sparse codes.
Yanwei Song, Engin Ipek
2015Neural acceleration for GPU throughput processors.
Amir Yazdanbakhsh, Jongse Park, Hardik Sharma, Pejman Lotfi-Kamran, Hadi Esmaeilzadeh
2015Neuromorphic accelerators: a comparison between neuroscience and machine-learning approaches.
Zidong Du, Daniel D. Ben-Dayan Rubin, Yunji Chen, Liqiang He, Tianshi Chen, Lei Zhang, Chengyong Wu, Olivier Temam
2015Prediction-guided performance-energy trade-off for interactive applications.
Daniel Lo, Taejoon Song, G. Edward Suh
2015Proceedings of the 48th International Symposium on Microarchitecture, MICRO 2015, Waikiki, HI, USA, December 5-9, 2015
Milos Prvulovic
2015Rubik: fast analytical power management for latency-critical systems.
Harshad Kasture, Davide B. Bartolini, Nathan Beckmann, Daniel Sánchez
2015SAWS: synchronization aware GPGPU warp scheduling for multiple independent warp schedulers.
Jiwei Liu, Jun Yang, Rami G. Melhem
2015Safe limits on voltage reduction efficiency in GPUs: a direct measurement approach.
Jingwen Leng, Alper Buyuktosunoglu, Ramon Bertran, Pradip Bose, Vijay Janapa Reddi
2015Self-contained, accurate precomputation prefetching.
Islam Atta, Xin Tong, Vijayalakshmi Srinivasan, Ioana Baldini, Andreas Moshovos
2015The CRISP performance model for dynamic voltage and frequency scaling in a GPGPU.
Rajib Nath, Dean M. Tullsen
2015The application slowdown model: quantifying and controlling the impact of inter-application interference at shared caches and main memory.
Lavanya Subramanian, Vivek Seshadri, Arnab Ghosh, Samira Manabi Khan, Onur Mutlu
2015The inner most loop iteration counter: a new dimension in branch history.
André Seznec, Joshua San Miguel, Jorge Albericio
2015ThyNVM: enabling software-transparent crash consistency in persistent memory systems.
Jinglei Ren, Jishen Zhao, Samira Manabi Khan, Jongmoo Choi, Yongwei Wu, Onur Mutlu
2015TimeTrader: exploiting latency tail to save datacenter energy for online search.
Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, T. N. Vijaykumar
2015Ultra-low power render-based collision detection for CPU/GPU systems.
Enrique de Lucas, Pedro Marcuello, Joan-Manuel Parcerisa, Antonio González
2015WarpPool: sharing requests with inter-warp coalescing for throughput processors.
John Kloosterman, Jonathan Beaumont, Mick Wollman, Ankit Sethia, Ronald G. Dreslinski, Trevor N. Mudge, Scott A. Mahlke
2015vCache: architectural support for transparent and isolated virtual LLCs in virtualized environments.
Daehoon Kim, Hwanju Kim, Nam Sung Kim, Jaehyuk Huh