MICRO - RankMe

62 papers

Year	Title / Authors
2015	A fast and accurate analytical technique to compute the AVF of sequential bits in a processor. Steven Raasch, Arijit Biswas, Jon Stephan, Paul Racunas, Joel S. Emer
2015	A scalable architecture for ordered parallelism. Mark C. Jeffrey, Suvinay Subramanian, Cong Yan, Joel S. Emer, Daniel Sánchez
2015	Adaptive guardband scheduling to improve system-level efficiency of the POWER7+. Yazhou Zu, Charles R. Lefurgy, Jingwen Leng, Matthew Halpern, Michael S. Floyd, Vijay Janapa Reddi
2015	An integrated concurrency and core-ISA architectural envelope definition, and test oracle, for IBM POWER multiprocessors. Kathryn E. Gray, Gabriel Kerneis, Dominic P. Mulligan, Christopher Pulte, Susmit Sarkar, Peter Sewell
2015	Architecture-aware automatic computation offload for native applications. Gwangmu Lee, Hyunjoon Park, Seonyeong Heo, Kyung-Ah Chang, Hyogun Lee, Hanjun Kim
2015	Authenticache: harnessing cache ECC for system authentication. Anys Bacha, Radu Teodorescu
2015	Avoiding information leakage in the memory controller with fixed service policies. Ali Shafiee, Akhila Gundu, Manjunath Shevgoor, Rajeev Balasubramonian, Mohit Tiwari
2015	Border control: sandboxing accelerators. Lena E. Olson, Jason Power, Mark D. Hill, David A. Wood
2015	Bungee jumps: accelerating indirect branches through HW/SW co-design. Daniel S. McFarlin, Craig B. Zilles
2015	CCICheck: using µhb graphs to verify the coherence-consistency interface. Yatin A. Manerkar, Daniel Lustig, Michael Pellauer, Margaret Martonosi
2015	CLEAN-ECC: high reliability ECC for adaptive granularity memory system. Seong-Lyong Gong, Minsoo Rhu, Jungrae Kim, Jinsuk Chung, Mattan Erez
2015	Characterizing, modeling, and improving the QoE of mobile devices with low battery level. Kaige Yan, Xingyao Zhang, Xin Fu
2015	Coherence domain restriction on large scale systems. Yaosheng Fu, Tri Minh Nguyen, David Wentzlaff
2015	Confluence: unified instruction supply for scale-out servers. Cansu Kaynak, Boris Grot, Babak Falsafi
2015	Control flow coalescing on a hybrid dataflow/von Neumann GPGPU. Dani Voitsechov, Yoav Etsion
2015	Cross-architecture performance prediction (XAPP) using CPU code to predict GPU performance. Newsha Ardalani, Clint Lestourgeon, Karthikeyan Sankaralingam, Xiaojin Zhu
2015	DCS: a fast and scalable device-centric server architecture. Jaehyung Ahn, Dongup Kwon, Youngsok Kim, Mohammadamin Ajdari, Jaewon Lee, Jangwoo Kim
2015	DeSC: decoupled supply-compute communication management for heterogeneous architectures. Tae Jun Ham, Juan L. Aragón, Margaret Martonosi
2015	Doppelgänger: a cache for approximate computing. Joshua San Miguel, Jorge Albericio, Andreas Moshovos, Natalie D. Enright Jerger
2015	DynaMOS: dynamic schedule migration for heterogeneous cores. Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott A. Mahlke
2015	Efficient GPU synchronization without scopes: saying no to complex consistency models. Matthew D. Sinclair, Johnathan Alsop, Sarita V. Adve
2015	Efficient persist barriers for multicores. Arpit Joshi, Vijay Nagarajan, Marcelo Cintra, Stratis Viglas
2015	Efficient warp execution in presence of divergence with collaborative context collection. Farzad Khorasani, Rajiv Gupta, Laxmi N. Bhuyan
2015	Efficiently enforcing strong memory ordering in GPUs. Abhayendra Singh, Shaizeen Aga, Satish Narayanasamy
2015	Efficiently prefetching complex address patterns. Manjunath Shevgoor, Sahil Koladiya, Rajeev Balasubramonian, Chris Wilkerson, Seth H. Pugsley, Zeshan Chishti
2015	Enabling coordinated register allocation and thread-level parallelism optimization for GPUs. Xiaolong Xie, Yun Liang, Xiuhong Li, Yudong Wu, Guangyu Sun, Tao Wang, Dongrui Fan
2015	Enabling interposer-based disintegration of multi-core processors. Ajaykumar Kannan, Natalie D. Enright Jerger, Gabriel H. Loh
2015	Enabling portable energy efficiency with memory accelerated library. Qi Guo, Tze Meng Low, Nikolaos Alachiotis, Berkin Akin, Larry T. Pileggi, James C. Hoe, Franz Franchetti
2015	Execution time prediction for energy-efficient hardware accelerators. Tao Chen, Alexander Rucker, G. Edward Suh
2015	Exploiting commutativity to reduce the cost of updates to shared data in cache-coherent systems. Guowei Zhang, Webb Horn, Daniel Sánchez
2015	Fast support for unstructured data processing: the unified automata processor. Yuanwei Fang, Tung Thanh Hoang, Michela Becchi, Andrew A. Chien
2015	Filtered runahead execution with a runahead buffer. Milad Hashemi, Yale N. Patt
2015	Fork path: improving efficiency of ORAM by removing redundant memory accesses. Xian Zhang, Guangyu Sun, Chao Zhang, Weiqi Zhang, Yun Liang, Tao Wang, Yiran Chen, Jia Di
2015	Free launch: optimizing GPU dynamic kernel launches through thread reuse. Guoyang Chen, Xipeng Shen
2015	GPU register file virtualization. Hyeran Jeon, Gokul Subramanian Ravi, Nam Sung Kim, Murali Annavaram
2015	Gather-scatter DRAM: in-DRAM address translation to improve the spatial locality of non-unit strided accesses. Vivek Seshadri, Thomas Mullins, Amirali Boroumand, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry
2015	HyComp: a hybrid cache compression method for selection of data-type-specific compression methods. Angelos Arelakis, Fredrik Dahlgren, Per Stenström
2015	IMP: indirect memory prefetcher. Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Srinivas Devadas
2015	Improving DRAM latency with dynamic asymmetric subarray. Shih-Lien Lu, Ying-Chen Lin, Chia-Lin Yang
2015	Large pages and lightweight memory management in virtualized environments: can you have it both ways? Binh Pham, Ján Veselý, Gabriel H. Loh, Abhishek Bhattacharjee
2015	Locking down insecure indirection with hardware-based control-data isolation. William Arthur, Sahil Madeka, Reetuparna Das, Todd M. Austin
2015	Long term parking (LTP): criticality-aware resource allocation in OOO processors. Andreas Sembrant, Trevor E. Carlson, Erik Hagersten, David Black-Schaffer, Arthur Perais, André Seznec, Pierre Michaud
2015	MORC: a manycore-oriented compressed cache. Tri Minh Nguyen, David Wentzlaff
2015	Microarchitectural implications of event-driven server-side web applications. Yuhao Zhu, Daniel Richins, Matthew Halpern, Vijay Janapa Reddi
2015	Modeling the implications of DRAM failures and protection techniques on datacenter TCO. Panagiota Nikolaou, Yiannakis Sazeides, Lorena Ndreu, Marios Kleanthous
2015	More is less: improving the energy efficiency of data movement via opportunistic use of sparse codes. Yanwei Song, Engin Ipek
2015	Neural acceleration for GPU throughput processors. Amir Yazdanbakhsh, Jongse Park, Hardik Sharma, Pejman Lotfi-Kamran, Hadi Esmaeilzadeh
2015	Neuromorphic accelerators: a comparison between neuroscience and machine-learning approaches. Zidong Du, Daniel D. Ben-Dayan Rubin, Yunji Chen, Liqiang He, Tianshi Chen, Lei Zhang, Chengyong Wu, Olivier Temam
2015	Prediction-guided performance-energy trade-off for interactive applications. Daniel Lo, Taejoon Song, G. Edward Suh
2015	Proceedings of the 48th International Symposium on Microarchitecture, MICRO 2015, Waikiki, HI, USA, December 5-9, 2015 Milos Prvulovic
2015	Rubik: fast analytical power management for latency-critical systems. Harshad Kasture, Davide B. Bartolini, Nathan Beckmann, Daniel Sánchez
2015	SAWS: synchronization aware GPGPU warp scheduling for multiple independent warp schedulers. Jiwei Liu, Jun Yang, Rami G. Melhem
2015	Safe limits on voltage reduction efficiency in GPUs: a direct measurement approach. Jingwen Leng, Alper Buyuktosunoglu, Ramon Bertran, Pradip Bose, Vijay Janapa Reddi
2015	Self-contained, accurate precomputation prefetching. Islam Atta, Xin Tong, Vijayalakshmi Srinivasan, Ioana Baldini, Andreas Moshovos
2015	The CRISP performance model for dynamic voltage and frequency scaling in a GPGPU. Rajib Nath, Dean M. Tullsen
2015	The application slowdown model: quantifying and controlling the impact of inter-application interference at shared caches and main memory. Lavanya Subramanian, Vivek Seshadri, Arnab Ghosh, Samira Manabi Khan, Onur Mutlu
2015	The inner most loop iteration counter: a new dimension in branch history. André Seznec, Joshua San Miguel, Jorge Albericio
2015	ThyNVM: enabling software-transparent crash consistency in persistent memory systems. Jinglei Ren, Jishen Zhao, Samira Manabi Khan, Jongmoo Choi, Yongwei Wu, Onur Mutlu
2015	TimeTrader: exploiting latency tail to save datacenter energy for online search. Balajee Vamanan, Hamza Bin Sohail, Jahangir Hasan, T. N. Vijaykumar
2015	Ultra-low power render-based collision detection for CPU/GPU systems. Enrique de Lucas, Pedro Marcuello, Joan-Manuel Parcerisa, Antonio González
2015	WarpPool: sharing requests with inter-warp coalescing for throughput processors. John Kloosterman, Jonathan Beaumont, Mick Wollman, Ankit Sethia, Ronald G. Dreslinski, Trevor N. Mudge, Scott A. Mahlke
2015	vCache: architectural support for transparent and isolated virtual LLCs in virtualized environments. Daehoon Kim, Hwanju Kim, Nam Sung Kim, Jaehyuk Huh