MICRO - RankMe

62 papers

Year	Title / Authors
2017	A many-core architecture for in-memory data processing. Sandeep R. Agrawal, Sam Idicula, Arun Raghavan, Evangelos Vlachos, Venkatraman Govindaraju, Venkatanathan Varadarajan, Cagri Balkesen, Georgios Giannikis, Charlie Roth, Nipun Agarwal, Eric Sedlar
2017	Ambit: in-memory accelerator for bulk bitwise operations using commodity DRAM technology. Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie S. Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons, Todd C. Mowry
2017	An experimental microarchitecture for a superconducting quantum processor. Xiang Fu, Michiel Adriaan Rol, Cornelis Christiaan Bultink, J. van Someren, Nader Khammassi, Imran Ashraf, R. F. L. Vermeulen, J. C. de Sterke, W. J. Vlothuizen, R. N. Schouten, Carmen G. Almudéver, Leonardo DiCarlo, Koen Bertels
2017	Architecting hierarchical coherence protocols for push-button parametric verification. Opeoluwa Matthews, Daniel J. Sorin
2017	Architectural opportunities for novel dynamic EMI shifting (DEMIS). Daphne I. Gorman, Matthew R. Guthaus, Jose Renau
2017	Architectural tradeoffs for biodegradable computing. Ting-Jung Chang, Zhuozhi Yao, Paul J. Jackson, Barry P. Rand, David Wentzlaff
2017	BVF: enabling significant on-chip power savings via bit-value-favor for throughput processors. Ang Li, Wenfeng Zhao, Shuaiwen Leon Song
2017	Banshee: bandwidth-efficient DRAM caching via software/hardware cooperation. Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Onur Mutlu, Srinivas Devadas
2017	Beyond the socket: NUMA-aware GPUs. Ugljesa Milic, Oreste Villa, Evgeny Bolotin, Akhil Arunkumar, Eiman Ebrahimi, Aamer Jaleel, Alex Ramírez, David W. Nellans
2017	Bit-pragmatic deep neural network computing. Jorge Albericio, Alberto Delmas, Patrick Judd, Sayeh Sharify, Gerard O'Leary, Roman Genov, Andreas Moshovos
2017	CSALT: context switch aware large TLB. Yashwant Marathe, Nagendra Gulur, Jee Ho Ryoo, Shuang Song, Lizy K. John
2017	Cache automaton. Arun Subramaniyan, Jingcheng Wang, Ezhil R. M. Balasubramanian, David T. Blaauw, Dennis Sylvester, Reetuparna Das
2017	CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices. Caiwen Ding, Siyu Liao, Yanzhi Wang, Zhe Li, Ning Liu, Youwei Zhuo, Chao Wang, Xuehai Qian, Yu Bai, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Jian Tang, Qinru Qiu, Xue Lin, Bo Yuan
2017	Constructing and characterizing covert channels on GPGPUs. Hoda Naghibijouybari, Khaled N. Khasawneh, Nael B. Abu-Ghazaleh
2017	Contutto: a novel FPGA-based prototyping platform enabling innovation in the memory subsystem of a server class processor. Bharat Sukhwani, Thomas Roewer, Charles L. Haymes, Kyu-Hyoun Kim, Adam J. McPadden, Daniel M. Dreps, Dean Sanner, Jan van Lunteren, Sameh W. Asaad
2017	DRISA: a DRAM-based reconfigurable in-situ accelerator. Shuangchen Li, Dimin Niu, Krishna T. Malladi, Hongzhong Zheng, Bob Brennan, Yuan Xie
2017	Data movement aware computation partitioning. Xulong Tang, Orhan Kislal, Mahmut T. Kandemir, Mustafa Karaköy
2017	DeftNN: addressing bottlenecks for DNN execution on GPUs via synapse vector elimination and near-compute data fission. Parker Hill, Animesh Jain, Mason Hill, Babak Zamirai, Chang-Hong Hsu, Michael A. Laurenzano, Scott A. Mahlke, Lingjia Tang, Jason Mars
2017	Detecting and mitigating data-dependent DRAM failures by exploiting current memory content. Samira Manabi Khan, Chris Wilkerson, Zhe Wang, Alaa R. Alameldeen, Donghyuk Lee, Onur Mutlu
2017	Efficient exception handling support for GPUs. Ivan Tanasic, Isaac Gelado, Marc Jordà, Eduard Ayguadé, Nacho Navarro
2017	Efficient support of position independence on non-volatile memory. Guoyang Chen, Lei Zhang, Richa Budhiraja, Xipeng Shen, Youfeng Wu
2017	Estimating and understanding architectural risk. Weilong Cui, Timothy Sherwood
2017	Exploiting heterogeneity for tail latency and energy efficiency. Md. Enamul Haque, Yuxiong He, Sameh Elnikety, Thu D. Nguyen, Ricardo Bianchini, Kathryn S. McKinley
2017	Fine-grained DRAM: energy-efficient DRAM for extreme bandwidth systems. Mike O'Connor, Niladrish Chatterjee, Donghyuk Lee, John M. Wilson, Aditya Agrawal, Stephen W. Keckler, William J. Dally
2017	GPUpd: a fast and scalable multi-GPU architecture using cooperative projection and distribution. Youngsok Kim, Jae-Eon Jo, Hanhwi Jang, Minsoo Rhu, Hanjun Kim, Jangwoo Kim
2017	Hardware supported persistent object address translation. Tiancong Wang, Sakthikumaran Sambasivam, Yan Solihin, James Tuck
2017	Harnessing voltage margins for energy efficiency in multicore CPUs. George Papadimitriou, Manolis Kaliorakis, Athanasios Chatzidimitriou, Dimitris Gizopoulos, Peter Lawthers, Shidhartha Das
2017	How secure is your cache against side-channel attacks? Zecheng He, Ruby B. Lee
2017	Hybrid analog-digital solution of nonlinear partial differential equations. Yipeng Huang, Ning Guo, Mingoo Seok, Yannis P. Tsividis, Kyle T. Mandli, Simha Sethumadhavan
2017	IDEAL: image denoising accelerator. Mostafa Mahmoud, Bojian Zheng, Alberto Delmas Lascorz, Felix Heide, Jonathan Assouline, Paul Boucher, Emmanuel Onzon, Andreas Moshovos
2017	Improving the effectiveness of searching for isomorphic chains in superword level parallelism. Joonmoo Huh, James Tuck
2017	Incidental computing on IoT nonvolatile processors. Kaisheng Ma, Xueqing Li, Jinyang Li, Yongpan Liu, Yuan Xie, Jack Sampson, Mahmut Taylan Kandemir, Vijaykrishnan Narayanan
2017	Load value prediction via path-based address prediction: avoiding mispredictions due to conflicting stores. Rami Sheikh, Harold W. Cain, Raguram Damodaran
2017	Memory cocktail therapy: a general learning-based framework to optimize dynamic tradeoffs in NVMs. Zhaoxia Deng, Lunkai Zhang, Nikita Mishra, Henry Hoffmann, Frederic T. Chong
2017	Mirage cores: the illusion of many out-of-order cores using in-order hardware. Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott A. Mahlke
2017	Mosaic: a GPU memory manager with application-transparent support for multiple page sizes. Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach, Onur Mutlu
2017	Multiperspective reuse prediction. Daniel A. Jiménez, Elvira Teran
2017	Optimized surface code communication in superconducting quantum computers. Ali Javadi-Abhari, Pranav Gokhale, Adam Holmes, Diana Franklin, Kenneth R. Brown, Margaret Martonosi, Frederic T. Chong
2017	PARSNIP: performant architecture for race safety with no impact on precision. Yuanfeng Peng, Benjamin P. Wood, Joseph Devietti
2017	Pageforge: a near-memory content-aware page-merging architecture. Dimitrios Skarlatos, Nam Sung Kim, Josep Torrellas
2017	Pipelining a triggered processing element. Thomas J. Repetti, João Pedro Cerqueira, Martha A. Kim, Mingoo Seok
2017	Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2017, Cambridge, MA, USA, October 14-18, 2017 Hillery C. Hunter, Jaime Moreno, Joel S. Emer, Daniel Sánchez
2017	Proteus: a flexible and fast software supported hardware logging approach for NVM. Seunghee Shin, Satish Kumar Tirukkovalluri, James Tuck, Yan Solihin
2017	RHMD: evasion-resilient hardware malware detectors. Khaled N. Khasawneh, Nael B. Abu-Ghazaleh, Dmitry Ponomarev, Lei Yu
2017	RTLcheck: verifying the memory consistency of RTL designs. Yatin A. Manerkar, Daniel Lustig, Margaret Martonosi, Michael Pellauer
2017	Race-to-sleep + content caching + display caching: a recipe for energy-efficient video streaming on handhelds. Haibo Zhang, Prasanna Venkatesh Rengasamy, Shulin Zhao, Nachiappan Chidambaram Nachiappan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravi R. Iyer, Chita R. Das
2017	Regless: just-in-time operand staging for GPUs. John Kloosterman, Jonathan Beaumont, Davoud Anoushe Jamshidi, Jonathan Bailey, Trevor N. Mudge, Scott A. Mahlke
2017	SCRATCH: an end-to-end application-aware soft-GPGPU architecture and trimming tool. Pedro Duarte, Pedro Tomás, Gabriel Falcão
2017	Scale-out acceleration for machine learning. Jongse Park, Hardik Sharma, Divya Mahajan, Joon Kyung Kim, Preston Olds, Hadi Esmaeilzadeh
2017	Schedtask: a hardware-assisted task scheduler. Prathmesh Kallurkar, Smruti R. Sarangi
2017	Software-based gate-level information flow security for IoT systems. Hari Cherupalli, Henry Duwe, Weidong Ye, Rakesh Kumar, John Sartori
2017	Summarizer: trading communication with computing near storage. Gunjae Koo, Kiran Kumar Matam, Te I, H. V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng, Steven Swanson, Murali Annavaram
2017	TMI: thread memory isolation for false sharing repair. Christian DeLozier, Ariel Eizenberg, Shiliang Hu, Gilles Pokam, Joseph Devietti
2017	Taming the instruction bandwidth of quantum computers via hardware-managed error correction. Swamit S. Tannu, Zachary A. Myers, Prashant J. Nair, Douglas M. Carmean, Moinuddin K. Qureshi
2017	UDP: a programmable accelerator for extract-transform-load workloads and more. Yuanwei Fang, Chen Zou, Aaron J. Elmore, Andrew A. Chien
2017	UNFOLD: a memory-efficient speech recognizer using on-the-fly WFST composition. Reza Yazdani, José-María Arnau, Antonio González
2017	Unleashing the power of GPU for physically-based rendering via dynamic ray shuffling. Ya-Shuai Lü, Libo Huang, Li Shen, Zhiying Wang
2017	Using branch predictors to predict brain activity in brain-machine implants. Abhishek Bhattacharjee
2017	Using intra-core loop-task accelerators to improve the productivity and performance of task-based parallel programs. Ji Kim, Shunning Jiang, Christopher Torng, Moyang Wang, Shreesha Srinath, Berkin Ilbeyi, Khalid Al-Hawaj, Christopher Batten
2017	Versapipe: a versatile programming framework for pipelined computing on GPU. Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen
2017	Wireframe: supporting data-dependent parallelism through dependency graph execution in GPUs. AmirAli Abdolrashidi, Devashree Tripathy, Mehmet Esat Belviranli, Laxmi Narayan Bhuyan, Daniel Wong
2017	Xylem: enhancing vertical thermal conduction in 3D processor-memory stacks. Aditya Agrawal, Josep Torrellas, Sachin Idgunji