MICRO A*

62 papers

YearTitle / Authors
2017A many-core architecture for in-memory data processing.
Sandeep R. Agrawal, Sam Idicula, Arun Raghavan, Evangelos Vlachos, Venkatraman Govindaraju, Venkatanathan Varadarajan, Cagri Balkesen, Georgios Giannikis, Charlie Roth, Nipun Agarwal, Eric Sedlar
2017Ambit: in-memory accelerator for bulk bitwise operations using commodity DRAM technology.
Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie S. Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons, Todd C. Mowry
2017An experimental microarchitecture for a superconducting quantum processor.
Xiang Fu, Michiel Adriaan Rol, Cornelis Christiaan Bultink, J. van Someren, Nader Khammassi, Imran Ashraf, R. F. L. Vermeulen, J. C. de Sterke, W. J. Vlothuizen, R. N. Schouten, Carmen G. Almudéver, Leonardo DiCarlo, Koen Bertels
2017Architecting hierarchical coherence protocols for push-button parametric verification.
Opeoluwa Matthews, Daniel J. Sorin
2017Architectural opportunities for novel dynamic EMI shifting (DEMIS).
Daphne I. Gorman, Matthew R. Guthaus, Jose Renau
2017Architectural tradeoffs for biodegradable computing.
Ting-Jung Chang, Zhuozhi Yao, Paul J. Jackson, Barry P. Rand, David Wentzlaff
2017BVF: enabling significant on-chip power savings via bit-value-favor for throughput processors.
Ang Li, Wenfeng Zhao, Shuaiwen Leon Song
2017Banshee: bandwidth-efficient DRAM caching via software/hardware cooperation.
Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Onur Mutlu, Srinivas Devadas
2017Beyond the socket: NUMA-aware GPUs.
Ugljesa Milic, Oreste Villa, Evgeny Bolotin, Akhil Arunkumar, Eiman Ebrahimi, Aamer Jaleel, Alex Ramírez, David W. Nellans
2017Bit-pragmatic deep neural network computing.
Jorge Albericio, Alberto Delmas, Patrick Judd, Sayeh Sharify, Gerard O'Leary, Roman Genov, Andreas Moshovos
2017CSALT: context switch aware large TLB.
Yashwant Marathe, Nagendra Gulur, Jee Ho Ryoo, Shuang Song, Lizy K. John
2017Cache automaton.
Arun Subramaniyan, Jingcheng Wang, Ezhil R. M. Balasubramanian, David T. Blaauw, Dennis Sylvester, Reetuparna Das
2017CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices.
Caiwen Ding, Siyu Liao, Yanzhi Wang, Zhe Li, Ning Liu, Youwei Zhuo, Chao Wang, Xuehai Qian, Yu Bai, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Jian Tang, Qinru Qiu, Xue Lin, Bo Yuan
2017Constructing and characterizing covert channels on GPGPUs.
Hoda Naghibijouybari, Khaled N. Khasawneh, Nael B. Abu-Ghazaleh
2017Contutto: a novel FPGA-based prototyping platform enabling innovation in the memory subsystem of a server class processor.
Bharat Sukhwani, Thomas Roewer, Charles L. Haymes, Kyu-Hyoun Kim, Adam J. McPadden, Daniel M. Dreps, Dean Sanner, Jan van Lunteren, Sameh W. Asaad
2017DRISA: a DRAM-based reconfigurable in-situ accelerator.
Shuangchen Li, Dimin Niu, Krishna T. Malladi, Hongzhong Zheng, Bob Brennan, Yuan Xie
2017Data movement aware computation partitioning.
Xulong Tang, Orhan Kislal, Mahmut T. Kandemir, Mustafa Karaköy
2017DeftNN: addressing bottlenecks for DNN execution on GPUs via synapse vector elimination and near-compute data fission.
Parker Hill, Animesh Jain, Mason Hill, Babak Zamirai, Chang-Hong Hsu, Michael A. Laurenzano, Scott A. Mahlke, Lingjia Tang, Jason Mars
2017Detecting and mitigating data-dependent DRAM failures by exploiting current memory content.
Samira Manabi Khan, Chris Wilkerson, Zhe Wang, Alaa R. Alameldeen, Donghyuk Lee, Onur Mutlu
2017Efficient exception handling support for GPUs.
Ivan Tanasic, Isaac Gelado, Marc Jordà, Eduard Ayguadé, Nacho Navarro
2017Efficient support of position independence on non-volatile memory.
Guoyang Chen, Lei Zhang, Richa Budhiraja, Xipeng Shen, Youfeng Wu
2017Estimating and understanding architectural risk.
Weilong Cui, Timothy Sherwood
2017Exploiting heterogeneity for tail latency and energy efficiency.
Md. Enamul Haque, Yuxiong He, Sameh Elnikety, Thu D. Nguyen, Ricardo Bianchini, Kathryn S. McKinley
2017Fine-grained DRAM: energy-efficient DRAM for extreme bandwidth systems.
Mike O'Connor, Niladrish Chatterjee, Donghyuk Lee, John M. Wilson, Aditya Agrawal, Stephen W. Keckler, William J. Dally
2017GPUpd: a fast and scalable multi-GPU architecture using cooperative projection and distribution.
Youngsok Kim, Jae-Eon Jo, Hanhwi Jang, Minsoo Rhu, Hanjun Kim, Jangwoo Kim
2017Hardware supported persistent object address translation.
Tiancong Wang, Sakthikumaran Sambasivam, Yan Solihin, James Tuck
2017Harnessing voltage margins for energy efficiency in multicore CPUs.
George Papadimitriou, Manolis Kaliorakis, Athanasios Chatzidimitriou, Dimitris Gizopoulos, Peter Lawthers, Shidhartha Das
2017How secure is your cache against side-channel attacks?
Zecheng He, Ruby B. Lee
2017Hybrid analog-digital solution of nonlinear partial differential equations.
Yipeng Huang, Ning Guo, Mingoo Seok, Yannis P. Tsividis, Kyle T. Mandli, Simha Sethumadhavan
2017IDEAL: image denoising accelerator.
Mostafa Mahmoud, Bojian Zheng, Alberto Delmas Lascorz, Felix Heide, Jonathan Assouline, Paul Boucher, Emmanuel Onzon, Andreas Moshovos
2017Improving the effectiveness of searching for isomorphic chains in superword level parallelism.
Joonmoo Huh, James Tuck
2017Incidental computing on IoT nonvolatile processors.
Kaisheng Ma, Xueqing Li, Jinyang Li, Yongpan Liu, Yuan Xie, Jack Sampson, Mahmut Taylan Kandemir, Vijaykrishnan Narayanan
2017Load value prediction via path-based address prediction: avoiding mispredictions due to conflicting stores.
Rami Sheikh, Harold W. Cain, Raguram Damodaran
2017Memory cocktail therapy: a general learning-based framework to optimize dynamic tradeoffs in NVMs.
Zhaoxia Deng, Lunkai Zhang, Nikita Mishra, Henry Hoffmann, Frederic T. Chong
2017Mirage cores: the illusion of many out-of-order cores using in-order hardware.
Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott A. Mahlke
2017Mosaic: a GPU memory manager with application-transparent support for multiple page sizes.
Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach, Onur Mutlu
2017Multiperspective reuse prediction.
Daniel A. Jiménez, Elvira Teran
2017Optimized surface code communication in superconducting quantum computers.
Ali Javadi-Abhari, Pranav Gokhale, Adam Holmes, Diana Franklin, Kenneth R. Brown, Margaret Martonosi, Frederic T. Chong
2017PARSNIP: performant architecture for race safety with no impact on precision.
Yuanfeng Peng, Benjamin P. Wood, Joseph Devietti
2017Pageforge: a near-memory content-aware page-merging architecture.
Dimitrios Skarlatos, Nam Sung Kim, Josep Torrellas
2017Pipelining a triggered processing element.
Thomas J. Repetti, João Pedro Cerqueira, Martha A. Kim, Mingoo Seok
2017Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2017, Cambridge, MA, USA, October 14-18, 2017
Hillery C. Hunter, Jaime Moreno, Joel S. Emer, Daniel Sánchez
2017Proteus: a flexible and fast software supported hardware logging approach for NVM.
Seunghee Shin, Satish Kumar Tirukkovalluri, James Tuck, Yan Solihin
2017RHMD: evasion-resilient hardware malware detectors.
Khaled N. Khasawneh, Nael B. Abu-Ghazaleh, Dmitry Ponomarev, Lei Yu
2017RTLcheck: verifying the memory consistency of RTL designs.
Yatin A. Manerkar, Daniel Lustig, Margaret Martonosi, Michael Pellauer
2017Race-to-sleep + content caching + display caching: a recipe for energy-efficient video streaming on handhelds.
Haibo Zhang, Prasanna Venkatesh Rengasamy, Shulin Zhao, Nachiappan Chidambaram Nachiappan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravi R. Iyer, Chita R. Das
2017Regless: just-in-time operand staging for GPUs.
John Kloosterman, Jonathan Beaumont, Davoud Anoushe Jamshidi, Jonathan Bailey, Trevor N. Mudge, Scott A. Mahlke
2017SCRATCH: an end-to-end application-aware soft-GPGPU architecture and trimming tool.
Pedro Duarte, Pedro Tomás, Gabriel Falcão
2017Scale-out acceleration for machine learning.
Jongse Park, Hardik Sharma, Divya Mahajan, Joon Kyung Kim, Preston Olds, Hadi Esmaeilzadeh
2017Schedtask: a hardware-assisted task scheduler.
Prathmesh Kallurkar, Smruti R. Sarangi
2017Software-based gate-level information flow security for IoT systems.
Hari Cherupalli, Henry Duwe, Weidong Ye, Rakesh Kumar, John Sartori
2017Summarizer: trading communication with computing near storage.
Gunjae Koo, Kiran Kumar Matam, Te I, H. V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng, Steven Swanson, Murali Annavaram
2017TMI: thread memory isolation for false sharing repair.
Christian DeLozier, Ariel Eizenberg, Shiliang Hu, Gilles Pokam, Joseph Devietti
2017Taming the instruction bandwidth of quantum computers via hardware-managed error correction.
Swamit S. Tannu, Zachary A. Myers, Prashant J. Nair, Douglas M. Carmean, Moinuddin K. Qureshi
2017UDP: a programmable accelerator for extract-transform-load workloads and more.
Yuanwei Fang, Chen Zou, Aaron J. Elmore, Andrew A. Chien
2017UNFOLD: a memory-efficient speech recognizer using on-the-fly WFST composition.
Reza Yazdani, José-María Arnau, Antonio González
2017Unleashing the power of GPU for physically-based rendering via dynamic ray shuffling.
Ya-Shuai Lü, Libo Huang, Li Shen, Zhiying Wang
2017Using branch predictors to predict brain activity in brain-machine implants.
Abhishek Bhattacharjee
2017Using intra-core loop-task accelerators to improve the productivity and performance of task-based parallel programs.
Ji Kim, Shunning Jiang, Christopher Torng, Moyang Wang, Shreesha Srinath, Berkin Ilbeyi, Khalid Al-Hawaj, Christopher Batten
2017Versapipe: a versatile programming framework for pipelined computing on GPU.
Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen
2017Wireframe: supporting data-dependent parallelism through dependency graph execution in GPUs.
AmirAli Abdolrashidi, Devashree Tripathy, Mehmet Esat Belviranli, Laxmi Narayan Bhuyan, Daniel Wong
2017Xylem: enhancing vertical thermal conduction in 3D processor-memory stacks.
Aditya Agrawal, Josep Torrellas, Sachin Idgunji