| 2017 | A many-core architecture for in-memory data processing. Sandeep R. Agrawal, Sam Idicula, Arun Raghavan, Evangelos Vlachos, Venkatraman Govindaraju, Venkatanathan Varadarajan, Cagri Balkesen, Georgios Giannikis, Charlie Roth, Nipun Agarwal, Eric Sedlar |
| 2017 | Ambit: in-memory accelerator for bulk bitwise operations using commodity DRAM technology. Vivek Seshadri, Donghyuk Lee, Thomas Mullins, Hasan Hassan, Amirali Boroumand, Jeremie S. Kim, Michael A. Kozuch, Onur Mutlu, Phillip B. Gibbons, Todd C. Mowry |
| 2017 | An experimental microarchitecture for a superconducting quantum processor. Xiang Fu, Michiel Adriaan Rol, Cornelis Christiaan Bultink, J. van Someren, Nader Khammassi, Imran Ashraf, R. F. L. Vermeulen, J. C. de Sterke, W. J. Vlothuizen, R. N. Schouten, Carmen G. Almudéver, Leonardo DiCarlo, Koen Bertels |
| 2017 | Architecting hierarchical coherence protocols for push-button parametric verification. Opeoluwa Matthews, Daniel J. Sorin |
| 2017 | Architectural opportunities for novel dynamic EMI shifting (DEMIS). Daphne I. Gorman, Matthew R. Guthaus, Jose Renau |
| 2017 | Architectural tradeoffs for biodegradable computing. Ting-Jung Chang, Zhuozhi Yao, Paul J. Jackson, Barry P. Rand, David Wentzlaff |
| 2017 | BVF: enabling significant on-chip power savings via bit-value-favor for throughput processors. Ang Li, Wenfeng Zhao, Shuaiwen Leon Song |
| 2017 | Banshee: bandwidth-efficient DRAM caching via software/hardware cooperation. Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Onur Mutlu, Srinivas Devadas |
| 2017 | Beyond the socket: NUMA-aware GPUs. Ugljesa Milic, Oreste Villa, Evgeny Bolotin, Akhil Arunkumar, Eiman Ebrahimi, Aamer Jaleel, Alex Ramírez, David W. Nellans |
| 2017 | Bit-pragmatic deep neural network computing. Jorge Albericio, Alberto Delmas, Patrick Judd, Sayeh Sharify, Gerard O'Leary, Roman Genov, Andreas Moshovos |
| 2017 | CSALT: context switch aware large TLB. Yashwant Marathe, Nagendra Gulur, Jee Ho Ryoo, Shuang Song, Lizy K. John |
| 2017 | Cache automaton. Arun Subramaniyan, Jingcheng Wang, Ezhil R. M. Balasubramanian, David T. Blaauw, Dennis Sylvester, Reetuparna Das |
| 2017 | CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices. Caiwen Ding, Siyu Liao, Yanzhi Wang, Zhe Li, Ning Liu, Youwei Zhuo, Chao Wang, Xuehai Qian, Yu Bai, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Jian Tang, Qinru Qiu, Xue Lin, Bo Yuan |
| 2017 | Constructing and characterizing covert channels on GPGPUs. Hoda Naghibijouybari, Khaled N. Khasawneh, Nael B. Abu-Ghazaleh |
| 2017 | Contutto: a novel FPGA-based prototyping platform enabling innovation in the memory subsystem of a server class processor. Bharat Sukhwani, Thomas Roewer, Charles L. Haymes, Kyu-Hyoun Kim, Adam J. McPadden, Daniel M. Dreps, Dean Sanner, Jan van Lunteren, Sameh W. Asaad |
| 2017 | DRISA: a DRAM-based reconfigurable in-situ accelerator. Shuangchen Li, Dimin Niu, Krishna T. Malladi, Hongzhong Zheng, Bob Brennan, Yuan Xie |
| 2017 | Data movement aware computation partitioning. Xulong Tang, Orhan Kislal, Mahmut T. Kandemir, Mustafa Karaköy |
| 2017 | DeftNN: addressing bottlenecks for DNN execution on GPUs via synapse vector elimination and near-compute data fission. Parker Hill, Animesh Jain, Mason Hill, Babak Zamirai, Chang-Hong Hsu, Michael A. Laurenzano, Scott A. Mahlke, Lingjia Tang, Jason Mars |
| 2017 | Detecting and mitigating data-dependent DRAM failures by exploiting current memory content. Samira Manabi Khan, Chris Wilkerson, Zhe Wang, Alaa R. Alameldeen, Donghyuk Lee, Onur Mutlu |
| 2017 | Efficient exception handling support for GPUs. Ivan Tanasic, Isaac Gelado, Marc Jordà, Eduard Ayguadé, Nacho Navarro |
| 2017 | Efficient support of position independence on non-volatile memory. Guoyang Chen, Lei Zhang, Richa Budhiraja, Xipeng Shen, Youfeng Wu |
| 2017 | Estimating and understanding architectural risk. Weilong Cui, Timothy Sherwood |
| 2017 | Exploiting heterogeneity for tail latency and energy efficiency. Md. Enamul Haque, Yuxiong He, Sameh Elnikety, Thu D. Nguyen, Ricardo Bianchini, Kathryn S. McKinley |
| 2017 | Fine-grained DRAM: energy-efficient DRAM for extreme bandwidth systems. Mike O'Connor, Niladrish Chatterjee, Donghyuk Lee, John M. Wilson, Aditya Agrawal, Stephen W. Keckler, William J. Dally |
| 2017 | GPUpd: a fast and scalable multi-GPU architecture using cooperative projection and distribution. Youngsok Kim, Jae-Eon Jo, Hanhwi Jang, Minsoo Rhu, Hanjun Kim, Jangwoo Kim |
| 2017 | Hardware supported persistent object address translation. Tiancong Wang, Sakthikumaran Sambasivam, Yan Solihin, James Tuck |
| 2017 | Harnessing voltage margins for energy efficiency in multicore CPUs. George Papadimitriou, Manolis Kaliorakis, Athanasios Chatzidimitriou, Dimitris Gizopoulos, Peter Lawthers, Shidhartha Das |
| 2017 | How secure is your cache against side-channel attacks? Zecheng He, Ruby B. Lee |
| 2017 | Hybrid analog-digital solution of nonlinear partial differential equations. Yipeng Huang, Ning Guo, Mingoo Seok, Yannis P. Tsividis, Kyle T. Mandli, Simha Sethumadhavan |
| 2017 | IDEAL: image denoising accelerator. Mostafa Mahmoud, Bojian Zheng, Alberto Delmas Lascorz, Felix Heide, Jonathan Assouline, Paul Boucher, Emmanuel Onzon, Andreas Moshovos |
| 2017 | Improving the effectiveness of searching for isomorphic chains in superword level parallelism. Joonmoo Huh, James Tuck |
| 2017 | Incidental computing on IoT nonvolatile processors. Kaisheng Ma, Xueqing Li, Jinyang Li, Yongpan Liu, Yuan Xie, Jack Sampson, Mahmut Taylan Kandemir, Vijaykrishnan Narayanan |
| 2017 | Load value prediction via path-based address prediction: avoiding mispredictions due to conflicting stores. Rami Sheikh, Harold W. Cain, Raguram Damodaran |
| 2017 | Memory cocktail therapy: a general learning-based framework to optimize dynamic tradeoffs in NVMs. Zhaoxia Deng, Lunkai Zhang, Nikita Mishra, Henry Hoffmann, Frederic T. Chong |
| 2017 | Mirage cores: the illusion of many out-of-order cores using in-order hardware. Shruti Padmanabha, Andrew Lukefahr, Reetuparna Das, Scott A. Mahlke |
| 2017 | Mosaic: a GPU memory manager with application-transparent support for multiple page sizes. Rachata Ausavarungnirun, Joshua Landgraf, Vance Miller, Saugata Ghose, Jayneel Gandhi, Christopher J. Rossbach, Onur Mutlu |
| 2017 | Multiperspective reuse prediction. Daniel A. Jiménez, Elvira Teran |
| 2017 | Optimized surface code communication in superconducting quantum computers. Ali Javadi-Abhari, Pranav Gokhale, Adam Holmes, Diana Franklin, Kenneth R. Brown, Margaret Martonosi, Frederic T. Chong |
| 2017 | PARSNIP: performant architecture for race safety with no impact on precision. Yuanfeng Peng, Benjamin P. Wood, Joseph Devietti |
| 2017 | Pageforge: a near-memory content-aware page-merging architecture. Dimitrios Skarlatos, Nam Sung Kim, Josep Torrellas |
| 2017 | Pipelining a triggered processing element. Thomas J. Repetti, João Pedro Cerqueira, Martha A. Kim, Mingoo Seok |
| 2017 | Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2017, Cambridge, MA, USA, October 14-18, 2017 Hillery C. Hunter, Jaime Moreno, Joel S. Emer, Daniel Sánchez |
| 2017 | Proteus: a flexible and fast software supported hardware logging approach for NVM. Seunghee Shin, Satish Kumar Tirukkovalluri, James Tuck, Yan Solihin |
| 2017 | RHMD: evasion-resilient hardware malware detectors. Khaled N. Khasawneh, Nael B. Abu-Ghazaleh, Dmitry Ponomarev, Lei Yu |
| 2017 | RTLcheck: verifying the memory consistency of RTL designs. Yatin A. Manerkar, Daniel Lustig, Margaret Martonosi, Michael Pellauer |
| 2017 | Race-to-sleep + content caching + display caching: a recipe for energy-efficient video streaming on handhelds. Haibo Zhang, Prasanna Venkatesh Rengasamy, Shulin Zhao, Nachiappan Chidambaram Nachiappan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravi R. Iyer, Chita R. Das |
| 2017 | Regless: just-in-time operand staging for GPUs. John Kloosterman, Jonathan Beaumont, Davoud Anoushe Jamshidi, Jonathan Bailey, Trevor N. Mudge, Scott A. Mahlke |
| 2017 | SCRATCH: an end-to-end application-aware soft-GPGPU architecture and trimming tool. Pedro Duarte, Pedro Tomás, Gabriel Falcão |
| 2017 | Scale-out acceleration for machine learning. Jongse Park, Hardik Sharma, Divya Mahajan, Joon Kyung Kim, Preston Olds, Hadi Esmaeilzadeh |
| 2017 | Schedtask: a hardware-assisted task scheduler. Prathmesh Kallurkar, Smruti R. Sarangi |
| 2017 | Software-based gate-level information flow security for IoT systems. Hari Cherupalli, Henry Duwe, Weidong Ye, Rakesh Kumar, John Sartori |
| 2017 | Summarizer: trading communication with computing near storage. Gunjae Koo, Kiran Kumar Matam, Te I, H. V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng, Steven Swanson, Murali Annavaram |
| 2017 | TMI: thread memory isolation for false sharing repair. Christian DeLozier, Ariel Eizenberg, Shiliang Hu, Gilles Pokam, Joseph Devietti |
| 2017 | Taming the instruction bandwidth of quantum computers via hardware-managed error correction. Swamit S. Tannu, Zachary A. Myers, Prashant J. Nair, Douglas M. Carmean, Moinuddin K. Qureshi |
| 2017 | UDP: a programmable accelerator for extract-transform-load workloads and more. Yuanwei Fang, Chen Zou, Aaron J. Elmore, Andrew A. Chien |
| 2017 | UNFOLD: a memory-efficient speech recognizer using on-the-fly WFST composition. Reza Yazdani, José-María Arnau, Antonio González |
| 2017 | Unleashing the power of GPU for physically-based rendering via dynamic ray shuffling. Ya-Shuai Lü, Libo Huang, Li Shen, Zhiying Wang |
| 2017 | Using branch predictors to predict brain activity in brain-machine implants. Abhishek Bhattacharjee |
| 2017 | Using intra-core loop-task accelerators to improve the productivity and performance of task-based parallel programs. Ji Kim, Shunning Jiang, Christopher Torng, Moyang Wang, Shreesha Srinath, Berkin Ilbeyi, Khalid Al-Hawaj, Christopher Batten |
| 2017 | Versapipe: a versatile programming framework for pipelined computing on GPU. Zhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen |
| 2017 | Wireframe: supporting data-dependent parallelism through dependency graph execution in GPUs. AmirAli Abdolrashidi, Devashree Tripathy, Mehmet Esat Belviranli, Laxmi Narayan Bhuyan, Daniel Wong |
| 2017 | Xylem: enhancing vertical thermal conduction in 3D processor-memory stacks. Aditya Agrawal, Josep Torrellas, Sachin Idgunji |