| 2015 | A case for core-assisted bottleneck acceleration in GPUs: enabling flexible data compression with assist warps. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu |
| 2015 | A fully associative, tagless DRAM cache. Yongjun Lee, Jongwon Kim, Hakbeom Jang, Hyunggyun Yang, Jangwoo Kim, Jinkyu Jeong, Jae W. Lee |
| 2015 | A scalable processing-in-memory accelerator for parallel graph processing. Junwhan Ahn, Sungpack Hong, Sungjoo Yoo, Onur Mutlu, Kiyoung Choi |
| 2015 | A variable warp size architecture. Timothy G. Rogers, Daniel R. Johnson, Mike O'Connor, Stephen W. Keckler |
| 2015 | Accelerating asynchronous programs through event sneak peek. Gaurav Chadha, Scott A. Mahlke, Satish Narayanasamy |
| 2015 | ArMOR: defending against memory consistency model mismatches in heterogeneous architectures. Daniel Lustig, Caroline Trippel, Michael Pellauer, Margaret Martonosi |
| 2015 | Architecting to achieve a billion requests per second throughput on a single key-value store server platform. Sheng Li, Hyeontaek Lim, Victor W. Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey |
| 2015 | BEAR: techniques for mitigating bandwidth bloat in gigascale DRAM caches. Chia-Chen Chou, Aamer Jaleel, Moinuddin K. Qureshi |
| 2015 | BlueDBM: an appliance for big data analytics. Sang Woo Jun, Ming Liu, Sungjin Lee, Jamey Hicks, John Ankcorn, Myron King, Shuotao Xu, Arvind |
| 2015 | Branch vanguard: decomposing branch functionality into prediction and resolution instructions. Daniel S. McFarlin, Craig B. Zilles |
| 2015 | CAWA: coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads. Shin-Ying Lee, Akhil Arunkumar, Carole-Jean Wu |
| 2015 | COP: to compress and protect main memory. David J. Palframan, Nam Sung Kim, Mikko H. Lipasti |
| 2015 | Callback: efficient synchronization without invalidation with a directory just for spin-waiting. Alberto Ros, Stefanos Kaxiras |
| 2015 | Clean: a race detector with cleaner semantics. Cedomir Segulja, Tarek S. Abdelrahman |
| 2015 | CloudMonatt: an architecture for security health monitoring and attestation of virtual machines in cloud computing. Tianwei Zhang, Ruby B. Lee |
| 2015 | Coherence protocol for transparent management of scratchpad memories in shared memory manycore architectures. Lluc Alvarez, Lluís Vilanova, Miquel Moretó, Marc Casas, Marc González, Xavier Martorell, Nacho Navarro, Eduard Ayguadé, Mateo Valero |
| 2015 | Computer performance microscopy with Shim. Xi Yang, Stephen M. Blackburn, Kathryn S. McKinley |
| 2015 | Cost-effective speculative scheduling in high performance processors. Arthur Perais, André Seznec, Pierre Michaud, Andreas Sembrant, Erik Hagersten |
| 2015 | Data reorganization in memory using 3D-stacked DRAM. Berkin Akin, Franz Franchetti, James C. Hoe |
| 2015 | DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers. Johann Hauswald, Yiping Kang, Michael A. Laurenzano, Quan Chen, Cheng Li, Trevor N. Mudge, Ronald G. Dreslinski, Jason Mars, Lingjia Tang |
| 2015 | DynaSpAM: dynamic spatial architecture mapping using out of order instruction schedules. Feng Liu, Heejin Ahn, Stephen R. Beard, Taewook Oh, David I. August |
| 2015 | Dynamic thread block launch: a lightweight execution mechanism to support irregular applications on GPUs. Jin Wang, Norm Rubin, Albert Sidelnik, Sudhakar Yalamanchili |
| 2015 | Efficient execution of memory access phases using dataflow specialization. Chen-Han Ho, Sung Jin Kim, Karthikeyan Sankaralingam |
| 2015 | Exploring the potential of heterogeneous von neumann/dataflow execution models. Tony Nowatzki, Vinay Gangadhar, Karthikeyan Sankaralingam |
| 2015 | FASE: finding amplitude-modulated side-channel emanations. Robert Locke Callan, Alenka G. Zajic, Milos Prvulovic |
| 2015 | FaultHound: value-locality-based soft-fault tolerance. Nitin, Irith Pomeranz, T. N. Vijaykumar |
| 2015 | Flexible auto-refresh: enabling scalable and energy-efficient DRAM refresh reductions. Ishwar Bhati, Zeshan Chishti, Shih-Lien Lu, Bruce L. Jacob |
| 2015 | Flexible software profiling of GPU architectures. Mark Stephenson, Siva Kumar Sastry Hari, Yunsup Lee, Eiman Ebrahimi, Daniel R. Johnson, David W. Nellans, Mike O'Connor, Stephen W. Keckler |
| 2015 | Fusion: design tradeoffs in coherent cache hierarchies for accelerators. Snehasish Kumar, Arrvindh Shriraman, Naveen Vedula |
| 2015 | HEB: deploying and managing hybrid energy buffers for improving datacenter efficiency and economy. Longjun Liu, Chao Li, Hongbin Sun, Yang Hu, Juncheng Gu, Tao Li, Jingmin Xin, Nanning Zheng |
| 2015 | Harmonia: balancing compute and memory power in high-performance GPUs. Indrani Paul, Wei Huang, Manish Arora, Sudhakar Yalamanchili |
| 2015 | Heracles: improving resource efficiency at scale. David Lo, Liqun Cheng, Rama K. Govindaraju, Parthasarathy Ranganathan, Christos Kozyrakis |
| 2015 | Hi-fi playback: tolerating position errors in shift operations of racetrack memory. Chao Zhang, Guangyu Sun, Xian Zhang, Weiqi Zhang, Weisheng Zhao, Tao Wang, Yun Liang, Yongpan Liu, Yu Wang, Jiwu Shu |
| 2015 | LaZy superscalar. Görkem Asilioglu, Zhaoxiang Jin, Murat Köksal, Omkar Javeri, Soner Önder |
| 2015 | MBus: an ultra-low power interconnect bus for next generation nanopower systems. Pat Pannuto, Yoonmyung Lee, Ye-Sheng Kuo, Zhiyoong Foo, Benjamin P. Kempke, Gyouho Kim, Ronald G. Dreslinski, David T. Blaauw, Prabal Dutta |
| 2015 | Manycore network interfaces for in-memory rack-scale computing. Alexandros Daglis, Stanko Novakovic, Edouard Bugnion, Babak Falsafi, Boris Grot |
| 2015 | MiSAR: minimalistic synchronization accelerator with resource overflow management. Ching-Kai Liang, Milos Prvulovic |
| 2015 | Multiple clone row DRAM: a low latency and area optimized DRAM. Jungwhan Choi, Wongyu Shin, Jaemin Jang, Jinwoong Suh, Yongkee Kwon, Youngsuk Moon, Lee-Sup Kim |
| 2015 | PIM-enabled instructions: a low-overhead, locality-aware processing-in-memory architecture. Junwhan Ahn, Sungjoo Yoo, Onur Mutlu, Kiyoung Choi |
| 2015 | Page overlays: an enhanced virtual memory framework to enable fine-grained memory management. Vivek Seshadri, Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry, Trishul M. Chilimbi |
| 2015 | PrORAM: dynamic prefetcher for oblivious RAM. Xiangyao Yu, Syed Kamran Haider, Ling Ren, Christopher W. Fletcher, Albert Kwon, Marten van Dijk, Srinivas Devadas |
| 2015 | Probable cause: the deanonymizing effects of approximate DRAM. Amir Rahmati, Matthew Hicks, Daniel E. Holcomb, Kevin Fu |
| 2015 | Proceedings of the 42nd Annual International Symposium on Computer Architecture, Portland, OR, USA, June 13-17, 2015 Deborah T. Marr, David H. Albonesi |
| 2015 | Profiling a warehouse-scale computer. Svilen Kanev, Juan Pablo Darago, Kim M. Hazelwood, Parthasarathy Ranganathan, Tipp Moseley, Gu-Yeon Wei, David M. Brooks |
| 2015 | Quantitative comparison of hardware transactional memory for Blue Gene/Q, zEnterprise EC12, Intel Core, and POWER8. Takuya Nakaike, Rei Odaira, Matthew Gaudet, Maged M. Michael, Hisanobu Tomari |
| 2015 | Reducing world switches in virtualized environment with flexible cross-world calls. Wenhao Li, Yubin Xia, Haibo Chen, Binyu Zang, Haibing Guan |
| 2015 | Redundant memory mappings for fast access to large memories. Vasileios Karakostas, Jayneel Gandhi, Furkan Ayar, Adrián Cristal, Mark D. Hill, Kathryn S. McKinley, Mario Nemirovsky, Michael M. Swift, Osman S. Unsal |
| 2015 | Rumba: an online quality management system for approximate computing. Daya Shanker Khudia, Babak Zamirai, Mehrzad Samadi, Scott A. Mahlke |
| 2015 | SHRINK: reducing the ISA complexity via instruction recycling. Bruno Cardoso Lopes, Rafael Auler, Luiz Ramos, Edson Borin, Rodolfo Azevedo |
| 2015 | SLIP: reducing wire energy in the memory hierarchy. Subhasis Das, Tor M. Aamodt, William J. Dally |
| 2015 | Semantic locality and context-based prefetching using reinforcement learning. Leeor Peled, Shie Mannor, Uri C. Weiser, Yoav Etsion |
| 2015 | ShiDianNao: shifting vision processing closer to the sensor. Zidong Du, Robert Fasthuber, Tianshi Chen, Paolo Ienne, Ling Li, Tao Luo, Xiaobing Feng, Yunji Chen, Olivier Temam |
| 2015 | Stash: have your scratchpad and cache it too. Rakesh Komuravelli, Matthew D. Sinclair, Johnathan Alsop, Muhammad Huzaifa, Maria Kotsifakou, Prakalp Srivastava, Sarita V. Adve, Vikram S. Adve |
| 2015 | The load slice core microarchitecture. Trevor E. Carlson, Wim Heirman, Osman Allam, Stefanos Kaxiras, Lieven Eeckhout |
| 2015 | Thermal time shifting: leveraging phase change materials to reduce cooling costs in warehouse-scale computers. Matt Skach, Manish Arora, Chang-Hong Hsu, Qi Li, Dean M. Tullsen, Lingjia Tang, Jason Mars |
| 2015 | Towards sustainable in-situ server systems in the big data era. Chao Li, Yang Hu, Longjun Liu, Juncheng Gu, Mingcong Song, Xiaoyao Liang, Jingling Yuan, Tao Li |
| 2015 | Unified address translation for memory-mapped SSDs with FlashMap. Jian Huang, Anirudh Badam, Moinuddin K. Qureshi, Karsten Schwan |
| 2015 | VIP: virtualizing IP chains on handheld platforms. Nachiappan Chidambaram Nachiappan, Haibo Zhang, Jihyun Ryoo, Niranjan Soundararajan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravishankar R. Iyer, Chita R. Das |
| 2015 | Warped-compression: enabling power efficient GPUs through register compression. Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, Murali Annavaram |