ISCA A*

59 papers

YearTitle / Authors
2015A case for core-assisted bottleneck acceleration in GPUs: enabling flexible data compression with assist warps.
Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu
2015A fully associative, tagless DRAM cache.
Yongjun Lee, Jongwon Kim, Hakbeom Jang, Hyunggyun Yang, Jangwoo Kim, Jinkyu Jeong, Jae W. Lee
2015A scalable processing-in-memory accelerator for parallel graph processing.
Junwhan Ahn, Sungpack Hong, Sungjoo Yoo, Onur Mutlu, Kiyoung Choi
2015A variable warp size architecture.
Timothy G. Rogers, Daniel R. Johnson, Mike O'Connor, Stephen W. Keckler
2015Accelerating asynchronous programs through event sneak peek.
Gaurav Chadha, Scott A. Mahlke, Satish Narayanasamy
2015ArMOR: defending against memory consistency model mismatches in heterogeneous architectures.
Daniel Lustig, Caroline Trippel, Michael Pellauer, Margaret Martonosi
2015Architecting to achieve a billion requests per second throughput on a single key-value store server platform.
Sheng Li, Hyeontaek Lim, Victor W. Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey
2015BEAR: techniques for mitigating bandwidth bloat in gigascale DRAM caches.
Chia-Chen Chou, Aamer Jaleel, Moinuddin K. Qureshi
2015BlueDBM: an appliance for big data analytics.
Sang Woo Jun, Ming Liu, Sungjin Lee, Jamey Hicks, John Ankcorn, Myron King, Shuotao Xu, Arvind
2015Branch vanguard: decomposing branch functionality into prediction and resolution instructions.
Daniel S. McFarlin, Craig B. Zilles
2015CAWA: coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads.
Shin-Ying Lee, Akhil Arunkumar, Carole-Jean Wu
2015COP: to compress and protect main memory.
David J. Palframan, Nam Sung Kim, Mikko H. Lipasti
2015Callback: efficient synchronization without invalidation with a directory just for spin-waiting.
Alberto Ros, Stefanos Kaxiras
2015Clean: a race detector with cleaner semantics.
Cedomir Segulja, Tarek S. Abdelrahman
2015CloudMonatt: an architecture for security health monitoring and attestation of virtual machines in cloud computing.
Tianwei Zhang, Ruby B. Lee
2015Coherence protocol for transparent management of scratchpad memories in shared memory manycore architectures.
Lluc Alvarez, Lluís Vilanova, Miquel Moretó, Marc Casas, Marc González, Xavier Martorell, Nacho Navarro, Eduard Ayguadé, Mateo Valero
2015Computer performance microscopy with Shim.
Xi Yang, Stephen M. Blackburn, Kathryn S. McKinley
2015Cost-effective speculative scheduling in high performance processors.
Arthur Perais, André Seznec, Pierre Michaud, Andreas Sembrant, Erik Hagersten
2015Data reorganization in memory using 3D-stacked DRAM.
Berkin Akin, Franz Franchetti, James C. Hoe
2015DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers.
Johann Hauswald, Yiping Kang, Michael A. Laurenzano, Quan Chen, Cheng Li, Trevor N. Mudge, Ronald G. Dreslinski, Jason Mars, Lingjia Tang
2015DynaSpAM: dynamic spatial architecture mapping using out of order instruction schedules.
Feng Liu, Heejin Ahn, Stephen R. Beard, Taewook Oh, David I. August
2015Dynamic thread block launch: a lightweight execution mechanism to support irregular applications on GPUs.
Jin Wang, Norm Rubin, Albert Sidelnik, Sudhakar Yalamanchili
2015Efficient execution of memory access phases using dataflow specialization.
Chen-Han Ho, Sung Jin Kim, Karthikeyan Sankaralingam
2015Exploring the potential of heterogeneous von neumann/dataflow execution models.
Tony Nowatzki, Vinay Gangadhar, Karthikeyan Sankaralingam
2015FASE: finding amplitude-modulated side-channel emanations.
Robert Locke Callan, Alenka G. Zajic, Milos Prvulovic
2015FaultHound: value-locality-based soft-fault tolerance.
Nitin, Irith Pomeranz, T. N. Vijaykumar
2015Flexible auto-refresh: enabling scalable and energy-efficient DRAM refresh reductions.
Ishwar Bhati, Zeshan Chishti, Shih-Lien Lu, Bruce L. Jacob
2015Flexible software profiling of GPU architectures.
Mark Stephenson, Siva Kumar Sastry Hari, Yunsup Lee, Eiman Ebrahimi, Daniel R. Johnson, David W. Nellans, Mike O'Connor, Stephen W. Keckler
2015Fusion: design tradeoffs in coherent cache hierarchies for accelerators.
Snehasish Kumar, Arrvindh Shriraman, Naveen Vedula
2015HEB: deploying and managing hybrid energy buffers for improving datacenter efficiency and economy.
Longjun Liu, Chao Li, Hongbin Sun, Yang Hu, Juncheng Gu, Tao Li, Jingmin Xin, Nanning Zheng
2015Harmonia: balancing compute and memory power in high-performance GPUs.
Indrani Paul, Wei Huang, Manish Arora, Sudhakar Yalamanchili
2015Heracles: improving resource efficiency at scale.
David Lo, Liqun Cheng, Rama K. Govindaraju, Parthasarathy Ranganathan, Christos Kozyrakis
2015Hi-fi playback: tolerating position errors in shift operations of racetrack memory.
Chao Zhang, Guangyu Sun, Xian Zhang, Weiqi Zhang, Weisheng Zhao, Tao Wang, Yun Liang, Yongpan Liu, Yu Wang, Jiwu Shu
2015LaZy superscalar.
Görkem Asilioglu, Zhaoxiang Jin, Murat Köksal, Omkar Javeri, Soner Önder
2015MBus: an ultra-low power interconnect bus for next generation nanopower systems.
Pat Pannuto, Yoonmyung Lee, Ye-Sheng Kuo, Zhiyoong Foo, Benjamin P. Kempke, Gyouho Kim, Ronald G. Dreslinski, David T. Blaauw, Prabal Dutta
2015Manycore network interfaces for in-memory rack-scale computing.
Alexandros Daglis, Stanko Novakovic, Edouard Bugnion, Babak Falsafi, Boris Grot
2015MiSAR: minimalistic synchronization accelerator with resource overflow management.
Ching-Kai Liang, Milos Prvulovic
2015Multiple clone row DRAM: a low latency and area optimized DRAM.
Jungwhan Choi, Wongyu Shin, Jaemin Jang, Jinwoong Suh, Yongkee Kwon, Youngsuk Moon, Lee-Sup Kim
2015PIM-enabled instructions: a low-overhead, locality-aware processing-in-memory architecture.
Junwhan Ahn, Sungjoo Yoo, Onur Mutlu, Kiyoung Choi
2015Page overlays: an enhanced virtual memory framework to enable fine-grained memory management.
Vivek Seshadri, Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry, Trishul M. Chilimbi
2015PrORAM: dynamic prefetcher for oblivious RAM.
Xiangyao Yu, Syed Kamran Haider, Ling Ren, Christopher W. Fletcher, Albert Kwon, Marten van Dijk, Srinivas Devadas
2015Probable cause: the deanonymizing effects of approximate DRAM.
Amir Rahmati, Matthew Hicks, Daniel E. Holcomb, Kevin Fu
2015Proceedings of the 42nd Annual International Symposium on Computer Architecture, Portland, OR, USA, June 13-17, 2015
Deborah T. Marr, David H. Albonesi
2015Profiling a warehouse-scale computer.
Svilen Kanev, Juan Pablo Darago, Kim M. Hazelwood, Parthasarathy Ranganathan, Tipp Moseley, Gu-Yeon Wei, David M. Brooks
2015Quantitative comparison of hardware transactional memory for Blue Gene/Q, zEnterprise EC12, Intel Core, and POWER8.
Takuya Nakaike, Rei Odaira, Matthew Gaudet, Maged M. Michael, Hisanobu Tomari
2015Reducing world switches in virtualized environment with flexible cross-world calls.
Wenhao Li, Yubin Xia, Haibo Chen, Binyu Zang, Haibing Guan
2015Redundant memory mappings for fast access to large memories.
Vasileios Karakostas, Jayneel Gandhi, Furkan Ayar, Adrián Cristal, Mark D. Hill, Kathryn S. McKinley, Mario Nemirovsky, Michael M. Swift, Osman S. Unsal
2015Rumba: an online quality management system for approximate computing.
Daya Shanker Khudia, Babak Zamirai, Mehrzad Samadi, Scott A. Mahlke
2015SHRINK: reducing the ISA complexity via instruction recycling.
Bruno Cardoso Lopes, Rafael Auler, Luiz Ramos, Edson Borin, Rodolfo Azevedo
2015SLIP: reducing wire energy in the memory hierarchy.
Subhasis Das, Tor M. Aamodt, William J. Dally
2015Semantic locality and context-based prefetching using reinforcement learning.
Leeor Peled, Shie Mannor, Uri C. Weiser, Yoav Etsion
2015ShiDianNao: shifting vision processing closer to the sensor.
Zidong Du, Robert Fasthuber, Tianshi Chen, Paolo Ienne, Ling Li, Tao Luo, Xiaobing Feng, Yunji Chen, Olivier Temam
2015Stash: have your scratchpad and cache it too.
Rakesh Komuravelli, Matthew D. Sinclair, Johnathan Alsop, Muhammad Huzaifa, Maria Kotsifakou, Prakalp Srivastava, Sarita V. Adve, Vikram S. Adve
2015The load slice core microarchitecture.
Trevor E. Carlson, Wim Heirman, Osman Allam, Stefanos Kaxiras, Lieven Eeckhout
2015Thermal time shifting: leveraging phase change materials to reduce cooling costs in warehouse-scale computers.
Matt Skach, Manish Arora, Chang-Hong Hsu, Qi Li, Dean M. Tullsen, Lingjia Tang, Jason Mars
2015Towards sustainable in-situ server systems in the big data era.
Chao Li, Yang Hu, Longjun Liu, Juncheng Gu, Mingcong Song, Xiaoyao Liang, Jingling Yuan, Tao Li
2015Unified address translation for memory-mapped SSDs with FlashMap.
Jian Huang, Anirudh Badam, Moinuddin K. Qureshi, Karsten Schwan
2015VIP: virtualizing IP chains on handheld platforms.
Nachiappan Chidambaram Nachiappan, Haibo Zhang, Jihyun Ryoo, Niranjan Soundararajan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravishankar R. Iyer, Chita R. Das
2015Warped-compression: enabling power efficient GPUs through register compression.
Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, Murali Annavaram