ISCA - RankMe – RankMe

59 papers

Year	Title / Authors
2015	A case for core-assisted bottleneck acceleration in GPUs: enabling flexible data compression with assist warps. Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, Rachata Ausavarungnirun, Chita R. Das, Mahmut T. Kandemir, Todd C. Mowry, Onur Mutlu
2015	A fully associative, tagless DRAM cache. Yongjun Lee, Jongwon Kim, Hakbeom Jang, Hyunggyun Yang, Jangwoo Kim, Jinkyu Jeong, Jae W. Lee
2015	A scalable processing-in-memory accelerator for parallel graph processing. Junwhan Ahn, Sungpack Hong, Sungjoo Yoo, Onur Mutlu, Kiyoung Choi
2015	A variable warp size architecture. Timothy G. Rogers, Daniel R. Johnson, Mike O'Connor, Stephen W. Keckler
2015	Accelerating asynchronous programs through event sneak peek. Gaurav Chadha, Scott A. Mahlke, Satish Narayanasamy
2015	ArMOR: defending against memory consistency model mismatches in heterogeneous architectures. Daniel Lustig, Caroline Trippel, Michael Pellauer, Margaret Martonosi
2015	Architecting to achieve a billion requests per second throughput on a single key-value store server platform. Sheng Li, Hyeontaek Lim, Victor W. Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey
2015	BEAR: techniques for mitigating bandwidth bloat in gigascale DRAM caches. Chia-Chen Chou, Aamer Jaleel, Moinuddin K. Qureshi
2015	BlueDBM: an appliance for big data analytics. Sang Woo Jun, Ming Liu, Sungjin Lee, Jamey Hicks, John Ankcorn, Myron King, Shuotao Xu, Arvind
2015	Branch vanguard: decomposing branch functionality into prediction and resolution instructions. Daniel S. McFarlin, Craig B. Zilles
2015	CAWA: coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads. Shin-Ying Lee, Akhil Arunkumar, Carole-Jean Wu
2015	COP: to compress and protect main memory. David J. Palframan, Nam Sung Kim, Mikko H. Lipasti
2015	Callback: efficient synchronization without invalidation with a directory just for spin-waiting. Alberto Ros, Stefanos Kaxiras
2015	Clean: a race detector with cleaner semantics. Cedomir Segulja, Tarek S. Abdelrahman
2015	CloudMonatt: an architecture for security health monitoring and attestation of virtual machines in cloud computing. Tianwei Zhang, Ruby B. Lee
2015	Coherence protocol for transparent management of scratchpad memories in shared memory manycore architectures. Lluc Alvarez, Lluís Vilanova, Miquel Moretó, Marc Casas, Marc González, Xavier Martorell, Nacho Navarro, Eduard Ayguadé, Mateo Valero
2015	Computer performance microscopy with Shim. Xi Yang, Stephen M. Blackburn, Kathryn S. McKinley
2015	Cost-effective speculative scheduling in high performance processors. Arthur Perais, André Seznec, Pierre Michaud, Andreas Sembrant, Erik Hagersten
2015	Data reorganization in memory using 3D-stacked DRAM. Berkin Akin, Franz Franchetti, James C. Hoe
2015	DjiNN and Tonic: DNN as a service and its implications for future warehouse scale computers. Johann Hauswald, Yiping Kang, Michael A. Laurenzano, Quan Chen, Cheng Li, Trevor N. Mudge, Ronald G. Dreslinski, Jason Mars, Lingjia Tang
2015	DynaSpAM: dynamic spatial architecture mapping using out of order instruction schedules. Feng Liu, Heejin Ahn, Stephen R. Beard, Taewook Oh, David I. August
2015	Dynamic thread block launch: a lightweight execution mechanism to support irregular applications on GPUs. Jin Wang, Norm Rubin, Albert Sidelnik, Sudhakar Yalamanchili
2015	Efficient execution of memory access phases using dataflow specialization. Chen-Han Ho, Sung Jin Kim, Karthikeyan Sankaralingam
2015	Exploring the potential of heterogeneous von neumann/dataflow execution models. Tony Nowatzki, Vinay Gangadhar, Karthikeyan Sankaralingam
2015	FASE: finding amplitude-modulated side-channel emanations. Robert Locke Callan, Alenka G. Zajic, Milos Prvulovic
2015	FaultHound: value-locality-based soft-fault tolerance. Nitin, Irith Pomeranz, T. N. Vijaykumar
2015	Flexible auto-refresh: enabling scalable and energy-efficient DRAM refresh reductions. Ishwar Bhati, Zeshan Chishti, Shih-Lien Lu, Bruce L. Jacob
2015	Flexible software profiling of GPU architectures. Mark Stephenson, Siva Kumar Sastry Hari, Yunsup Lee, Eiman Ebrahimi, Daniel R. Johnson, David W. Nellans, Mike O'Connor, Stephen W. Keckler
2015	Fusion: design tradeoffs in coherent cache hierarchies for accelerators. Snehasish Kumar, Arrvindh Shriraman, Naveen Vedula
2015	HEB: deploying and managing hybrid energy buffers for improving datacenter efficiency and economy. Longjun Liu, Chao Li, Hongbin Sun, Yang Hu, Juncheng Gu, Tao Li, Jingmin Xin, Nanning Zheng
2015	Harmonia: balancing compute and memory power in high-performance GPUs. Indrani Paul, Wei Huang, Manish Arora, Sudhakar Yalamanchili
2015	Heracles: improving resource efficiency at scale. David Lo, Liqun Cheng, Rama K. Govindaraju, Parthasarathy Ranganathan, Christos Kozyrakis
2015	Hi-fi playback: tolerating position errors in shift operations of racetrack memory. Chao Zhang, Guangyu Sun, Xian Zhang, Weiqi Zhang, Weisheng Zhao, Tao Wang, Yun Liang, Yongpan Liu, Yu Wang, Jiwu Shu
2015	LaZy superscalar. Görkem Asilioglu, Zhaoxiang Jin, Murat Köksal, Omkar Javeri, Soner Önder
2015	MBus: an ultra-low power interconnect bus for next generation nanopower systems. Pat Pannuto, Yoonmyung Lee, Ye-Sheng Kuo, Zhiyoong Foo, Benjamin P. Kempke, Gyouho Kim, Ronald G. Dreslinski, David T. Blaauw, Prabal Dutta
2015	Manycore network interfaces for in-memory rack-scale computing. Alexandros Daglis, Stanko Novakovic, Edouard Bugnion, Babak Falsafi, Boris Grot
2015	MiSAR: minimalistic synchronization accelerator with resource overflow management. Ching-Kai Liang, Milos Prvulovic
2015	Multiple clone row DRAM: a low latency and area optimized DRAM. Jungwhan Choi, Wongyu Shin, Jaemin Jang, Jinwoong Suh, Yongkee Kwon, Youngsuk Moon, Lee-Sup Kim
2015	PIM-enabled instructions: a low-overhead, locality-aware processing-in-memory architecture. Junwhan Ahn, Sungjoo Yoo, Onur Mutlu, Kiyoung Choi
2015	Page overlays: an enhanced virtual memory framework to enable fine-grained memory management. Vivek Seshadri, Gennady Pekhimenko, Olatunji Ruwase, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry, Trishul M. Chilimbi
2015	PrORAM: dynamic prefetcher for oblivious RAM. Xiangyao Yu, Syed Kamran Haider, Ling Ren, Christopher W. Fletcher, Albert Kwon, Marten van Dijk, Srinivas Devadas
2015	Probable cause: the deanonymizing effects of approximate DRAM. Amir Rahmati, Matthew Hicks, Daniel E. Holcomb, Kevin Fu
2015	Proceedings of the 42nd Annual International Symposium on Computer Architecture, Portland, OR, USA, June 13-17, 2015 Deborah T. Marr, David H. Albonesi
2015	Profiling a warehouse-scale computer. Svilen Kanev, Juan Pablo Darago, Kim M. Hazelwood, Parthasarathy Ranganathan, Tipp Moseley, Gu-Yeon Wei, David M. Brooks
2015	Quantitative comparison of hardware transactional memory for Blue Gene/Q, zEnterprise EC12, Intel Core, and POWER8. Takuya Nakaike, Rei Odaira, Matthew Gaudet, Maged M. Michael, Hisanobu Tomari
2015	Reducing world switches in virtualized environment with flexible cross-world calls. Wenhao Li, Yubin Xia, Haibo Chen, Binyu Zang, Haibing Guan
2015	Redundant memory mappings for fast access to large memories. Vasileios Karakostas, Jayneel Gandhi, Furkan Ayar, Adrián Cristal, Mark D. Hill, Kathryn S. McKinley, Mario Nemirovsky, Michael M. Swift, Osman S. Unsal
2015	Rumba: an online quality management system for approximate computing. Daya Shanker Khudia, Babak Zamirai, Mehrzad Samadi, Scott A. Mahlke
2015	SHRINK: reducing the ISA complexity via instruction recycling. Bruno Cardoso Lopes, Rafael Auler, Luiz Ramos, Edson Borin, Rodolfo Azevedo
2015	SLIP: reducing wire energy in the memory hierarchy. Subhasis Das, Tor M. Aamodt, William J. Dally
2015	Semantic locality and context-based prefetching using reinforcement learning. Leeor Peled, Shie Mannor, Uri C. Weiser, Yoav Etsion
2015	ShiDianNao: shifting vision processing closer to the sensor. Zidong Du, Robert Fasthuber, Tianshi Chen, Paolo Ienne, Ling Li, Tao Luo, Xiaobing Feng, Yunji Chen, Olivier Temam
2015	Stash: have your scratchpad and cache it too. Rakesh Komuravelli, Matthew D. Sinclair, Johnathan Alsop, Muhammad Huzaifa, Maria Kotsifakou, Prakalp Srivastava, Sarita V. Adve, Vikram S. Adve
2015	The load slice core microarchitecture. Trevor E. Carlson, Wim Heirman, Osman Allam, Stefanos Kaxiras, Lieven Eeckhout
2015	Thermal time shifting: leveraging phase change materials to reduce cooling costs in warehouse-scale computers. Matt Skach, Manish Arora, Chang-Hong Hsu, Qi Li, Dean M. Tullsen, Lingjia Tang, Jason Mars
2015	Towards sustainable in-situ server systems in the big data era. Chao Li, Yang Hu, Longjun Liu, Juncheng Gu, Mingcong Song, Xiaoyao Liang, Jingling Yuan, Tao Li
2015	Unified address translation for memory-mapped SSDs with FlashMap. Jian Huang, Anirudh Badam, Moinuddin K. Qureshi, Karsten Schwan
2015	VIP: virtualizing IP chains on handheld platforms. Nachiappan Chidambaram Nachiappan, Haibo Zhang, Jihyun Ryoo, Niranjan Soundararajan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravishankar R. Iyer, Chita R. Das
2015	Warped-compression: enabling power efficient GPUs through register compression. Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, Won Woo Ro, Murali Annavaram