HPCA - RankMe – RankMe

56 papers

Year	Title / Authors
2015	21st IEEE International Symposium on High Performance Computer Architecture, HPCA 2015, Burlingame, CA, USA, February 7-11, 2015
2015	Adaptive-latency DRAM: Optimizing DRAM timing for the common-case. Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Manabi Khan, Vivek Seshadri, Kevin Kai-Wei Chang, Onur Mutlu
2015	Adrenaline: Pinpointing and reining in tail queries with quick voltage boosting. Chang-Hong Hsu, Yunqi Zhang, Michael A. Laurenzano, David Meisner, Thomas F. Wenisch, Jason Mars, Lingjia Tang, Ronald G. Dreslinski
2015	Alloy: Parallel-serial memory channel architecture for single-chip heterogeneous processor systems. Hao Wang, Chang-Jae Park, Gyungsu Byun, Jung Ho Ahn, Nam Sung Kim
2015	Architecture exploration for ambient energy harvesting nonvolatile processors. Kaisheng Ma, Yang Zheng, Shuangchen Li, Karthik Swaminathan, Xueqing Li, Yongpan Liu, Jack Sampson, Yuan Xie, Vijaykrishnan Narayanan
2015	Augmenting low-latency HPC network with free-space optical links. Ikki Fujiwara, Michihiro Koibuchi, Tomoya Ozaki, Hiroki Matsutani, Henri Casanova
2015	BRAINIAC: Bringing reliable accuracy into neurally-implemented approximate computing. Beayna Grigorian, Nazanin Farahpour, Glenn Reinman
2015	Balancing reliability, cost, and performance tradeoffs with FreeFault. Dong-Wan Kim, Mattan Erez
2015	Bamboo ECC: Strong, safe, and flexible codes for reliable computer memory. Jungrae Kim, Michael B. Sullivan, Mattan Erez
2015	BeBoP: A cost effective predictor infrastructure for superscalar value prediction. Arthur Perais, André Seznec
2015	CAFO: Cost aware flip optimization for asymmetric memories. Rakan Maddah, Seyed Mohammad Seyedzadeh, Rami G. Melhem
2015	CiDRA: A cache-inspired DRAM resilience architecture. Young Hoon Son, Sukhan Lee, Seongil O, Sanghyuk Kwon, Nam Sung Kim, Jung Ho Ahn
2015	Coordinated static and dynamic cache bypassing for GPUs. Xiaolong Xie, Yun Liang, Yu Wang, Guangyu Sun, Tao Wang
2015	Correction prediction: Reducing error correction latency for on-chip memories. Henry Duwe, Xun Jian, Rakesh Kumar
2015	Data retention in MLC NAND flash memory: Characterization, optimization, and recovery. Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Onur Mutlu
2015	Domain knowledge based energy management in handhelds. Nachiappan Chidambaram Nachiappan, Praveen Yedlapalli, Niranjan Soundararajan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravishankar R. Iyer, Chita R. Das
2015	Event-based scheduling for energy-efficient QoS (eQoS) in mobile Web applications. Yuhao Zhu, Matthew Halpern, Vijay Janapa Reddi
2015	Exploiting compressed block size as an indicator of future reuse. Gennady Pekhimenko, Tyler Huberty, Rui Cai, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry
2015	Exploring architectural heterogeneity in intelligent vision systems. Nandhini Chandramoorthy, Giuseppe Tagliavini, Kevin M. Irick, Antonio Pullini, Siddharth Advani, Sulaiman Al Habsi, Matthew Cotter, John Sampson, Vijaykrishnan Narayanan, Luca Benini
2015	FTXen: Making hypervisor resilient to hardware faults on relaxed cores. Xinxin Jin, Soyeon Park, Tianwei Sheng, Rishan Chen, Zhiyong Shan, Yuanyuan Zhou
2015	Flask coherence: A morphable hybrid coherence protocol to balance energy, performance and scalability. Lucia G. Menezo, Valentin Puente, José-Ángel Gregorio
2015	GPGPU performance and power estimation using machine learning. Gene Y. Wu, Joseph L. Greathouse, Alexander Lyashevsky, Nuwan Jayasena, Derek Chiou
2015	GPU voltage noise: Characterization and hierarchical smoothing of spatial and temporal voltage noise interference in GPU architectures. Jingwen Leng, Yazhou Zu, Vijay Janapa Reddi
2015	Heterogeneous memory architectures: A HW/SW approach for mixing die-stacked and off-package memories. Mitesh R. Meswani, Sergey Blagodurov, David Roberts, John Slice, Mike Ignatowski, Gabriel H. Loh
2015	Hierarchical private/shared classification: The key to simple and efficient coherence for clustered cache hierarchies. Alberto Ros, Mahdad Davari, Stefanos Kaxiras
2015	High performing cache hierarchies for server workloads: Relaxing inclusion to capture the latency benefits of exclusive caches. Aamer Jaleel, Joseph Nuzman, Adrian Moga, Simon C. Steely Jr., Joel S. Emer
2015	Increasing multicore system efficiency through intelligent bandwidth shifting. Víctor Jiménez, Alper Buyuktosunoglu, Pradip Bose, Francis P. O'Connell, Francisco J. Cazorla, Mateo Valero
2015	Malware-aware processors: A framework for efficient online malware detection. Meltem Ozsoy, Caleb Donovick, Iakov Gorelik, Nael B. Abu-Ghazaleh, Dmitry V. Ponomarev
2015	Mascar: Speeding up GPU warps by reducing memory pitstops. Ankit Sethia, Davoud Anoushe Jamshidi, Scott A. Mahlke
2015	NDA: Near-DRAM acceleration architecture leveraging commodity DRAM devices and standard memory modules. Amin Farmahini Farahani, Jung Ho Ahn, Katherine Morrow, Nam Sung Kim
2015	Octopus-Man: QoS-driven task management for heterogeneous multicores in warehouse-scale computers. Vinicius Petrucci, Michael A. Laurenzano, John Doherty, Yunqi Zhang, Daniel Mossé, Jason Mars, Lingjia Tang
2015	Overcoming far-end congestion in large-scale networks. Jongmin Won, Gwangsun Kim, John Kim, Ted Jiang, Mike Parker, Steve Scott
2015	Overcoming the challenges of crossbar resistive memory architectures. Cong Xu, Dimin Niu, Naveen Muralimanohar, Rajeev Balasubramonian, Tao Zhang, Shimeng Yu, Yuan Xie
2015	Paying to save: Reducing cost of colocation data center via rewards. Mohammad A. Islam, A. Hasan Mahmud, Shaolei Ren, Xiaorui Wang
2015	Power punch: Towards non-blocking power-gating of NoC routers. Lizhong Chen, Di Zhu, Massoud Pedram, Timothy Mark Pinkston
2015	Prediction-based superpage-friendly TLB designs. Misel-Myrto Papadopoulou, Xin Tong, André Seznec, Andreas Moshovos
2015	Priority-based cache allocation in throughput processors. Dong Li, Minsoo Rhu, Daniel R. Johnson, Mike O'Connor, Mattan Erez, Doug Burger, Donald S. Fussell, Stephen W. Redder
2015	Quantifying sources of error in McPAT and potential impacts on architectural studies. Sam Likun Xi, Hans M. Jacobson, Pradip Bose, Gu-Yeon Wei, David M. Brooks
2015	Reducing read latency of phase change memory via early read and Turbo Read. Prashant J. Nair, Chia-Chen Chou, Bipin Rajendran, Moinuddin K. Qureshi
2015	Run-time monitoring with adjustable overhead using dataflow-guided filtering. Daniel Lo, Tao Chen, Mohamed Ismail, G. Edward Suh
2015	SCOC: High-radix switches made of bufferless clos networks. Nikolaos Chrysos, Cyriel Minkenberg, Mark Rudquist, Claude Basso, Brian Vanderpool
2015	SNNAP: Approximate computing on programmable SoCs via neural acceleration. Thierry Moreau, Mark Wyse, Jacob Nelson, Adrian Sampson, Hadi Esmaeilzadeh, Luis Ceze, Mark Oskin
2015	Scalable communication architecture for network-attached accelerators. Sarah Neuwirth, Dirk Frey, Mondrian Nuessle, Ulrich Brüning
2015	Scaling distributed cache hierarchies through computation and data co-scheduling. Nathan Beckmann, Po-An Tsai, Daniel Sánchez
2015	Studying the impact of multicore processor scaling on directory techniques via reuse distance analysis. Minshu Zhao, Donald Yeung
2015	Supporting superpages in non-contiguous physical memory. Yu Du, Miao Zhou, Bruce R. Childers, Daniel Mossé, Rami G. Melhem
2015	Tag tables. Sean Franey, Mikko H. Lipasti
2015	Talus: A simple way to remove cliffs in cache performance. Nathan Beckmann, Daniel Sánchez
2015	Understanding GPU errors on large-scale HPC systems and the implications for system design and operation. Devesh Tiwari, Saurabh Gupta, James H. Rogers, Don Maxwell, Paolo Rech, Sudharshan S. Vazhkudai, Daniel Oliveira, Dave Londo, Nathan DeBardeleben, Philippe Olivier Alexandre Navaux, Luigi Carro, Arthur S. Bland
2015	Understanding contention-based channels and using them for defense. Casen Hunger, Mikhail Kazdagli, Ankit Singh Rawat, Alexandros G. Dimakis, Sriram Vishwanath, Mohit Tiwari
2015	Understanding idle behavior and power gating mechanisms in the context of modern benchmarks on CPU-GPU Integrated systems. Manish Arora, Srilatha Manne, Indrani Paul, Nuwan Jayasena, Dean M. Tullsen
2015	Understanding the virtualization "Tax" of scale-out pass-through GPUs in GaaS clouds: An empirical study. Ming Liu, Tao Li, Neo Jia, Andy Currid, Vladimir Troy
2015	Unlocking bandwidth for GPUs in CC-NUMA systems. Neha Agarwal, David W. Nellans, Mike O'Connor, Stephen W. Keckler, Thomas F. Wenisch
2015	VSR sort: A novel vectorised sorting algorithm & architecture extensions for future microprocessors. Timothy Hayes, Oscar Palomar, Osman S. Unsal, Adrián Cristal, Mateo Valero
2015	XChange: A market-based approach to scalable dynamic multi-resource allocation in multicore architectures. Xiaodong Wang, José F. Martínez
2015	iPatch: Intelligent fault patching to improve energy efficiency. David J. Palframan, Nam Sung Kim, Mikko H. Lipasti