HPCA A*

56 papers

YearTitle / Authors
201521st IEEE International Symposium on High Performance Computer Architecture, HPCA 2015, Burlingame, CA, USA, February 7-11, 2015
2015Adaptive-latency DRAM: Optimizing DRAM timing for the common-case.
Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Manabi Khan, Vivek Seshadri, Kevin Kai-Wei Chang, Onur Mutlu
2015Adrenaline: Pinpointing and reining in tail queries with quick voltage boosting.
Chang-Hong Hsu, Yunqi Zhang, Michael A. Laurenzano, David Meisner, Thomas F. Wenisch, Jason Mars, Lingjia Tang, Ronald G. Dreslinski
2015Alloy: Parallel-serial memory channel architecture for single-chip heterogeneous processor systems.
Hao Wang, Chang-Jae Park, Gyungsu Byun, Jung Ho Ahn, Nam Sung Kim
2015Architecture exploration for ambient energy harvesting nonvolatile processors.
Kaisheng Ma, Yang Zheng, Shuangchen Li, Karthik Swaminathan, Xueqing Li, Yongpan Liu, Jack Sampson, Yuan Xie, Vijaykrishnan Narayanan
2015Augmenting low-latency HPC network with free-space optical links.
Ikki Fujiwara, Michihiro Koibuchi, Tomoya Ozaki, Hiroki Matsutani, Henri Casanova
2015BRAINIAC: Bringing reliable accuracy into neurally-implemented approximate computing.
Beayna Grigorian, Nazanin Farahpour, Glenn Reinman
2015Balancing reliability, cost, and performance tradeoffs with FreeFault.
Dong-Wan Kim, Mattan Erez
2015Bamboo ECC: Strong, safe, and flexible codes for reliable computer memory.
Jungrae Kim, Michael B. Sullivan, Mattan Erez
2015BeBoP: A cost effective predictor infrastructure for superscalar value prediction.
Arthur Perais, André Seznec
2015CAFO: Cost aware flip optimization for asymmetric memories.
Rakan Maddah, Seyed Mohammad Seyedzadeh, Rami G. Melhem
2015CiDRA: A cache-inspired DRAM resilience architecture.
Young Hoon Son, Sukhan Lee, Seongil O, Sanghyuk Kwon, Nam Sung Kim, Jung Ho Ahn
2015Coordinated static and dynamic cache bypassing for GPUs.
Xiaolong Xie, Yun Liang, Yu Wang, Guangyu Sun, Tao Wang
2015Correction prediction: Reducing error correction latency for on-chip memories.
Henry Duwe, Xun Jian, Rakesh Kumar
2015Data retention in MLC NAND flash memory: Characterization, optimization, and recovery.
Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Onur Mutlu
2015Domain knowledge based energy management in handhelds.
Nachiappan Chidambaram Nachiappan, Praveen Yedlapalli, Niranjan Soundararajan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravishankar R. Iyer, Chita R. Das
2015Event-based scheduling for energy-efficient QoS (eQoS) in mobile Web applications.
Yuhao Zhu, Matthew Halpern, Vijay Janapa Reddi
2015Exploiting compressed block size as an indicator of future reuse.
Gennady Pekhimenko, Tyler Huberty, Rui Cai, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry
2015Exploring architectural heterogeneity in intelligent vision systems.
Nandhini Chandramoorthy, Giuseppe Tagliavini, Kevin M. Irick, Antonio Pullini, Siddharth Advani, Sulaiman Al Habsi, Matthew Cotter, John Sampson, Vijaykrishnan Narayanan, Luca Benini
2015FTXen: Making hypervisor resilient to hardware faults on relaxed cores.
Xinxin Jin, Soyeon Park, Tianwei Sheng, Rishan Chen, Zhiyong Shan, Yuanyuan Zhou
2015Flask coherence: A morphable hybrid coherence protocol to balance energy, performance and scalability.
Lucia G. Menezo, Valentin Puente, José-Ángel Gregorio
2015GPGPU performance and power estimation using machine learning.
Gene Y. Wu, Joseph L. Greathouse, Alexander Lyashevsky, Nuwan Jayasena, Derek Chiou
2015GPU voltage noise: Characterization and hierarchical smoothing of spatial and temporal voltage noise interference in GPU architectures.
Jingwen Leng, Yazhou Zu, Vijay Janapa Reddi
2015Heterogeneous memory architectures: A HW/SW approach for mixing die-stacked and off-package memories.
Mitesh R. Meswani, Sergey Blagodurov, David Roberts, John Slice, Mike Ignatowski, Gabriel H. Loh
2015Hierarchical private/shared classification: The key to simple and efficient coherence for clustered cache hierarchies.
Alberto Ros, Mahdad Davari, Stefanos Kaxiras
2015High performing cache hierarchies for server workloads: Relaxing inclusion to capture the latency benefits of exclusive caches.
Aamer Jaleel, Joseph Nuzman, Adrian Moga, Simon C. Steely Jr., Joel S. Emer
2015Increasing multicore system efficiency through intelligent bandwidth shifting.
Víctor Jiménez, Alper Buyuktosunoglu, Pradip Bose, Francis P. O'Connell, Francisco J. Cazorla, Mateo Valero
2015Malware-aware processors: A framework for efficient online malware detection.
Meltem Ozsoy, Caleb Donovick, Iakov Gorelik, Nael B. Abu-Ghazaleh, Dmitry V. Ponomarev
2015Mascar: Speeding up GPU warps by reducing memory pitstops.
Ankit Sethia, Davoud Anoushe Jamshidi, Scott A. Mahlke
2015NDA: Near-DRAM acceleration architecture leveraging commodity DRAM devices and standard memory modules.
Amin Farmahini Farahani, Jung Ho Ahn, Katherine Morrow, Nam Sung Kim
2015Octopus-Man: QoS-driven task management for heterogeneous multicores in warehouse-scale computers.
Vinicius Petrucci, Michael A. Laurenzano, John Doherty, Yunqi Zhang, Daniel Mossé, Jason Mars, Lingjia Tang
2015Overcoming far-end congestion in large-scale networks.
Jongmin Won, Gwangsun Kim, John Kim, Ted Jiang, Mike Parker, Steve Scott
2015Overcoming the challenges of crossbar resistive memory architectures.
Cong Xu, Dimin Niu, Naveen Muralimanohar, Rajeev Balasubramonian, Tao Zhang, Shimeng Yu, Yuan Xie
2015Paying to save: Reducing cost of colocation data center via rewards.
Mohammad A. Islam, A. Hasan Mahmud, Shaolei Ren, Xiaorui Wang
2015Power punch: Towards non-blocking power-gating of NoC routers.
Lizhong Chen, Di Zhu, Massoud Pedram, Timothy Mark Pinkston
2015Prediction-based superpage-friendly TLB designs.
Misel-Myrto Papadopoulou, Xin Tong, André Seznec, Andreas Moshovos
2015Priority-based cache allocation in throughput processors.
Dong Li, Minsoo Rhu, Daniel R. Johnson, Mike O'Connor, Mattan Erez, Doug Burger, Donald S. Fussell, Stephen W. Redder
2015Quantifying sources of error in McPAT and potential impacts on architectural studies.
Sam Likun Xi, Hans M. Jacobson, Pradip Bose, Gu-Yeon Wei, David M. Brooks
2015Reducing read latency of phase change memory via early read and Turbo Read.
Prashant J. Nair, Chia-Chen Chou, Bipin Rajendran, Moinuddin K. Qureshi
2015Run-time monitoring with adjustable overhead using dataflow-guided filtering.
Daniel Lo, Tao Chen, Mohamed Ismail, G. Edward Suh
2015SCOC: High-radix switches made of bufferless clos networks.
Nikolaos Chrysos, Cyriel Minkenberg, Mark Rudquist, Claude Basso, Brian Vanderpool
2015SNNAP: Approximate computing on programmable SoCs via neural acceleration.
Thierry Moreau, Mark Wyse, Jacob Nelson, Adrian Sampson, Hadi Esmaeilzadeh, Luis Ceze, Mark Oskin
2015Scalable communication architecture for network-attached accelerators.
Sarah Neuwirth, Dirk Frey, Mondrian Nuessle, Ulrich Brüning
2015Scaling distributed cache hierarchies through computation and data co-scheduling.
Nathan Beckmann, Po-An Tsai, Daniel Sánchez
2015Studying the impact of multicore processor scaling on directory techniques via reuse distance analysis.
Minshu Zhao, Donald Yeung
2015Supporting superpages in non-contiguous physical memory.
Yu Du, Miao Zhou, Bruce R. Childers, Daniel Mossé, Rami G. Melhem
2015Tag tables.
Sean Franey, Mikko H. Lipasti
2015Talus: A simple way to remove cliffs in cache performance.
Nathan Beckmann, Daniel Sánchez
2015Understanding GPU errors on large-scale HPC systems and the implications for system design and operation.
Devesh Tiwari, Saurabh Gupta, James H. Rogers, Don Maxwell, Paolo Rech, Sudharshan S. Vazhkudai, Daniel Oliveira, Dave Londo, Nathan DeBardeleben, Philippe Olivier Alexandre Navaux, Luigi Carro, Arthur S. Bland
2015Understanding contention-based channels and using them for defense.
Casen Hunger, Mikhail Kazdagli, Ankit Singh Rawat, Alexandros G. Dimakis, Sriram Vishwanath, Mohit Tiwari
2015Understanding idle behavior and power gating mechanisms in the context of modern benchmarks on CPU-GPU Integrated systems.
Manish Arora, Srilatha Manne, Indrani Paul, Nuwan Jayasena, Dean M. Tullsen
2015Understanding the virtualization "Tax" of scale-out pass-through GPUs in GaaS clouds: An empirical study.
Ming Liu, Tao Li, Neo Jia, Andy Currid, Vladimir Troy
2015Unlocking bandwidth for GPUs in CC-NUMA systems.
Neha Agarwal, David W. Nellans, Mike O'Connor, Stephen W. Keckler, Thomas F. Wenisch
2015VSR sort: A novel vectorised sorting algorithm & architecture extensions for future microprocessors.
Timothy Hayes, Oscar Palomar, Osman S. Unsal, Adrián Cristal, Mateo Valero
2015XChange: A market-based approach to scalable dynamic multi-resource allocation in multicore architectures.
Xiaodong Wang, José F. Martínez
2015iPatch: Intelligent fault patching to improve energy efficiency.
David J. Palframan, Nam Sung Kim, Mikko H. Lipasti