| 2014 | 20th IEEE International Symposium on High Performance Computer Architecture, HPCA 2014, Orlando, FL, USA, February 15-19, 2014 |
| 2014 | 3D stacking of high-performance processors. Philip G. Emma, Alper Buyuktosunoglu, Michael B. Healy, Krishnan Kailas, Valentin Puente, Roy Yu, Allan Hartstein, Pradip Bose, Jaime H. Moreno, Eren Kursun |
| 2014 | A Non-Inclusive Memory Permissions architecture for protection against cross-layer attacks. Jesse Elwell, Ryan Riley, Nael B. Abu-Ghazaleh, Dmitry Ponomarev |
| 2014 | A detailed GPU cache model based on reuse distance theory. Cedric Nugteren, Gert-Jan van den Braak, Henk Corporaal, Henri E. Bal |
| 2014 | A scalable multi-path microarchitecture for efficient GPU control flow. Ahmed ElTantawy, Jessica Wenjie Ma, Mike O'Connor, Tor M. Aamodt |
| 2014 | Accelerating decoupled look-ahead via weak dependence removal: A metaheuristic approach. Raj Parihar, Michael C. Huang |
| 2014 | Accordion: Toward soft Near-Threshold Voltage Computing. Ulya R. Karpuzcu, Ismail Akturk, Nam Sung Kim |
| 2014 | Adaptive placement and migration policy for an STT-RAM-based hybrid cache. Zhe Wang, Daniel A. Jiménez, Cong Xu, Guangyu Sun, Yuan Xie |
| 2014 | Atomic SC for simple in-order processors. Dibakar Gope, Mikko H. Lipasti |
| 2014 | BigDataBench: A big data benchmark suite from internet services. Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Zhen Jia, Yingjie Shi, Shujie Zhang, Chen Zheng, Gang Lu, Kent Zhan, Xiaona Li, Bizhu Qiu |
| 2014 | CDTT: Compiler-generated data-triggered threads. Hung-Wei Tseng, Dean M. Tullsen |
| 2014 | CREAM: A Concurrent-Refresh-Aware DRAM Memory architecture. Tao Zhang, Matthew Poremba, Cong Xu, Guangyu Sun, Yuan Xie |
| 2014 | Concurrent and consistent virtual machine introspection with hardware transactional memory. Yutao Liu, Yubin Xia, Haibing Guan, Binyu Zang, Haibo Chen |
| 2014 | DASCA: Dead Write Prediction Assisted STT-RAM Cache Architecture. Junwhan Ahn, Sungjoo Yoo, Kiyoung Choi |
| 2014 | DraMon: Predicting memory bandwidth usage of multi-threaded programs with high accuracy and low overhead. Wei Wang, Tanima Dey, Jack W. Davidson, Mary Lou Soffa |
| 2014 | Dynamic management of TurboMode in modern multi-core chips. David Lo, Christos Kozyrakis |
| 2014 | Dynamically detecting and tolerating IF-Condition Data Races. Shanxiang Qi, Abdullah Muzahid, Wonsun Ahn, Josep Torrellas |
| 2014 | Exploiting thermal energy storage to reduce data center capital and operating expenses. Wenli Zheng, Kai Ma, Xiaorui Wang |
| 2014 | FADE: A programmable filtering accelerator for instruction-grain monitoring. Sotiria Fytraki, Evangelos Vlachos, Yusuf Onur Koçberber, Babak Falsafi, Boris Grot |
| 2014 | GPUdmm: A high-performance and memory-oblivious GPU architecture using dynamic memory management. Youngsok Kim, Jaewon Lee, Jae-Eon Jo, Jangwoo Kim |
| 2014 | Implications of high energy proportional servers on cluster-wide energy proportionality. Daniel Wong, Murali Annavaram |
| 2014 | Improving DRAM performance by parallelizing refreshes with accesses. Kevin Kai-Wei Chang, Donghyuk Lee, Zeshan Chishti, Alaa R. Alameldeen, Chris Wilkerson, Yoongu Kim, Onur Mutlu |
| 2014 | Improving GPGPU resource utilization through alternative thread block scheduling. Minseok Lee, Seokwoo Song, Joosik Moon, John Kim, Woong Seo, Yeon-Gon Cho, Soojung Ryu |
| 2014 | Improving cache performance using read-write partitioning. Samira Manabi Khan, Alaa R. Alameldeen, Chris Wilkerson, Onur Mutlu, Daniel A. Jiménez |
| 2014 | Improving in-memory database index performance with Intel Tomas Karnagel, Roman Dementiev, Ravi Rajwar, Konrad Lai, Thomas Legler, Benjamin Schlegel, Wolfgang Lehner |
| 2014 | Improving system throughput and fairness simultaneously in shared memory CMP systems via Dynamic Bank Partitioning. Mingli Xie, Dong Tong, Kan Huang, Xu Cheng |
| 2014 | Increasing TLB reach by exploiting clustering in page translations. Binh Pham, Abhishek Bhattacharjee, Yasuko Eckert, Gabriel H. Loh |
| 2014 | Locality-aware data replication in the Last-Level Cache. George Kurian, Srinivas Devadas, Omer Khan |
| 2014 | Low-overhead and high coverage run-time race detection through selective meta-data management. Ruirui C. Huang, Erik Halberg, Andrew Ferraiuolo, G. Edward Suh |
| 2014 | MP3: Minimizing performance penalty for power-gating of Clos network-on-chip. Lizhong Chen, Lihang Zhao, Ruisheng Wang, Timothy Mark Pinkston |
| 2014 | MRPB: Memory request prioritization for massively parallel processors. Wenhao Jia, Kelly A. Shaw, Margaret Martonosi |
| 2014 | MemZip: Exploring unconventional benefits from memory compression. Ali Shafiee, Meysam Taassori, Rajeev Balasubramonian, Al Davis |
| 2014 | Mosaic: Exploiting the spatial locality of process variation to reduce refresh energy in on-chip eDRAM modules. Aditya Agrawal, Amin Ansari, Josep Torrellas |
| 2014 | NUAT: A non-uniform access time memory controller. Wongyu Shin, Jeongmin Yang, Jungwhan Choi, Lee-Sup Kim |
| 2014 | Over-clocked SSD: Safely running beyond flash memory chip I/O clock specs. Kai Zhao, Kalyana S. Venkataraman, Xuebin Zhang, Jiangpeng Li, Ning Zheng, Tong Zhang |
| 2014 | PVCoherence: Designing flat coherence protocols for scalable verification. Meng Zhang, Jesse D. Bingham, John Erickson, Daniel J. Sorin |
| 2014 | Practical data value speculation for future high-end processors. Arthur Perais, André Seznec |
| 2014 | Precision-aware soft error protection for GPUs. David J. Palframan, Nam Sung Kim, Mikko H. Lipasti |
| 2014 | QORE: A fault tolerant network-on-chip architecture with power-efficient quad-function channel (QFC) buffers. Dominic DiTomaso, Avinash Karanth Kodi, Ahmed Louri |
| 2014 | QuickRelease: A throughput-oriented approach to release consistency on GPUs. Blake A. Hechtman, Shuai Che, Derek R. Hower, Yingying Tian, Bradford M. Beckmann, Mark D. Hill, Steven K. Reinhardt, David A. Wood |
| 2014 | Reducing the cost of persistence for nonvolatile heaps in end user devices. Sudarsun Kannan, Ada Gavrilovska, Karsten Schwan |
| 2014 | Revolver: Processor architecture for power efficient loop execution. Mitchell Hayenga, Vignyan Reddy Kothinti Naresh, Mikko H. Lipasti |
| 2014 | STM: Cloning the spatial and temporal memory access behavior. Amro Awad, Yan Solihin |
| 2014 | Sandbox Prefetching: Safe run-time evaluation of aggressive prefetchers. Seth H. Pugsley, Zeshan Chishti, Chris Wilkerson, Peng-fei Chuang, Robert L. Scott, Aamer Jaleel, Shih-Lien Lu, Kingsum Chow, Rajeev Balasubramonian |
| 2014 | Scalably verifiable dynamic power management. Opeoluwa Matthews, Meng Zhang, Daniel J. Sorin |
| 2014 | Spare register aware prefetching for graph algorithms on GPUs. Nagesh B. Lakshminarayana, Hyesoon Kim |
| 2014 | Sprinkler: Maximizing resource utilization in many-chip solid state disks. Myoungsoo Jung, Mahmut T. Kandemir |
| 2014 | Stash directory: A scalable directory for many-core coherence. Socrates Demetriades, Sangyeun Cho |
| 2014 | Strategies for anticipating risk in heterogeneous system design. Marisabel Guevara, Benjamin Lubin, Benjamin C. Lee |
| 2014 | Supporting x86-64 address translation for 100s of GPU lanes. Jason Power, Mark D. Hill, David A. Wood |
| 2014 | Suppressing the Oblivious RAM timing channel while making information leakage and program efficiency trade-offs. Christopher W. Fletcher, Ling Ren, Xiangyao Yu, Marten van Dijk, Omer Khan, Srinivas Devadas |
| 2014 | TSO-CC: Consistency directed cache coherence for TSO. Marco Elver, Vijay Nagarajan |
| 2014 | Tangle: Route-oriented dynamic voltage minimization for variation-afflicted, energy-efficient on-chip networks. Amin Ansari, Asit K. Mishra, Jianping Xu, Josep Torrellas |
| 2014 | Timing channel protection for a shared memory controller. Yao Wang, Andrew Ferraiuolo, G. Edward Suh |
| 2014 | Transportation-network-inspired network-on-chip. Hanjoon Kim, Gwangsun Kim, Seungryoul Maeng, Hwasoo Yeo, John Kim |
| 2014 | Understanding the impact of gate-level physical reliability effects on whole program execution. Raghuraman Balasubramanian, Karthikeyan Sankaralingam |
| 2014 | Undersubscribed threading on clustered cache architectures. Wim Heirman, Trevor E. Carlson, Kenzo Van Craeynest, Ibrahim Hur, Aamer Jaleel, Lieven Eeckhout |
| 2014 | Up by their bootstraps: Online learning in Artificial Neural Networks for CMP uncore power management. Jae-Yeon Won, Xi Chen, Paul Gratz, Jiang Hu, Vassos Soteriou |
| 2014 | Warp-level divergence in GPUs: Characterization, impact, and mitigation. Ping Xiang, Yi Yang, Huiyang Zhou |