HPCA A*

57 papers

YearTitle / Authors
201925th IEEE International Symposium on High Performance Computer Architecture, HPCA 2019, Washington, DC, USA, February 16-20, 2019
2019A Hybrid Framework for Fast and Accurate GPU Performance Estimation through Source-Level Analysis and Trace-Based Simulation.
Xiebing Wang, Kai Huang, Alois C. Knoll, Xuehai Qian
2019A Scalable Priority-Aware Approach to Managing Data Center Server Power.
Yang Li, Charles R. Lefurgy, Karthick Rajamani, Malcolm S. Allen-Ware, Guillermo J. Silva, Daniel D. Heimsoth, Saugata Ghose, Onur Mutlu
2019Active-Routing: Compute on the Way for Near-Data Processing.
Jiayi Huang, Ramprakash Reddy Puli, Pritam Majumder, Sungkeun Kim, Rahul Boyapati, Ki Hwan Yum, Eun Jung Kim
2019Adaptive Voltage/Frequency Scaling and Core Allocation for Balanced Energy and Performance on Multicore CPUs.
George Papadimitriou, Athanasios Chatzidimitriou, Dimitris Gizopoulos
2019Analysis and Optimization of the Memory Hierarchy for Graph Processing Workloads.
Abanti Basak, Shuangchen Li, Xing Hu, Sang Min Oh, Xinfeng Xie, Li Zhao, Xiaowei Jiang, Yuan Xie
2019Architecting Waferscale Processors - A GPU Case Study.
Saptadeep Pal, Daniel Petrisko, Matthew Tomei, Puneet Gupta, Subramanian S. Iyer, Rakesh Kumar
2019BRB: Mitigating Branch Predictor Side-Channels.
Ilias Vougioukas, Nikos Nikoleris, Andreas Sandberg, Stephan Diestelhorst, Bashir M. Al-Hashimi, Geoff V. Merrett
2019Bingo Spatial Data Prefetcher.
Mohammad Bakhshalipour, Mehran Shakerinava, Pejman Lotfi-Kamran, Hamid Sarbazi-Azad
2019Bit Prudent In-Cache Acceleration of Deep Convolutional Neural Networks.
Xiaowei Wang, Jiecao Yu, Charles Augustine, Ravi R. Iyer, Reetuparna Das
2019CIDR: A Cost-Effective In-Line Data Reduction System for Terabit-Per-Second Scale SSD Arrays.
Mohammadamin Ajdari, Pyeongsu Park, Joonsung Kim, Dongup Kwon, Jangwoo Kim
2019Composite-ISA Cores: Enabling Multi-ISA Heterogeneity Using a Single ISA.
Ashish Venkat, Harsha Basavaraj, Dean M. Tullsen
2019Conditional Speculation: An Effective Approach to Safeguard Out-of-Order Execution Against Spectre Attacks.
Peinan Li, Lutan Zhao, Rui Hou, Lixin Zhang, Dan Meng
2019D-RaNGe: Using Commodity DRAM Devices to Generate True Random Numbers with Low Latency and High Throughput.
Jeremie S. Kim, Minesh Patel, Hasan Hassan, Lois Orosa, Onur Mutlu
2019Darwin-WGA: A Co-processor Provides Increased Sensitivity in Whole Genome Alignments with High Speedup.
Yatish Turakhia, Sneha D. Goenka, Gill Bejerano, William J. Dally
2019E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs.
Zhe Li, Caiwen Ding, Siyue Wang, Wujie Wen, Youwei Zhuo, Chang Liu, Qinru Qiu, Wenyao Xu, Xue Lin, Xuehai Qian, Yanzhi Wang
2019Early Visibility Resolution for Removing Ineffectual Computations in the Graphics Pipeline.
Martí Anglada, Enrique de Lucas, Joan-Manuel Parcerisa, Juan L. Aragón, Antonio González
2019Efficient Load Value Prediction Using Multiple Predictors and Filters.
Rami Sheikh, Derek Hower
2019Elastic Instruction Fetching.
Arthur Perais, Rami Sheikh, Luke Yen, Michael McIlvaine, Robert D. Clancy
2019Enabling Transparent Memory-Compression for Commodity Memory Systems.
Vinson Young, Sanjay Kariyappa, Moinuddin K. Qureshi
2019Enhancing Server Efficiency in the Face of Killer Microseconds.
Amirhossein Mirhosseini, Akshitha Sriraman, Thomas F. Wenisch
2019FPGA Accelerated INDEL Realignment in the Cloud.
Lisa Wu, David Bruns-Smith, Frank A. Nothaft, Qijing Huang, Sagar Karandikar, Johnny Le, Andrew Lin, Howard Mao, Brendan Sweeney, Krste Asanovic, David A. Patterson, Anthony D. Joseph
2019FPGA-Based High-Performance Parallel Architecture for Homomorphic Computing on Encrypted Data.
Sujoy Sinha Roy, Furkan Turan, Kimmo Järvinen, Frederik Vercauteren, Ingrid Verbauwhede
2019FUSE: Fusing STT-MRAM into GPUs to Alleviate Off-Chip Memory Access Overheads.
Jie Zhang, Myoungsoo Jung, Mahmut T. Kandemir
2019Featherlight Reuse-Distance Measurement.
Qingsen Wang, Xu Liu, Milind Chabbi
2019Fine-Tuning the Active Timing Margin (ATM) Control Loop for Maximizing Multi-core Efficiency on an IBM POWER Server.
Yazhou Zu, Daniel Richins, Charles Lefurgy, Vijay Janapa Reddi
2019Freeway: Maximizing MLP for Slice-Out-of-Order Execution.
Rakesh Kumar, Mehdi Alipour, David Black-Schaffer
2019Gables: A Roofline Model for Mobile SoCs.
Mark D. Hill, Vijay Janapa Reddi
2019HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array.
Linghao Song, Jiachen Mao, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
2019Kelp: QoS for Accelerated Machine Learning Systems.
Haishan Zhu, David Lo, Liqun Cheng, Rama K. Govindaraju, Parthasarathy Ranganathan, Mattan Erez
2019Killi: Runtime Fault Classification to Deploy Low Voltage Caches without MBIST.
Shrikanth Ganapathy, John Kalamatianos, Bradford M. Beckmann, Steven Raasch, Lukasz G. Szafaryn
2019Machine Learning at Facebook: Understanding Inference at the Edge.
Carole-Jean Wu, David Brooks, Kevin Chen, Douglas Chen, Sy Choudhury, Marat Dukhan, Kim M. Hazelwood, Eldad Isaac, Yangqing Jia, Bill Jia, Tommer Leyvand, Hao Lu, Yang Lu, Lin Qiao, Brandon Reagen, Joe Spisak, Fei Sun, Andrew Tulloch, Peter Vajda, Xiaodong Wang, Yanghan Wang, Bram Wasti, Yiming Wu, Ran Xian, Sungjoo Yoo, Peizhao Zhang
2019NAND-Net: Minimizing Computational Complexity of In-Memory Processing for Binary Neural Networks.
Hyeonuk Kim, Jaehyeong Sim, Yeongjae Choi, Lee-Sup Kim
2019NoMap: Speeding-Up JavaScript Using Hardware Transactional Memory.
Thomas Shull, Jiho Choi, María Jesús Garzarán, Josep Torrellas
2019PIM-VR: Erasing Motion Anomalies In Highly-Interactive Virtual Reality World with Customized Memory Cube.
Chenhao Xie, Xingyao Zhang, Ang Li, Xin Fu, Shuaiwen Song
2019POWERT Channels: A Novel Class of Covert CommunicationExploiting Power Management Vulnerabilities.
S. Karen Khatamifard, Longfei Wang, Amitabh Das, Selçuk Köse, Ulya R. Karpuzcu
2019PageSeer: Using Page Walks to Trigger Page Swaps in Hybrid Memory Systems.
Apostolos Kokolis, Dimitrios Skarlatos, Josep Torrellas
2019Pliant: Leveraging Approximation to Improve Datacenter Resource Efficiency.
Neeraj Kulkarni, Feng Qi, Christina Delimitrou
2019Poise: Balancing Thread-Level Parallelism and Memory System Performance in GPUs Using Machine Learning.
Saumay Dublish, Vijay Nagarajan, Nigel P. Topham
2019Poly: Efficient Heterogeneous System and Application Management for Interactive Applications.
Shuo Wang, Yun Liang, Wei Zhang
2019Power Aware Heterogeneous Node Assembly.
Bilge Acun, Alper Buyuktosunoglu, Eun Kyung Lee, Yoonho Park
2019R3-DLA (Reduce, Reuse, Recycle): A More Efficient Approach to Decoupled Look-Ahead Architectures.
Sushant Kondguli, Michael C. Huang
2019Recycling Data Slack in Out-of-Order Cores.
Gokul Subramanian Ravi, Mikko H. Lipasti
2019Reliability Evaluation of Mixed-Precision Architectures.
Fernando Fernandes dos Santos, Caio B. Lunardi, Daniel Oliveira, Fabiano Libano, Paolo Rech
2019Rendering Elimination: Early Discard of Redundant Tiles in the Graphics Pipeline.
Martí Anglada, Enrique de Lucas, Joan-Manuel Parcerisa, Juan L. Aragón, Pedro Marcuello, Antonio González
2019Resilient Low Voltage Accelerators for High Energy Efficiency.
Nandhini Chandramoorthy, Karthik Swaminathan, Martin Cochet, Arun Paidimarri, Schuyler Eldridge, Rajiv V. Joshi, Matthew M. Ziegler, Alper Buyuktosunoglu, Pradip Bose
2019Shortcut Mining: Exploiting Cross-Layer Shortcut Reuse in DCNN Accelerators.
Arash AziziMazreah, Lizhong Chen
2019Stretch: Balancing QoS and Throughput for Colocated Server Workloads on SMT Cores.
Artemiy Margaritov, Siddharth Gupta, Rekai González-Alberquilla, Boris Grot
2019String Figure: A Scalable and Elastic Memory Network Architecture.
Matheus Ogleari, Ye Yu, Chen Qian, Ethan L. Miller, Jishen Zhao
2019The Accelerator Wall: Limits of Chip Specialization.
Adi Fuchs, David Wentzlaff
2019The Best of IEEE Computer Architecture Letters in 2018.
Paul Gratz
2019The What's Next Intermittent Computing Architecture.
Karthik Ganesan, Joshua San Miguel, Natalie D. Enright Jerger
2019Understanding the Future of Energy Efficiency in Multi-Module GPUs.
Akhil Arunkumar, Evgeny Bolotin, David W. Nellans, Carole-Jean Wu
2019Understanding the Impact of Socket Density in Density Optimized Servers.
Manish Arora, Matt Skach, Wei Huang, Xudong An, Jason Mars, Lingjia Tang, Dean M. Tullsen
2019VIP: A Versatile Inference Processor.
Skand Hurkat, José F. Martínez
2019eQASM: An Executable Quantum Instruction Set Architecture.
Xiang Fu, Leon Riesebos, M. A. Rol, Jeroen van Straten, J. van Someren, Nader Khammassi, Imran Ashraf, R. F. L. Vermeulen, V. Newsum, K. K. L. Loh, J. C. de Sterke, W. J. Vlothuizen, R. N. Schouten, Carmen G. Almudéver, Leonardo DiCarlo, Koen Bertels
2019μDPM: Dynamic Power Management for the Microsecond Era.
Chih-Hsun Chou, Laxmi N. Bhuyan, Daniel Wong