PACT B

51 papers

YearTitle / Authors
201726th International Conference on Parallel Architectures and Compilation Techniques, PACT 2017, Portland, OR, USA, September 9-13, 2017
2017A DSL for Performance Orchestration.
Thiago Santos Faria Xavier Teixeira, David A. Padua, William Gropp
2017A GPU-Friendly Skiplist Algorithm.
Nurit Moscovici, Nachshon Cohen, Erez Petrank
2017A Generalized Framework for Automatic Scripting Language Parallelization.
Taewook Oh, Stephen R. Beard, Nick P. Johnson, Sergiy Popovych, David I. August
2017An Ultra Low-Power Hardware Accelerator for Acoustic Scoring in Speech Recognition.
Hamid Tabani, José-María Arnau, Jordi Tubella, Antonio González
2017Application Clustering Policies to Address System Fairness with Intel's Cache Allocation Technology.
Vicent Selfa, Julio Sahuquillo, Lieven Eeckhout, Salvador Petit, María Engracia Gómez
2017Architecting a Novel Hybrid Cache with Low Energy.
Jiacong He, Joseph Callenes-Sloan
2017Avoiding TLB Shootdowns Through Self-Invalidating TLB Entries.
Amro Awad, Arkaprava Basu, Sergey Blagodurov, Yan Solihin, Gabriel H. Loh
2017Cache Automaton: Repurposing Caches for Automata Processing.
Arun Subramaniyan, Jingcheng Wang, Ezhil R. M. Balasubramanian, David T. Blaauw, Dennis Sylvester, Reetuparna Das
2017DRUT: An Efficient Turbo Boost Solution via Load Balancing in Decoupled Look-Ahead Architecture.
Raj Parihar, Michael C. Huang
2017DrMP: Mixed Precision-Aware DRAM for High Performance Approximate and Precise Computing.
Xianwei Zhang, Youtao Zhang, Bruce R. Childers, Jun Yang
2017Efficient Checkpointing of Loop-Based Codes for Non-volatile Main Memory.
Hussein Elnawawy, Mohammad A. Alshboul, James Tuck, Yan Solihin
2017End-to-End Deep Learning of Optimization Heuristics.
Chris Cummins, Pavlos Petoumenos, Zheng Wang, Hugh Leather
2017Exploiting Asymmetric SIMD Register Configurations in ARM-to-x86 Dynamic Binary Translation.
Yu-Ping Liu, Ding-Yong Hong, Jan-Jan Wu, Sheng-Yu Fu, Wei-Chung Hsu
2017Graphie: Large-Scale Asynchronous Graph Traversals on Just a GPU.
Wei Han, Daniel Mawhirter, Bo Wu, Matthew Buland
2017In-memory Data Flow Processor.
Daichi Fujiki, Scott A. Mahlke, Reetuparna Das
2017Introspective Computing.
Karl Taht, Rajeev Balasubramonian
2017Large Scale Data Clustering Using Memristive k-Median Computation.
Yomi Karthik Rupesh, Mahdi Nazm Bojnordi
2017Leeway: Addressing Variability in Dead-Block Prediction for Last-Level Caches.
Priyank Faldu, Boris Grot
2017Lightweight Provenance Service for High-Performance Computing.
Dong Dai, Yong Chen, Philip H. Carns, John Jenkins, Robert B. Ross
2017MultiGraph: Efficient Graph Processing on GPUs.
Changwan Hong, Aravind Sukumaran-Rajam, Jinsung Kim, P. Sadayappan
2017Multilayer Compute Resource Management with Robust Control Theory.
Raghavendra Pradyumna Pothukuchi, Sweta Yamini Pothukuchi, Petros G. Voulgaris, Josep Torrellas
2017Near-Memory Address Translation.
Javier Picorel, Djordje Jevdjic, Babak Falsafi
2017Nexus: A New Approach to Replication in Distributed Shared Caches.
Po-An Tsai, Nathan Beckmann, Daniel Sánchez
2017POSTER: Accelerate GPU Concurrent Kernel Execution by Mitigating Memory Pipeline Stalls.
Hongwen Dai, Zhen Lin, Chao Li, Chen Zhao, Fei Wang, Nanning Zheng, Huiyang Zhou
2017POSTER: Application-Driven Near-Data Processing for Similarity Search.
Vincent T. Lee, Amrita Mazumdar, Carlo C. del Mundo, Armin Alaghi, Luis Ceze, Mark Oskin
2017POSTER: BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads.
Yuxi Liu, Xia Zhao, Zhibin Yu, Zhenlin Wang, Xiaolin Wang, Yingwei Luo, Lieven Eeckhout
2017POSTER: BigBus: A Scalable Optical Interconnect.
Eldhose Peter, Janibul Bashir, Smruti R. Sarangi
2017POSTER: Bridge the Gap Between Neural Networks and Neuromorphic Hardware.
Yu Ji, Youhui Zhang, Wenguang Chen, Yuan Xie
2017POSTER: Bridging the Gap Between Deep Learning and Sparse Matrix Format Selection.
Yue Zhao, Jiajia Li, Chunhua Liao, Xipeng Shen
2017POSTER: Cutting the Fat: Speeding Up RBM for Fast Deep Learning Through Generalized Redundancy Elimination.
Lin Ning, Randall Pittman, Xipeng Shen
2017POSTER: DaQueue: A Data-Aware Work-Queue Design for GPGPUs.
Ya-Shuai Lü, Libo Huang, Li Shen
2017POSTER: Design Space Exploration for Performance Optimization of Deep Neural Networks on Shared Memory Accelerators.
Swagath Venkataramani, Jungwook Choi, Vijayalakshmi Srinivasan, Kailash Gopalakrishnan, Leland Chang
2017POSTER: Elastic Reconfiguration for Heterogeneous NoCs with BiNoCHS.
Amirhossein Mirhosseini, Mohammad Sadrosadati, Behnaz Soltani, Hamid Sarbazi-Azad, Thomas F. Wenisch
2017POSTER: Exploiting Approximations for Energy/Quality Tradeoffs in Service-Based Applications.
Liu Liu, Sibren Isaacman, Abhishek Bhattacharjee, Ulrich Kremer
2017POSTER: Improving Datacenter Efficiency Through Partitioning-Aware Scheduling.
Harshad Kasture, Xu Ji, Nosayba El-Sayed, Nathan Beckmann, Xiaosong Ma, Daniel Sánchez
2017POSTER: Improving NUMA System Efficiency with a Utilization-Based Co-scheduling.
Younghyun Cho, Camilo A. Celis Guzman, Bernhard Egger
2017POSTER: Location-Aware Computation Mapping for Manycore Processors.
Orhan Kislal, Jagadish Kotra, Xulong Tang, Mahmut Taylan Kandemir, Myoungsoo Jung
2017POSTER: NUMA-Aware Power Management for Chip Multiprocessors.
Changmin Ahn, Camilo A. Celis Guzman, Bernhard Egger
2017POSTER: Putting the G back into GPU/CPU Systems Research.
Andreas Sembrant, Trevor E. Carlson, Erik Hagersten, David Black-Schaffer
2017POSTER: Statement Reordering to Alleviate Register Pressure for Stencils on GPUs.
Prashant Singh Rawat, Aravind Sukumaran-Rajam, Atanas Rountev, Fabrice Rastello, Louis-Noël Pouchet, P. Sadayappan
2017POSTER: The Liberation Day of Nondeterministic Programs.
Enrico Armenio Deiana, Vincent St-Amour, Peter A. Dinda, Nikos Hardavellas, Simone Campanoni
2017Performance Improvement via Always-Abort HTM.
Joseph Izraelevitz, Lingxiang Xiang, Michael L. Scott
2017Proxy Benchmarks for Emerging Big-Data Workloads.
Reena Panda, Lizy Kurian John
2017RCU-HTM: Combining RCU with HTM to Implement Highly Efficient Concurrent Binary Search Trees.
Dimitrios Siakavaras, Konstantinos Nikas, Georgios I. Goumas, Nectarios Koziris
2017Redesigning Go's Built-In Map to Support Concurrent Operations.
Louis Jenkins, Tingzhe Zhou, Michael F. Spear
2017SAM: Optimizing Multithreaded Cores for Speculative Parallelism.
Maleen Abeydeera, Suvinay Subramanian, Mark C. Jeffrey, Joel S. Emer, Daniel Sánchez
2017Sthira: A Formal Approach to Minimize Voltage Guardbands under Variation in Networks-on-Chip for Energy Efficiency.
Raghavendra Pradyumna Pothukuchi, Amin Ansari, Bhargava Gopireddy, Josep Torrellas
2017SuperGraph-SLP Auto-Vectorization.
Vasileios Porpodas
2017Transparent Dual Memory Compression Architecture.
Seikwon Kim, Seonyoung Lee, Taehoon Kim, Jaehyuk Huh
2017Weak Memory Models: Balancing Definitional Simplicity and Implementation Flexibility.
Sizhuo Zhang, Muralidaran Vijayaraghavan, Arvind