SBAC-PAD C

35 papers

YearTitle / Authors
20222022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Bordeaux, France, November 2-5, 2022
2022A Multi-GPU Python Solver for Low-Temperature Non-Equilibrium Plasmas.
James Almgren-Bell, Nader Al Awar, Dilip S. Geethakrishnan, Milos Gligoric, George Biros
2022A Test for FLOPs as a Discriminant for Linear Algebra Algorithms.
Aravind Sankaran, Paolo Bientinesi
2022A predictive approach for dynamic replication of operators in distributed stream processing systems.
Daniel Wladdimiro, Luciana Arantes, Pierre Sens, Nicolas Hidalgo
2022An MPI-Parallel Algorithm for Static and Dynamic Top-k Harmonic Centrality.
Alexander van der Grinten, Geert Custers, Duy Le Thanh, Henning Meyerhenke
2022Analyzing Power Decisions in Data Center Powered by Renewable Sources.
Igor Fontana De Nardin, Patricia Stolf, Stéphane Caux
2022Approximate Memory with Protected Static Allocation.
João Fabrício Filho, Isaías B. Felzmann, Lucas Wanner
2022Automatic aggregation of subtask accesses for nested OpenMP-style tasks.
Omar Shaaban, Jimmy Aguilar Mena, Vicenç Beltran, Paul M. Carpenter, Eduard Ayguadé, Jesús Labarta Mancho
2022Avoiding Unnecessary Caching with History-Based Preemptive Bypassing.
Arthur M. Krause, Paulo C. Santos, Philippe O. A. Navaux
2022Characterizing Prefetchers using CacheObserver.
Guillaume Didier, Clémentine Maurice, Antoine Geimer, Walid J. Ghandour
2022Convergence of HPC and Big Data in extreme-scale data analysis through the DCEx programming model.
Javier García-Blas, Javier Fernández Muñoz, Jesús Carretero, Fabrizio Marozzo, Domenico Talia, Paolo Trunfio, Alberto Fernández-Pena, Daniel Martín de Blas
2022Convolution Operators for Deep Learning Inference on the Fujitsu A64FX Processor.
Manuel F. Dolz, Héctor Martínez, Pedro Alonso, Enrique S. Quintana-Ortí
2022Dynamic Set Stealing to Improve Cache Performance.
Brady Testa, Samira Mirbagher Ajorpaz, Daniel A. Jiménez
2022Efficient Strategies for Graph Pattern Mining Algorithms on GPUs.
Samuel Ferraz, Vinícius Vitor dos Santos Dias, Carlos H. C. Teixeira, George Teodoro, Wagner Meira Jr.
2022Exploring the Effects of Silent Data Corruption in Distributed Deep Learning Training.
Elvis Rojas, Diego Pérez, Esteban Meneses
2022FiBHA: Fixed Budget Hybrid CNN Accelerator.
Fareed Qararyah, Muhammad Waqar Azhar, Pedro Trancoso
2022IntP: Quantifying cross-application interference via system-level instrumentation.
Miguel G. Xavier, Carlos H. C. Cano, Vinícius Meyer, César A. F. De Rose
2022Ion-Molecule Collision Cross-Section Simulation using Linked-cell and Trajectory Parallelization.
Samuel Cajahuaringa, Leandro N. Zanotto, Daniel L. Z. Caetano, Sandro Rigo, Hervé Yviquel, Munir S. Skaf, Guido Araujo
2022Memory-Side Acceleration and Sparse Compression for Quantized Packed Convolutions.
Alex Weaver, Krishna Kavi, Pranathi Vasireddy, Gayatri Mehta
2022Metrics for Packing Efficiency and Fairness of HPC Cluster Batch Job Scheduling.
Alexander V. Goponenko, Kenneth Lamar, Christina L. Peterson, Benjamin A. Allan, Jim M. Brandt, Damian Dechev
2022Mitigating Unnecessary Throttling in Linux CFS Bandwidth Control.
Odin Ugedal, Rakesh Kumar
2022Mixed and Multi-Precision SpMV for GPUs with Row-wise Precision Selection.
Erhan Tezcan, Tugba Torun, Fahrican Kosar, Kamer Kaya, Didem Unat
2022NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors.
Sandra Catalán, Francisco D. Igual, Rafael Rodríguez-Sánchez, José R. Herrero, Enrique S. Quintana-Ortí
2022Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud Providers.
Rafaela C. Brum, Pierre Sens, Luciana Arantes, Maria Clicia Stelling de Castro, Lúcia Maria de A. Drummond
2022Parallelizing Git Checkout: a Case Study of I/O Parallelism.
Matheus Tavares Bernardino, Alfredo Goldman
2022Performance Improvements of Parallel Applications thanks to MPI-4.0 Hints.
Maxim Moraru, Adrien Roussel, Hugo Taboada, Christophe Jaillet, Marc Pérache, Michaël Krajecki
2022Prof5: A RISC-V profiler tool.
Jonathas Silveira, Lucas Castro, Victor Araújo, Rodrigo Zeli, Daniel Lazari, Marcelo Guedes, Rodolfo Azevedo, Lucas Wanner
2022STEER: Asymmetry-aware Energy Efficient Task Scheduler for Cluster-based Multicore Architectures.
Jing Chen, Madhavan Manivannan, Bhavishya Goel, Mustafa Abduljabbar, Miquel Pericàs
2022Seriema: RDMA-based Remote Invocation with a Case-Study on Monte-Carlo Tree Search.
Hammurabi Mendes, Bryce Wiedenbeck, Aidan O'Neill
2022Setting up an experimental framework for analysing an immersion cooling system.
Thierry Arrabal, Lucas Betencourt, Eddy Caron, Laurent Lefèvre
2022Strategies for Fault-Tolerant Tightly-Coupled HPC Workloads Running on Low-Budget Spot Cloud Infrastructures.
Vanderlei Munhoz, Márcio Castro, Odorico M. Mendizabal
2022Study of the Processor and Memory Power and Energy Consumption of Coupled Sparse/Dense Solvers.
Emmanuel Agullo, Marek Felsöci, Amina Guermouche, Hervé Mathieu, Guillaume Sylvand, Bastien Tagliaro
2022TCUDA: A QoS-based GPU Sharing Framework for Autonomous Navigation Systems.
Pangbo Sun, Hao Wu, Jiangming Jin, Ziyue Jiang, Yifan Gong
2022Taming the Big Data Monster: Managing Petabytes of Data with Multi-Model Databases.
Yang Chen, Feng Zhang, Yinhao Hong, Yunpeng Chai, Wei Lu, Hong Chen, Xiaoyong Du, Peipei Wang, Le Mi, Jintao Li, Xilin Tang, Yanliang Zhou, Wei Zhou, Peng Zhang, Fengyi Chen, Pengfei Li, Yu Li
2022gem5-ndp: Near-Data Processing Architecture Simulation From Low Level Caches to DRAM.
João Vieira, Nuno Roma, Gabriel Falcão, Pedro Tomás