SBAC-PAD C

25 papers

YearTitle / Authors
202537th IEEE/SBC International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2025, Bonito, Mato Grosso do Sul, Brazil, October 28-31, 2025
2025A Distributed and Storage-Aware Approach to Large-Scale Cholesky Factorization.
Carla Cusihuallpa, Rodrigo Ceccato, Sandro Rigo, Guido Araujo, Hervé Yviquel
2025A Framework for Analytical Performance and Energy Prediction of DL Training on GPUs.
Roblex Nana Tchakoute, Claude Tadonki, Petr Dokládal, Youssef Mesri
2025A-Flow: managing dataflows on the computing continuum using abstract communication channels.
Catherine Alessandra Torres Charles, Dante D. Sánchez-Gallegos, Diana Carrizales-Espinoza, José Luis González Compeán, Jesús Carretero
2025Accelerating GNN Inference via Automated Parallel Execution on Edge Heterogeneous Platforms.
Yi-Chien Lin, Haoyang Fan, Sameh Gobriel, Nilesh Jain, Viktor K. Prasanna
2025Data Management in the Continuum: Cross-facility Object-based Data Transfers.
Jean Luca Bez, Houjun Tang, Chen Wang, Suren Byna
2025DynaMap: A Map Equation-based Parallel Algorithm for Detecting Communities on Dynamic Graphs.
Gabriel G. Dos Santos, Kartik Lakhotia, César A. F. De Rose
2025Efficient Multi-Workload Execution for Sustainable GPU Performance.
Matheus M. Costa, Philippe O. A. Navaux, Silvio Rizzi, Bronson Messer, Arthur Francisco Lorenzon
2025Evaluating Code Portability for Carbon-Efficient RTM Computing.
Arthur Francisco Lorenzon, Philippe O. A. Navaux, Alexandre Sardinha, Bronson Messer
2025Extraction and Representation of Sparsity Patterns for Efficient Data Transfer on Accelerators.
Yang Su, Toshiyuki Ichiba, Katsuhiro Yoda, Yasuhiro Watanabe, Takahide Yoshikawa, Tarek S. Abdelrahman
2025Fine-grained Communication Phase based Analytical Performance Modeling and Analysis.
Vishal Deka, Preeti Malakar
2025Generative Fabrication of Medical Images for Machine Learning Training.
Andres G. Calzada-Jasso, Andrei Tchernykh, Ixchel D. Avendaño-Pacheco, Jorge M. Cortés-Mendoza, Bernardo Pulido-Gaytan, Mikhail G. Babenko, Alfredo Goldman, Horacio González-Vélez
2025Heuristics for Energy-Efficient Instruction-Level Approximate Computing.
Gregório K. Neto, Felipe Sovernigo, Daniela Catelan, Ricardo Santos, Liana Duenha
2025Hierarchical Dynamic Multilevel Graph Partitioning for Load Balancing in Distributed Agent-Based Simulations.
Cristina Quesada Peralta, Eduardo César Galobardes, Andreu Moreno Vendrell, Anna Sikora
2025MIDAS: A Mapping Infrastructure for Configurable, Data-Streaming Based Domain Specific Accelerators.
Martim Bento, Nuno Neves, Pedro Tomás, Nuno Roma
2025Mobility-aware placement of service-composed applications on Cloud-Edge Continuum.
Paulo Roberto Albuquerque, Guilherme P. Koslovski, Maurício A. Pillon, Tiago C. Ferreto
2025Obstruction-Free Software Transactional Memory for GPUs.
Tiago Perlin, André Rauber Du Bois, Gerson G. H. Cavalheiro
2025Performance, Portability, and Productivity of HIP on GPUs with NAS Parallel Benchmarks.
Gabriell Alves de Araujo, Dalvan Griebler, Luiz Gustavo Fernandes
2025Profiler-Guided Execution of Recurrent OpenMP Task Graphs on Heterogeneous Clusters.
Rémy Neveu, Rodrigo Ceccato, Adrian Munera, Sara Royuela, Jose Manuel Monsalve Diaz, Hervé Yviquel
2025SPINN: a Tool for Distributed Patch Inference on Massive Data Samples.
João Seródio, Júlio César Faracco, Fernando Gubitoso, Otávio O. Napoli, Alan Souza, Daniel Miranda, Carlos A. Astudillo, Edson Borin
2025Scalable and Efficient Deep Learning for Diabetic Retinopathy Classification on ARM.
Thiago da Silva Araújo, Beatriz Schaan, Carla Maria Dal Sasso Freitas, Philippe O. A. Navaux
2025Spotting the Right Cloud Instances with Multiple AWS EC2 Fleets.
Daniel B. Sodré, Lucas Serrano, Miguel De Lima, Cristina Boeres, Lúcia M. A. Drummond, Vinod E. F. Rebello
2025Super-Stencil: A Memory-Efficient Superstep Wave Propagation Method for Seismic Imaging.
George Gigilas, Pedro S. Peixoto, Hermes Senger, Hervé Yviquel
2025TRAP: Time-Aware Probabilistic In-Dram RowHammer Solution.
Samiksha Verma, Virendra Singh
2025Towards Portability at Scale: A Cross-Architecture Performance Evaluation of a GPU-enabled Shallow Water Solver.
Johansell Villalobos, Daniel Caviedes-Voullième, Silvio Rizzi, Esteban Meneses