CLUSTER C

68 papers

YearTitle / Authors
2022A Comprehensive I/O Knowledge Cycle for Modular and Automated HPC Workload Analysis.
Zhaobin Zhu, Sarah Neuwirth, Thomas Lippert
2022A Performance Evaluation of Adaptive MPI for a Particle-In-Cell Code.
Christian Asch, Diego Jiménez, Markus Rampp, Erwin Laure, Esteban Meneses
2022A framework for hierarchical single-copy MPI collectives on multicore nodes.
George Katevenis, Manolis Ploumidis, Manolis Marazakis
2022ACCLAiM: Advancing the Practicality of MPI Collective Communication Autotuning Using Machine Learning.
Michael Wilkins, Yanfei Guo, Rajeev Thakur, Peter A. Dinda, Nikos Hardavellas
2022ALBADross: Active Learning Based Anomaly Diagnosis for Production HPC Systems.
Burak Aksar, Efe Sencan, Benjamin Schwaller, Omar Aaziz, Vitus J. Leung, Jim M. Brandt, Brian Kulis, Ayse K. Coskun
2022An Analysis of Performance Variability on Dragonfly+topology.
Majid Salimi Beni, Biagio Cosenza
2022An Asynchronous Parallel Algorithm to Improve the Scalability of Finite Element Solvers.
Zhuo Tian, Changyou Zhang
2022An Efficient Sparse CNNs Accelerator on FPGA.
Yonghua Zhang, Hongxu Jiang, Xiaobin Li, Haojie Wang, Dong Dong, Yongxiang Cao
2022Assessing the State of Autovectorization Support based on SVE.
Bine Brank, Dirk Pleiter
2022Assessment of the I/O and Storage Subsystem in Modular Supercomputing Architectures.
Sarah Neuwirth
2022AutoPipe: A Fast Pipeline Parallelism Approach with Balanced Partitioning and Micro-batch Slicing.
Weijie Liu, Zhiquan Lai, Shengwei Li, Yabo Duan, Keshi Ge, Dongsheng Li
2022Automatic vehicle counting area creation based on vehicle Deep Learning detection and DBSCAN.
Gerardo Alvarez Piña, Eduardo Ulises Moya-Sánchez, Abraham Sánchez Pérez, Ulises Cortés
2022BALA-CPD: BALanced and Asynchronous Distributed Tensor Decomposition.
Zheng Miao, Jiajia Li, Jon C. Calhoun, Rong Ge
2022Be SMART, Save I/O: A Probabilistic Approach to Avoid Uncorrectable Errors in Storage Systems.
Md. Arifuzzaman, Masudul Hasan Masud Bhuiyan, Mehmet Gümüs, Engin Arslan
2022Bridging the Gap between Application Performance Analysis and System Monitoring.
Thomas Ilsche, Mario Bielert, Christian von Elm
2022Bring the BitCODE-Moving Compute and Data in Distributed Heterogeneous Systems.
Wenbin Lu, Luis E. Peña, Pavel Shamis, Valentin Churavy, Barbara M. Chapman, Steve Poole
2022Call Scheduling to Reduce Response Time of a FaaS System.
Pawel Zuk, Bartlomiej Przybylski, Krzysztof Rzadca
2022Deadlock Detection for MPI Programs Based on Refined Match-sets.
Shushan Li, Meng Wang, Hong Zhang
2022Distributed Continuation Stealing is More Scalable than You Might Think.
Shumpei Shiina, Kenjiro Taura
2022Early Experiences of Noise-Sensitivity Performance Analysis of a Distributed Deep Learning Framework.
Elvis Rojas, Michael Knobloch, Nour Daoud, Esteban Meneses, Bernd Mohr
2022Efficient Hierarchical State Vector Simulation of Quantum Circuits via Acyclic Graph Partitioning.
Bo Fang, M. Yusuf Özkaya, Ang Li, Ümit V. Çatalyürek, Sriram Krishnamoorthy
2022Empirical Study on the GPU-accelerated HPL Performance: Effects of PCIe Communication.
Jieun Choi, Yosang Jeong, Ji Hoon Kang, Gibeom Gu, Hoon Ryu
2022Enabling Dynamic Virtual Frequency Scaling for Virtual Machines in the Cloud.
Emile Cadorel, Romain Rouvoy
2022Exploring Light-weight Cryptography for Efficient and Secure Lossy Data Compression.
Ruiwen Shan, Sheng Di, Jon C. Calhoun, Franck Cappello
2022Extracting and characterizing I/O behavior of HPC workloads.
Hariharan Devarajan, Kathryn M. Mohror
2022FaaSt: Optimize makespan of serverless workflows in federated commercial FaaS.
Sashko Ristov, Philipp Gritsch
2022Fast Dynamic Updates and Dynamic SpGEMM on MPI-Distributed Graphs.
Alexander van der Grinten, Geert Custers, Duy Le Thanh, Henning Meyerhenke
2022Fast(er) Construction of Round-optimal $n$-Block Broadcast Schedules.
Jesper Larsson Träff
2022H2M: Towards Heuristics for Heterogeneous Memory.
Clément Foyer, Brice Goglin, Emmanuel Jeannot, Jannis Klinkenberg, Anara Kozhokanova, Christian Terboven
2022HPC Storage Service Autotuning Using Variational- Autoencoder -Guided Asynchronous Bayesian Optimization.
Matthieu Dorier, Romain Egele, Prasanna Balaprakash, Jaehoon Koo, Sandeep Madireddy, Srinivasan Ramesh, Allen D. Malony, Robert B. Ross
2022HPH: Hybrid Parallelism on Heterogeneous Clusters for Accelerating Large-scale DNNs Training.
Yabo Duan, Zhiquan Lai, Shengwei Li, Weijie Liu, Keshi Ge, Peng Liang, Dongsheng Li
2022High Performance Adaptive Physics Refinement to Enable Large-Scale Tracking of Cancer Cell Trajectory.
Daniel F. Puleri, Sayan Roychowdhury, Peter Balogh, John Gounley, Erik W. Draeger, Jeff Ames, Adebayo Adebiyi, Simbarashe Chidyagwai, Benjamín Hernández, Seyong Lee, Shirley V. Moore, Jeffrey S. Vetter, Amanda Randles
2022Hvac: Removing I/O Bottleneck for Large-Scale Deep Learning Applications.
Awais Khan, Arnab K. Paul, Christopher Zimmer, Sarp Oral, Sajal Dash, Scott Atchley, Feiyi Wang
2022Hybrid Analysis of Fusion Data for Online Understanding of Complex Science on Extreme Scale Computers.
Eric Suchyta, Jong Youl Choi, Seung-Hoe Ku, David Pugmire, Ana Gainaru, Kevin A. Huck, Ralph Kube, Aaron Scheinberg, Frédéric Suter, Choong-Seock Chang, Todd S. Munson, Norbert Podhorszki, Scott Klasky
2022IEEE International Conference on Cluster Computing, CLUSTER 2022, Heidelberg, Germany, September 5-8, 2022
2022Impact of Containerization on Low-Cost Post Moore Computing Architectures.
Pablo Josue Rojas Yepes, Carlos Jaime Barrios Hernández, Luiz Angelo Steffenel
2022IncProf: Efficient Source-Oriented Phase Identification for Application Behavior Understanding.
Omar Aaziz, Mohammad Al-Tahat, Strahinja Trecakov, Jonathan Cook
2022Integrating process, control-flow, and data resiliency layers using a hybrid Fenix/Kokkos approach.
Matthew Whitlock, Nicolas M. Morales, George Bosilca, Aurelien Bouteiller, Bogdan Nicolae, Keita Teranishi, Elisabeth Giem, Vivek Sarkar
2022LDMS Darshan Connector: For Run Time Diagnosis of HPC Application I/O Performance.
Sara Walton, Omar Aaziz, Ana Luisa Veroneze Solórzano, Benjamin Schwaller
2022Last-mile Matters: Mitigating the Tail Latency of Virtualized Networks with Multipath Data Plane.
Dian Shen, Yi Zhai, Fang Dong, Junzhou Luo
2022Learning tenant behavior and evolutionary approach for demand response in colocation datacenters.
Jonathan Muraña, Santiago Iturriaga, Sergio Nesmachnow
2022Lossy all-to-all exchange for accelerating parallel 3-D FFTs on hybrid architectures with GPUs.
Sébastien Cayrols, Jiali Li, George Bosilca, Stanimire Tomov, Alan Ayala, Jack J. Dongarra
2022MRSch: Multi-Resource Scheduling for HPC.
Boyang Li, Yuping Fan, Matthew T. Dearing, Zhiling Lan, Paul Rich, William E. Allcock, Michael E. Papka
2022Matching-based Scheduling of Asynchronous Data Processing Workflows on the Computing Continuum.
Narges Mehran, Zahra Najafabadi Samani, Dragi Kimovski, Radu Prodan
2022MemGaze: Rapid and Effective Load-Level Memory Trace Analysis.
Ozgur O. Kilic, Nathan R. Tallent, Yasodha Suriyakumar, Chenhao Xie, Andrés Márquez, Stéphane Eranian
2022On Using Linux Kernel Huge Pages with FLASH, an Astrophysical Simulation Code.
Alan C. Calder, Catherine Feldman, Eva Siegmann, John Dey, Anthony Curtis, Smeet Chheda, Robert J. Harrison
2022Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors.
Tetsuya Hoshino, Akihiro Ida, Toshihiro Hanawa
2022Optimizing Irregular-Shaped Matrix-Matrix Multiplication on Multi-Core DSPs.
Shangfei Yin, Qinglin Wang, Ruochen Hao, Tianyang Zhou, Songzhu Mei, Jie Liu
2022PYTHIA: an oracle to guide runtime system decisions.
Alexis Colin, François Trahay, Denis Conan
2022Painless Transposition of Reproducible Distributed Environments with NixOS Compose.
Quentin Guilloteau, Jonathan Bleuzen, Millian Poquet, Olivier Richard
2022Performance analysis of a state vector quantum circuit simulation on A64FX processor.
Miwako Tsuji, Mitsuhisa Sato
2022Performance of an Astrophysical Radiation Hydrodynamics Code under Scalable Vector Extension Optimization.
Dennis C. Smolarski, F. Douglas Swesty, Alan C. Calder
2022Productivity meets Performance: Julia on A64FX.
Mosè Giordano, Milan Klöwer, Valentin Churavy
2022Protecting Metadata Servers From Harm Through Application-level I/O Control.
Ricardo Macedo, Mariana Miranda, Yusuke Tanimura, Jason Haga, Amit Ruhela, Stephen Lien Harrell, Richard Todd Evans, João Paulo
2022Pushing the Boundaries of Small Tasks: Scalable Low-Overhead Data-Flow Programming in TTG.
Joseph Schuchart, Poornima Nookala, Thomas Hérault, Edward F. Valeev, George Bosilca
2022Recursive Multi-Section on the Fly: Shared-Memory Streaming Algorithms for Hierarchical Graph Partitioning and Process Mapping.
Marcelo Fonseca Faraj, Christian Schulz
2022SKV: A SmartNIC-Offloaded Distributed Key-Value Store.
Shangyi Sun, Rui Zhang, Ming Yan, Jie Wu
2022SVAGC: Garbage Collection with a Scalable Virtual Address Swapping Technique.
Ismail Ataie, Weikuan Yu
2022Scalable Architectures to Support Sustainable Advanced Information Technologies.
Oscar Carrillo, Carlos Jaime Barrios Hernández, Frédéric Le Mouël, Harold Enrique Castro Barrera, Yves Denneulin, José Tiberio Hernández, Fernando Jiménez Vargas, Lola Xiomara Bautista Rozo, Claudia Roncancio, Michel Riveill
2022Shasta Log Aggregation, Monitoring and Alerting in HPC Environments with Grafana Loki and ServiceNow.
Elizabeth Bautista, Nitin Sukhija, Siqi Deng
2022Spark Meets MPI: Towards High-Performance Communication Framework for Spark using MPI.
Kinan Al-Attar, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda
2022The Cost of Flexibility: Embedded versus Discrete Routers in CGRAs for HPC.
Boma A. Adhi, Carlos Cortes, Yiyu Tan, Takuya Kojima, Artur Podobas, Kentaro Sano
2022The role of storage target allocation in applications' I/O performance with BeeGFS.
Francieli Boito, Guillaume Pallez, Luan Teylo
2022Towards Real- Time Classification of HPC Workloads via Out-of-band Telemetry.
Steven Presser
2022Towards Virtual Certification of Gas Turbine Engines With Performance-Portable Simulations.
Gihan R. Mudalige, István Z. Reguly, Arun Prabhakar, Dario Amirante, Leigh Lapworth, Stephen A. Jarvis
2022What does Inter-Cluster Job Submission and Execution Behavior Reveal to Us?
Tirthak Patel, Devesh Tiwari, Raj Kettimuthu, William E. Allcock, Paul Rich, Zhengchun Liu
2022ecoHMEM: Improving Object Placement Methodology for Hybrid Memory Systems in HPC.
Marc Jordà, Siddharth Rai, Eduard Ayguadé, Jesús Labarta, Antonio J. Peña
2022fairDMS: Rapid Model Training by Data and Model Reuse.
Ahsan Ali, Hemant Sharma, Rajkumar Kettimuthu, Peter Kenesei, Dennis Trujillo, Antonino Miceli, Ian T. Foster, Ryan N. Coffee, Jana Thayer, Zhengchun Liu