CLOUD B

49 papers

YearTitle / Authors
202518th IEEE International Conference on Cloud Computing, CLOUD 2025, Helsinki, Finland, July 7-12, 2025
Rong N. Chang, Carl K. Chang, Jingwei Yang, Nimanthi Atukorala, Dan Chen, Sumi Helal, Sasu Tarkoma, Qiang He, Tevfik Kosar, Claudio A. Ardagna, Feras Awaysheh, Volker Hilt, Yogesh Simmhan
2025Accelerating RL-Based Scheduler Adaptation with Transfer Learning in Evolving HPC Architectures.
Lingfei Wang, Maria A. Rodriguez, Nir Lipovetzky
2025An Experimental Validation of Architectural Measures for Cloud-Native Quality Evaluations.
Robin Lichtenthäler, Guido Wirtz
2025Automated LLM Deployment and Evaluation: A Cloud-Native Approach Using LLM-as-a-Judge.
Ansar Rafique, Brian D. Marsden
2025Avoiding Pitfalls in Networked Key-Value Store for Tiered Memory.
Seungmin Shin, Leeiu Kim, Wookyung Lee, Eyee Hyun Nam, Seungmin Kim, Bryan S. Kim, Sungjin Lee, Eunji Lee
2025Carbon-Aware Temporal Data Transfer Scheduling Across Cloud Datacenters.
Elvis Rodrigues, Jacob Goldverg, Tevfik Kosar
2025Causal Latency Modelling for Cloud Microservices.
Christopher Lohse, Diego Tsutsumi, Amadou Ba, Pavithra Harsha, Chitra Subramanian, Martin Straesser, Marco Ruffini
2025ClusterLink: Redefining Application Connectivity for the Multi-cloud Era.
Kfir Toledo, Pravein Govindan Kannan, Michal Malka, Etai Lev-Ran, Or Ozeri, Vita Bortnikov, Ziv Nevo, Kathy Barabash
2025Cost-Efficient VM Selection for Cloud-Based LLM Inference with KV Cache Offloading.
Kihyun Kim, Jinwoo Kim, Hyunsun Chung, Myung-Hoon Cha, Hong-Yeon Kim, Youngjae Kim
2025DNN-Adapt: Reinforcement Learning-Based Hybrid Batching for Efficient DNN Serving.
Milind Varma, Sai Venkat Malreddy, Liting Hu
2025Disk-Based Shared KV Cache Management for Fast Inference in Multi-Instance LLM RAG Systems.
Hyungwoo Lee, Kihyun Kim, Jinwoo Kim, Jungmin So, Myung-Hoon Cha, Hong-Yeon Kim, James J. Kim, Youngjae Kim
2025Dynamic In-node Group-Aware Scheduling for Multi-Tenant Machine Learning Services on Kubernetes.
Peini Liu, Jordi Guitart
2025ESTHER: Application-First Hardware-Level QoS-Enforcement for Cloud Native Environments.
Oliver Larsson, Thijs Metsch, Cristian Klein, Erik Elmroth
2025Efficient Microservice Monitoring Via Kernel Transformation and FFT Forecasting.
Marianna Ojanen, Maryam Sabzevari, Sándor Szedmák
2025Efficient Versioning for Unikernels.
Gaulthier Gain, Benoit Knott, Laurent Mathy
2025Energy-Aware Resource Allocation and Container Migration in Distributed Data Centers Under Variable Energy Pricing: A Genetic Programming Hyper-Heuristic Approach.
Mathew Falloon, Hui Ma, Gang Chen
2025EnergyLess: An Energy-Aware Serverless Workflow Batch Orchestration on the Computing Continuum.
Reza Farahani, Radu Prodan
2025Game-Theoretic Reinforcement Learning for Task Optimization Under Time-Sensitive Constraints.
Emanuele Carlini, Patrizio Dazzi, Matteo Mordacchini
2025HEART: Heterogeneous-Aware Traffic Allocation in Multi-Replica Deployments on Kubernetes.
Hokun Park, Donggyun Kim, Hyungjun Kim, Gyujeong Lim, HeonChang Yu
2025Helm-ET: Reducing Exposure to Lateral Movement in Kubernetes Artifacts.
Jacopo Bufalino, Jose Luiz Martin Navarro, Aleksi Peltonen, Tuomas Aura
2025HeteroScheduler: Dynamic Task Scheduling for CPU-GPU Optimization and Contention Mitigation in Cloud Data Centers.
Seokwon Choi, Hyeonsang Eom
2025HotSwap: Enabling Live Dependency Sharing in Serverless Computing.
Rui Li, Devesh Tiwari, Gene Cooperman
2025Is Your Cluster Truly Fully Loaded? Exploring Shadow Resources in Host State Synchronization.
Jiawen Liu, Yuehao Xu, Zhijun Ding
2025Korel: Mitigating Stragglers via Real-Time Automatic Mixed Precision in Distributed Deep Learning Environments.
Hyunseung Jung, Hyungjun Kim, HeonChang Yu
2025LLM-Powered Automated Cloud Forensics: From Log Analysis to Investigation.
Dalal Alharthi, Rozhin Yasaei
2025MOBOS: Co-Optimizing Cost and Execution Time in Serverless Workflow with Multi-Objective Bayesian Optimization.
Minjae Kang, HeonChang Yu
2025MSTH-Former: Optimizing Workload Prediction in Edge-Cloud Continuum with Multi-Scale Temporal and Hierarchical Knowledge Convergence and Distillation.
Sharmen Akhter, Eui-Nam Huh
2025Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference.
Pol G. Recasens, Ferran Agullo, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Jordi Torres, Josep Lluís Berral
2025Multi-Agent Reinforcement Learning-Based In-Place Scaling Engine for Edge-Cloud Systems.
Jovan Prodanov, Blaz Bertalanic, Carolina Fortuna, Shih-Kai Chou, Matjaz B. Juric, Ramon Sanchez-Iborra, Jernej Hribar
2025Optimizing Receive Flow Steering for Mixed Traffic in High-Performance Cloud Datacenters.
Junseo Jang, Jaehyun Hwang
2025PROBA: Enhancing Serverless Edge Computing via Adaptive Task Scheduling and Probabilistic Resource Sharing.
Manish Pandey, Byungchul Tak, Young-Woo Kwon
2025Precomputation-Optimized Lakehouse Architecture for Online Analytical Processing Tasks.
Haida Zhang, Lin Sun, Zhengtong Zhang, Jiayang Xia, Ziang Huang, Jiansi Wang, Haopeng Chen, Yan Jiao, Yongming Xu
2025QPS- Fit: An Efficient and Performant Parallel Algorithm for Hybrid Optical and Packet Switching.
Dongzhao Song, Jingfan Meng, Qianru Yu, Jun Jim Xu
2025RACS-SADL: Robust and Understandable Randomized Consensus in the Cloud.
Pasindu Tennage, Antoine Desjardins, Lefteris Kokoris-Kogias
2025ReSACO: A Meta Reinforcement Learning Method for Fast Offloading in Mobile Edge Computing.
Myeongjun Kim, HeonChang Yu
2025Real-Time Interference-Aware CPU and I/O Capping Mechanism for Multi-Tenant Containers.
MohammadReza HoseinyFarahabady, Albert Y. Zomaya
2025Revisiting SQL Statement Logging for SQLite on AWS S3.
Yewon Shin, Jonghyeok Park
2025Routing Strategies for RoCE Networks in AI Clouds.
Abdul Alim, Ali Sydney, Liran Schour, Abdullah Kayi, Laurent Schares, Pavlos Maniotis, Anand Singh, Bengi Karacali
2025SLO-Aware Container Orchestration on Kubernetes Clusters.
Angelo Marchese, Orazio Tomarchio
2025Serverless Data Analytics (Finally) Bridging the Gap: Introducing the Ortzi DataFrame.
Germán T. Eizaguirre, Marc Hostau, Marc Sánchez Artigas
2025Speeding up Model Loading with Fastsafetensors.
Takeshi Yoshimura, Tatsuhiro Chiba, Manish Sethi, Daniel G. Waddington, Swaminathan Sundararaman
2025Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework.
Julien Soulé, Jean-Paul Jamont, Michel Occello, Louis-Marie Traonouez, Paul Théron
2025Temporal Fusion Transformer Based Vertical Scaling Management for Kubernetes.
Kemalcan Bora, Elli Kartsakli, Eduardo Quiñones Moreno
2025The IoT Whisperer: A Framework for Intelligent IoT Service Composition Through LLMs.
Ewan Warburton, Abdessalam Elhabbash, Saad Ezzini, Yehia Elkhatib
2025Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference.
Yue Zhu, Hao Yu, Chen Wang, Zhuoran Liu, Eun Kyung Lee
2025Towards Secure Cloud-Native Computing: Unveiling Kubernetes Misconfigurations with Large Language Models.
Mostafa Anouar Ghorab, Mohamed Aymen Saied
2025TraceWizard: End-to-End Distributed Tracing Across Host and Network Devices in Cloud.
Kuangyuan Li, Jingrun Zhang, Pengfei Chen, Hongyang Chen, Ruipeng Hong, Wanqi Yang, Chen Sun
2025Universal Workers: A Vision for Eliminating Cold Starts in Serverless Computing.
Saman Akbari, Manfred Hauswirth
2025ZipNN: Lossless Compression for AI Models.
Moshik Hershcovitch, Andrew Wood, Leshem Choshen, Guy Girmonsky, Roy Leibovitz, Or Ozeri, Ilias Ennmouri, Michal Malka, Sang (Peter) Chin, Swaminathan Sundararaman, Danny Harnik