Part of the 2025 IEEE World Congress on SERVICES
July 7-12
Helsinki, Finland
IEEE INTERATIONAL CONFERENCE ON CLOUD COMPUTING
CLOUD 2025 - Conference Program
CLOUD-CON-S1 (Session I)
AI for Cloud Optimization and Deployment
10:50 – 12:00
Room: F4050
Session Chair: TBA
CLD_REG_24: Accelerating RL-Based Scheduler Adaptation with Transfer Learning in Evolving HPC Architectures
Lingfei Wang, Maria A. Rodriguez and Nir Lipovetzky
CLD_REG_83: LLM-Powered Automated Cloud Forensics: From Log Analysis to Investigation
Dalal Alharthi and Rozhin Yasaei
CLD_REG_84: Korel: Dynamic AMP-based Straggler Mitigation in Multi-Tenant Distributed Deep Learning Environments
Hyunseung Jung, Hyungjun Kim and Heonchang Yu
CLOUD Keynote
13:40 – 14:50
Room: F4050
Session Chair: TBA
Keynote: Wolfgang John
Going Beyond Connectivity Services in 6G
CLOUD-CON-S2 (Session II)
AI for Cloud Optimization and Deployment
15:10 – 16:20
Room: F4050
Session Chair: TBA
CLD_REG_171: Multi-agent Reinforcement Learning-based In-place Scaling Engine for Edge-cloud Systems
Jovan Prodanov, Blaz Bertalanic, Carolina Fortuna, Shih-Kai Chou, Matjaž Branko Jurič, Ramon Sanchez and Jernej Hribar
CLD_REG_175: Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework
Julien Soulé, Jean-Paul Jamont, Michel Occello, Louis-Marie Traonouez and Paul Théron
CON_REG_6: The IoT Whisperer: A Framework for Intelligent IoT Service Composition through LLMs
Ewan Warburton, Abdessalam Elhabbash, Saad Ezzini and Yehia Elkhatib
CLOUD-CON-S3 (Session III)
8:30 – 9:40
Resource Management, Scheduling, and Orchestration
Room: F4050
Session Chair: TBA
CLD_REG_23: Dynamic In-node Group-aware Scheduling for Multi-tenant Machine Learning Services on Kubernetes
Peini Liu and Jordi Guitart
CLD_REG_26: ESTHER: Application-First Hardware-Level QoS-Enforcement for Cloud Native Environments
Oliver Larsson, Thijs Metsch, Cristian Klein and Erik Elmroth
CLD_REG_138: Towards Secure Cloud-Native Computing: Unveiling Kubernetes Misconfigurations with Large Language Models
Mostafa Anouar Ghorab and Mohamed Aymen Saied
CLOUD Keynote
13:40 – 14:50
Room: F4050
Session Chair: TBA
Keynote: Victor Chang
From Cloud Computing to Software-Defined AI-based Neuromorphic Computing
CLOUD-CON-S4 (Session IV)
Resource Management, Scheduling, and Orchestration
15:10 – 16:20
Room: F4050
Session Chair: TBA
CLD_REG_59: Is Your Cluster Truly Fully Loaded? Exploring Shadow Resources in Host State Synchronization
Jiawen Liu, Yuehao Xu and Zhijun Ding
CLD_REG_98: Helm-ET: Reducing Exposure to Lateral Movement in Kubernetes Artifacts
Jacopo Bufalino, Jose Luis Martin-Navarro, Aleksi Peltonen and Tuomas Aura
CLD_REG_112: HeteroScheduler: Dynamic Task Scheduling for CPU-GPU Optimization and Contention Mitigation in Cloud Data Centers
Seokwon Choi and Hyeonsang Eom
CLOUD-CON-S5 (Session V)
Serverless & Function-as-a-Service
16:20 – 17:10
Room: F4050
Session Chair: TBA
CLD_REG_38: MOBOS: Co-optimizing Cost and Execution Time in Serverless Workflow with Multi-Objective Bayesian Optimization
Minjae Kang and Heonchang Yu
CLD_REG_49: Causal Latency Modelling for Cloud Microservices
Christopher Lohse, Diego Tsutsumi, Amadou Ba, Pavithra Harsha, Chitra Subramanian, Martin Straesser and Marco Ruffini
CLD_REG_56: HotSwap: Enabling Live Dependency Sharing in Serverless Computing
Rui Li, Devesh Tiwari and Gene Cooperman
CLOUD-CON-S6 (Session VI)
Infrastructure for ML & AI
8:30 – 9:40
Room: F4050
Session Chair: TBA
CLD_REG_20: Speeding up Model Loading with fastsafetensors
Takeshi Yoshimura, Tatsuhiro Chiba, Manish Sethi, Daniel Waddington and Swaminathan Sundararaman
CLD_REG_85: Cost-Efficient VM Selection for Cloud-Based LLM Inference with KV Cache Offloading
Kihyun Kim, Jinwoo Kim, Hyunsun Chung, Myung Hoon Cha, Hong-Yeon Kim and Youngjae Kim
CLD_REG_96: LOSSLESS COMPRESSION FOR AI MODELS
Moshik Hershcovitch, Andrew Wood, Leshem Choshen, Guy Girmonsky, Roy Leibovitz, Or Ozeri, Ilias Ennmouri, Michal Malka, Peter Chin, Swaminathan Sundararaman and Danny Harnik
CLOUD-CON-S7 (Session VII)
Programming Abstractions & Data Processing
11:10 – 12:30
Room: F4050
Session Chair: TBA
CLD_REG_100: Disk-Based Shared KV Cache Management for Fast Inference in Multi-Instance LLM RAG Systems
Hyungwoo Lee, Kihyun Kim, Jinwoo Kim, Jungmin So, Myung-Hoon Cha, Hongyeon Kim, Jongman Kim and Youngjae Kim
CLD_REG_130: ClusterLink: Redefining Application Connectivity for the Multi-cloud Era
Kfir Toledo, Pravein Govindan Kannan, Kathy Barabash, Etai Lev-Ran, Or Ozeri, Michal Malka, Vita Bortnikov and Ziv Nevo
CLD_REG_156: Precomputation-Optimized Lakehouse Architecture for Online Analytical Processing Tasks
Haida Zhang, Lin Sun, Zhengtong Zhang, Jiayang Xia, Ziang Huang, Jiansi Wang, Haopeng Chen, Yan Jiao and Yongming Xu
CLOUD Keynote
13:40 – 14:50
Room: F4050
Session Chair: TBA
Keynote: Jiannong Cao
Serving Collaborative Edge Computing for Ubiquitous AI
CLOUD-CON-S8 (Session VIII)
15:10 – 16:20
Room: F4050
Session Chair: TBA
CLD_REG_86: Energy-Aware Resource Allocation and Container Migration in Distributed Data Centers under Variable Energy Pricing: A Genetic Programming Hyper-Heuristic Approach
Mathew Falloon, Hui Ma and Gang Chen
CLD_REG_128: EnergyLess: An Energy-Aware Serverless Workflow Batch Orchestration on the Computing Continuum
Reza Farahani and Radu Prodan
CON_REG_56: Carbon-Aware Temporal Data Transfer Scheduling Across Cloud Datacenters
Elvis Rodrigues, Jacob Goldverg and Tevfik Kosar
CLOUD-CON-S9 (Session IX)
System Monitoring & Analysis
8:30 – 9:40
Room: F4050
Session Chair: TBA
CLD_REG_45: TraceWizard: End-to-End Distributed Tracing Across Host and Network Devices in Cloud
Kuangyuan Li, Jingrun Zhang, Pengfei Chen, Hongyang Chen, Ruipeng Hong, Wanqi Yang and Chen Sun
CLD_REG_119: Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference
Pol Garcia, Ferran Agullo, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Jordi Torres and Josep Lluis Berral
CLD_REG_199: Efficient Microservice Monitoring via Kernel Transformation and FFT Forecasting
Marianna Ojanen, Maryam Sabzevari and Sandor Szedmak
CLOUD-CON-S10 (Session X)
11:10 – 12:20
Room: F4050
Session Chair: TBA
CLD_REG_52: Efficient Versioning for Unikernels
Gaulthier Gain, Benoît Knott and Laurent Mathy
CLD_REG_75: Real-Time Interference-Aware CPU and I/O Capping Mechanism for Multi-Tenant Containers
Mohammadreza Hoseinyfarahabady and Albert Zomaya
CLD_REG_191: SLO-Aware Container Orchestration on Kubernetes Clusters
Angelo Marchese and Orazio Tomarchio
CLOUD-CON-S11 (Session XI)
Edge Computing
13:40 – 14:50
Room: F4050
Session Chair: TBA
CLD_REG_40: ReSACO: A Meta Reinforcement Learning Method for Fast Offloading in Mobile Edge Computing
Myeong Jun Kim and Heonchang Yu
CLD_REG_53: MSTH-Former: Optimizing Workload Prediction in Edge-Cloud Continuum with Multi-Scale Temporal and Hierarchical Knowledge Convergence and Distillation
Sharmen Akhter and Eui-Nam Huh
CLD_REG_151: PROBA: Enhancing Serverless Edge Computing via Adaptive Task Scheduling and Probabilistic Resource Sharing
Manish Pandey, Byungchul Tak and Young-Woo Kwon
CLOUD-CON-S12 (Session XII)
Security, Quality, Consensus
15:10 – 16:20
Room: F4050
Session Chair: TBA
CLD_REG_1: Robust and Understandable Consensus in the Cloud
Pasindu Tennage, Lefteris Kokoris-Kogias and Antoine Desjardins
CLD_REG_70: An experimental validation of architectural measures for cloud-native quality evaluations
Robin Lichtenthäler and Guido Wirtz
CLD_REG_136: Routing Strategies for RoCE Networks in AI Clouds
Abdul Alim Alim, Ali Sydney, Liran Schour, Abdullah Kayi, Laurent Schares, Pavlos Maniotis, Anand Singh and Bengi Karacali-Akyamac
CLOUD-CON-S13 (Session XIII)
Network and Workload Optimization
16:20 – 17:55
Room: F4050
Session Chair: TBA
CLD_REG_57: QPS-Fit: An Efficient and Performant Parallel Algorithm for Hybrid Optical and Packet Switching
Dongzhao Song, Jingfan Meng, Qianru Yu and Jun Xu
CLD_REG_76: HEART: Heterogeneous-Aware Traffic Scheduling in Multi-Replica Deployments on Edge Cloud
Hokun Park, Heonchang Yu, Donggyun Kim, Hyungjun Kim and Gyujeong Lim
CLD_REG_81: Optimizing Receive Flow Steering for Mixed Traffic in High-Performance Cloud Datacenters
Junseo Jang and Jaehyun Hwang
CLD_REG_157: Avoiding Pitfalls in Networked Key-Value Store for Tiered Memory
Seungmin Shin, Leeju Kim, Wookyung Lee, Eyee Hyun Nam, Seungmin Kim, Bryan S. Kim, Sungjin Lee and Eunji Lee
CLOUD-CON-S14 (Session XIV)
Short & Work-in-Progress Papers
15:10 – 17:30
Room: F4050
Session Chair: TBA
CLD_WIP_91: Universal Workers: A Vision for Eliminating Cold Starts in Serverless Computing
Saman Akbari and Manfred Hauswirth
CLD_WIP_107: Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference
Yue Zhu, Hao Yu, Chen Wang, Zhuoran Liu and Eun Kyung Lee
CLD_WIP_113: Automated LLM Deployment and Evaluation: A Cloud-Native Approach Using LLM-as-a-Judge
Ansar Rafique and Brian Marsden
CLD_WIP_186: DNN-Adapt: Reinforcement Learning-based Hybrid Batching for Efficient DNN Serving
Milind Varma Penumathsa, Sai Venkat Malreddy and Liting Hu
CLD_WIP_205: Game-Theoretic Reinforcement Learning for Task Optimization under Time-Sensitive Constraints
Patrizio Dazzi, Emanuele Carlini and Matteo Mordacchini
CLD_WIP_66: Revisiting SQL Statement Logging for SQLite on AWS S3
Yewon Shin and Jonghyeok Park
CLD_SHT_62: Serverless Data Analytics (Finally) Bridging the Gap: Introducing the Ortzi DataFrame
Germán T. Eizaguirre, Marc Hostau and Marc Sánchez-Artigas
CLD_SHT_46: Temporal Fusion Transformer Based Vertical Scaling Management for Kubernetes
Kemalcan Bora, Elli Kartsakli and Eduardo Quiñones Moreno