The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.
Please note: This schedule is automatically displayed in British Summer Time (BST) (UTC +1). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis.
Sign up or log in to bookmark your favorites and sync them to your phone or calendar.
Recent years have seen a proliferation of large language models (LLMs) that extend beyond traditional language tasks to generative AI. This includes models like ChatGPT and Stable Diffusion. As this generative AI focus continues to grow, there is a rising need for a cloud native infrastructure that
Provides solid and scalable multi-cluster machine learning platform.
This talks will explore how these rising needs are addressed by leverage Volcano and Karmada that enable multi-cluster job queuing, management, enhanced scheduling.
This talk will cover: - The challenges for LLM training in single Kubernetes cluster\ - How to combine Volcano and Karmada to build a multi-cluster training platform - How to handle to job queuing across multi-cluster to ensure the fairness and SLA - How to balance the workload performance and multi-cluster utilization. - Scheduling policies to avoid busy waiting and dead lock