Loading…
In-person
1-4 April 2025
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in British Summer Time (BST) (UTC +1). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 

Tuesday April 1, 2025 15:04 - 15:09 BST
Recent years have seen a proliferation of large language models (LLMs) that extend beyond traditional language tasks to generative AI. This includes models like ChatGPT and Stable Diffusion. As this generative AI focus continues to grow, there is a rising need for a cloud native infrastructure that

Provides solid and scalable multi-cluster machine learning platform.

This talks will explore how these rising needs are addressed by leverage Volcano and Karmada that enable multi-cluster job queuing, management, enhanced scheduling.

This talk will cover:
- The challenges for LLM training in single Kubernetes cluster\
- How to combine Volcano and Karmada to build a multi-cluster training platform
- How to handle to job queuing across multi-cluster to ensure the fairness and SLA
- How to balance the workload performance and multi-cluster utilization.
- Scheduling policies to avoid busy waiting and dead lock
Tuesday April 1, 2025 15:04 - 15:09 BST
Platinum Suite | Level 3
Log in to leave feedback.

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link