Loading…
In-person
1-4 April 2025
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in British Summer Time (BST) (UTC +1). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Wednesday April 2, 2025 11:15 - 11:45 BST
Scaling a Kubernetes platform is no fairy tale—it’s a quest with unexpected twists, chaos, and the occasional missing treasure map. In this talk, we’ll recount our journey taming the complexity of multi-cluster platforms with SLIs, SLOs, and observability dashboards.
From defining meaningful metrics to designing actionable SLO dashboards, we’ll share insights, lessons learned, and practical tips for maintaining platform reliability — regardless of if you’re deploying in the cloud, on-prem or in a hybrid environment. Through real-life lessons and battle-tested strategies, we’ll dive into the role of SLIs and SLOs in helping ensure platform robustness, discuss how to design platform observability, and highlight best practices for maintaining reliability at scale. You’ll leave equipped with the knowledge to design observability practices that ensure your AI workloads run smoothly, even at scale. Join us as we demystify SLI/SLO strategies with practical examples from our AI platform.
Speakers
avatar for Ankita Chaudhari

Ankita Chaudhari

Senior Technical Product Manager, Bloomberg
Ankita is a Senior Technical Product Manager for the AI Platforms team in the Office of the CTO at Bloomberg. She focuses on the product strategy and development of cutting-edge solutions that power GenAI workloads at scale. She drives initiatives that involve optimizing performance... Read More →
avatar for Alexa Nicole Griffith

Alexa Nicole Griffith

Senior Software Engineer, Bloomberg LP
Alexa Griffith is a Senior Software Engineer on Bloomberg’s Cloud Native Compute Services organization. She works on building an inference platform for ML workflows and the open source project KServe. She enjoys solving engineering challenges at scale and writing code in Go. She... Read More →
Wednesday April 2, 2025 11:15 - 11:45 BST
Level 1 | Hall Entrance S10 | Room B
  Platform Engineering
  • Content Experience Level Any

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link