Loading…
In-person
1-4 April 2025
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in British Summer Time (BST) (UTC +1). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Wednesday April 2, 2025 12:00 - 12:30 BST
Scaling Prometheus to support 50 billion active time series across 20 regions on Kubernetes is a monumental challenge. This session delves into the architecture, processes, and tools that make it possible. We will explore the design of stateful sets and zone-aware deployments to ensure reliability and scalability, alongside deployment processes tailored for high availability and fault tolerance. Learn how cellular architecture enables granular scaling and fault isolation, and discover our approach to multi-tenancy, including protection mechanisms against noisy neighbors such as shuffle sharding, throttling with token buckets . We’ll also discuss the journey of scaling each cell to 1 billion active time series, highlighting the Kubernetes challenges we faced and solved along the way. Attendees will leave with actionable insights into building resilient, efficient, and scalable systems using Kubernetes in the cloud-native ecosystem.
Speakers
avatar for Alan Protasio

Alan Protasio

Software Developer Enginner, AWS
Alan is a core contributor and maintainer of Cortex and currently serves as a Senior Software Engineer at AWS, where he works on the Amazon Managed Prometheus Service. With over 15 years of experience in the tech industry, Alan has played a pivotal role in shaping several AWS services... Read More →
avatar for Orcun Berkem

Orcun Berkem

Principal Engineer, AWS
Orcun is a seasoned engineer with expertise in building scalable, resilient systems and leading large teams. As a Principal Engineer at AWS Open Source Observability, he focuses on scaling Cortex, along with working on AWS Distribution of OpenTelemetry, Grafana, and OpenSearch, and... Read More →
Wednesday April 2, 2025 12:00 - 12:30 BST
Level 1 | Hall Entrance S10 | Room C
  Operations + Performance

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link