KubeCon + CloudNativeCon Europe 2025: Full Schedule

In-person
1-4 April 2025
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in British Summer Time (BST) (UTC +1). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis.

11:15 BST

Dashboards & Dragons: Crafting SLOs To Tame the AI Platform Chaos - Alexa Griffith & Ankita Chaudhari, Bloomberg

Wednesday April 2, 2025 11:15 - 11:45 BST

Level 1 | Hall Entrance S10 | Room B

Scaling a Kubernetes platform is no fairy tale—it’s a quest with unexpected twists, chaos, and the occasional missing treasure map. In this talk, we’ll recount our journey taming the complexity of multi-cluster platforms with SLIs, SLOs, and observability dashboards.
From defining meaningful metrics to designing actionable SLO dashboards, we’ll share insights, lessons learned, and practical tips for maintaining platform reliability — regardless of if you’re deploying in the cloud, on-prem or in a hybrid environment. Through real-life lessons and battle-tested strategies, we’ll dive into the role of SLIs and SLOs in helping ensure platform robustness, discuss how to design platform observability, and highlight best practices for maintaining reliability at scale. You’ll leave equipped with the knowledge to design observability practices that ensure your AI workloads run smoothly, even at scale. Join us as we demystify SLI/SLO strategies with practical examples from our AI platform.

Speakers

Ankita Chaudhari

Senior Technical Product Manager, Bloomberg

Ankita is a Senior Technical Product Manager for the AI Platforms team in the Office of the CTO at Bloomberg. She focuses on the product strategy and development of cutting-edge solutions that power GenAI workloads at scale. She drives initiatives that involve optimizing performance... Read More →

Alexa Nicole Griffith

Senior Software Engineer, Bloomberg LP

Alexa Griffith is a Senior Software Engineer on Bloomberg’s Cloud Native Compute Services organization. She works on building an inference platform for ML workflows and the open source project KServe. She enjoys solving engineering challenges at scale and writing code in Go. She... Read More →

Wednesday April 2, 2025 11:15 - 11:45 BST
Level 1 | Hall Entrance S10 | Room B

Platform Engineering

Content Experience Level Any

11:15 BST

Lessons Learned From Architecting the Highest-scale Operational Systems in the World - Artur Bergman, Fastly

Wednesday April 2, 2025 11:15 - 11:45 BST

Level 0 | ICC Capital Hall | Room 2

Platform engineering for accelerating modern, resilient cloud-native systems requires a ruthless focus on the experience of both your customers and your developers. Restrictive vendor experiences, made worse by overreliance on single-point solutions, and the isolated bash script approaches from the past introduce unacceptable compromises to performance, security, and quality for continuous operations. As the founder and CTO of Fastly, Artur Bergman has spent decades optimizing the vendors in his stack and how he uses them to build a cohesive developer toolchain for Fastly’s internal teams and customer platform teams worldwide. This talk will cover: lessons learned from testing the limits of vendor systems to meet business needs, evaluating when to build versus buy platform engineering systems from first principles, and how to apply a rigorous experience design lens when architecting platforms for team success.

Speakers

Artur Bergman

Founder and CTO, Fastly

Artur Bergman currently serves as Chief Technology Officer of Fastly, Inc., a leading edge cloud platform. Artur founded Fastly in 2011 and served as its CEO until 2020, guiding the company through its IPO in 2019. Prior to becoming CTO in 2024, he held the role of Chief Architect... Read More →

Wednesday April 2, 2025 11:15 - 11:45 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Intermediate

12:00 BST

Day-2’000 - Migration From Kubeadm+Ansible To ClusterAPI+Talos: A Swiss Bank’s Journey - Clément Nussbaumer, PostFinance

Wednesday April 2, 2025 12:00 - 12:30 BST

Level 1 | Hall Entrance S10 | Room B

Is it even possible to migrate 35 clusters in an air-gapped environment with a custom PKI infrastructure to ClusterAPI without Downtime? We'll show you why and how this can be pulled off, and how you could do the same.

The journey starts with our legacy provisioning setup (a mix of kubeadm/ansible/puppet), followed by the migration path and tooling. Along the road, we'll discover a series of challenges such as loss of etcd quorum, matching legacy/new kube-apiserver configuration, mismatching etcd encryption keys, and more.

After a live demo of a migration, the session explores managing our fleet of clusters with ArgoCD (with a focus on simple Talos configuration files in our repositories thanks to a few templating tricks, and a clean ClusterAPI workload cluster overview through ArgoCD ApplicationSets).

The presentation concludes by addressing a critical puzzle: solving the chicken/egg bootstrapping problem of the first ClusterAPI management cluster(s).

Speakers

Clément Nussbaumer

Systems Engineer, PostFinance

🇨🇭 Systems Engineer living on a farm 🐄Kubernetes Clusters during the day, helping out on the farm whenever needed, and playing music in the evening 🎺

Wednesday April 2, 2025 12:00 - 12:30 BST
Level 1 | Hall Entrance S10 | Room B

Platform Engineering

Content Experience Level Intermediate

12:00 BST

Leveraging Internal Knowledge: Building AiKA at Spotify - Majd Salman & Jofre Mateu Matesanz, Spotify

Wednesday April 2, 2025 12:00 - 12:30 BST

Level 0 | ICC Capital Hall | Room 2

In the fast-paced world of technology, access to the right information at the right time is crucial for innovation and efficiency. Enter AiKA, Spotify's RAG based internal “artificial intelligence knowledge assistance” platform, designed to empower our developers by providing instant access to the vast pool of internal knowledge through various surfaces. We'll cover why we developed AiKA, detailing the challenges of managing and retrieving info across a large organization. Learn about the technologies and methodologies we employed and how we integrated AiKA seamlessly into our existing infrastructure

We'll highlight how AiKA's flexible API allows engineers to ingest their own custom knowledge, tailoring the tool to meet the unique needs of different teams. Discover how it not only enhances productivity but also fosters a culture of self-service and continuous learning.

Speakers

Jofre Mateu Matesanz

Software Engineer, Spotify

Jofre is a Senior Data Engineer at Spotify with a focus on making internal knowledge assistance and productivity tools for engineers.

Majd Salman

Senior Data Engineer, Spotify

Majd Salman is a Senior Data Engineer at Spotify with a focus on making internal knowledge assistance and productivity tools for engineers.

Wednesday April 2, 2025 12:00 - 12:30 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Intermediate

14:30 BST

Don't Write Controllers Like Charlie Don't Does: Avoiding Common Kubernetes Controller Mistakes - Nick Young, Isovalent at Cisco

Wednesday April 2, 2025 14:30 - 15:00 BST

Level 1 | Hall Entrance S10 | Room B

So you've learned about Custom Resource Definition (CRD) design errors, you've designed your CRD to avoid common mistakes, and now you're ready to write the controller.

Turns out there's a lot of gotchas in that process as well!

This talk explores the common pitfalls that the ever-unlucky Charlie Don't, who always makes the worst decisions, runs into when implementing a controller.

The talk should be particularly useful for anyone writing reconciliation loops that use Kubernetes objects, whether they are CRDs or not. You can expect to come away from this talk having learned about common mistakes like: straining the apiserver with too many status updates, missing updates in complex systems of CRDs, and having scaling problems from not using caching correctly.

No knowledge of the previous talks is required, so come and have a chuckle at poor old Charlie Don't's bad luck while picking up some tips for yourself.

Speakers

Nick Young

Senior Software Engineer, Isovalent at Cisco

Nick has been working to prevent the entropic downfall of systems for 25 years, across datacenters, clouds, networking, and others. He's a Staff Engineer at Isovalent, and a maintainer on the Kubernetes Gateway API project, where he works on improving the ingress and mesh experiences... Read More →

Wednesday April 2, 2025 14:30 - 15:00 BST
Level 1 | Hall Entrance S10 | Room B

Platform Engineering

Content Experience Level Intermediate

14:30 BST

Many Cooks, One Platform: Balancing Ownership and Contribution for the Perfect Broth - Lian Li, lianmakesthings

Wednesday April 2, 2025 14:30 - 15:00 BST

Level 0 | ICC Capital Hall | Room 2

When I started contracting with the Dutch government to build a new internal developer platform, I found myself navigating competing demands from different teams. Development teams wanted support tailored to their processes, neighboring infrastructure teams aimed to protect their areas of responsibility, and management expected visible progress. These conflicting priorities kept pulling my team in multiple directions, making it challenging to stay aligned and focused.

Since I have a background in Developer Relations, I soon made it my goal to engage all involved parties, giving users a sense of ownership and collaboration, while keeping the platform cohesive.

In this talk, I’ll share the tools and processes that helped address these challenges. I’ll provide practical insights for aligning diverse stakeholders. If you’ve ever faced the challenge of “too many cooks” this session will show how to turn competing demands into a recipe for success.

Speakers

Lian Li

Cloud Native Human, lianmakesthings

Lian always wanted to save the world. After leaving law school, she decided to work with computers instead. While in Web Dev, she started attending tech events, and soon fell in love with the community. In her roles as Consultant and DevRel, Lian combined technical knowledge with... Read More →

Wednesday April 2, 2025 14:30 - 15:00 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Any

15:15 BST

More Data Please: Hands on Green Cloud Experiments - Leonard Pahlke, BWI GmbH & Antonio Di Turi, Data Reply

Wednesday April 2, 2025 15:15 - 15:45 BST

Level 0 | ICC Capital Hall | Room 2

Sustainable cloud computing has been a topic for over a decade, but we lack concrete data on Kubernetes energy consumption. This session shares a case study of a microservice running on a k3s clusters, providing real energy metrics at every stage of Platform Engineering: Day 0 (manual setup with k3s, Cilium, microservice deployment), Day 1 (introducing ArgoCD, Falco for security), and Day 2 (adding observability with Prometheus, Grafana, OpenTelemetry, and Kepler). We use bare metal environments ensuring clean, measurable energy data, from idle setup to fully operational.

We’ll explore how tools like Kepler estimate energy consumption for Kubernetes components and compare them to actual plug measurements. For Day 3, we’ll present experiments: changing programming languages, OS images, VPA and KEDA. By sharing practical insights and data, we aim to inspire engineers to innovate and build a more sustainable cloud-native ecosystem.

Presented by TAG Environmental Sustainability Leads.

Speakers

Antonio Di Turi

Data Engineer, Data Reply

Co-chair of WG Green review in the CNCF TAG-environmental-sustainability. I am determined and dynamic, I like the crowd and I like to be exposed to new stimuli. DevOps and Sustainability are my passions. I feel very lucky because in my job I always find some fun.

Leonard Pahlke

Senior Expert Cloud Native Engineering, BWI GmbH

Leonard is a dedicated open source contributor and leader, currently chairing the CNCF TAG Environmental Sustainability. Previously, Leonard led the K8s release team for v1.26 and as the emeritus advisor for v1.28. With a strong focus on emerging technologies, he advocates for open... Read More →

Wednesday April 2, 2025 15:15 - 15:45 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Intermediate

15:15 BST

Zero Forks Given: Minimizing Friction When Adopting OSS - Alexander Perlman & Narayanamurthi Mari, Capital One

Wednesday April 2, 2025 15:15 - 15:45 BST

Level 1 | Hall Entrance S10 | Room B

Open source software often does not meet internal requirements at large enterprises, especially those with elevated security and regulation requirements. Leveraging said projects often requires modifying or extending them to meet these internal mandates.

In this talk, we will review different patterns for “internalizing” external open source projects and discuss the pros and cons of each approach. These patterns are upstream contribution, forking, wrapping, and mutation.

We will review specific case studies using popular open source projects (including Kubeflow, Argo Workflows, Dask, and more) and how we fulfilled internal requirements using the four aforementioned approaches.

In particular, we want to highlight the comparative benefits of Kubernetes mutating admission control (with Kyverno) when adopting open source projects. We hope that audiences will walk away with concrete tools to streamline open source adoption.

Speakers

Alexander Perlman

Senior Lead Software Engineer, Capital One

Alexander Perlman is a senior lead software engineer at Capital One's Machine Learning Experience organization. His areas of focus include distributed compute and workflow orchestration. He lives in the NYC metro area (aka NJ and ashamed) with his wife and three young children. He... Read More →

Narayanamurthi Mari

Distinguished Engineer @ Capitalone, Capitalone

Moorthy is a distinguished engineer at Capital One's Machine Learning Experience organization. His areas of focus include Site Reliability, Platform Engineering and Workflow Orchestration. He lives in the New Jersey with his wife and two young children.

Wednesday April 2, 2025 15:15 - 15:45 BST
Level 1 | Hall Entrance S10 | Room B

Platform Engineering

Content Experience Level Intermediate

16:15 BST

How the SIG-Multicluster API Specifications Are Used for Real World Multicluster Management - August Simonelli, Red Hat & Ryan Zhang, Microsoft

Wednesday April 2, 2025 16:15 - 16:45 BST

Level 1 | Hall Entrance N10 | Room H

Nearly everyone touches multiple clusters today, often resorting to bespoke management systems. But did you know that Kubernetes SIG-Multicluster has published specifications covering multicluster management which are actively used in production environments today?

This talk will review real-world implementations as demonstrated in the Open Cluster Management project (OCM-io) and KubeFleet (kubernetes-fleet.io).

We'll begin with an overview of key Multicluster API concepts from SIG-Multicluster exploring how the upcoming ClusterProfile API provides a standard way to represent clusters. We'll demo how OCM-io and KubeFleet use some of these APIs, such as the Work API for workload placement across clusters and the Multicluster Services API for managing endpoints and traffic policies.

If you manage – or plan to manage – multiple Kubernetes clusters across public and private clouds please join us to learn how these specifications can improve your multi-cluster management experience.

Speakers

August Simonelli

Principal Product Manager, Red Hat

August Simonelli is a Principal Product Manager at Red Hat. He has worked with customers around the world to help them adopt, use, improve, and implement open source technologies. Raised in Boulder, Colorado, August now lives in Sydney, Australia and is a strong advocate for using... Read More →

Ryan Zhang

Principal Software Engineering Manager, Microsoft

Dr. Ryan Zhang is a Principal Software Engineer Manager at Microsoft, working on Azure Kubernetes Service Team. Ryan has been working on Cloud Native open source projects for the past few years including CloudEvents, Open Application Model (OAM) and multi-cluster related initiati... Read More →

Wednesday April 2, 2025 16:15 - 16:45 BST
Level 1 | Hall Entrance N10 | Room H

Platform Engineering

Content Experience Level Intermediate

16:15 BST

Making CRDs Delightful: Beyond the Pitfalls - Evan Anderson, Stacklok, Inc

Wednesday April 2, 2025 16:15 - 16:45 BST

Level 0 | ICC Capital Hall | Room 2

CRDs have a lot of traps for new operator authors; this is a different talk about developing for Kubernetes! If you're building Kubernetes resource types, let's talk about how to make them satisfying and enjoyable for your users. Using examples from multiple popular projects, Evan will provide 10 tips on how to make your APIs friendly to Kubernetes beginners and experts alike.

* Use status for humans and machines
* Condition super-powers with one simple rule!
* How to avoid needing to build a CLI
* When to build one anyway
* Day-1 RBAC for everyone
* Supporting GitOps gracefully
* Status-free objects: Policies and Classes
* The beauty of zero
* Borrowing is best: embedding known types
* Operating someone else's CRD: labels and annotations

Evan has been extending and operating Kubernetes for the last 6 years. The above patterns will be illustrated with examples from his experience with ArgoCD, Cert-Manager, Gateway-API, Knative, and Kubernetes, among others.

Speakers

Evan Anderson

Software Engineer, Stacklok, Inc

Founder and maintainer on Knative serverless project. Currently at Stacklok working on supply chain security, previously at Google and VMware; recovering SRE.

Wednesday April 2, 2025 16:15 - 16:45 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Advanced

16:15 BST

The GPUs on the Bus Go ‘Round and ‘Round - Natalie Bandel & Ryan Hallisey, NVIDIA

Wednesday April 2, 2025 16:15 - 16:45 BST

Level 1 | Hall Entrance S10 | Room B

Come take a behind-the-scenes look at NVIDIA’s large-scale GPU deployment. NVIDIA’s GPU Cloud has taken on the challenges of day-2 maintenance for 60,000+ GPUs in production, uncovering hard truths and surprising revelations along the way. From problems we didn’t even know existed, to pushing the limits of device uptime. We’ve spent years experimenting, fine-tuning, and learning what works—and what doesn’t.

As Kubernetes is increasing support for allocating accelerators with DRA, day-2 device management is becoming more important. We’ll speak about:
- Techniques we use to uncover device failures
- How we keep devices healthy
- How we remediate failures with operational transparency and without impacting running workloads.

Speakers

Ryan Hallisey

Software Engineer, NVIDIA

Ryan is a software engineer at NVIDIA. He works on building data centers powered by Kubernetes and KubeVirt for NVIDIA products.

Natalie Bandel

Senior Software Engineer, Cloud Computing, NVIDIA

Natalie is a Senior Software Engineer at NVIDIA. She works on building software for cloud infrastructure powered by Kubernetes, KubeVirt and strong coffee.

Wednesday April 2, 2025 16:15 - 16:45 BST
Level 1 | Hall Entrance S10 | Room B

Platform Engineering

Content Experience Level Beginner

17:00 BST

Platform Engineering for Software Developers and Architects (Redux) - Daniel Bryant, Syntasso

Wednesday April 2, 2025 17:00 - 17:30 BST

Level 0 | ICC Capital Hall | Room 2

Building on my KubeCon EU 2022 talk, "From Kubernetes to PaaS to... err, what's next," I aim to introduce platform engineering to the software developer and architect communities.

My primary goal is for developers to understand "what good looks like" with a successful platform build and help them understand how a platform can influence the SDLC (for better or worse!)

Key takeaways from the session:
- Explore how platform architecture influences software architecture and vice versa
- Learn why the principles of coupling and cohesion apply to platform components (and configuration) in the same way as they do with software components
- Understand what to expect from an effective platform, including how applications are built, shipped, and run
- Learn about key platform metrics grounded in developer experience frameworks such as DORA, SPACE, and DevEx

Speakers

Daniel Bryant

Platform Engineer and Head of Product Marketing, Syntasso

Daniel Bryant is a platform engineer and the Head of Product Marketing at Syntasso. Daniel is a long-time coder, platform engineer, and Java Champion, and he contributes to several open source projects. He also writes for InfoQ, O’Reilly, and The New Stack, and regularly presents... Read More →

Wednesday April 2, 2025 17:00 - 17:30 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Any

17:00 BST

The Explorer's Guide To Cloud Native GenAI Platform Engineering - Max Körbächer, Liquid Reply & Alexa Griffith, Bloomberg

Wednesday April 2, 2025 17:00 - 17:30 BST

Level 1 | Hall Entrance S10 | Room B

The rapid evolution of generative AI has introduced a complex ecosystem of tools, but many conversations focus narrowly on isolated features and solutions. This can leave engineers wondering: What does a complete, functional GenAI platform actually look like?

This talk provides a practical roadmap for building generative AI infrastructure from the ground up. We’ll guide you through three key phases starting with a Thinnest Viable Platform (TVP), demonstrating the minimal setup needed for LLM inference on Kubernetes. Moving to MVP, we'll explore essential platform components: LLM gateways, vector databases, and KServe deployments. Finally, we'll dive into advanced platform features: intelligent load balancing for LLMs, observability patterns, and performance optimization techniques.

This session is more than a showcase of tools — it’s a roadmap for navigating the AI platform landscape. Join us for practical insights and lessons learned from real-world GenAI platform engineering.

Speakers

Alexa Nicole Griffith

Senior Software Engineer, Bloomberg LP

Max Körbächer

Technology Advisor & Managing Director, Liquid Reply

Max is Founder and Cloud Native Advisor at Liquid Reply based in Munich. His focus is on building cloud-native solutions on/with Kubernetes and platform engineering to simplify the current challenges of complex target environments. He is Co-Chair of the CNCF Environmental Sustainability... Read More →

Wednesday April 2, 2025 17:00 - 17:30 BST
Level 1 | Hall Entrance S10 | Room B

Platform Engineering

Content Experience Level Beginner

17:45 BST

Scale Smarter Not Harder: How Extending Cluster Autoscaler Saves Millions - Rahul Rangith & Ben Hinthorne, Datadog

Wednesday April 2, 2025 17:45 - 18:15 BST

Level 0 | ICC Capital Hall | Room 2

“I need 100 instances with 32 CPUs and 128GB of memory each, with remote storage and up to 10GB/s of network bandwidth, and I need them now”! At Datadog, we make scaling requests like this thousands of times a day, across dozens of clusters in multiple cloud providers. At this scale, and with so many machine specifications to choose from, we realized the importance of asking the question: how do I select the best instance type in every environment?
Join us to learn how answering this question with every scale up decision significantly reduces our cloud costs. We’ll discuss the tools we use to score instance types, and strategies to plug these recommendations into the Kubernetes Cluster Autoscaler via its gRPC expander. Whether you’re operating a single cluster or a massive Kubernetes platform, this talk will teach you how to upgrade your infrastructure to make informed instance type selections that minimize your cloud spend.

Speakers

Rahul Rangith

Software Engineer, Datadog

Rahul Rangith has worked at Datadog since 2022 after graduating from the University of Waterloo. He works on Datadog’s Compute team which is responsible for the company’s Kubernetes platform. On the team, he focuses on node management and autoscaling. Rahul is active in the Kubernetes... Read More →

Ben Hinthorne

Software Engineer, Datadog

Ben Hinthorne joined Datadog’s Compute Team in 2021, which is responsible for building and scaling their Kubernetes platform. Recently, he has focused on the autoscaling ecosystem, working to optimize application performance, infrastructure cost, and resiliency through opinionated... Read More →

Wednesday April 2, 2025 17:45 - 18:15 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Intermediate

17:45 BST

The API Gateway Maturity Matrix: Where Do You Rank? - Joel Hans, ngrok

Wednesday April 2, 2025 17:45 - 18:15 BST

Level 1 | Hall Entrance S10 | Room B

Every API needs a front door, but many organizations struggle to define what "done" means for their API gateway implementation. Is authentication and rate limiting enough? What about multi-region failover or self-service development environments?

In this talk, we'll build on the CNCF's Cloud Native Maturity Model to create a practical framework for API gateway evolution across five key phases: Build (choosing fundamentals), Operate (implementing CI/CD), Scale (mastering multi-region), Improve (balancing control with velocity), and Adapt (enabling advanced patterns).

Through real-world examples and interactive audience polling, we'll identify where most organizations get stuck and discuss concrete solutions using popular tools. You'll walk away with a clear assessment of your current API gateway maturity and practical tips for implementing critical capabilities like GitOps workflows, nuanced rate limiting, and self-service developer environments.

Speakers

Joel Hans

Senior Developer Educator, ngrok

Joel Hans is a senior developer educator at ngrok. He’s been sharing know-how on infrastructure and networking for more than a decade, with stints in companies doing open source, observability, cloud native tooling, and more. Away from how-to guides and demo apps, you can find him... Read More →

Wednesday April 2, 2025 17:45 - 18:15 BST
Level 1 | Hall Entrance S10 | Room B

Platform Engineering

Content Experience Level Any

11:00 BST

Extending Kubernetes Resource Model (KRM) Beyond Kubernetes Workloads - Mangirdas Judeikis, Cast AI & Nabarun Pal, Broadcom

Thursday April 3, 2025 11:00 - 11:30 BST

Level 0 | ICC Capital Hall | Room 2

Writing consistent APIs is hard. The Kubernetes Resource Model (KRM) is the foundation of Kubernetes’ success because it is consistent, predictable, and easy to understand, and it provides a declarative approach to managing infrastructure and applications. But what if KRM could transcend Kubernetes itself?

This talk will explore the paradigm shift of how one could use KRM with kcp or Kubernetes Generic control plane to provide more than just workload management. This is not a new concept, Crossplane and many other tools are already doing this. But if we could take this further? What if each cloud API would look and feel like Kubernetes API? We will extensively cover how “kcp + friends” in the CNCF ecosystem fulfill that vision.

At the end of the talk, the audience will walk away with knowledge of KRM++, the approaches on building a scalable multi-tenant control plane for managing resources in their multi-cluster Kubernetes based infrastructure, possibly hybrid cloud.

Speakers

MJ / Mangirdas Judeikis

Staff Engineer, kcp maintainer, Cast AI

Control planes, distributed systems and opensource. All Kubernetes and kcp! A decade of Kubernetes experience, focusing on platform engineering based on Kubernetes over the last decade. Doing platform engineering before it was cool. :)I thrive on Go, Kubernetes, and an Open Source... Read More →

Nabarun Pal

Principal Software Engineer, Broadcom

Nabarun is a Principal Software Engineer at Broadcom, a maintainer of the Kubernetes project, a chair of Kubernetes SIG Contributor Experience and an emeritus Kubernetes Steering Committee member. He is contributing to kcp in various ways in the recent past.He is a Release Manager... Read More →

Thursday April 3, 2025 11:00 - 11:30 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Intermediate

11:00 BST

Starting and Scaling a Platform Engineering Team - Camille Fournier, Independent & Ian Nowland, Junction Labs

Thursday April 3, 2025 11:00 - 11:30 BST

Level 1 | Hall Entrance N10 | Room F

This talk will take highlights from our book to discuss the basics of starting and scaling Platform Engineering teams. We will cover highlights including: when to start, the skills the team needs, how to set out a successful product and execution strategy, and dealing with challenging stakeholders in order to create success. The audience will leave this talk with concrete takeaways that will help them wherever they are on the platform engineering journey, and clear ideas for what success looks like.

Speakers

Camille Fournier

Author, CTO, Open Athena AI

Camille Fournier is a writer, speaker, and entrepreneur, who has held technology executive positions at companies ranging from startups to the Fortune 50. She has participated in numerous open source foundations and projects including CNCF, FINOS, and Apache ZooKeeper, and is the... Read More →

Ian Nowland

Co-founder, Junction Labs

Ian Nowland has been in the software industry for 25 years. He is currently a co-founder at a seed stage startup, Junction Labs, building tools for platform teams. Prior to that, he co-authored a book on Platform Engineering with Camille Fournier. This included his learnings from... Read More →

Thursday April 3, 2025 11:00 - 11:30 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Any

11:45 BST

Building a 5* Kubernetes Hotel - Dean Fuller, Fidelity International & Rachael Wonnacott, Fidelity International

Thursday April 3, 2025 11:45 - 12:15 BST

Level 0 | ICC Capital Hall | Room 2

When Fidelity International's public cloud journey began to slow it became clear that our barrier to cloud was too high and with lower cognitive load platforms readily available on premises (CloudFoundry) why would anyone move? This sparked the realisation that we needed to build a public cloud container hosting platform that could provide that experience our developers had become used to for so many years, what was born was known as the "Kubernetes Hotel". Abstracting much of the K8s infrastructure complexity from our internal developers it allowed them to focus on the business logic and leaving the platform team to do the heavy engineering. In this talk we'll explore the high's and low's of the K8s hotel business, how our MVP was more of a motel and what we believe a 5* K8s hotel might look like as we progress further on our journey.

Speakers

Rachael Wonnacott

Associate Director - Container Platform Engineering, Fidelity International

Rachael has spent the last decade focused on platform engineering. She places a conscious emphasis on improving flow and is on the quest to smooth the application lifecycle for developers in the enterprise. With a background in astrophysics, Rachael brings her scientific approach... Read More →

Dean Fuller

Director of Developer Platform Engineering, Fidelity International

Dean Fuller has spent the last 20 years working in the technology infrastructure domain, always looking for opportunities to challenge approach and focusing on value and quality of the outcomes. Today Dean oversees the Developer Platform Engineering group at Fidelity International... Read More →

Thursday April 3, 2025 11:45 - 12:15 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Any

11:45 BST

Platform Engineering Loves Security: Shift Down To Your Platform, Not Left To Your Developers! - Maxime Coquerel, Royal Bank of Canada - RBC & Mathieu Benoit, Humanitec

Thursday April 3, 2025 11:45 - 12:15 BST

Level 1 | Hall Entrance N10 | Room F

In the evolving cloud native landscape of software development, the paradigm of "shifting left" has championed embedding security, and its complexity into the development lifecycle (SDLC). A well-designed platform security threat model underpins this strategy. By mapping out potential attack surfaces such as API vulnerabilities, misconfigured RBAC, unscanned container images, and lack of runtime anomaly detection.
Platform Engineering promotes a "shift down" approach, embedding a robust security posture directly into the platform. By integrating governance and scalable security controls into the infrastructure, platform teams enable developers to focus on delivering code and business value without being burdened by security complexities or productivity blockers.
With this talk, attendees will walk away with real life examples based on successful implementations for regulated entities like financial companies, including actionable best practices for security controls and threat models.

Speakers

Mathieu Benoit

Cloud Native Ambassador & Customer Success Engineer, Humanitec

I’m passionate about Cloud Native Computing technologies driven by Open Source, Cloud, Security, SRE, Containers, DevOps, Platform Engineering and Kubernetes. Based on my past experiences as software engineer, IT consultant, solution architect and customer success engineer, I now... Read More →

Maxime Coquerel

Principal Cloud Security Architect, Royal Bank of Canada - RBC

In my current role at RBC - Royal Bank of Canada, I lead the Kubernetes Security program, overseeing security architecture, cloud threat research, threat modeling, and risk assessment of cloud designs and patterns. Additionally, I collaborate closely with our strategic partners, cultivating... Read More →

Thursday April 3, 2025 11:45 - 12:15 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Beginner

14:15 BST

Conveying the Importance of Platform as a Product in the Cloud Native Ecosystem - Valentina Rodriguez Sosa, Red Hat; Danielle Cook, StackGen; Simon Forster, Stackegy; Colin Griffin, Krumware

Thursday April 3, 2025 14:15 - 14:45 BST

Level 0 | ICC Capital Hall | Room 1

A problem in the cloud native community is explaining the value of cloud native to stakeholders. Yet AI and other emerging technologies are driving the need for cloud native and the support platforms provide. Implementing a Platform as a Product can increase the value to the diverse teams and businesses by delivering features faster and with a methodology that can support self-service and focuses on a user-centric approach.

By defining a platform as a product and not a project, they establish themselves as core business and an asset the business will invest in and iterate on over time, driving up the ability to provide capabilities for a business to deliver value to all customers.

Panellists will discuss a product approach to platforms, how they can be managed, iterated on, and mature within the organization. They will draw on materials such as the Platforms Whitepaper, Platform Maturity Model, and new content about Platform as a Product built by the Platform Working Group.

Speakers

Colin Griffin

CEO, Krumware

Colin Griffin is CEO at Krumware, and a Co-Chair of the CNCF Platforms Working Group. Colin Griffin is a software engineer by trade, specializing in cloud-native application and infrastructure development; with an emphasis on developer enablement and platform engineering. He founded... Read More →

Valentina Rodriguez Sosa

Principal Architect, Red Hat

Valentina Rodriguez is a Principal Technical Marketing Manager at Red Hat, focusing on the developer journeys in Kubernetes and emerging technologies. She loves contributing to the community, such as co-organizing KCD NY, and the industry and has spoken at conferences such as O'Reilly... Read More →

Simon Forster

Technical Architect and CNCF Ambassador, Stackegy

Simon Forster is a CNCF Ambassador and cloud native technology architect and engineer based in London. Simon has extensive experience working in heavily regulated financial institutions on the design, delivery and security of critical cloud native applications. He has a specific focus... Read More →

Danielle Cook

StackGen, VP

Danielle Cook has worked in the cloud native industry since 2016 helping organizations adopt the technologies that make cloud native enterprise ready. She co-authored and launched the CNCF Cloud Native Maturity Model in 2021, is a co-chair of the CNCF Cartografos Working Group and... Read More →

Thursday April 3, 2025 14:15 - 14:45 BST
Level 0 | ICC Capital Hall | Room 1

Platform Engineering

Content Experience Level Any

14:15 BST

Set Your Developers Free: Fleet Management at Spotify - Tim Hansen, Spotify

Thursday April 3, 2025 14:15 - 14:45 BST

Level 0 | ICC Capital Hall | Room 2

Migrations, security patches, and dependency upgrades are a necessary toil, but not one that your developers have to suffer through. Learn about Spotify’s approach to managing its fleet of over 10,000 software components — and how we patched the Log4J vulnerability across most of our software in 6 hours.

Fleet Management has freed our developers to focus on impactful software development — rather than the toil of dependency upgrades and migrations. Through automation, our percentage of software that’s up-to-date jumped from 10% to 80%, and security vulnerabilities were cut in half. Spotify orchestrates hundreds of changes, across thousands of repositories, and releases them to production — all without developer intervention.

Speakers

Tim Hansen

Staff Engineer, Spotify

Tim is a staff engineer at Spotify who works in the Platform organization to decrease infrastructure toil for Spotify developers, focused on the open-source Backstage platform. Prior to this, he worked in FinOps at Spotify, focused on reducing cloud infrastructure costs.

Thursday April 3, 2025 14:15 - 14:45 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Any

14:15 BST

Simplify Kubernetes Operator Development With a Modular Design Pattern - Mostafa Hadadian & Alexander Lazovik, University of Groningen

Thursday April 3, 2025 14:15 - 14:45 BST

Level 1 | Hall Entrance N10 | Room F

Kubernetes operators automate complex application management. However, building and maintaining them poses significant challenges. Custom Resource Definitions (CRDs) are painful to evolve once established, and controllers’ logic becomes increasingly complex over time. We learned these lessons the hard way through years, but you don't have to.

We present a design pattern that simplifies Kubernetes operator development by decomposing CRDs into manageable pieces and controllers into more focused microcontrollers. This pattern decouples K8s instructions from controllers' logic by leveraging Helm charts for translating CRD specifications into Kubernetes resources. As a result, our solution reduces code and maintenance complexities, accelerates iteration, and provides an efficient development workflow.

Finally, we share a real-world implementation of our design in the Netherlands' water sector that accelerates AI stream processing application delivery.

Speakers

Mostafa Hadadian

AI/MLOps Innovator| Founder & CEO, University of Groningen | CAIDEL

Mostafa is Founder and CEO of CAIDEL: Continuous AI Deliver. He is also completing his PhD in Computer Science at the University of Groningen. His work lies in cloud native and machine learning development, emphasizing MLOps. Complementing his academic pursuits, he brings a wealth... Read More →

Alexander Lazovik

Professor in Distributed Systems, University of Groningen

Alexander Lazovik, Professor of Distributed Systems at the University of Groningen since 2009, specializes in AI, optimization in distributed environments, cloud computing, and scalable IT infrastructures. He earned his Ph.D. from the University of Trento in 2006 on the topic of Interaction... Read More →

Thursday April 3, 2025 14:15 - 14:45 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Intermediate

15:00 BST

Breaking Free From the Cloud: Banking on Self-Hosted Kubernetes - Kārlis Akots Gribulis & Per Hedegaard Christiansen, Saxo Bank

Thursday April 3, 2025 15:00 - 15:30 BST

Level 0 | ICC Capital Hall | Room 2

What drives a global investment bank to transition from managed cloud Kubernetes service to self-hosted on-premises solution? While managed Kubernetes in the cloud can simplify deployments they do often come with significant trade-offs. At Saxo Bank, we made the decision to regain control by shifting to a self-hosted, on-premises Kubernetes platform.

This session will unpack our motivations, such as decreasing costs by 80%, reducing cluster creation time fifteenfold, and improving our CIS benchmark standing by 30%. We’ll dive into the architecture we adopted, the lessons learned from overcoming performance and resilience challenges, and how this change has impacted our infrastructure into positioning Kubernetes as Saxo Bank’s cornerstone for the future.

Speakers

Per Hedegaard Christiansen

Head of Container Platform Engineering, Saxo Bank

Passionate about container technology and always eager to explore new tech stacks. With extensive experience in Docker, Kubernetes, and microservices, I design and optimize scalable, secure container environments. Constantly learning and embracing cutting-edge tools, I thrive in agile... Read More →

Kārlis Akots Gribulis

Senior Container Platform Engineer, Saxo Bank

Kārlis Akots Gribulis has hands-on experience working across various companies in the cloud-native space. Throughout his career, he has been deeply involved in deploying, managing, and optimizing Kubernetes clusters, helping organizations harness the full power of cloud-native technologies... Read More →

Thursday April 3, 2025 15:00 - 15:30 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Intermediate

15:00 BST

Building a Platform Framework: Lessons Learned From Developing a Multi-Cluster Kubernetes Operator - Cat Morris & Jake Klein, Syntasso

Thursday April 3, 2025 15:00 - 15:30 BST

Level 1 | Hall Entrance N10 | Room F

Running multi-cluster Kubernetes environments introduces significant operational complexities that challenge even skilled engineering teams. In developing Kratix, an open source platform framework to manage these complexities, we encountered technical, organisational, and strategic challenges that reshaped our approach to building reliable, customer-centric software.

As founding engineer and product manager of Kratix, we have found that while technical expertise is crucial, it often falls short in solving complex, cross-cluster issues. We learnt the importance of diverse perspectives, customer involvement, and embracing existing CNCF projects.

By sharing our successes and setbacks, we aim to equip you with a holistic approach to managing multi-cluster Kubernetes environments, fostering resilience and adaptability in an evolving technological landscape.

Speakers

Cat Morris

Staff Product Manager, Syntasso

Cat is the Product Manager at Syntasso delivering Kratix, an open-source cloud-native framework for building internal platforms. She has worked in tech for over 10 years, the last 6 have been in Platform Engineering across all kinds of domains. She specialises in bringing Product... Read More →

Jake Klein

Staff Software Engineer, Syntasso

Jake has over five years of experience working in the Kubernetes and the platform space. He started his career working on CloudFoundry, a popular CNCF Platform as a Service project. He then went on to work at Weaveworks where he worked on the open source project EKSctl, which is a... Read More →

Thursday April 3, 2025 15:00 - 15:30 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Any

15:00 BST

🤔🔧 "Can You Maintain 1000 Apps? WasmCloud & K8s: The Ultimate Golden Template - Liam Randall, Cosmonic

Thursday April 3, 2025 15:00 - 15:30 BST

Level 0 | ICC Capital Hall | Room 1

You can deploy 1,000 applications to Kubernetes, but can you maintain them? Kubernetes excels as an infrastructure abstraction, but today's app management demands better abstractions for applications and their capabilities. This talk introduces CNCF wasmCloud (incubating) as the ultimate golden template for platform engineering. With wasmCloud, you can manage common capabilities like blob stores, HTTP, messaging, and secrets centrally, enabling pluggable, reusable components that scale. wasmCloud simplifies migrations and secures operations across diverse computing environments—essential as data locality laws balkanize the world's compute. By shifting to pluggable capability abstractions, platform engineers can update thousands of apps at once while freeing development teams to focus on building their business logic. This demonstration heavy talk is based on real world adoption & deployments across the F100 in financial services, tech, and the startup ecosystem.

Speakers

Liam Randall

Founder, CNCF wasmCloud, Cosmonic

Liam lives at the intersection of open source and enterprise and has contributed to dozens of major open-source platforms and standards. A serial entrepreneur he has built and scaled companies around Bro/Zeek, Kubernetes, OS Query, Cloud Custodian, and today WebAssembly.

Thursday April 3, 2025 15:00 - 15:30 BST
Level 0 | ICC Capital Hall | Room 1

Platform Engineering

Content Experience Level Beginner

16:00 BST

A Day in the Life of a Kubernetes Engineer - Rajas Kakodkar & Nikhita Raghunath, Broadcom; Amine Hilaly, AWS; Shane Lawrence, Shopify; Kasper Borg Nissen, Dash0

Thursday April 3, 2025 16:00 - 16:30 BST

Level 0 | ICC Capital Hall | Room 1

In the cloud native universe, Kubernetes engineers are the unsung heroes--who wrangle YAML, tame logs and brave production crises. While AI steals the spotlight, challenges of managing Kubernetes remain in the shadows. Platform engineers, often grappling with relentless war room calls rarely get the stage they deserve.
For the first time, we’ll dive into the untold stories of platform engineers: triumphs and the grit it takes to navigate the complexities of Kubernetes. This is a celebration of the human side of cloud native, told through anecdotes, technical insights and lessons learned from the trenches.

Join industry experts as they explore:
- Critical skills for resolving production issues
- Successes & failures that define their journeys
- Balancing ecosystem health
- Secure practices for AI workloads
- Reducing Kubernetes complexity

Join us for unfiltered insights, empowering stories and actionable takeaways to inspire your Kubernetes journey—you, too, deserve to be heard.

Speakers

Kasper Borg Nissen

Developer Relations Engineer, Dash0

Kasper is a Developer Relations Engineer at Dash0, where he is thrilled to step into the observability space. He looks forward to promoting open standards with OpenTelemetry and Perses, helping teams gain deeper insights into their systems. Previously, Kasper served as a Staff Platform... Read More →

Shane Lawrence

Sr Staff Engineer, Shopify

Shane is a Senior Staff Infrastructure Security Engineer at Shopify, where he's working on a multi-tenant platform that allows developers to securely build scalable apps and services for crafters, entrepreneurs, and businesses of all sizes.

Nikhita Raghunath

Principal Engineer, Broadcom

Nikhita is a Principal Engineer at Broadcom, past co-chair of KubeCon and a maintainer of the Kubernetes project. She is the vice chair of the CNCF Technical Oversight Committee and has won the CNCF Top Committer Award in 2021 for her technical contributions. She was also a member... Read More →

Amine Hilaly

Software Engineer, AWS

Amine is a Software Development Engineer at Amazon Web Services working on the Kubernetes and Open source related projects for about three years. Amine is a Go, open-source, and Kubernetes fanatic.

Rajas Kakodkar

Senior Member of Technical Staff | Tech Lead TAG Runtime CNCF, Broadcom

Rajas is a senior member of technical staff at Broadcom and a tech lead of the CNCF Technical Advisory Group, Runtime. He is actively involved in the AI working group in the CNCF. He is a Kubernetes contributor and has been a maintainer of the Kube Proxy Next Gen Project. He has also... Read More →

Thursday April 3, 2025 16:00 - 16:30 BST
Level 0 | ICC Capital Hall | Room 1

Platform Engineering

Content Experience Level Any

16:00 BST

How We Progressively Deliver Changes To Kubernetes Using Canary Deployments and Feature Flags - Bob Walker, Octopus Deploy

Thursday April 3, 2025 16:00 - 16:30 BST

Level 0 | ICC Capital Hall | Room 2

This is the case study of how we changed how we ship software.

With thousands of customers, each in their own Kubernetes container, deploying updates was tough. Off-hours schedules meant it took over 24 hours to push a new version. If something broke, we had to scramble. Canary deployments let us update small groups of customers at a time. We built a tool to stop rollouts fast when issues appeared, limiting the damage.

In the past, new features went to everyone at once. Rolling back wasn't an option. If something failed it'd leave customers stuck in the mess. Now, using OpenFeature, we hide new functionality behind feature flags. We release features to small groups, gather feedback, and test internally for weeks. If things go wrong, we flip the flag off and move on.

This two-pronged approach lets us avoid risky big-bang releases. We went from deploying every 10 days to every 4, with fewer than 1% high-severity defects. Most of these are resolved before customers notice them.

Speakers

Bob Walker

Field CTO, Octopus Deploy

Bob Walker is a Field CTO Octopus Deploy. Bob started as a developer in the early days of .NET when web forms were the hottest new thing, and manual deployments were the norm. After one too many five-hour 2 AM Saturday deployments, he searched for any automation to stop that pain... Read More →

Thursday April 3, 2025 16:00 - 16:30 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Intermediate

16:00 BST

Practical Zombie Hunting for Kubernetes Users - Holly Cummins, Red Hat

Thursday April 3, 2025 16:00 - 16:30 BST

Level 1 | Hall Entrance N10 | Room F

Zombies? Yup, zombies. Zombies are servers which aren’t doing useful work. They’re everywhere, costing money, eating electricity, and belching carbon. And they’re useless! Sadly, the cloud has *not* helped our zombie problem, and even Kubernetes hasn't helped.

One of the reasons zombies don’t get switched off is that no one knows they’re there. So how do we get rid of our pesky zombies? In this talk, Holly will explain the underlying technical and organisational factors that lead to zombies, and introduce a range of real-world zombie-hunting strategies. These include getting to grips with elasticity and utilisation, LightSwitchOps, FinOps, and the eco-monkey (it’s like the chaos monkey, but greener). Technologies covered include absurdly simple scripts, DailyClean, Kruize Autotune, and Backstage.

Speakers

Holly Cummins

Senior Principal Software Engineer, Red Hat

Holly Cummins is a Senior Principal Software Engineer on the Red Hat Quarkus team. Before joining Red Hat, Holly was a long time IBMer, in a range of roles from cloud consultant, full-stack javascript developer, WebSphere Liberty devops architect, JVM performance engineer, to innovation... Read More →

Thursday April 3, 2025 16:00 - 16:30 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Intermediate

16:45 BST

A Journey To Modernizing a Regulated Cloud Control Plane - Pranita Praveen, Macquarie Group Pty Ltd & Steven Borrelli, Upbound

Thursday April 3, 2025 16:45 - 17:15 BST

Level 0 | ICC Capital Hall | Room 1

At Macquarie, we have embarked on a transformative journey to modernize our cloud control plane. Initially designed for a single-cloud environment (AWS) to facilitate our move away from data centers, we are now evolving towards a multi-cloud solution underpinned by GitOps principles and foundational tooling made possible through the CNCF ecosystem. Our focus is on Kubernetes, Crossplane, OPA, Argo, among others, which have been instrumental in our progress.

We aim to share our successes and the lessons learned throughout this journey, built for engineers in a globally regulated environment comprising four distinct lines of business. Our experience underscores the vital role of the CNCF in our modernization efforts, and we are eager to give back to the community that has provided us with indispensable resources and support.

Speakers

Steven Borrelli

Principal Solutions Architect, Upbound

Steven is a Principal Solutions Architect for Upbound, where he helps customers adopt Crossplane.

Pranita Praveen

Head of Enterprise Multi-Cloud, Macquarie Group Pty Ltd

I am a cloud platform engineer and passionate about creating robust, simple and easy to operate solutions.

Thursday April 3, 2025 16:45 - 17:15 BST
Level 0 | ICC Capital Hall | Room 1

Platform Engineering

Content Experience Level Intermediate

16:45 BST

From Hours To Minutes: The Evolution of Platform Engineering at Decathlon - Adrien Gillard & Christophe Furmaniak, Decathlon

Thursday April 3, 2025 16:45 - 17:15 BST

Level 1 | Hall Entrance N10 | Room F

The platform engineering mindset is spreading like wildfire. As a way to accelerate business and empower developers, it is quickly becoming a “must-have”.

However there are still few real world feedbacks of the process. That’s why we offer attendees a look at Decathlon’s journey into platform engineering.
From its inception (when we did not call it platform engineering yet), to its current state where users are able to deploy infrastructure (including workloads, databases, service exposure) to host their applications, in full autonomy, while keeping strong standardization.

We will share how we can now provide our internal users with their environments in 20 minutes, and discuss the challenges we faced and the choices we had to make. We will also look into the next steps and improvements we intend for the future.

Speakers

Adrien Gillard

Senior Ops Engineer, Decathlon

After a dozen years in the IT industry and nearly half working with containers and Kubernetes, for IT hosters and service providers, Adrien acquired experience on a spectrum from on-premises datacenters to serverless functions in public clouds. He currently works at Decathlon, the... Read More →

Christophe Furmaniak

Staff Engineer, Decathlon

Christophe started as a passionate and responsible developer and this led him to pay close attention to all aspects of the project lifecycle, from the developer's workstation right through to deployment on production environments. His current main areas of interest are related to... Read More →

Thursday April 3, 2025 16:45 - 17:15 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Any

16:45 BST

How Do You Measure Developer Productivity? - Jennifer Riggins, The New Stack; Cat Morris, Syntasso; Akshaya Aradhya, Oscilar; Laura Tacho, DX; Helen Greul, Multiverse.io

Thursday April 3, 2025 16:45 - 17:15 BST

Level 0 | ICC Capital Hall | Room 2

Engineering is a science, so we know we can't improve what we don't measure. But many ways of measuring developer productivity focus too much on output, and aren’t trusted by developers.
So how should we measure developer productivity, and quantify the impact of processes, tools, Gen AI and culture on the developer experience (DevEx)?
Then, how do you take this data and turn it into something that's actionable and effective? Should we collect quantitative vs qualitative measurements? What about business impact? Cognitive load? Is there a way to measure the maturity of your platform strategy?
Join this panel to learn how from those who have been working with a Platform-as-a-Product mindset for years now. Join Multiverse's (ex-Backstage) Helen Greul, Oscilar’s (ex-GitHub, Netflix) Akshaya Aradhya, DX's Laura Tacho and Syntasso's Cat Morris in this epic panel hosted by The New Stack's Jennifer Riggins.

Speakers

Jennifer Riggins

Technology Journalist, The New Stack

Jennifer Riggins is a tech storyteller, journalist, writer, and event and podcast host, helping to share the stories where culture and technology collide and to translate the impact of the tech we are building. She has been a working writer since 2003, and is currently based in L... Read More →

Cat Morris

Staff Product Manager, Syntasso

Helen Greul

VP Engineering at Multiverse, Multiverse.io

Helen is an engineering leader, speaker and a strong advocate for creating developer ecosystems that empower teams to thrive. Her journey has taken her from hands-on coding to steering engineering and platform teams, providing her with a holistic perspective on the challenges and... Read More →

Akshaya Aradhya

VP of Engineering, Oscilar

Akshaya is a seasoned engineering executive with deep, technical knowledge about data, cloud, platform, machine learning, AI and infrastructure. Prior to joining Oscilar, she had worked at companies like GitHub, Netflix, LiveRamp and Intuit.She is passionate about building high performing... Read More →

Laura Tacho

CTO, DX

Laura Tacho is CTO at DX, a developer intelligence platform. She previously led teams at companies like CloudBees, Aula Education, and Nova Credit, and is a Docker Captain alumni.

Thursday April 3, 2025 16:45 - 17:15 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Any

17:30 BST

Automating Kubernetes Cluster Updates: Achieving Zero Downtime Effortlessly - Haitao Zhang, CloudPilot AI; Baofa Fan, DaoCloud; Ling Ling, Independent; Wei Jiang, Huawei

Thursday April 3, 2025 17:30 - 18:00 BST

Level 0 | ICC Capital Hall | Room 1

Upgrading a Kubernetes cluster is an ongoing task. The biggest challenge for teams maintaining Kubernetes clusters is how to avoid service disruptions or system crashes during the upgrade process. With Karpenter's disruption mechanism, we can now automate Kubernetes cluster upgrades on major cloud platforms such as AWS, Azure, and AlibabaCloud with controlled, zero downtime. To date, Karpenter supports these cloud vendors and will expand to more platforms in the future. This mechanism makes Kubernetes cluster upgrades safe, controllable, easy and efficient, and significantly reduces the operation and maintenance pressure of DevOps teams. In this session, we will discuss how Karpenter's disruption works, show examples of its practice on major cloud platforms, and help you master how to achieve smooth upgrades and ensure the continuous and stable operation of services.

Speakers

Wei Jiang

Tech Leader, CloudPilot AI

Wei Jiang serves as a Tech Leader at CloudPilot AI. He primarily works on open-source projects, focusing on node scaling with Karpenter and other technologies that achieve high utilization and cost-effectiveness.

Xinxia Ling

Open Source & AI Enthusiast, CloudPilot AI Inc.

With experience in promoting cloud-native solutions like Karpenter and Rancher, Ling offers valuable insights on how developers can cut cloud costs while scaling their infrastructure efficiently.

Fan Baofa

Software Engineer, DaoCloud

Baofa Fan (GitHub @carlory) is an active reviewer of the Kubernetes, Kubernetes-sigs and Kubernetes-csi organization, currently mainly on sig-storage. And He is also a reviewer of the Karmada project which focus on the multi-cluster area.

Haitao Zhang

Software Engineer, CloudPilot AI

Haitao Zhang (GitHub@helen-frank) is a major contributor and reviewer of karpenter-provider-alibabacloud, and a member of kubernetes-sigs and karmada.

Thursday April 3, 2025 17:30 - 18:00 BST
Level 0 | ICC Capital Hall | Room 1

Platform Engineering

Content Experience Level Intermediate

17:30 BST

Cloudy With a Chance of Kubernetes: Going From One To Three Cloud Providers - Laurent Bernaille & Maxime Visonneau, Datadog

Thursday April 3, 2025 17:30 - 18:00 BST

Level 0 | ICC Capital Hall | Room 2

Over the past five years, Datadog expanded from operating in a single region to six regions across three cloud providers. Kubernetes facilitated this expansion by abstracting the differences between cloud environments. However, we encountered several interesting challenges as some implementation details leaked through the abstraction.

This talk will begin with our rationale for adopting a multi-cloud strategy and the constraints it introduced. We will then share our insights on leveraging Kubernetes, the disparities among cloud provider implementations, and how these inconsistencies sometimes breached the Kubernetes abstraction. Finally, we will discuss how our platform teams created additional abstractions hiding most of these differences and the few remaining details that we have to expose to teams deploying on our platform.

Speakers

Maxime Visonneau

Engineering Manager, Datadog

Maxime is an experienced systems and software engineer known for his passion in building robust infrastructures for small to large businesses. Having successfully led his startup to acquisition by Twitter in 2021. He is currently leading teams in charge of the Kubernetes platform... Read More →

Laurent Bernaille

Principal Engineer, Datadog

Laurent Bernaille worked several years as a consultant specializing in cloud, containers, and automation and helped organizations migrate to the public cloud and adopt containers. He is now Principal Engineer at Datadog and works closely with infrastructure teams, which are responsible... Read More →

Thursday April 3, 2025 17:30 - 18:00 BST
Level 0 | ICC Capital Hall | Room 2

Platform Engineering

Content Experience Level Any

17:30 BST

From Metal To Apps: LinkedIn’s Kubernetes-based Compute Platform - Ahmet Alp Balkan & Ronak Nathani, LinkedIn

Thursday April 3, 2025 17:30 - 18:00 BST

Level 1 | Hall Entrance N10 | Room F

What does it take to design a Kubernetes-based fleet management stack that bridges the gap between bare-metal servers in data centers and a platform capable of hosting thousands of microservices, large-scale stateful applications, and a GPU fleet running AI workloads?

At LinkedIn, we use Kubernetes as a foundational primitive in our compute platform. We run thousands of microservices, manage large stateful applications with our custom scheduler, manage a large fleet of GPUs –all while performing regular maintenance on the bare metal hosts with no downtime or manual intervention.

In this talk, we’ll talk about how we architected and built an API-driven, Kubernetes-based compute stack with a large-scale microservices platform, a workload-agnostic stateful scheduler, and a multi-tenant ML/batch jobs platform. We’ll share insights on scaling Kubernetes for diverse workloads while maintaining tenant isolation, resilience, flexibility, and ease of use for developers.

Speakers

Ahmet Alp Balkan

Sr.Staff Software Engineer, LinkedIn

Ahmet is working on building LinkedIn's next generation compute cluster management stack using Kubernetes. In the open source he maintains projects like Krew (kubectl plugin manager), and kubectx.

Ronak Nathani

Sr. Staff Software Engineer, LinkedIn

Ronak leads the Kubernetes team at LinkedIn, spearheading the company's transition to Kubernetes over the past few years. Prior to this role, he contributed to the development and management of LinkedIn's home-grown scheduler and internal private cloud. In addition to his day job... Read More →

Thursday April 3, 2025 17:30 - 18:00 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Intermediate

13:45 BST

How Millennium Bcp Leverages Radius To Empower Developer + Operator Collaboration - Nuno Guedes, Millennium bcp & Jonathan Smith, Microsoft

Friday April 4, 2025 13:45 - 14:10 BST

Level 1 | Hall Entrance N10 | Room H

Collaboration between developers and operators is essential to building scalable, resilient application platforms, but achieving this alignment is often easier said than done. In this session, learn how Millennium bcp, Portugal’s largest privately-owned bank, has successfully bridged this gap using Radius, a CNCF sandbox project. By adopting Radius as a universal language for their application platform, Millennium bcp has streamlined the application lifecycle and enhanced team collaboration.

We’ll explore how Radius helps decouple applications from infrastructure to foster operational flexibility and how GitOps tools like Flux complement Radius to enable declarative management of resources, ensuring consistency and reliability across environments.

Whether you’re a developer, operator, or platform engineer, this session will provide practical strategies, actionable insights, and inspiration for building cohesive, future-ready application platforms.

Speakers

Nuno Guedes

Cloud Compute Lead, Millennium bcp

I am the Cloud Compute Lead at Millennium bcp, Portugal's largest privately-owned bank, with several years of experience in designing and leading the implementation of cloud-based solutions prioritizing containers. In addition to the Cloud Compute team, I am responsible for the Infrastructure... Read More →

Jonathan Smith

Product Manager, Azure Open Source Incubations, Microsoft

Jonathan Smith is a veteran product leader at Microsoft focused most recently on early incubations including: the Parallel Computing Platform, Cloud Engineering Systems, IoT, and cloud-based production of film and television content. Jonathan currently leads product management for... Read More →

Friday April 4, 2025 13:45 - 14:10 BST
Level 1 | Hall Entrance N10 | Room H

Platform Engineering

Content Experience Level Any

15:15 BST

Stateful Connections in Kubernetes: The Scaling Secrets Nobody Talks About - André Mocke & Rodrigo Fior Kuntzer, Miro

Friday April 4, 2025 15:15 - 15:45 BST

Level 1 | Hall Entrance N10 | Room H

Dive into how Miro scales real-time collaboration with long-living TCP connections at its core. Learn how we built and deployed a custom a WebSocket manager in Kubernetes, leveraging connection rebalancing, draining, and graceful shutdown techniques, while maintaining enterprise level compliance. Discover the k8s operators that made it possible, the design decisions we nailed (and the ones we regretted), and how we tackled unforeseen challenges. This is your backstage pass to engineering the intelligent canvas!

Speakers

Rodrigo Fior Kuntzer

Staff Site Reliability Engineer, Miro

A Software Engineer and Cloud Native Specialist with 20 years of experience, currently serving as Staff Site Reliability Engineer at Miro. Specializing in building high-performance platforms and ensuring system reliability, I leverage extensive experience with Docker, Kubernetes... Read More →

André Mocke

Software Engineer, Miro

I'm a Full-stack engineer with north of a decade of experience in a variety of industries, from agriculture to finance, now, multiplayer online games where we get sued if we lose data (Miro). More recently I've taken the opportunity to dive deeper into developing platforms for Infrastructure... Read More →

Friday April 4, 2025 15:15 - 15:45 BST
Level 1 | Hall Entrance N10 | Room H

Platform Engineering

Content Experience Level Intermediate

15:15 BST

Taming the Beast: Advanced Resource Management With Kubernetes - Lucy Sweet, Uber & Dawn Chen, Google

Friday April 4, 2025 15:15 - 15:45 BST

Level 1 | Hall Entrance N10 | Room F

Are you struggling to optimize resource utilization for demanding workloads like databases?

Kubernetes 1.30 to 1.32 introduced a list of powerful new features to help you tame resource-hungry applications and achieve peak cluster efficiency. In this session, Dawn Chen (Software Engineer at Google & Tech Lead SIG Node) and Lucy Sweet (Software Engineer at Uber) will guide you through the latest advancements in pod resource management, including in-place pod resizing, pod-level resource limits, and node swap memory.

Learn how to leverage these features to reduce infrastructure costs, improve application performance, and prevent resource contention in your clusters. Discover best practices for resource allocation, QoS configuration, and troubleshooting, and get a glimpse into the future of pod resource management in Kubernetes.

Speakers

Dawn Chen

Principal Software Engineer, Google

Dawn Chen is a principal software engineer at Google. Dawn has worked on Kubernetes and Google Container Engine (GKE) before the project was founded. She has been one of tech leads in both Kubernetes and GKE. Prior to Kubernetes, she was the one of the tech leads for Google internal... Read More →

Lucy Sweet

Senior Software Engineer, Uber

Lucy is a Senior Software Engineer at Uber Denmark who works on platform infrastructure

Friday April 4, 2025 15:15 - 15:45 BST
Level 1 | Hall Entrance N10 | Room F

Platform Engineering

Content Experience Level Intermediate