KubeCon + CloudNativeCon Europe 2023: Full Schedule

In-person + Virtual
18-21 April
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon Europe 2023 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Central European Summer Time (UTC +2). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis.

11:00 CEST

Node Resource Management: The Big Picture - Sascha Grunert & Swati Sehgal, Red Hat; Alexander Kanevskiy, Intel; Evan Lezar, NVIDIA; David Porter, Google

Resource management is a fundamental area in Kubernetes that focuses on how to properly reserve, allocate, and isolate finite resources on nodes such as CPU, memory, disk, network, accelerators, etc. Resource Management is a hot topic, with multiple proposals raised recently on how to improve things both in Kubernetes and container runtimes: Dynamic Resource Allocation, QoS class resources, improvements to CPU Management, to container lifecycle management and statistics, support in CRI-enabled container runtimes for advanced low-level runtimes such as Kata containers, Firecracker, gVisor, and Confidential Containers and many more. In this presentation, speakers will present the “big picture” for these proposals, how they are interconnected, how they are different, which problems they are targeting to solve, and what they mean for Kubernetes users. This presentation will be helpful for cluster administrators and users to understand the future direction in their resource management area and give a framework for them to provide feedback that can help shape these future efforts. We will also describe opportunities for folks who are more interested to get involved with the open source SIG-Node and runtime communities to drive these efforts forward.

Speakers

Alexander Kanevskiy

Principal Engineer, Cloud Software, Intel

Alexander is currently employed by Intel as Principal Engineer, Cloud Software, focusing on various aspects in Kubernetes: Resource Management, Device plugins for hardware accelerators, Cluster Lifecycle and Cluster APIs. Alexander has over 25+ years of experience in areas of Linux... Read More →

Swati Sehgal

Principal Software Engineer, Red Hat

Swati Sehgal is a Principal Software Engineer in the Ecosystem Engineering Group at Red Hat. She works to enhance OpenShift and its platform to deliver best-in-class networking applications, leading edge solutions and innovative enhancements across the stack. Her work includes working... Read More →

David Porter

Senior Software Engineer, Google

David Porter is a Senior Software Engineer at Google on Kubernetes GKE node team. David’s focus is on the kubelet node agent and the resource management area. He is primary maintainer of cAdvisor, a resource monitoring library widely used in kubernetes, reviewer of a SIG Node, and... Read More →

Sascha Grunert

Senior Software Engineer, Red Hat

Sascha is a Senior Software Engineer at Red Hat, where he works on many different container related open-source projects like Kubernetes. He joined the open-source community in November 2018. Sascha's passions include contributing to open source, as well as giving talks and evangelizing... Read More →

Evan Lezar

Senior Systems Software Engineer, NVIDIA

Evan Lezar is a Senior Systems Software Engineer on the Cloud Native team at NVIDIA. His focus is making GPUs and other NVIDIA devices easily accessible from containerized environments. This includes driving development and adoption of the Container Device Interface (CDI).

Node Resource Management The Big Picture Kubecon EU 2023 pdf

Wednesday April 19, 2023 11:00 - 11:35 CEST
Hall 7, Room C | Ground Floor | Europe Complex

Runtime Performance + Constrained Environments

Content Experience Level Advanced
Talk Type In-Person
Presentation Slides Attached Yes

11:55 CEST

Device Plugins 2.0: How to Build a Driver for Dynamic Resource Allocation - Kevin Klues, NVIDIA & Alexey Fomenko, Intel

Dynamic Resource Allocation (DRA) is a new Kubernetes feature that puts resource scheduling in the hands of 3rd-party developers. From an end-users perspective, it moves away from the limited "countable" interface for requesting access to resources (e.g. "nvidia.com/gpu: 2"), providing an API more akin to that of persistent volumes. Using GPUs as an example, DRA unlocks a host of new features without the need for awkward solutions shoehorned on top of the existing device plugin API. These features include: * Controlled GPU Sharing (both within a pod and across pods) * Multiple GPU models per node (e.g. T4 and A100) * Specifying arbitrary constraints for a GPU (min/max memory, device model, etc.) * Dynamic allocation of MIG devices * Dynamic repurposing of a GPU from full to MIG mode * Dynamic repurposing of a GPU for use as Passthrough vs. vGPU * ... the list goes on ... In this talk, you will learn how to build your own resource driver for DRA. This includes details of how to use Kubernetes's in-tree helper libraries for DRA, where to find an example driver to get you started, as well as best-practices for architecting the driver itself. Throughout this talk, we will use our existing NVIDIA and Intel GPU drivers as a guide, concluding with a demo of these drivers in action.

Speakers

Kevin Klues

Distinguished Engineer, NVIDIA

Kevin Klues is a distinguished engineer on the NVIDIA Cloud Native team. Kevin has been involved in the design and implementation of a number of Kubernetes technologies, including the Topology Manager, the Kubernetes stack for Multi-Instance GPUs, and Dynamic Resource Allocation (DRA... Read More →

Alexey Fomenko

Cloud Software Developer, Intel

Started using computers at the age of 7, hacking at 10, programming at 14. Been using Linux-based OS for last 20 years. Working 15 years by now, with quite many different fields: a bit of Linux OS core components maintenance for mobile phones, a bit of B2B consulting, a little Big... Read More →

KubeCon EU23 Device Plugins 2.0 How to Build a Driver for Dynamic Resource Allocation pdf

Wednesday April 19, 2023 11:55 - 12:30 CEST
G104-105 | First Floor | Congress Centre

Runtime Performance + Constrained Environments

Content Experience Level Intermediate
Talk Type In-Person
Presentation Slides Attached Yes

14:30 CEST

Emergent Load Testing: Rules for Organized Chaos - Nicole van der Hoeven, Grafana Labs

When we write load testing scripts against our applications, we write them sequentially: A, then B, then C. But this doesn't accurately reflect the organized chaos of a system in production, nor does it prepare the system for the unexpected. Emergence is a phenomenon where parts of a whole independently develop properties not originally present in the whole. Emergence is what helps ant workers develop roles without leadership, prompts animals to evolve adaptive traits without forethought, and facilitates non-toxic communities without moderators. The growing field of emergent software applies this swarm logic to the programs that we write. What would it take to write emergent load testing scripts? It turns out that there are a few ingredients for emergence: a large population size, opportunities to interact, feedback, and an element of control. In this talk, Nicole van der Hoeven discusses how to bring these elements to load testing by writing a script in Grafana k6 that can independently decide what requests to make next, modify Kubernetes app pods, and disrupt services based on a continual feed of results during runtime-- all without manual intervention. She shows how to wield this new breed of load testing to improve confidence in the complex systems we build.

Speakers

Nicole van der Hoeven

Senior Developer Advocate, Grafana Labs

Nicole is a performance engineer with over a decade of experience in breaking software and learning to build it back up again. She has lived in the Philippines, the US, Australia, the Netherlands, and Portugal, helping teams all over the world scale up their load tests on the cloud... Read More →

Wednesday April 19, 2023 14:30 - 15:05 CEST
G104-105 | First Floor | Congress Centre

Runtime Performance + Constrained Environments

Content Experience Level Intermediate
Talk Type In-Person

15:25 CEST

Efficient Access to Shared GPU Resources: Mechanisms and Use Cases - Diogo Filipe Tomas Guerra & Diana Gaponcic, CERN

GPUs and accelerators are changing traditional High Energy Physics (HEP) deployments while also being the key to enable efficient machine learning. GPU scheduling in Kubernetes has been limited until now. Not being able to easily share access to single GPUs by multiple workloads leads to inefficiencies when those are light or spiky. At the same time these resources are scarce, expensive and in high demand. In this talk we explore the different possibilities to improve overall usage of GPU resources. We explore the multiple options for GPU scheduling, time sharing and the recently introduced Nvidia Multi-Instance-GPU (MIG) for physical partitioning. We cover the features and limitations of each option and present extensive benchmark results that helped us assign each workload to the most appropriate layout. Finally we describe how we manage GPUs in a centralized way, ensuring optimal resource utilization for services like continuous integration, machine learning and batch.

Speakers

Diogo Guerra

Cloud Engineer, CERN

Diogo is a Computing Engineer in the CERN Kubernetes service offering focusing on containerized deployments and supporting infrastructure. His main contributions focus on the automatically set up of cluster monitoring and other features like hardware accelerators configuration. He... Read More →

Diana Gaponcic

CERN

Diana is a Computing Engineer in the CERN IT department. After an internship at CERN focusing on containerization of ETL applications she later joined the Kubernetes team. Her current focus is on optimizing the usage of GPUs and other Accelerators for simulation and machine learning... Read More →

Efficient Access to Shared GPU Resources pdf

Efficient Access to Shared GPU Resources odp

Efficient Access to Shared GPU Resources pptx

Wednesday April 19, 2023 15:25 - 16:00 CEST
G104-105 | First Floor | Congress Centre

Runtime Performance + Constrained Environments

Content Experience Level Intermediate
Talk Type In-Person
Presentation Slides Attached Yes

16:30 CEST

Love, Death and Robots - with Wasm & K8s on Boston Dynamics Spot - Max Körbächer, Liquid Reply

Can containers and Kubernetes run anywhere? Yes, nearly. We have seen in the past fighter jets, fully isolated environments, security critical infrastructure and more with Kubernetes. So it is no wonder that Boston Dynamics Spot, the most advanced mobile quadrupled robot, is running on containers too. But this wasn’t enough for us. How and why we tweaked the (real world) bot a little and what are our lessons learned is part of this talk. We will show you the easy steps to migrate to K8s, the experimental integrations with Wasm and ideas on how to manage Spot like any other Kubernetes. Our targets are to provide a highly reliable, self-healing software infrastructure for industrial great robots that are secure, fast and autonomous.

Speakers

Max Körbächer

Co-Founder & Cloud Native Advisor, Liquid Reply

Max is Founder and Cloud Native Advocate at Liquid Reply. He is Co-Chair of the CNCF Environmental Sustainability Technical Advisory Group, CNCF Ambassador, Linux Foundation Europe Advisory Board inaugural member and served 3 years at the Kubernetes release team. In his work he supports... Read More →

love death and robots.pptx pdf

Wednesday April 19, 2023 16:30 - 17:05 CEST
G104-105 | First Floor | Congress Centre

Runtime Performance + Constrained Environments

Content Experience Level Any
Talk Type In-Person

11:00 CEST

Setting up Etcd with Kubernetes to Host Clusters with Thousands of Nodes - Marcel Zięba, Isovalent & Laurent Bernaille, Datadog

Setting up clusters that need thousands of nodes can be challenging especially when it comes to etcd architecture and configuration. It’s especially common in use cases like large processing farms for AI/ML/HPC workloads,or in case of internet scale serving applications. In this session you’ll be able to learn best practices around etcd deployments architecture and configuration from tech leads from DataDog and Google Cloud. DataDog has been running their own Kubernetes clusters with thousands of nodes for many years already. Google Cloud has been offering managed clusters up to 15000 nodes since 2020. You’ll be able to hear from practitioners in the space how to squeeze performance, reliability and scale from etcd instances in your clusters. You'll be able to hear about topics like handling disk io or network throughput bottlenecks or how to handle api server restarts and their impact on etcd.

Speakers

Laurent Bernaille

Principal Engineer, Datadog

Laurent Bernaille worked several years as a consultant specializing in cloud, containers, and automation and helped organizations migrate to the public cloud and adopt containers. He is now Principal Engineer at Datadog and works closely with infrastructure teams, which are responsible... Read More →

Marcel Zięba

Senior Software Engineer, Isovalent

Marcel Zięba is a Senior Software Engineer at Isovalent and is leading SIG Scalability in the Kubernetes open-source community. Previously, Marcel worked on Kubernetes and Google Kubernetes Engine since 2020 focusing mainly on performance and scalability. Now he is focusing on the... Read More →

Setting up Etcd with Kubernetes to Host Clusters with Thousands of Nodes pdf

Thursday April 20, 2023 11:00 - 11:35 CEST
G104-105 | First Floor | Congress Centre

Runtime Performance + Constrained Environments

Content Experience Level Advanced
Talk Type In-Person

11:55 CEST

Taming Tactical Cluster Federation at the Edge - Anna Magdalena Kosek, TNO & Stefan van Gastel, Dutch Ministry of Defence

Cluster federation sounds easy: you take several k8s clusters and treat them as one. Right? Easy! What if the network is unstable? What if clusters are resource-diverse and on the move? What if clusters join, unjoin, and re-join spontaneously? What if a cluster running a stateful application leaves the federation? Cluster federation does not sound so easy anymore. Join us on an adventure of bringing cloud federation to the edge and uncovering just how far a mesh of elastic mobile clouds can stretch. See the use Liqo for distributed federation; a combination of TAS (Telemetry Aware Scheduler) and OLSR (Optimized Link State Routing) for network aware scheduling; and Chaos Mesh to simulate network effects. We will show you a tactical cloud concept developed together by TNO and the Dutch Ministry of Defence, where manned or unmanned vehicles join spontaneously in ad-hoc cloud constellations to deliver a resilient, distributed, and collaborative computation.

Speakers

Stefan van Gastel

Head of Innovation and Research at JIVC, Dutch Ministry of Defence

Starting his career as a web developer, Stefan van Gastel has seen and joined the rise of innovative technologies, methods, and principles. Being responsible for starting movements to implement CI/CD, DevOps, containerization, and other modern software development practices within... Read More →

Anna Magdalena Kosek

Lead Developer, Helin Data

PhD Anna Magdalena Kosek is a Lead Developer at Helin Data and previously a senior software engineering and integration expert at TNO, Netherlands Organization for Applied Scientific Research . She has a background in mathematics and computer science and is an experienced software... Read More →

Taming Tactical Cluster Federation at the Edge KubeCon2023 pdf

Thursday April 20, 2023 11:55 - 12:30 CEST
G104-105 | First Floor | Congress Centre

Runtime Performance + Constrained Environments

Content Experience Level Beginner
Talk Type In-Person
Presentation Slides Attached Yes

14:00 CEST

Sharpen the Edge with K3s and Containerized Operating Systems - Rey Lejano, SUSE

Containers allow applications to run anywhere and on anything (with a container runtime on Linux). This ability to run anywhere is perfect for Edge or IoT devices where device management may be complex. How do we bring container orchestration out to the Edge, IoT devices, and other distant, difficult, or constrained environments – K3s. K3s is a lightweight Kubernetes distribution that can run on as little as 1 CPU and 512MB of RAM and does not need etcd. K3s is a CNCF sandbox project that is perfect to deliver Kubernetes out to the Edge, IoT devices, and constrained environments. This talk isn't a K3s overview and goes beyond deploying Kubernetes at the Edge or on IoT devices. This talk is about "Cloud Nativifying" a few sysadmin duties for Edge and IoT devices where access is difficult. This talk covers how to control OS updates of Edge and IoT devices via Kubernetes by making the OS a Kubernetes resource. First we make a bootable Linux derivative OCI container image and store it in a container registry. Updates are delivered via container registry, controlled by Kubernetes, and accomplished by switching active/passive images. Not only do you manage an application through Kubernetes but also the underlying OS of the machine.

Speakers

Rey Lejano

Cloud Native Solution Architect, SUSE

Rey Lejano is a Field Engineer at SUSE by way of Rancher Labs and is the co-chair of Kubernetes SIG Docs. Rey contributes to various Kubernetes Special Interest Groups such as Contributor Experience, Docs, Release, and Security. He is a member of seven Kubernetes Release Teams including... Read More →

Sharpen the Edge with K3s and Containerized Operating Systems pdf

Friday April 21, 2023 14:00 - 14:35 CEST
Hall 7, Room A | Ground Floor | Europe Complex

Runtime Performance + Constrained Environments

Content Experience Level Any
Talk Type In-Person
Presentation Slides Attached Yes