In the process of building a monitoring and analytics platform that ingests trillions of data points a day, Datadog has learned many lessons about scalable, distributed systems in the cloud. We'd like to share those experiences with our community in this series: "Datadog on..."
Each episode will offer a conversation with the engineers who build Datadog. They'll share real-world experiences architecting, building, operating, and monitoring modern systems giving you actionable information you can apply at your organization. With plenty of time left for Q&A, we'd like you to join the discussion.
Datadog on Kubernetes Node Management
October 10, 2023
Adrien Trouillaud, David Benque and Ara Pulido
Datadog, the observability platform used by thousands of companies, runs on dozens of self-managed Kubernetes clusters in a multi-cloud environment, adding up to tens of thousands of nodes, or hundreds of thousands of pods. This infrastructure is used by a wide variety of engineering teams at Datadog, with different feature and capacit...