WEBINAR

How to find Kubernetes reliability risks with Gremlin

Office Hours
April 17th, 2025 at 11 am

Register Now

Thank you for registering! We'll email you a link to join the webinar on April 17, 2025.

Most Kubernetes clusters have reliability risks lurking just below the surface. You could spend hours or even days manually finding these risks, but what if someone could find them for you?

With Detected Risks, Gremlin automates the work involved in finding and tracking reliability risks across your Kubernetes clusters. Surface failed Pods, mismatched image versions, missing resource definitions, and single points of failure, all without having to run a single test.

Learn how Gremlin uses automatic risk detection to scan your Kubernetes clusters for reliability risks. You’ll also learn where to find your risks in the Gremlin web app, strategies for resolving risks, and how to generate a risk report for leadership.

  • Where to find detected risks in the Gremlin web app
  • Techniques for resolving detected risks before they can cause an incident or outage
  • How to confirm that your fixes address the underlying problem
  • How to add or remove Detected Risks from your reliability score
About the speakers

Andre Newman

Sr. Reliability Specialist
Gremlin

At Gremlin, Andre promotes the benefits of Chaos Engineering and reliability testing to engineering teams around the world, including at some of the largest enterprise organizations. Prior to Gremlin, he created technical content explaining Kubernetes and containerization, the shift to cloud computing, DevOps, observability, and more. His work has been featured in The New Stack, DZone, Software Engineering Daily, TechBeacon, and StatusCode Weekly.

Avoid downtime. Use Gremlin to turn failure into resilience.

Gremlin empowers you to proactively root out failure before it causes downtime. See how you can harness chaos to build resilient systems by requesting a demo of Gremlin.

Product Hero ImageShape