Simple Kubernetes Targeting for Your Chaos Experiments
Today we’re excited to introduce native Kubernetes support to the Gremlin Reliability as a Service platform. Gremlin users can now easily discover, visualize, and target their Kubernetes objects within our web app and using our API. Now, when kicking off an experiment, Gremlin will intelligently select the containers underlying the specified Kubernetes objects so that you can be confident that your application runs the way you expect it to on Kubernetes.
Simplicity
In the past if you’ve installed Gremlin on a container in a Kubernetes cluster, you dealt with containers being spun up and down by the Kubernetes orchestrator. Targeting the containers underlying the service on which you’d like to experiment has been a bit like a game of whack-a-mole.
[ Related: How to Install and use Gremlin with EKS ]
At the core, Kubernetes is a system to automate and abstract the complexity around deploying, scaling, and managing containerized applications. As Kubernetes has become a production ready platform used across the industry, we’ve brought this simplicity to Gremlin, allowing safe and easy targeting of your desired services when running Chaos Engineering experiments, allowing you to think about your infrastructure in abstract terms, exactly as you do with Kubernetes.
Gremlin enables us to run chaos experiments on specific Kubernetes services. This significantly reduces the amount of time it takes to do negative testing and increases our systems' resilience to failure.
Your Kubernetes Cluster
Once you’ve updated the Gremlin agent for Kubernetes using our helm chart, open our web app and get started by creating a new attack to simply target your services running on Kubernetes objects.
What you see next should be very familiar to you, but with the newly added Kubernetes option. Your list of clusters and namespaces are available and searchable to let you filter down the set of Kubernetes objects that are available for experimentation. Your objects are then broken down by Deployment, DaemonSet, ReplicaSet, StatefulSet, and Pod to allow easy and safe targeting. Try clicking on one or many objects and you’ll notice the visual map of your cluster light up to represent what will be impacted when this attack is run.
Once you’ve selected and configured the Gremlin of your choice, unleashing the attack will interpret the Kubernetes objects you’ve selected and intelligently target the appropriate containers. You no longer have to look for specific containers from a long list and wonder if you’ve found them all. Instead, the containers targeted by this attack will show up in the attack details screen, grouped together by the Kubernetes object they belong to.
All of the container details and logs will be available for your review as you’d expect when running any type of Gremlin experiment.
Get Started Now
The native integration with Kubernetes is available now for all users. Get started by heading to the web app, create a new attack and click on Kubernetes. Targeting Kubernetes will soon be possible within a Scenario, however is not available just yet.
For more information on using Kubernetes with Gremlin, checkout our docs, download a copy of our comprehensive eBook, “Kubernetes Reliability at Scale,” or join the #support channel in the Chaos Community Slack.
Gremlin's automated reliability platform empowers you to find and fix availability risks before they impact your users. Start finding hidden risks in your systems with a free 30 day trial.
sTART YOUR TRIALTo learn more about Kubernetes failure modes and how to prevent them at scale, download a copy of our comprehensive ebook
Get the Ultimate GuideWhat is Failure Flags? Build testable, reliable software—without touching infrastructure
Building provably reliable systems means building testable systems. Testing for failure conditions is the only way to...
Building provably reliable systems means building testable systems. Testing for failure conditions is the only way to...
Read moreIntroducing Custom Reliability Test Suites, Scoring and Dashboards
Last year, we released Reliability Management, a combination of pre-built reliability tests and scoring to give you a consistent way to define, test, and measure progress toward reliability standards across your organization.
Last year, we released Reliability Management, a combination of pre-built reliability tests and scoring to give you a consistent way to define, test, and measure progress toward reliability standards across your organization.
Read more