An easy to use and powerful chaos engineering experiment toolkit.（阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具）
ChaosBlade is an Alibaba open source experimental injection tool that follows the principles of chaos engineering and chaos experimental models to help enterprises improve the fault tolerance of distributed systems and ensure business continuity during the process of enterprises going to cloud or moving to cloud native systems.
Chaosblade is an internal open source project of MonkeyKing. It is based on Alibaba's nearly ten years of failure testing and drill practice, and combines the best ideas and practices of the Group's businesses.
ChaosBlade is not only easy to use, but also supports rich experimental scenarios. The scenarios include: * Basic resources: such as CPU, memory, network, disk, process and other experimental scenarios; * Java applications: such as databases, caches, messages, JVM itself, microservices, etc. You can also specify any class method to inject various complex experimental scenarios; * C ++ applications: such as specifying arbitrary methods or experimental lines of code injection delay, tampering with variables and return values; * Docker container: such as killing the container, the CPU in the container, memory, network, disk, process and other experimental scenarios; * Cloud-native platforms: For example, CPU, memory, network, disk, and process experimental scenarios on Kubernetes platform nodes, Pod network and Pod itself experimental scenarios such as killing Pods, and container experimental scenarios such as the aforementioned Docker container experimental scenario;
Encapsulating scenes by domain into individual projects can not only standardize the scenes in the domain, but also facilitate the horizontal and vertical expansion of the scenes. By following the chaos experimental model, the chaosblade cli can be called uniformly. The items currently included are: * chaosblade: Chaos experiment management tool, including commands for creating experiments, destroying experiments, querying experiments, preparing experimental environments, and canceling experimental environments. It is the execution of chaotic experiments. Tools, execution methods include CLI and HTTP. Provides complete commands, experimental scenarios, and scenario parameter descriptions, and the operation is simple and clear. * chaosblade-spec-go: Chaos experimental model Golang language definition, scenes implemented using Golang language are easy to implement based on this specification. * chaosblade-exec-os: Implementation of basic resource experimental scenarios. * chaosblade-exec-docker: Docker container experimental scenario implementation, standardized by calling the Docker API. * chaosblade-operator: Kubernetes platform experimental scenario is implemented, chaos experiments are defined by Kubernetes standard CRD method, it is very convenient to use Kubernetes resource operation method To create, update, and delete experimental scenarios, including using kubectl, client-go, etc., and also using the chaosblade cli tool described above. * chaosblade-exec-jvm: Java application experimental scenario implementation, using Java Agent technology to mount dynamically, without any access, zero-cost use It also supports uninstallation and completely recycles various resources created by the Agent. * chaosblade-exec-cplus: C ++ application experimental scenario implementation, using GDB technology to implement method and code line level experimental scenario injection.
You can download the latest chaosblade toolkit from Releases and extract it and use it. If you want to inject Kubernetes related fault scenarios, you need to install chaosblade-operator. For detailed Chinese usage documents, please see chaosblade-help-zh-cn .
chaosblade supports CLI and HTTP invocation methods. The supported commands are as follows: * prepare: alias is p, preparation before the chaos engineering experiment, such as drilling Java applications, you need to attach the java agent. For example, to drill an application whose application name is business, execute
blade p jvm --process businesson the target host. If the attach is successful, return the uid for status query or agent revoke. * revoke: alias is r, undo chaos engineering experiment preparation before, such as detaching java agent. The command is
blade revoke UID* create: alias is c, create a chaos engineering experiment. The command is
blade create [TARGET] [ACTION] [FLAGS]. For example, if you implement a Dubbo consumer call xxx.xxx.Service interface delay 3s, the command executed is
blade create dubbo delay --consumer --time 3000 --Service xxx.xxx.Service, if the injection is successful, return the experimental uid for status query and destroy the experiment. * destroy: alias is d, destroy a chaos engineering experiment, such as destroying the Dubbo delay experiment mentioned above, the command is
blade destroy UID* status: alias s, query preparation stage or experiment status, the command is
blade status UIDor
blade status --type create* server: start the web server, expose the HTTP service, and call chaosblade through HTTP requests. For example, execute on the target machine xxxx:
blade server start -p 9526to perform a CPU full load experiment:
blade help [COMMAND]or
blade [COMMAND] -hcommand to view help
Download the chaosblade demo image and experience the use of the blade toolkit
Download image command：
shell script docker pull chaosbladeio/chaosblade-demoRun the demo container：
shell script docker run -it --privileged chaosbladeio/chaosblade-demoAfter entering the container, you can read the README.txt file to implement the chaos experiment, Enjoy it.
chaosblade-operator The project is a chaos experiment injection tool for cloud-native platforms. It follows the chaos experiment model to standardize the experimental scenario and defines the experiment as Kubernetes CRD Resources, mapping experimental models to Kubernetes resource attributes, and very friendly combination of chaotic experimental models with Kubernetes declarative design. While relying on chaotic experimental models to conveniently develop scenarios, it can also well integrate Kubernetes design concepts, through kubectl or Write code to directly call the Kubernetes API to create, update, and delete chaotic experiments, and the resource status can clearly indicate the execution status of the experiment, and standardize Kubernetes fault injection. In addition to using the above methods to perform experiments, you can also use the chaosblade cli method to execute kubernetes experimental scenarios and query the experimental status very conveniently. For details, please read the chinese document: Chaos Engineering Practice under Cloud Native
This project is written in golang, so you need to install the latest golang version first. The minimum supported version is 1.11. After the Clone project, enter the project directory and execute the following command to compile:
shell script makeIf on a mac system, compile the current system version, execute:
shell script make build_darwinIf you want to compile linux system version on mac system, execute:
shell script make build_linuxYou can also compile selectively, for example, you only need to compile cli and os scenes, then execute: ```shell script make build_with cli os
make buildwith cli osdarwin
ARGS="cli os" make buildwithlinux ```
For bug report, questions and discussions please submit GitHub Issues.
We welcome every contribution, even if it is just punctuation. See details of CONTRIBUTING
The original intention of our open source project is to lower the threshold for chaos engineering to be implemented in enterprises, so we highly value the use of the project in enterprises. Welcome everyone here ISSUE. After registration, you will be invited to join the corporate mail group to discuss the problems encountered by Chaos Engineering in the landing of the company and share the landing experience.
This project exists thanks to all the people who contribute. [Contribute].
Chaosblade is licensed under the Apache License, Version 2.0. See LICENSE for the full license text.