Tumult — Rust-Native Chaos Engineering Platform

What is Tumult?

Tumult is a modern, modular chaos engineering platform written in Rust. It serves as a fast, portable, and inherently observable alternative to Python-based tools like Chaos Toolkit.

Tumult is designed for the modern cloud-native landscape. It doesn't just create disruption; it provides the native observability required to understand exactly how systems respond, and the structured data format necessary for modern analytics and automated tooling to analyze those responses at scale. Every result flows into a structured data pipeline — token-efficient formats, embedded SQL analytics, and OpenTelemetry traces — making the platform as readable by LLMs and data tools as it is by engineers.

Try it

curl -sSL https://raw.githubusercontent.com/mwigge/tumult/main/install.sh | sh

One command. Builds the binary, starts Docker infrastructure, runs a verification experiment. Requires Rust and Docker.

Then run your first chaos experiment:

tumult run examples/redis-chaos.toon

Run a full GameDay (E2E)

One script — starts infrastructure, runs 4 PostgreSQL resilience experiments via MCP, scores results, maps to DORA compliance:

./scripts/gameday-demo.sh

GameDay: Q2 PostgreSQL Resilience Programme
Status: COMPLIANT
Resilience Score: 1.00
  #1 [PASS] PostgreSQL connection kill under load
  #2 [PASS] PostgreSQL container pause — total unavailability
  #3 [PASS] PostgreSQL CPU stress — resource pressure
  #4 [PASS] PostgreSQL memory stress — resource pressure

Compliance: DORA EU 2022/2554 Art. 11, 24, 25 | NIS2

See QUICKSTART.md for the full guided walkthrough, or watch the interactive demo on the website.

Demo: validate → run → analyze → compliance (click to expand)

❯ tumult validate examples/redis-chaos.toon
Experiment: Redis resilience — verify recovery after disruption
Method steps: 3 | Hypothesis: Redis responds to ping (1 probes)
Validation passed.

❯ tumult run examples/redis-chaos.toon
Running experiment: Redis resilience — verify recovery after disruption
INFO experiment.completed status=Completed duration_ms=314 deviations=0
Status: Completed | Duration: 314ms | Method steps: 3 executed
Journal written to: journal.toon

❯ tumult analyze --query "SELECT title, status, duration_ms FROM experiments"
title                                      status     duration_ms
─────────────────────────────────────────  ─────────  ───────────
Redis resilience — verify recovery         completed  314
PostgreSQL failover — kill connections     completed  241
Pumba network latency injection            completed  12787
SSH remote execution — stress test         completed  3385

❯ tumult compliance --framework dora .
=== DORA — Digital Operational Resilience Act (EU 2022/2554) ===
Success rate: 100.0% | Overall: COMPLIANT
Source: https://eur-lex.europa.eu/eli/reg/2022/2554/oj

❯ tumult store stats
Experiments: 47 | Activities: 142 | File size: 2.76 MB

Core Concepts (Why Rust-Native?)

Legacy chaos engineering tools are powerful but face significant hurdles in modern production environments: Python runtime overhead, complex dependency deployments, and verbose JSON data structures that are costly and inefficient for advanced analysis.

Tumult solves these issues by being built in Rust:

Speed & Single Binary: Compiles to a single binary per platform. It executes faster and "just runs" without runtime dependencies.
Observability-First: Every action, probe, and lifecycle event is emitted as a real OpenTelemetry span with resilience.* attributes. Each activity gets its own span with unique trace/span IDs. OTLP gRPC export is built-in.
Data-Driven Analysis: Tumult uses TOON (Token-Oriented Object Notation) for experiments and journals. Journals flow through Apache Arrow into embedded DuckDB for SQL analytics, and export to Parquet for any data tool. TOON is 40-50% more token-efficient than JSON.

Architecture
Experiment Format & Plugin Model
Available Plugins
MCP Server (AI Integration)
Data-Driven Chaos Engineering
Load Testing During Chaos
GameDay Orchestration
OpenTelemetry Observability
Security
Hardening
Docker Images
Local End-to-End Environment
Phasing & Roadmap
Example Experiment
Quick Start
Direct Comparison to Chaos Toolkit
Acknowledgements
License

Architecture

Tumult uses a decoupled engine and adapter layer architecture, allowing the core engine to be orchestrated by a CLI, an API, or any automated orchestration system via the Model Context Protocol (MCP).

The Chaos Engineering Landscape

Experiment Format & Plugin Model

Compatibility

Tumult retains the familiar conceptual model of Chaos Toolkit, allowing you to transfer existing knowledge of:

Steady-State Hypotheses
Methods (Sequential and Background steps)
Probes & Actions
Controls (Lifecycle Hooks)
Rollbacks

TOON Experiments

Experiments are defined in TOON (.toon), replacing verbose JSON with a concise, token-efficient format designed for both humans and advanced tooling.

Community Plugins: Script-Based

The script-based plugin model enables the community to contribute chaos capabilities without needing to know Rust. Community plugins are simply directories containing executable scripts (Bash, Python, etc.) and a TOON manifest declaring their capabilities.

tumult-plugin-kafka/
├── plugin.toon              # declares actions, probes, arguments
├── actions/
│   ├── kill-broker.sh
├── probes/
│   ├── consumer-lag.sh

Native Rust Plugins

Native plugins (for performance-critical or SDK-heavy tasks like kube-rs or cloud provider SDKs) are built directly into the core and enabled via Cargo feature flags.

cargo install tumult --features kubernetes,aws

Available Plugins

Plugin	Type	Capabilities
tumult-core	Native (Rust)	Experiment runner, five-phase lifecycle, controls, rollbacks
tumult-otel	Native (Rust)	OTLP gRPC export, per-activity spans, resilience.* attributes
tumult-analytics	Native (Rust)	DuckDB embedded SQL, Arrow columnar, Parquet/CSV/IPC export
tumult-baseline	Native (Rust)	Statistical baseline derivation, percentiles, deviation detection
tumult-ssh	Native (Rust)	SSH remote execution, key/agent auth, file upload
tumult-kubernetes	Native (Rust)	Pod delete, node drain, deployment scale, network policy, label selectors
tumult-mcp	Native (Rust)	MCP server with 14 tools (stdio + HTTP/SSE) for AI-assisted chaos engineering
tumult-clickhouse	Native (Rust)	ClickHouse backend — shared storage with SigNoz for cross-correlation
tumult-stress	Script	CPU/memory/IO stress via stress-ng, utilization probes
tumult-containers	Script	Docker/Podman kill, stop, pause, resource limits, health probes
tumult-process	Script	Process kill/suspend/resume by PID/name/pattern, resource probes
tumult-db-postgres	Script	Kill connections, lock tables, inject latency, exhaust connection pool
tumult-db-mysql	Script	Kill connections, lock tables
tumult-db-redis	Script	FLUSHALL, CLIENT PAUSE, DEBUG SLEEP, connection/memory probes
tumult-kafka	Script	Kill broker, partition broker, add latency, consumer lag probes
tumult-network	Script	tc netem latency/loss/corruption, DNS block, host partition
tumult-pumba	Script	Container-scoped network chaos via Pumba — netem delay/loss/duplicate/corrupt/rate, iptables, container kill/pause/stop, stress injection. Cross-platform (works anywhere Docker runs).

See docs/plugins/ for detailed documentation per plugin.

MCP Server (AI Integration)

Tumult ships a built-in Model Context Protocol server, enabling AI assistants to run, analyze, and create chaos experiments natively.

# Stdio transport (IDE integration)
tumult-mcp

# HTTP/SSE transport (containers, agent fleets, CI/CD)
tumult-mcp --transport http --port 3100

# Docker (HTTP mode by default)
docker run --network tumult-e2e -p 3100:3100 tumult-mcp

# With authentication (recommended in production)
TUMULT_MCP_TOKEN=my-secret tumult-mcp --transport http

MCP Tool	Description
`tumult_run_experiment`	Execute an experiment and return the journal
`tumult_validate`	Validate experiment syntax and provider support
`tumult_analyze`	SQL query over journals via embedded DuckDB
`tumult_read_journal`	Read a TOON journal and return contents
`tumult_list_journals`	List .toon journal files in a directory
`tumult_discover`	List all plugins, actions, and probes
`tumult_create_experiment`	Create a new experiment from a template
`tumult_query_traces`	Query trace data (trace/span IDs) for observability correlation
`tumult_store_stats`	Return persistent store statistics
`tumult_analyze_store`	SQL query directly against the persistent DuckDB store
`tumult_list_experiments`	List experiment .toon files in a directory
`tumult_gameday_run`	Run a coordinated GameDay campaign
`tumult_gameday_analyze`	Analyze GameDay results with resilience scoring
`tumult_gameday_list`	List available GameDay definitions

Authentication

Set TUMULT_MCP_TOKEN to require bearer token authentication on all tool calls (constant-time comparison, no timing attack surface). If unset, the server runs without auth and emits a log warning.

Data-Driven Chaos Engineering

Tumult is data-driven by design. Every experiment produces structured evidence — not just pass/fail, but columnar analytics data that flows through a modern data pipeline.

Experiment → TOON Journal → Apache Arrow (columnar) → DuckDB (embedded SQL) → Parquet (export)

Every probe result, every action timing, every hypothesis evaluation is captured as structured columnar data — queryable with SQL, exportable as Parquet for any data tool, and token-efficient for LLM analysis.

# Run experiments — data is captured automatically
tumult run experiment.toon

# Query your experiment data with SQL
tumult analyze journals/ --query "
    SELECT status, count(*) as runs, avg(duration_ms) as avg_ms
    FROM experiments GROUP BY status"

# Export to Parquet — portable to Spark, Polars, pandas, Jupyter
tumult export journal.toon --format parquet

Why this matters:

Transparency — all experiment evidence is in standard Parquet format, auditable by anyone
Reusability — query across hundreds of experiment runs with SQL, no custom scripts
LLM-friendly — TOON journals are 40-50% fewer tokens than JSON equivalents
No infrastructure — DuckDB is embedded, Arrow is in-memory, Parquet is a file

See Analytics Guide for table schemas, SQL examples, and export options.

Load Testing During Chaos

Tumult runs load tests concurrently with chaos injection. k6 or JMeter hammer your system while faults are active — measuring the real impact on latency, throughput, and error rates.

# Via experiment config (load: section in TOON)
tumult run examples/pg-load-chaos.toon

# Via CLI flags (override or add load to any experiment)
tumult run experiment.toon --load k6 --load-script load.js --load-vus 50 --load-duration 30s

The load runs as a background process while the method executes in the foreground. Both share the same OTel parent trace — the resilience.load span runs in parallel with resilience.action spans in the SigNoz waterfall.

Results flow into the TOON journal as load_result and into DuckDB for SQL analytics:

load_result:
  tool: k6
  duration_s: 10.5
  vus: 5
  latency_p50_ms: 101.0
  latency_p95_ms: 157.0
  error_rate: 0.003
  total_requests: 300
  thresholds_met: true

SELECT e.title, l.latency_p95_ms, l.error_rate, l.total_requests
FROM experiments e
JOIN load_results l ON e.experiment_id = l.experiment_id
WHERE l.error_rate > 0.01

GameDay Orchestration

A GameDay is a coordinated campaign of experiments that runs under shared load and maps results to regulatory compliance articles. It's the difference between "we ran a test" and "we executed a quarterly resilience testing programme per DORA Article 24."

# Create a GameDay from existing experiments
tumult gameday create q2-postgres-resilience \
  --load k6 --load-script examples/k6/smoke-test.js \
  --experiments gamedays/pg-connection-kill.toon,gamedays/pg-container-pause.toon,gamedays/pg-cpu-stress.toon,gamedays/pg-mem-stress.toon \
  --framework dora

# Run all experiments under shared load
tumult gameday run q2-postgres-resilience.gameday.toon

# View aggregate results with resilience score
tumult gameday analyze q2-postgres-resilience.gameday.toon

Output:

GameDay: Q2 PostgreSQL Resilience Programme
Status:  4/4 PASS (COMPLIANT)
Duration: 45.2s

  #1 [PASS] PG connection kill (8200ms)
  #2 [PASS] PG container pause (10700ms)
  #3 [PASS] PG CPU stress (15300ms)
  #4 [PASS] PG memory stress (12100ms)

Resilience Score: 0.91
  Pass rate:    1.00  Recovery: 0.85  Load: 0.90  Compliance: 0.80

Compliance:
  DORA Art. 24 — Testing programme:   MET
  DORA Art. 25 — Scenario testing:    MET
  DORA Art. 11 — Response & recovery: MET

See gamedays/q2-postgres-resilience.gameday.toon for the reference example.

OpenTelemetry Observability

Tumult creates real OpenTelemetry spans across every module — not just the experiment runner, but SSH, Kubernetes, plugin execution, baseline acquisition, analytics pipeline, MCP dispatch, and ClickHouse storage.

resilience.experiment           (root span — tumult-core)
├── resilience.hypothesis.before
│   └── resilience.probe        (per probe)
├── resilience.action           (per action)
│   ├── ssh.connect / ssh.execute   (tumult-ssh)
│   ├── k8s.pod.delete / k8s.node.drain  (tumult-kubernetes)
│   └── script.execute          (tumult-plugin)
├── resilience.hypothesis.after
│   └── resilience.probe
├── resilience.rollback
├── baseline.acquire            (tumult-baseline)
├── resilience.analytics.ingest (tumult-analytics → DuckDB or ClickHouse)
│   ├── resilience.analytics.query
│   └── resilience.analytics.export
└── mcp.tool.call               (tumult-mcp)

Every span carries structured events per OTel semantic conventions: journal.ingested, script.completed, drain.completed, tolerance.derived, anomaly.detected.

SigNoz showing a Tumult experiment trace: resilience.experiment → hypothesis → actions → probes → hypothesis after, with timing and span attributes.

Dual-Mode Analytics (DuckDB + ClickHouse)

# Default: DuckDB embedded (works offline, zero dependencies)
tumult run experiment.toon

# With SigNoz: ClickHouse shared storage for cross-correlation
TUMULT_CLICKHOUSE_URL=http://localhost:8123 tumult run experiment.toon

When ClickHouse mode is active, experiment data lives in the same database as SigNoz traces/metrics/logs, enabling queries like:

SELECT e.title, e.status, t.serviceName
FROM tumult.experiments e
JOIN signoz_traces.signoz_index_v2 t ON e.experiment_id = t.traceID

Observability Platform

Tumult ships a composable Docker observability stack powered by SigNoz (MIT licensed) — traces, metrics, and logs in a single UI.

# Full platform: chaos targets + SigNoz observability
make up
open http://localhost:13301    # SigNoz UI

# Chaos targets only (PostgreSQL, Redis, Kafka, SSH)
make up-targets

# Observability only (deploy alongside existing infra)
make up-observe

# Classic stack (Jaeger + Prometheus + Grafana)
make up-classic

The OTel Collector automatically scrapes all services:

Source	Receiver	Key Metrics
PostgreSQL	`postgresql`	connections, rows, locks, WAL
Redis	`redis`	clients, memory, ops/sec
Kafka	`kafkametrics`	brokers, topics, consumer lag
Docker	`docker_stats`	CPU, memory, network per container
Host	`hostmetrics`	CPU, memory, disk, network
Tumult	OTLP	experiment spans, analytics gauges, script counters

Docker Images

Development and evaluation only. The Docker images shipped with this repository are convenience images for local development, demos, and sandbox testing. They are not hardened for production use — they run as root, bind to 0.0.0.0, and do not enforce authentication by default. For production deployments, build your own images with your organisation's security baseline (non-root user, pinned base image digests, network policies, mandatory auth tokens).

Pre-built images are published to GitHub Container Registry on every release.

# Pull pre-built images (no Rust toolchain needed)
docker pull ghcr.io/mwigge/tumult:latest        # CLI tool
docker pull ghcr.io/mwigge/tumult-mcp:latest     # MCP server (HTTP/SSE)

# Run CLI commands
docker run --rm ghcr.io/mwigge/tumult discover
docker run --rm ghcr.io/mwigge/tumult --help

# Start MCP server
docker run -p 3100:3100 --network tumult-e2e ghcr.io/mwigge/tumult-mcp

Both images contain the full platform: all 11 Rust crates, 10 plugins (48 actions), example experiments, and GameDay definitions. The only difference is the default entrypoint.

Image	Entrypoint	Use case
`ghcr.io/mwigge/tumult`	`tumult` (CLI)	Run experiments, analyze, export, GameDays
`ghcr.io/mwigge/tumult-mcp`	`tumult-mcp --transport http`	MCP server for agent fleets and CI/CD

Local End-to-End Environment

Tumult provides composable Docker bundles for a complete chaos engineering lab with observability. Use start.sh for one-command setup:

./start.sh                  # infra + observe (default)
./start.sh infra            # chaos targets only
./start.sh infra observe    # targets + observability
./start.sh tumult           # MCP server (HTTP, needs infra)
./start.sh all              # everything
./start.sh down             # stop all
./start.sh status           # container health

Full stack architecture

┌─────────────────────────────────────────────────────────────────────────┐
│  ./start.sh all                                                         │
├─────────────────┬─────────────────┬────────────────┬────────────────────┤
│  infra bundle   │  observe bundle │  tumult bundle │  aqe bundle        │
│                 │                 │                │                    │
│  PostgreSQL 16  │  SigNoz UI      │  tumult-mcp    │  Agentic QE Fleet  │
│  :15432         │  :3301          │  :3100 (HTTP)  │  (autonomous QE)   │
│                 │                 │                │                    │
│  Redis 7        │  OTel Collector │  14 MCP tools  │  Connects to       │
│  :16379         │  :14317 (OTLP)  │  DuckDB store  │  tumult-mcp:3100   │
│                 │  :18889 (prom)  │  10 plugins    │                    │
│  Kafka 3.8      │                 │  48 actions    │                    │
│  :19092         │  ClickHouse     │                │                    │
│                 │  (inside SigNoz)│                │                    │
│  SSH Server     │                 │                │                    │
│  :12222         │                 │                │                    │
└─────────────────┴─────────────────┴────────────────┴────────────────────┘
                          │                  │
                          ▼                  ▼
              ┌──────────────────────────────────────┐
              │  tumult-e2e Docker network            │
              │  All services communicate internally  │
              └──────────────────────────────────────┘

Port map

All ports use the 1xxxx range to avoid conflicts with local services.

Bundle	Service	Port	Purpose
infra	PostgreSQL 16	15432	Database chaos target
infra	Redis 7	16379	Cache chaos target
infra	Kafka 3.8 (KRaft)	19092	Message broker chaos target
infra	SSH Server	12222	Remote execution target
observe	SigNoz UI	3301	Traces, metrics, logs dashboard
observe	OTel Collector (OTLP)	14317	OTLP gRPC ingest
observe	OTel Collector (Prom)	18889	Prometheus metrics (host + APM)
observe	OTel Collector (health)	13133	Collector health endpoint
tumult	MCP Server (HTTP/SSE)	3100	MCP tools for agents and CI/CD
classic	Jaeger	16686	Trace visualization (optional)
classic	Grafana	13000	Dashboards (optional)

End-to-end workflow

# 1. Start infrastructure + observability
./start.sh infra observe

# 2. Run a chaos experiment (traces flow to SigNoz automatically)
export OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:14317
tumult run examples/postgres-failover.toon

# 3. Analyze results with SQL
tumult analyze --all --query "SELECT title, status, duration_ms FROM experiments"

# 4. View traces in SigNoz
open http://localhost:3301    # → Traces → Filter by service: tumult

# 5. Run a GameDay campaign
tumult gameday run gamedays/q2-postgres-resilience.gameday.toon

# 6. Export for external tools
tumult export --format parquet --output results.parquet

# 7. Start MCP server for agent access
./start.sh tumult
# Agents connect to http://localhost:3100/mcp

# 8. Stop everything
./start.sh down

OTel data flow

tumult run experiment.toon
    │
    ▼
┌───────────────────────┐
│  tumult-otel          │  7 canonical spans:
│  (built into CLI)     │  resilience.experiment, .hypothesis.before/after,
│                       │  .action, .probe, .rollback, .analytics.ingest
└───────────┬───────────┘
            │ OTLP gRPC :14317
            ▼
┌───────────────────────┐
│  OTel Collector       │  Contrib image with:
│  (tumult-collector)   │  - OTLP + Arrow receivers
│                       │  - Span-to-metrics (APM)
│                       │  - Host metrics
│                       │  - Prometheus exporter (:18889)
└───────────┬───────────┘
            │ OTLP gRPC :4317
            ▼
┌───────────────────────┐
│  SigNoz Standalone    │  - ClickHouse storage
│  (all-in-one)         │  - Trace explorer
│                       │  - Metrics dashboard
│                       │  - Log aggregation
│                       │  - Alerting
│  UI: :3301            │
└───────────────────────┘

The OTel Collector uses the standard Contrib image — no custom build. See docker/tumult-collector/config.yaml for the pipeline configuration.

Platform Test Protocol

Full functional validation of all platform components is documented in docs/testprotocol.md.

166 tests across 22 categories covering CLI, experiment engine, TOON format, plugins (10 including Pumba), Arrow/DuckDB pipeline, OpenTelemetry observability, custom OTel Collector, SigNoz, ClickHouse, containers, SSH, baseline statistics, analytics/reporting, compliance frameworks, MCP server, and end-to-end scenarios.

Category	Tests	Pass
CLI + Core Engine	17	17
TOON Format + Plugins	8	8
Script Plugins (10 plugins, 48 actions)	15	13
Arrow + DuckDB Analytics	12	12
OpenTelemetry (7 canonical spans)	10	10
Custom OTel Collector (build + signals)	10	9
Pumba Network Chaos (netem, iptables, container)	15	15
SigNoz + ClickHouse + Containers + SSH	23	23
Analytics, Reporting, Compliance (7 frameworks)	14	14
End-to-End Pipelines + Quickstart	16	16
Unit Tests (580 Rust tests)	7	7
Total	162	161 (99.4%)

Zero failures. Zero issues. Zero skips. See the full test protocol for detailed results per test.

See docker/README.md for detailed setup instructions.

Phasing & Roadmap

Phase	Scope	Status
0 — Foundation	tumult-core, tumult-plugin, tumult-cli, tumult-otel	Done
1 — Essential Plugins	SSH, stress, containers, process, Kubernetes	Done
2 — Analytics & Data	DuckDB, Arrow, Parquet export, trend analysis, databases, Kafka, network	Done
3 — Automation	MCP server (14 tools, stdio + HTTP/SSE), AI-assisted chaos engineering	Done
4 — Persistent Analytics	DuckDB + ClickHouse dual-mode, SigNoz integration, backup/restore	Done
5 — Regulatory Compliance	DORA (EU 2022/2554), NIS2, PCI-DSS evidence reporting	Done
6 — Hardening	SSH session pool, MCP auth, streaming baseline, experiment templates, signal handlers, audit log, proptest, fuzz	Done
7 — Infrastructure	SigNoz observability platform, Docker Compose stacks	Done
8 — GameDay	GameDay orchestration with resilience scoring, DORA/NIS2 compliance mapping	Done
9 — Containerization	HTTP/SSE transport, GHCR Docker images, composable bundles, e2e environment	Done
10 — Agentic QE	AQE Fleet integration, autonomous chaos engineering, demo pipeline	In Progress

Security

Tumult is built entirely in safe Rust — zero unsafe blocks across all 11 crates. The full security posture is documented in docs/security-assessment.md and vulnerability reporting in SECURITY.md.

Area	Status
Unsafe code (our crates)	0 blocks — completely memory-safe
`.unwrap()` in production	0 calls — all error paths use `?` or `.context()`
SQL injection	0 vectors — no string-formatted queries, Arrow record batch inserts
Command injection	Mitigated — null-byte validation, env var passing (not shell interpolation)
Hardcoded credentials	0 — secrets resolved from environment at runtime
cargo-audit	0 HIGH/CRITICAL — 5 low-severity transitive warnings (unmaintained crates)
Dependency tree	675 crates scanned against RustSec Advisory Database on every commit

Script plugins execute shell scripts as subprocesses with timeout enforcement (kill_on_drop), argument validation, and captured stdout/stderr. This is a trust boundary by design — same model as kubectl plugins or Git hooks. See the security assessment for the full analysis including integer cast review, deserialization surface, and supply chain audit.

Hardening

Hardening focused on production-readiness, test coverage, and security hardening.

SSH Session Pool

tumult-ssh now maintains a connection pool (SshPool) that reuses SSH sessions across multiple actions in an experiment. This eliminates repeated TCP + SSH handshake overhead for experiments with many remote steps:

let pool = SshPool::new();
let session = pool.get_or_connect(&config).await?;
session.exec("systemctl stop myservice").await?;
// Next call to the same host reuses the connection:
session.exec("systemctl start myservice").await?;

Experiment Templates

Experiments support ${VARIABLE} substitution in titles and activity names, allowing a single template to be reused across environments:

tumult run template.toon --var env=staging --var cluster=eu-west-1

Undefined variables cause a hard error at startup (not silent).

Streaming Baseline Acquisition

tumult-baseline exposes AcquisitionStream for incremental sample collection. This is used by the runner to start computing statistics as soon as warmup ends, without buffering the full sample set.

MCP Authentication

The MCP server supports opt-in bearer token authentication via TUMULT_MCP_TOKEN. Token comparison uses constant-time equality (subtle crate) to prevent timing attacks. A Semaphore(10) rate-limits concurrent tool calls.

Audit Log

Every experiment run emits structured audit events (experiment.started, experiment.completed) as tracing::info! with experiment_id, title, status, and duration_ms fields. These flow into log aggregators (Loki, Elasticsearch) correlated with the OTel trace.

Signal Handlers

tumult run wires SIGINT/SIGTERM to a CancellationToken. In-flight activities complete their current step, rollbacks execute, and the journal is written before exit. The experiment status is recorded as Interrupted.

Trace Context Propagation

Script plugins receive TRACEPARENT and TRACESTATE environment variables, allowing subprocess-emitted OTel spans to attach as children of the script.execute span without any changes to existing scripts.

Test Infrastructure

566 tests across the workspace (up from 391)
Property-based tests (proptest) for all statistical functions in tumult-baseline
Fuzz target for experiment TOON deserialization (tumult-core/fuzz/)
tumult-test-utils crate — shared MockPlugin, EventLog, and experiment builders for integration tests
Criterion benchmarks for baseline statistics (tumult-baseline/benches/)

Example Experiment

Here's a complete experiment in TOON that validates database failover with automatic reconnection:

title: Database failover validates automatic reconnection
description: Kill PostgreSQL primary connections and verify app reconnects

tags[2]: database, resilience

configuration:
  db_host:
    type: env
    key: DATABASE_HOST

estimate:
  expected_outcome: recovered
  expected_recovery_s: 15.0
  expected_degradation: moderate
  expected_data_loss: false
  confidence: high
  rationale: Tested monthly with consistent recovery
  prior_runs: 5

baseline:
  duration_s: 120.0
  warmup_s: 15.0
  interval_s: 2.0
  method: mean_stddev
  sigma: 2.0
  confidence: 0.95

steady_state_hypothesis:
  title: Application responds healthy
  probes[1]:
    - name: health-check
      activity_type: probe
      provider:
        type: http
        method: GET
        url: http://localhost:8080/health
        timeout_s: 5.0
      tolerance:
        type: exact
        value: 200

method[1]:
  - name: kill-db-connections
    activity_type: action
    provider:
      type: native
      plugin: tumult-db
      function: terminate_connections
      arguments:
        database: myapp
    pause_after_s: 5.0
    background: false

rollbacks[1]:
  - name: restore-connections
    activity_type: action
    provider:
      type: native
      plugin: tumult-db
      function: reset_connection_pool
    background: false

regulatory:
  frameworks[1]: DORA
  requirements[1]:
    - id: DORA-Art24
      description: ICT resilience testing
      evidence: Recovery within RTO

Quick Start

Prerequisites

You need three things installed. Everything else (Tokio, DuckDB, Arrow, OpenTelemetry, etc.) is handled automatically by cargo build.

Tool	Install	Verify
Rust	rustup.rs — `curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs \| sh`	`rustc --version`
Docker	Docker Desktop or Colima (`brew install colima && colima start`)	`docker ps`
Git	git-scm.com or `brew install git` / `apt install git`	`git --version`

Install and run

# 1. Clone
git clone https://github.com/mwigge/tumult.git
cd tumult

# 2. Build (downloads all Rust dependencies automatically, takes ~3 min first time)
cargo build --release

# 3. Start Docker infrastructure (PostgreSQL, Redis, Kafka, SSH)
make up-targets

# 4. Run your first chaos experiment
./target/release/tumult run examples/redis-chaos.toon

Or use the one-liner that does all of the above:

git clone https://github.com/mwigge/tumult.git && cd tumult && ./install.sh

After install

tumult run examples/redis-chaos.toon       # break Redis, watch it recover
tumult run examples/postgres-failover.toon  # kill PG connections
tumult run examples/pumba-latency.toon      # inject 200ms network latency
tumult analyze --query "SELECT title, status, duration_ms FROM experiments"
tumult discover                             # list all 10 plugins and 48 actions
tumult init                                 # create your own experiment

See QUICKSTART.md for the full guided walkthrough including observability, SSH experiments, compliance reports, and bring-your-own-target instructions.

Pre-built binaries

Download from Releases — pre-built for macOS (Intel + Apple Silicon), Linux (x86_64 + aarch64), and Windows. No Rust toolchain needed for pre-built binaries.

Usage

# Create a new experiment
tumult init

# Validate an experiment
tumult validate experiment.toon

# Dry run — see the execution plan without running
tumult run experiment.toon --dry-run

# Run the experiment
tumult run experiment.toon

# Run with template variable substitution
tumult run template.toon --var env=staging --var cluster=eu-west-1

# Run with custom rollback strategy
tumult run experiment.toon --rollback-strategy always

# List discovered plugins
tumult discover

# Analyze experiment results with SQL
tumult analyze journal.toon
tumult analyze journals/ --query "SELECT status, count(*) FROM experiments GROUP BY status"

# Persistent store management
tumult store stats
tumult store backup --output ~/tumult-backup
tumult store purge --older-than-days 90

# Cross-run trend analysis
tumult trend journals/ --metric resilience_score

# Regulatory compliance report
tumult compliance journals/ --framework dora

# Export to Parquet for external tools
tumult export journal.toon --format parquet

# Start MCP server (stdio for IDE, HTTP for containers)
tumult-mcp                                # stdio (default)
tumult-mcp --transport http --port 3100   # HTTP/SSE
TUMULT_MCP_TOKEN=my-secret tumult-mcp --transport http  # with auth

See CLI Reference for full command documentation.

Development (Makefile)

The root Makefile provides all development and infrastructure commands:

make up              # Full platform: chaos targets + SigNoz observability
make up-targets      # Chaos targets only (PostgreSQL, Redis, Kafka, SSH)
make up-observe      # Observability only (SigNoz + OTel Collector)
make up-classic      # Classic stack (Jaeger + Prometheus + Grafana)
make down            # Stop everything
make status          # Container health check
make ssh-key         # Extract SSH test key to /tmp/tumult-test-key
make test            # Run all Rust tests
make e2e             # Run e2e tests against Docker infra
make lint            # cargo fmt --check + clippy
make build           # Build release binary
make clean           # cargo clean + docker compose down

Direct Comparison to Chaos Toolkit

Chaos Toolkit Component	Tumult Equivalent	Key Advantage
`chaostoolkit` (CLI)	`tumult-cli`	Single binary, no runtime dependencies
`chaostoolkit-lib` (engine)	`tumult-core`	Rust speed, five-phase lifecycle with baseline
Python extensions	Script plugins + Native Rust plugins	Community plugins without Rust; native for performance
JSON experiments	TOON experiments	40-50% fewer tokens, human-readable
opentracing control	Built-in OTel (per-activity spans)	Real spans with `resilience.*` attributes, always on
Manual analysis	`tumult-analytics` (DuckDB + Arrow)	Embedded SQL over journals, Parquet export
No AI integration	`tumult-mcp` (14 MCP tools)	AI assistants run experiments natively
Ad-hoc infrastructure	Docker Compose e2e stack	One command to spin up test services

Acknowledgements

Tumult is inspired by and builds upon the concepts pioneered by the Chaos Toolkit project. Chaos Toolkit's experiment model — steady-state hypothesis, method, rollbacks, controls, and declarative experiment format — established the foundational patterns that Tumult reimagines in Rust.

We are grateful to Russ Miles, the ChaosIQ team, and the entire Chaos Toolkit community for making chaos engineering accessible and standardized.

Tumult also leverages:

TOON by Johann Schopplich — token-efficient data format
Tokio — async runtime powering all concurrent execution
OpenTelemetry — vendor-neutral observability standard
Apache Arrow + DuckDB — in-memory columnar analytics
Pumba by Alexei Ledenev — container-scoped network chaos and fault injection for Docker
SigNoz — open-source observability platform (traces, metrics, logs)
Agentic QE Framework by Dragan Spiridonov — autonomous quality engineering

License

Licensed under the Apache License, Version 2.0. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 259 Commits
.cargo		.cargo
.github		.github
collector		collector
docker		docker
docs		docs
examples		examples
gamedays		gamedays
plugins		plugins
scripts		scripts
tumult-analytics		tumult-analytics
tumult-baseline		tumult-baseline
tumult-cli		tumult-cli
tumult-clickhouse		tumult-clickhouse
tumult-core		tumult-core
tumult-kubernetes		tumult-kubernetes
tumult-mcp		tumult-mcp
tumult-otel		tumult-otel
tumult-plugin		tumult-plugin
tumult-ssh		tumult-ssh
tumult-test-utils		tumult-test-utils
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.mcp.json		.mcp.json
CHANGELOG.md		CHANGELOG.md
CODEOWNERS		CODEOWNERS
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
QUICKSTART.md		QUICKSTART.md
README.md		README.md
SECURITY.md		SECURITY.md
clippy.toml		clippy.toml
deny.toml		deny.toml
install.sh		install.sh
rustfmt.toml		rustfmt.toml
start.sh		start.sh

Folders and files

Latest commit

History

Repository files navigation

Tumult — Rust-Native Chaos Engineering Platform

What is Tumult?

Try it

Run a full GameDay (E2E)

Core Concepts (Why Rust-Native?)

Table of Contents

Architecture

The Chaos Engineering Landscape

Experiment Format & Plugin Model

Compatibility

TOON Experiments

Community Plugins: Script-Based

Native Rust Plugins

Available Plugins

MCP Server (AI Integration)

Authentication

Data-Driven Chaos Engineering

Load Testing During Chaos

GameDay Orchestration

OpenTelemetry Observability

Dual-Mode Analytics (DuckDB + ClickHouse)

Observability Platform

Docker Images

Local End-to-End Environment

Full stack architecture

Port map

End-to-end workflow

OTel data flow

Platform Test Protocol

Phasing & Roadmap

Security

Hardening

SSH Session Pool

Experiment Templates

Streaming Baseline Acquisition

MCP Authentication

Audit Log

Signal Handlers

Trace Context Propagation

Test Infrastructure

Example Experiment

Quick Start

Prerequisites

Install and run

After install

Pre-built binaries

Usage

Development (Makefile)

Direct Comparison to Chaos Toolkit

Acknowledgements

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages