Small local failures (design for them)
Малые локальные провалы · Chaos engineering · Small failures
Build systems where small components fail often, cheaply, and visibly - so the whole system gets information without ever risking ruin.
Forest fires that are suppressed too well accumulate fuel until a catastrophic fire. Companies that prevent every micro-failure accumulate fragility until a big one. Antifragile design accepts - even invites - small failures as information, then learns from them.
If your system never fails small, it's accumulating the conditions to fail big.
Netflix's Chaos Monkey: deliberately kills production services in random patterns. The system has to handle this. Big outages become almost impossible.
- 1Where can you afford a small failure today that will teach you something?
This part of the knowledge base is inspired by the book. Go to the Ukrainian edition to explore the concept in depth.
- · Антихрупкость - local failures