Just Out! "Beyond Root Cause: A Better Approach to Understanding Complex System Failures"
I'm excited to share my latest article, which explains why traditional root cause analysis and the 5 Whys approach fall short in complex systems.
I hope you enjoy it!
www.resiliumlabs.com/blog/beyond-...
Posts by Adrian Hornsby
Resilience Bites #9 - LinkedIn Rewind (week 14) is out!
I've discussed a few key concepts, including Sherlock Holmes' "Dogs Not Barking", the tension between high standards and adaptability, and why sometimes "turning it off and on again" contains hidden wisdom.
adhorn.me/posts/resili...
Holly smoke .. .I hadn't seen it. Now I can't unsee it.
That's not what I mean. I just think it isn't easy as that :)
(last) Anyway, thanks a lot for the feedback and for making me think :)
(7/n) I'm not saying teams shouldn't be responsible, but I am just wondering if our traditional ideas about accountability need to evolve as these systems become more autonomous.
(6/n) It's like trying to hold someone accountable for the weather. Yes, they can build the forecasting system, but at some point, the complexity makes understanding impossible.
(5/n) But here's my struggle. As these AI systems get more complex and their decision-making more opaque, can we honestly say the teams fully understand what's happening anymore?
(4/n) Your point about responsibility really got me thinking. While I love blameless postmortems too, the accountability question gets tricky with these systems. In theory, yes, the human teams should be responsible.
(3/n) And good catch on the redundant "famous" - definitely missed that one in editing!
(2/n) Yea, I should've been clearer about the difference between regular AIOps (where humans still make the final calls with AI help) and what I'm calling "meta-operators" (where the AI is actually making the decisions itself). Thanks for picking up on that while still following my main points!
(1/n) Thanks so much for the thoughtful feedback, Dave!
Let me know how you like it or not please :)
🚀🚀🚀New blog post! 🚀🚀🚀
I have been thinking a lot about AI meta-operators, which are AI agents that will manage our systems and make operational decisions.
In this blog post, I am sharing some thoughts and asking questions.
I hope you enjoy it!
medium.com/the-cloud-ar...
#AI
🚀 New blog post out! 🚀
This post discusses the 70% problem with AI-generated code, Bainbridge's automation ironies, and what chaos engineering can teach us about managing complexity in the age of AI.
I hope you enjoy it!
Happy weekend!
adhorn.medium.com/chaos-engine...
In every system, something works.
Rather than asking what's wrong and how to fix it, ask what's working and how to get more of it.
The best time to test your runbook is before the incident, not during it.
"A good traveler has no fixed plans and is not intent on arriving."
- Lao Tzu
“In a wicked world, relying upon experience from a single domain is not only limiting, it can be disastrous.”
― David Epstein, Range: Why Generalists Triumph in a Specialized World
Yes, really good piece indeed.
That is pretty scary.
Will 2025 finally mark the rise of the Chief Resilience Officer?
Right. The years and billions of dollars spent preparing are why Y2K didn’t “live up to the hype.” They *fixed* it. Before it happened. Which is good. Yes.
😱 r/where/were
“Some problems are better evaded than solved.”
Tony Hoare
"There are two methods in software design. One is to make the program so simple, there are obviously no errors. The other is to make it so complicated, there are no obvious errors."
Tony Hoare
en.wikipedia.org/wiki/Tony_Ho...
"Awareness is the greatest agent for change."
- Eckhart Tolle