Adrian Hornsby (@adhorn.me) Bsky

Beyond Root Cause: A Better Approach to Understanding Complex System Failures Discover why traditional root cause analysis and 5 Whys frameworks fall short in complex systems. Learn practical alternatives and the 'Trojan Horse' approach to implement meaningful change in your or...

Just Out! "Beyond Root Cause: A Better Approach to Understanding Complex System Failures"

I'm excited to share my latest article, which explains why traditional root cause analysis and the 5 Whys approach fall short in complex systems.

I hope you enjoy it!

www.resiliumlabs.com/blog/beyond-...

10 months ago 1 0 1 0

Resilience Bites #9 - LinkedIn Rewind (week 14) — adhorn.me Discover the latest insights, innovations, and discussions about resilience engineering. Stay ahead with Resilience Bites.

Resilience Bites #9 - LinkedIn Rewind (week 14) is out!

I've discussed a few key concepts, including Sherlock Holmes' "Dogs Not Barking", the tension between high standards and adaptability, and why sometimes "turning it off and on again" contains hidden wisdom.

adhorn.me/posts/resili...

1 year ago 1 0 0 0

Holly smoke .. .I hadn't seen it. Now I can't unsee it.

1 year ago 0 0 0 0

That's not what I mean. I just think it isn't easy as that :)

1 year ago 0 0 0 0

(last) Anyway, thanks a lot for the feedback and for making me think :)

1 year ago 0 0 0 0

(7/n) I'm not saying teams shouldn't be responsible, but I am just wondering if our traditional ideas about accountability need to evolve as these systems become more autonomous.

1 year ago 0 0 1 0

(6/n) It's like trying to hold someone accountable for the weather. Yes, they can build the forecasting system, but at some point, the complexity makes understanding impossible.

1 year ago 0 0 2 0

(5/n) But here's my struggle. As these AI systems get more complex and their decision-making more opaque, can we honestly say the teams fully understand what's happening anymore?

1 year ago 0 0 1 0

(4/n) Your point about responsibility really got me thinking. While I love blameless postmortems too, the accountability question gets tricky with these systems. In theory, yes, the human teams should be responsible.

1 year ago 0 0 1 0

(3/n) And good catch on the redundant "famous" - definitely missed that one in editing!

1 year ago 0 0 1 0

(2/n) Yea, I should've been clearer about the difference between regular AIOps (where humans still make the final calls with AI help) and what I'm calling "meta-operators" (where the AI is actually making the decisions itself). Thanks for picking up on that while still following my main points!

1 year ago 0 0 1 0

(1/n) Thanks so much for the thoughtful feedback, Dave!

1 year ago 0 0 1 0

Let me know how you like it or not please :)

1 year ago 1 0 1 0

When AI Makes the Call Questions About Meta-Operators and System Responsibility

🚀🚀🚀New blog post! 🚀🚀🚀

I have been thinking a lot about AI meta-operators, which are AI agents that will manage our systems and make operational decisions.

In this blog post, I am sharing some thoughts and asking questions.

I hope you enjoy it!

medium.com/the-cloud-ar...

#AI

1 year ago 1 0 1 0

Chaos Engineering in the Age of AI: Surfacing Hidden Complexity The rise of AI in software development presents a fascinating paradox. While AI tools make it easier than ever to generate complex systems…

🚀 New blog post out! 🚀

This post discusses the 70% problem with AI-generated code, Bainbridge's automation ironies, and what chaos engineering can teach us about managing complexity in the age of AI.

I hope you enjoy it!

Happy weekend!

adhorn.medium.com/chaos-engine...

1 year ago 2 2 0 0

In every system, something works.

Rather than asking what's wrong and how to fix it, ask what's working and how to get more of it.

1 year ago 1 0 0 0

The best time to test your runbook is before the incident, not during it.

1 year ago 1 0 0 0

"A good traveler has no fixed plans and is not intent on arriving."

- Lao Tzu

1 year ago 1 0 0 0

“In a wicked world, relying upon experience from a single domain is not only limiting, it can be disastrous.”

― David Epstein, Range: Why Generalists Triumph in a Specialized World

1 year ago 2 0 1 0

Yes, really good piece indeed.

1 year ago 0 0 0 0

The Canva outage: another tale of saturation and resilience Today’s public incident writeup comes courtesy of Brendan Humphries, the CTO of Canva. Like so many other incidents that came before, this is another tale of saturation, where the failure mod…

@adhorn.me - another one stacked with greatest-hits: surfingcomplexity.blog/2024/12/21/t...

1 year ago 2 1 1 0

That is pretty scary.

1 year ago 0 0 0 0

Will 2025 finally mark the rise of the Chief Resilience Officer?

1 year ago 1 0 1 0

Right. The years and billions of dollars spent preparing are why Y2K didn’t “live up to the hype.” They *fixed* it. Before it happened. Which is good. Yes.

1 year ago 1496 267 62 25

😱 r/where/were

1 year ago 0 0 0 0

James Cameron: The Lessons of Titanic and other Reflections Excerpt from: “Risk and Exploration: Earth, Sea and Sky” NASA Administrator’s Symposium September 26-29 Naval Postgraduate School Monterey, California I am also honored to be part of this august panel...

"Luck is not a factor. Hope is not a strategy. Fear is not an option."

spaceref.com/status-repor...

1 year ago 0 0 0 0

“Some problems are better evaded than solved.”

Tony Hoare

1 year ago 1 1 0 0

Tony Hoare - Wikipedia

"There are two methods in software design. One is to make the program so simple, there are obviously no errors. The other is to make it so complicated, there are no obvious errors."

Tony Hoare

en.wikipedia.org/wiki/Tony_Ho...

1 year ago 2 0 0 0

"Awareness is the greatest agent for change."

- Eckhart Tolle

1 year ago 3 0 1 0

Amazon ECS now supports network fault injection experiments on AWS Fargate - AWS Discover more about what's new at AWS with Amazon ECS now supports network fault injection experiments on AWS Fargate

New HUGE Launch - AWS FIS now supports networking actions on AWS Fargate!!!!

Network latency, Network blackhole, and Network packet loss are ready to be used!

Long awaited. I'm super happy to see this one out for the end of 2024!

Have fun!

aws.amazon.com/about-aws/wh...

1 year ago 5 2 0 0

Posts by Adrian Hornsby