Advertisement · 728 × 90

Posts by Chris Evans

Post image Post image Post image Post image

On-call? More like on-cool 😎

9 months ago 6 1 0 0
How Picnic uses incident.io to combine digital and operational reliability
How Picnic uses incident.io to combine digital and operational reliability

And as if that’s not enough, they’re also one of the nicest and most motivated teams we have the pleasure of working with. Thanks team Picnic! 🚚🍞🛒

Full video here:
www.youtube.com/watch?v=uJt...

11 months ago 1 0 0 0

“We are now moving to a level where we are building up structural technical resilience, meaning whenever there's an incident happening, there's no longer customers affected, and that is basically the ultimate goal that we are striving for.”

11 months ago 1 0 1 0

This comes off the back of a full migration from PagerDuty to @incident_io On-call, and a replacement of their in-house incident response tool too.

And in terms of impact, I really can’t put it better than Daniel Gebler, their CTO:

11 months ago 0 0 1 0

With all this in mind, it really is a huge honour for @incident_io to be the software they use to help detect, respond to, and mitigate issues as they happen.

11 months ago 0 0 1 0

And when Picnic talk about combining the digital and operational worlds, they really mean it. There’s bespoke software running everywhere, and Grafana dashboards monitoring the status of thousands of devices, sensors, and systems.

11 months ago 0 0 1 0

and was treated to a tour of one of their automated fulfilment centres. Suffice to say, it’s an incredible operation, where robots and humans work side-by-side to get groceries ready for delivery.

11 months ago 0 0 1 0
Advertisement

I had the absolute pleasure of visiting the team to discuss their rollout of @incident_io

11 months ago 0 0 1 0

Behind the scenes, it's a complex hybrid business, combining a digital platform with a vast physical network of warehouses, delivery hubs, and a fleet of thousands of electric vehicles — all running in perfect coordination to keep millions of customers stocked up and smiling.

11 months ago 0 0 1 0

Picnic is an online supermarket on a mission to make grocery shopping simple, fun, and affordable. And they're clearly a company with great taste, choosing @incident_io to help them achieve their high bar for reliability.

11 months ago 2 0 2 0
Avoiding the ironies of automation | Building with AI Incidents happen when the normal playbook fails—so why would we let AI run them solo? Inspired by Bainbridge’s Ironies of automation, this post unpacks how AI can go wrong in high-stakes situations, and shares the principles guiding our approach to building tools that make humans sharper, not sidelined.

I wrote a little more about how we’re doing this. Link below if you're curious.

incident.io/building-wi...

11 months ago 0 0 0 0

At incident, we’re trying to build in the opposite direction. Instead of replacing people, we’re designing AI to show up like your best engineer. Not the brilliant jerk who fixes the thing while you're grabbing a coffee, but the one that helps you understand, decide, and act when things are on fire.

11 months ago 1 0 1 0

People get good at watching systems, not running them. Then something breaks, and we expect them to swoop in and save the day, with little context and limited practice.

11 months ago 0 0 1 0

But there’s a catch. Lisanne Bainbridge highlighted it back in 1983 in her paper "Ironies of Automation": the more we automate, the more we deskill.

11 months ago 0 1 1 0
Advertisement

My take: the only viable solution is to use AI to support the people operating these systems. It’s not unlike modern aircraft—too complex for a human alone, so we rely on automation to augment the pilot, not replace them.

11 months ago 1 0 1 0

As AI enables faster software development and systems become harder to fully understand, we’re left with a growing problem: how do we safely operate what we no longer fully grok?

11 months ago 0 0 1 0

After six months of watching our engineering team deep in the AI trenches, I’m more convinced than ever: AI isn't just a nice-to-have in incident management—it’s a necessity.

11 months ago 0 0 1 0
Video

So when you're debriefing next and every fibre of your being wants to criticise someone for taking a risky action that made things worse, consider how different you'd be feeling if their heroics actually paid off.

11 months ago 0 0 0 0

Take the forklift-jet-engine saga... Had the forklift flipped over and injured 4 adults, this would be a very different post. And yet, I'm here congratulating the ingenuity.

11 months ago 1 0 1 0

What's interesting to me is that these off-piste actions are often highly dangerous, and when it comes to debriefing, there's a fine line between blame if it goes wrong, and celebration when it goes right.

11 months ago 0 0 1 0

But what does this have to do with incidents?

Incidents are a place where things that have never been done before, happen all the time. And when it comes to debriefing, it can be fascinating to dig into the actions that are taken to understand how things really get done.

11 months ago 0 0 1 0

Then, they fired up a small tractor, balanced it's front loader on the pickup, pushed down so its full weight was on the forklift, and lo-and-behold it worked.

But don't take my word for it. Check out the video of the whole thing below!

11 months ago 0 0 1 0
Advertisement

What did they do?

First up, they got 4 adults to stand on the back of the forklift as a counterbalance. Turns out we were nowhere near heavy enough.

11 months ago 0 0 1 0

But the implications of doing so were high: The truck couldn't wait, so would have to take the engine to be unloaded elsewhere, the cost of the forklift was sunk, and the cost of hiring a crane likely significant.

<Insert joke here about them being able to afford it>

11 months ago 0 0 1 0

In any kind of normal mode of operation, this is where you'd go back to the drawing board, replan and probably order a crane to unload it.

11 months ago 0 0 1 0

It showed up on a truck the size of a small country, and the owner had a forklift ready to unload it.

Only the forklift was severely unpowered, and insufficiently weighted to lift the thing up. Turns out jet engines are pretty heavy at ~4.5 tons.

11 months ago 0 0 1 0

I recently stayed at an Airbnb where the owner (a self-confessed plane nerd) had ordered a disused 747 jet engine to be a show piece on the property.

11 months ago 0 0 1 0

What does unloading a 747 jet engine from a trailer have to do with incident management?

Great question...

11 months ago 0 0 1 0
Building an AI incident responder
Building an AI incident responder YouTube video by incident-io

I sat down with @lawrencejones.dev and our AI PM earlier this week to chat more about it.

Full recording here if you're interested: youtu.be/rNpwZPOUhuE?...

1 year ago 3 0 0 1
Advertisement
Video

We're building an AI incident responder 🔥

Not to replace humans, but to work jointly alongside them.

Initially, we're focused on accelerating the investigation and diagnosis stages of an incident, helping search and correlate data points, signals and context from across your organization.

1 year ago 4 1 1 0