Most AI agent tools let you watch your agents fail ๐ฟ
Restate lets you do something about it ๐ฎ
โธ๏ธ Pause during outages
โ Kill stuck retry loops
๐ Restart from a specific step, keep the work done
๐ฌ Update in-flight agent calls
Your agents deserve a remote control: www.restate.dev/blog/a-remot...
Posts by Restate
@restate.dev v1.6.1 + v1.6.2 just dropped ๐
โ ๏ธ fixed leadership thrashing that could destabilize clusters
๐ง slashed per-invocation memory (gigabytes saved)
๐ fixed lazy state keys stuck in retry loops
๐ฆ weekly Docker security refreshes
details โ github.com/restatedev/r...
Restate + Vercel AI SDK =
โ
Recover from failures without losing progress
โ
Built-in state management
โ
Trace every decision and action
โ
From simple agents to complex multi-agent systems
โ
Resilient human approvals, timeouts, and rollback
www.restate.dev/blog/buildin...
Restate + Vercel AI SDK = enterprise-grade durability for your AI agents with just a few lines of code.
Survive crashes, timeouts, and restarts - even for multi-turn conversations, human approval and long-running research agents.
@vercel.com @sdk.vercel.ai
AWS's recent us-east-1 outage hit ChatGPT, Snapchat, and Perplexity. In 2021: Netflix, Disney+, Robinhood.
The problem? No true geo-replication.
We show how @restate.dev makes it simple with zero downtime during a simulated region failure:
www.restate.dev/blog/geo-rep...
If yesterday's AWS outage made you rethink your infrastructure, this blog post shows exactly how to build apps that stay up when the cloud goes down.
๐ Full blog post: www.restate.dev/blog/geo-rep...
We ran a Restate app deployed in multiple regions, then killed an entire region.
Result:
โ
Zero downtime
โ
Auto-failover in <60s
โ
No data loss
โ
All requests processed
With Restate, geo-replication is a deployment detail, not an application concern.
Write normal business logic. Deploy across regions with one config line:
'default-replication = "{region: 2, node: 3}"'
Traditional geo-replication is brutally hard:
- Complex distributed state management
- Custom replication logic & conflict resolution
- Leader election
- Restructuring your entire application code
It's an expert-level undertaking that costs companies months of engineering time.
AWSโs us-east-1 outage took down ChatGPT, Snapchat & Perplexity.
The same happened in 2021: Netflix, Disney+, and Robinhood all went dark.
The reason? Often a lack of true geo-replication.
Here's how you can run apps with zero downtime during a region failure using Restate ๐งต
Restate Cloud is now publicly available! This webinar takes you through everything you need to know to get started smoothly:
youtu.be/W2iDBqSIbqU?...
โLast week, we announced that Restate Cloud is now publicly available. Join us on Wednesday for a hands-on walkthrough of Restate Cloud. Learn how to quickly set up your first project, and start building resilient applications, without managing any infra.
luma.com/bdspjxof
You can also swap it around and add LLM/agentic steps to workflows. For example first parse some text to structured form with an LLM and then execute additional durable steps calling other services/APIs/databases, updating state, etc
You read our mind! Here is an example of a resilient mcp server with Restate: github.com/restatedev/a... Indeed a good fit for Durable Execution together with Restate's task control (start, cancel, re-attach, long-running tasks)
Good question! Restate helps not just agents but workflows in general. The agent might kick off a tool/workflow, that you only want to execute exactly once: maybe it sends an email or writes to a database or updates K/V state. It also lets long/heavy tasks replay instead of redoing work on retry.
The Restate UI has improved so much over the last releases. The UI now shows a live timeline of execution steps, including retries, nested RPC calls, events, and even cancellation signals. And it's so beautiful!
๐ Yesterday, we announced that ๐ฅ๐ฒ๐๐๐ฎ๐๐ฒ ๐๐น๐ผ๐๐ฑ is now publicly available. Today, weโre announcing ๐ฅ๐ฒ๐๐๐ฎ๐๐ฒ ๐ญ.๐ฑ!
This release brings key features for building, controlling and observing your durable agents and workflows.
Sign up for the free tier with a few clicks and get your first durable agent running in seconds with the built-in tutorial!
restate.dev/cloud
Restate Cloud has all you need to develop transactional, resilient, secure applications.
Connect services running on serverless compute platforms like @cloudflare.social, @vercel.com, @deno.land, or AWS Lambda. Or on your own infra on AWS, GCP, or Kubernetes.
www.restate.dev/blog/announc...
๐ Exciting news! ๐ฅ๐ฒ๐๐๐ฎ๐๐ฒ ๐๐น๐ผ๐๐ฑ is now publicly available! Build workflows and AI agents with durable execution, observability, and production-grade availability without running any stateful infra yourself.
A lot is brewing at Restate! Hereโs the first in a series of rollouts coming over the next few weeks โจ Stay tuned!
๐ The simplest way to build resilient applications by Giselle van Dongen @ Spring I/O 2025
โถ๏ธ Video: youtu.be/lYcEa8APBTQ
๐ Slides: 2025.springio.net/slides/the-s...
๐ ๏ธ Repo: github.com/gvdongen/res... #springio25 @restate.dev
๐ Restate v1.4.3 is here!
โจ New features:
๐ง Shell completions for CLI tools
โต Enhanced Helm charts
๐ OTLP+HTTP tracing support
Seamless upgrade, no migration needed! ๐
๐ฆ Download: github.com/restatedev/r...
๐ Get started: docs.restate.dev/get_started/...
In case you missed our latest blog post, check it out here:
"Durable AI Loops: Fault Tolerance Across Frameworks (Without Handcuffs)"
๐ restate.dev/blog/durable...
Why Durable Execution matters:
๐ Automatic retries & recovery
โณ Long-running workflows
๐ง Resilient human-in-the-loop
๐น๏ธ Agent orchestration
๐ Built-in observability
โถ๏ธ Watch the full session: www.youtube.com/watch?v=Bawf...
๐ฅ Build Durable AI Agents with Restate + Vercel AI SDK
Here are the highlights from our recent community meeting on building resilient AI agents using Restate with your favorite Agent SDK (like Vercel AI or OpenAI).