Thanks for the head's up. We're aware of the issue and we've got folks get the binaries back into the channels where they don't cache it, like homebrew. Downloading the binary directly from our downloads page still does wok. Sorry for the troubles!
github.com/ngrok/homebr...
Posts by ngrok
Set it up in the dashboard or API. Currently free on pay-as-you-you plans.
ngrok.com/blog/2026-03...
We gave ngrok domains a `resolves_to` property and, what do you know, made the *routing* part of data residency a one-click.
Pin your traffic to specific PoPs so European data stays in Europe, then rearrange the whole thing tomorrow if you want.
Expose local apps and APIs to the internet with your coding agent and our new `expose-localhost` skill.
Then ask it to add in OAuth, OWASP protection, rate limits, and more if you're feeling plucky.
$ npx skills add ngrok/agent-skills
github.com/ngrok/agent-...
Quantization can make an LLM 4x smaller and 2x faster, with barely any quality loss. But what *is* it? @samwho.dev crafted a beautiful interactive essay explaining it from first principles, aimed at coders, not mathematicians.
ngrok.com/blog/quantiz...
I spent 2 months learning about quantization and am extremely proud of the post I've written about it. I think these are some of the nicest visuals I've ever made, and I love how this compression technique invented in 1898 is being used on the bleeding edge in 2026.
ngrok.com/blog/quantiz...
Route between OpenAI and Anthropic with one API key without an account for either.
ngrok's AI Gateway now has its own API keys and prepaid credits. One key covers both, and you can just set your model to "ngrok/auto" if you don't even want to pick.
ngrok.com/blog/ai-gate...
ngrok's AI gateway now supports the Anthropic SDK natively. Change your base_url, keep your prompt caching, extended thinking, all of it.
The diff is one line & there's no step two.
ngrok.com/blog/native-...
What do LLMs see?
I wrote a lil' tool that extracts the attention matrices out of open models and creates this typing visual, with each token's opacity changing according to its average attention score as the prompt progresses. Dimmer words are considered less important to the model.
This week on Overcommitted, we got to sit down with Bluesky's favorite tech blogger @samwho.dev and it did not disappoint!
Sam makes some of the coolest tech content on the internet, and if you haven't heard from him yet, you should! Full episode out now: overcommitted.dev/interactive-...
We're hiring a full-time video creator. Looking for someone who always aims for quality above cadence, is eager to deep-dive on AI and networking, wants to be part of a small and taste-obsessed team.
Details: job-boards.greenhouse.io/ngrokinc/job...
Did you know that SWE-bench only tests how good models are at Python? Or the big drama around the FrontierMath benchmark? We didn't either! But now you will, because @samwho.dev put together an extensive explanation of 14 popular AI benchmarks.
ngrok.com/blog/ai-benc...
These all started as product feedback from folks who couldn't use ngrok without them or felt one papercut too many.
Give one or all of em a *hey i want this please* click on our early access page in the dash.
dashboard.ngrok.com/developer-pr...
Three ships coming soon we're excited about:
◆ region pinning for data residency
◆ similarly, dedicated IPs instead of our default range of multi-tenant IPs
◆ faster & more reliable certificate provisioning for ngrok/custom/wildcard domains ↓
A favorite beat on the new ngrok.com: We put ngrok on top of ngrok.com so you can ngrok while you learn about ngrok. If we're going to ask you to use us for anything, we better do it ourselves first, in prod, and also show our work.
And we finally made a homepage that says that, and much more, with some real *gusto*.
ngrok is one gateway for all your traffic.
So, time to introduce the new ngrok.com
For years we've helped you secure, transform, and route to services running anywhere. localhost to prod, APIs to AI models, devices in the field to databases in customer networks.
We heard you like paging through dozens or hundreds of objects in API responses to find the one you want.
Just kidding, nobody's ever said that, so thank goodness that server-side filtering of the ngrok API is now GA.
ngrok.com/blog/api-fil...
An enjoyable look at how LLMs work under the hood from @samwho.dev. The whole chain of tokenizers (text chunks), embeddings (vectorized text chunks), transformers (the "T" in ChatGPT), caching (reused chunks), and all the math that goes into returning a response.
ngrok.com/blog/prompt-...
Learned more about LLM's under the hood from this post than my stupid $30 online course
ngrok.com/blog/prompt-...
@samwho.dev doing @samwho.dev things 🙌
Privacy is at our core! 🛡️
New post! ✨ Prompt caching ✨
My first big project post for @ngrok.com. 5 weeks, 12217 lines of code, 195 commits. I poured a lot into this one, and learned a lot in the process.
I really hope you enjoy it ❤️
Yesterday we launched ngrok.ai into early access. Today we're bringing you a deep dive into LLM internals with beautiful visuals crafted by our very own @samwho.dev. Discover exactly what gets cached to offer you 10x cheaper input tokens.
ngrok.com/blog/prompt-...
One gateway for every AI model. Change your baseURL and we handle routing, failover, key rotation, and more. Early access is open—and we're building the roadmap with you.
🔗 ngrok.com/blog/ngrok-a...
At ngrok, we embedded the OWASP Core Rule Set and Coraza into ngrok's own Traffic Policy engine, and then dogfooded it across 300M+ requests to ngrok.com. 🛡️
In this post, Ben Chan walks you through how we built (and battle-tested) ngrok’s WAF! ⚔️
🔗 ngrok.com/blog/ngrok-w...
Shape #1: The Database Gateway
Gateways aren’t just for APIs. This one lives between your users and the database you need to (securely) put online, offering mTLS, rate limiting, query filtering, and secrets support, all at the edge.
🔗 ngrok.com/docs/univers...
1/5
ngrok Documentation: The many shapes of API Gateways and where ngrok fits in
To celebrate the many shapes of gateways (🔗 RE: ngrok.com/blog/api-gat...), this week we'll walk through a few of our favorite gateway patterns; how they work, what they solve, and where ngrok fits in each one!
ngrok Blog: Sync secrets from external sources to ngrok with Kubernetes External Secrets
If your secrets live on a platform like HashiCorp Vault, AWS Secrets Manager, or Google Secret Manager, you may want to sync them into ngrok Vaults via the External Secrets Operator so your Traffic Policies reference a single, rotated source of truth. 🔑
🔗 Check: ngrok.com/blog/kuberne...
ngrok Blog: Why didn't ngrok go down in last week's AWS outage?
Last week’s AWS outage took down half the internet. ngrok stayed up, and a bunch of people asked "HOW?" 🤔
Our Senior Customer Engineer, Peter Yoakum, covers just that in the latest on our blog. 🕸️
🔗 Read here: ngrok.com/blog/dont-us...
ngrok Blog: What are AI gateways, and do you even need them?
“The AI gold rush has led to a very modern problem: too many shovels, too little gold.”
AI gateways are how teams keep their models, costs, and chaos under control. 💸
Here’s a guide on how they work and whether you actually need one...
🔗 ngrok.com/blog/ai-gate...