Advertisement · 728 × 90

Posts by Rita Zhang

It’s basically moving from 'it works on my cluster' to 'it’s ready for the product' in one step.

4 weeks ago 2 0 1 0

No Engine Lock-in: They can swap backends (vLLM, SGlang, LlamaCPP, etc.) without rewriting their deployment logic.

Unified Lifecycle: If the rest of the company’s stack is on K8s, AI Runway lets them use the same observability, security, and CI/CD tools as everyone else.

4 weeks ago 1 0 1 0

Zero-to-Inference: Instead of writing complex Slurm scripts or wrapping models in custom Flask containers, AI Runway provides a standard interface to deploy LLMs with production-grade scaling and routing out of the box.

4 weeks ago 3 0 1 0

The pitch for AI Runway to AI/ML teams isn't about 'changing how you work,' but about 'removing the wall between your model and the user.'

4 weeks ago 2 0 1 0

AI Runway is aimed at platform teams and AI users who want a Kubernetes-native way to deploy and operate inference workloads across multiple serving frameworks, with a simpler UX and tighter integration with the Kubernetes ecosystem.

4 weeks ago 5 0 1 0

Maintainer for AI Runway here 👋. Slurm on Kubernetes is useful if you want HPC-style job scheduling, but it does not provide a common kubernetes inference API or a model-centric control plane.

4 weeks ago 2 0 0 0
Ralph on stage with a slide describing the gpu and node layer.

Ralph on stage with a slide describing the gpu and node layer.

Why run inference on Kubernetes: better economics, more control, built for production.

Why run inference on Kubernetes: better economics, more control, built for production.

Why AI Runway is the AI inference platform for k8s platform teams, ai engineers, organizations.

Why AI Runway is the AI inference platform for k8s platform teams, ai engineers, organizations.

One interface, many backends: no engine lock in, capability aware routing, community extensible.

One interface, many backends: no engine lock in, capability aware routing, community extensible.

AI Runway as explained by @squillace.bsky.social at the #KubeCon Azure pre-day: deploy and manage large language models on Kubernetes. github.com/kaito-projec...

4 weeks ago 11 3 1 0
Preview
How NVIDIA Dynamo 1.0 Powers Multi-Node Inference at Production Scale | NVIDIA Technical Blog | Rita Zhang 🚀 Huge milestone! Congrats to the NVIDIA team on Dynamo v1.0.0! Great to be at the #Dynamo #GTC2026 event seeing real-world adoptions and exciting to see Microsoft as one of the top external contribu...

"Great to be at the #Dynamo #GTC2026 event seeing real-world adoptions and exciting to see Microsoft as one of the top external contributors to help strengthen the platform and grow the OSS community." @ritazh.bsky.social www.linkedin.com/posts/ritazh...

1 month ago 5 4 0 0
Preview
Debugging Kubernetes Network Issues with Inspektor Gadget's tcpdump Gadget | Inspektor Gadget Debugging Kubernetes Network Issues with Inspektor Gadget's tcpdump Gadget

❄️Our holiday gift to you❄️ tcpdump for Kubernetes! In this blog we share how we've taken tcpdump to the next level with K8s context to make it easier than ever to debug networking issues - including with Wireshark! inspektor-gadget.io/blog/2025/12...

4 months ago 3 1 0 0
Kat, Rita, Maciej on stage

Kat, Rita, Maciej on stage

Ask the experts: Kubernetes Steering Committee at #KubeCon. Lots of good discussion about supporting contributors with @kat.lol @ritazh.bsky.social @soltysh.bsky.social.

5 months ago 29 3 0 0
Advertisement
Rita, Janet, and Federico on stage.

Rita, Janet, and Federico on stage.

At the Kubernetes AI Conformance discussion at #KubeCon Maintainer Summit, @ritazh.bsky.social talks about the value of collaborating across open source communities. github.com/kubernetes-s...

5 months ago 5 3 0 0

If you, ALSO, are motivated by superfast, hardware protected micro vms, this starts in 5 minutes! www.youtube.com/live/tROp-nm...

5 months ago 3 1 1 0
Scaling multi-node LLM inference with NVIDIA Dynamo and ND GB200 NVL72 GPUs on AKS | AKS Engineering Blog Learn how optimizing routing, planning, and system setup boosts the efficiency and reliability of multi-node LLM inference on AKS with OSS Dynamo and ND GB200-v6 GPUs.

New blog post: Scaling multi-node LLM inference with NVIDIA Dynamo and ND GB200 NVL72 GPUs on AKS (by Sachi Desai, @ritazh.bsky.social, &
@sozercan.bsky.social) blog.aks.azure.com/2025/10/24/d...

5 months ago 1 3 1 0
Preview
🚀 Internship Opportunity in Systems Research The Microsoft Research - Systems Group https://lnkd.in/dPCb5fqC) and the Azure Research - Systems https://lnkd.in/dhf6Xx-s) are hiring an intern to help… ... 🚀 Internship Opportunity in Systems Research The Microsoft Research - Systems Group https://lnkd.in/dPCb5fqC) and the Azure Research - Systems https://lnkd.in/dhf6Xx-s) are hiring an intern to help b...

For those Master's or Ph.D. students doing systems work, you might have a look at this internship with @microsoft.com's research team: www.linkedin.com/posts/pedroh...

Yes, you'll do some immediately important work, and it's really cool, too.

6 months ago 4 3 0 0
AI-Aware Kubernetes Infrastructure - Upstream@AKS - Episode1 - Azure Kubernetes Service (AKS)
AI-Aware Kubernetes Infrastructure - Upstream@AKS - Episode1 - Azure Kubernetes Service (AKS) YouTube video by Azure Kubernetes Service (AKS)

Interested in the upstream Kubernetes ecosystem? @lachie.bsky.social is hosting a new conversation series where he chats with an Azure colleague about exciting new topics in K8s! In the first episode, he discusses AI-aware Kubernetes infrastructure with Jack Francis! www.youtube.com/watch?v=7G-z...

7 months ago 6 4 0 0
Preview
Wassette: A bridge between Wasm and MCP Microsoft’s new open-source tool kit offers a way to assemble lightweight and secure Model Context Protocol servers from WebAssembly components.

Wassette: A bridge between Wasm and MCP by Simon Bisson (@sbisson.com) www.infoworld.com/article/4039...

8 months ago 2 1 0 0
Preview
CVE-2025-5187: Nodes can delete themselves by adding an OwnerReference · Issue #133471 · kubernetes/kubernetes CVSS Rating: CVSS:3.1/AV:N/AC:L/PR:H/UI:N/S:U/C:H/I:H/A:L - Medium (6.7) A vulnerability exists in the NodeRestriction admission controller where node users can delete their corresponding node obje...
8 months ago 48 7 6 0

I'm super excited for Wassette, so I put together a quick article on why it's important and a comparison with our current options for MCP.

8 months ago 11 4 1 3
Advertisement
Preview
Introducing Wassette: WebAssembly-based tools for AI agents - Microsoft Open Source Blog Wassette is a secure, open-source Model Context Protocol server that leverages WebAssembly to provide a trusted execution environment for untrusted tools.

Introducing Wassette: WebAssembly-based tools for AI agents by Yosh Wuyts opensource.microsoft.com/blog/2025/08...

8 months ago 10 6 0 4
Post image

TIL @microsoft.com is the largest cloud contributor to #oss @cncf.io projects #MSBuild

11 months ago 24 8 2 2
Preview
Simplifying InfiniBand on AKS Learn the what, the whys, and the hows of configuring InfiniBand networking for high performance compute (HPC) workloads on AKS

Need to run really large LLMs that require the power of RDMA with InfiniBand?
Check out this blog to see how to get it working on Azure Kubernetes Service:
azure.github.io/AKS/2025/04/...

#AKS #Kubernetes #InfiniBand #AI #LLM #RDMA #Azure

1 year ago 4 3 0 0
Maintainers of the containerd project speaking at KubeCon EU

Maintainers of the containerd project speaking at KubeCon EU

Maintainers of the containerd project presenting at KubeCon EU providing an update on recent changes and releases

Maintainers of the containerd project presenting at KubeCon EU providing an update on recent changes and releases

The @containerd.dev maintainer session is underway at #KubeCon EU in London with an awesome cross-section of contributors and maintainers from both the core project and subprojects like nerdctl and runwasi.

1 year ago 17 5 1 1

👋 Betty!!!

1 year ago 1 0 0 0

I will be there on behalf of sig auth and the k8s security response committee. See you there! #kubecon

1 year ago 6 0 1 0
Preview
KubeCon + CloudNativeCon Europe 2025: Kubernetes SIG Meet & Greet Lunch and Le... View more about this event at KubeCon + CloudNativeCon Europe 2025

Day 2 of #KubeCon begins tomorrow! On Day 1, we learned from keynotes & sessions & enjoyed the #Kubecrawl party. We hope you're ready to do it all again tomorrow! Don't forget to stop by the Project Pavillion during lunch to meet the Kubernetes community!

1 year ago 9 3 0 0
Post image

Standing room only for the talk on Policy as code by @ritazh.bsky.social, Joe Betz, Andy Sunderman, and Jim Bugwadia

1 year ago 5 1 0 0
Rita Zhang speaking at KubeCon

Rita Zhang speaking at KubeCon

Rita introducing Gatekeeper

Rita introducing Gatekeeper

Chart about where Gatekeeper fills in gaps versus VAP/MAP

Chart about where Gatekeeper fills in gaps versus VAP/MAP

Gatekeeper instead of VAP: examples

Gatekeeper instead of VAP: examples

OPA Gatekeeper (open-policy-agent.github.io/gatekeeper/) as explained by @ritazh.bsky.social at #KubeCon: tooling to help enforce your Kubernetes policies.

1 year ago 5 1 0 0
Advertisement
Post image

huge fan of headlamp moving into kubernetes to improve the dashboard and devex experience for all k8s users! kudos @ahrkrak.bsky.social and team! #kubecon #cloudnativecon

1 year ago 20 11 0 0
Greg on stage with a slide “Linux runs the world”

Greg on stage with a slide “Linux runs the world”

Fantastic to have Greg K-H on the #kubecon keynote stage!

1 year ago 12 2 0 0
Post image

Making kubernetes easier to use. better UX = happier humans with @headlamp_ui #KubeCon keynote. Now part of sig UI @ahrkrak.bsky.social

1 year ago 31 5 0 0