Joe Hellerstein (@joehellerstein) Bsky

Playing for Complications—and Why Systems Shouldn't In the last post I argued that coordination is commitment—a vow to avoid futures with undesirable outcomes. I also raised a question: when coordination is…

It turns out that the 'weakest' tic-tac-toe opening is actually the best one — if your opponent occasionally makes mistakes.

New post on what games can teach us about coordination cost in distributed systems: jhellerstein.github.io/blog/two-com...

2 weeks ago 5 0 0 0

Coding Agents Meet Distributed Reality AI is about to write most of the code in the world. Most of the code in the world participates in a distributed system. And distributed code is where our worst…

📄 New blog post!

AI will soon write most distributed code.
Distributed code is where our worst bugs —Heisenbugs— live.

The real lever isn’t “test more,” it’s **aim better**: AI should target frameworks where correctness contracts are explicit and checkable.

jhellerstein.github.io/blog/codegen...

2 months ago 11 2 0 0

Nebula Film Festival Help us raise money to host this years Nebula Film Festival at the Notting Hill Picture house!

Nebula Film Festival www.crowdfunder.co.uk/p/nebula-fil...

9 months ago 2 0 0 0

Let’s collaborate on democratizing insights from tabular data in Amsterdam! ✨

PhD directions: 1) fundamental techniques for tabular foundation models, 2) reliable mechanisms for AI-powered tabular data analysis.

Sharing w/ friends appreciated! ⬇️

10 months ago 3 3 2 0

CRDTs #4: Convergence, Determinism, Lower Bounds and Inflation The CRDT literature sometimes leaves room for mathematical ambiguity. Maybe because the bulk of the work tends to be targeted at systems researchers and…

The last blog post in my miniseries on CRDTs is up!

jhellerstein.github.io/blog/crdt-in...

Mix of pragmatism and formalism.

There's actually a small result in there that may be novel: Strong Eventual Consistency !=> Determinism. Curious to hear whether they've seen this result elsewhere.

10 months ago 13 2 0 0

CRDTs #3: Do Not Read! Ever used a CRDT, thought you were safe, and—boom—you bought a Ferrari you didn't mean to? It could happen to you! The truth is that CRDTs are dangerous to…

posted today!

BTW I peeked at the automerge Rust? Collaborative editing is an example where one probably *has* to resort to unsafe behavior (you're the expert there!) so I'm mostly advocating for more encapsulation/comments in that case.

jhellerstein.github.io/blog/crdt-do...

10 months ago 6 4 0 0

CRDTs #3: Do Not Read! Ever used a CRDT, thought you were safe, and—boom—you bought a Ferrari you didn't mean to? It could happen to you! The truth is that CRDTs are dangerous to…

Next blog post in the CRDT Series is up!

This one is for the developers... stay safe out there, folks.

jhellerstein.github.io/blog/crdt-do...

10 months ago 14 2 1 2

Good thread. Thoughtful as always.

10 months ago 3 0 0 0

Really early and well seen, definitely influenced me and my team! Hats off.

10 months ago 1 0 0 0

Depends what you want the “set of lists” semantics to mean. I’d think you likely want a 2P-map lattice of RGAs (2P-map would be like a 2P-set but with a lattice value associated with each unique item in adds). If you want more detail please comment in the blog so it’s easier for others to find it.

10 months ago 0 0 0 0

There are simple and helpful composites that can be written generically and reused safely. E.g. lattice pairs (free or lexical) and Map lattices. Helps to have a language with good support for generics (parameterized types).

10 months ago 0 0 1 0

CRDTs #1: Turtles All the Way Down This is the 1st post in a series of 4 detailed posts I'm doing on CRDTs. Please see the intro post for context. Modern distributed systems often seem to rest on…

(Catching up to my LI feed).

Next blog post is out! This is the first real post in a short series on CRDTs, an idea that has some currency in the distributed programming community, but one that comes with a number of sharp edges. Be careful out there!

jhellerstein.github.io/blog/crdt-tu...

10 months ago 12 3 1 1

A Run of CRDT Posts Over the next few days, I'm going to post a number of observations about CRDTs: Convergent Replicated Data Types. These are data structures that aspire to help…

Blog relaunch! Bbye wordpress, hello github.

If you're into SW dev, cloud, databases, distributed systems, automatic codegen ... or data and CS in general... check it out.

As a warmup, I'm starting with a series of posts on CRDTs. Intro post up now: jhellerstein.github.io/blog/crdt-in...

10 months ago 18 5 2 2

Wow! @arvind.bsky.social giving an awesome keynote including discussion of VegaExpress and GoFish interactive vis libraries from his group. #EPICRetreat #UCBerkeley.

1 year ago 2 0 0 0

1 year ago 0 0 0 0

Here’s a provocative example from JD Zamfirescu-Pereira on ways that humans and LLMs can get misaligned on expectations. Is the LLM lying? Is it just emitting tokens? How do people interpret this? #EPICRetreat #UCBerkeley.

1 year ago 2 0 1 0

SF Systems Meetup: Correctness and Security for Distributed Systems · Luma The SF Systems Meetup is back for the new year! This meetup, our theme is correctness and security. It's easy to write a distributed protocol, but very hard to…

The SF Systems Meetup is back! On 2/27, we're excited to have headline talks from the creator of FizzBee and a research collaborator with Signal. This is going to be a super fun night diving deep into making distributed protocols work, hope you'll join us! lu.ma/vqjf30k3

1 year ago 5 3 0 0

GPT4o shows that f(a,b) = (a+b)/2 is an example of a commutative function that is not associative.

GPT4o did better:

1 year ago 1 0 0 0

GPT4 asserts that Min and Max functions are commutative but not associative, but then checks itself and backtracks.

The question: "what are examples of commutative functions that are not associative?"

GPT4 was funny, thinking aloud and then proving itself wrong:

1 year ago 0 0 1 0

In some kind of sad watershed, today was the day as a professor when I live-ChatGPT'ed the answer to a question in a Zoom with my PhD student and his undergrad mentees.

But hey, let's paint it in a positive light: this was a demonstration of using the right tool at the right time.

1 year ago 5 0 1 0

Operationalizing Machine Learning: An Interview Study by @joehellerstein.bsky.social, @adityagp.bsky.social, et al. Particularly love the part on "Retrofitting Explanations".
#MachineLearning #MLOps #Datascience.
arxiv.org/pdf/2209.09125

1 year ago 12 4 1 2

I think “getting all of your coordination under one roof” (or behind a unified api or something) is the message I’m hearing from you. Don’t know if that helps?

1 year ago 0 0 0 0

A muddled post at best. A sequential log *is* a point of coordination. It doesn't avoid coordination as claimed, it just centralizes it in 1 service (and arguably encourages overuse). Coordination avoidance is orthogonal: discover when global ordering is not needed. Ie avoidance avoids the log!

1 year ago 8 0 2 0

Sunset in #Berkeley these days is a perfect field goal over the golden gate bridge. Shifts quite a ways north during the summer.

1 year ago 9 0 0 0

"Whats new in Excel" dialog box. The text says "Data Aggregation Functions: We've added two incredibly powerful new data aggregation functions: GROUPBY and PIVOTBY"

2025. What a time to be alive!

1 year ago 9 1 0 0

Fickle faculty followup follies

1 year ago 2 0 0 0

It’s incredibly beautiful that President Carter is our emissary on a Voyager probe. His words live on across our galaxy!

1 year ago 183 42 8 8

The culture in my community in CS has long been to share course materials openly. My lecture videos+notes are all posted public online, as are those of many of my peers. If anything there's some competition for attention.

No judgement implied, just interesting difference in community norms.

1 year ago 1 0 0 0

Flo: a Semantic Foundation for Progressive Stream Processing Streaming systems are present throughout modern applications, processing continuous data in real-time. Existing streaming languages have a variety of semantic models and guarantees that are often inco...

Thrilled to share that our paper “Flo: A Semantic Foundation for Progressive Stream Processing” (with @mpmilano.bsky.social, Alvin Cheung, and @joehellerstein.bsky.social) will appear at POPL 2025! Check out the preprint at arxiv.org/abs/2411.08274, and read on for more!

1 year ago 44 12 1 0

An egret walking in the San Francisco Bay with the sunset behind the Golden Gate Bridge

Silhouettes of people by the San Francisco Bay at sunset with the Golden Gate Bridge in the background

San Francisco Bay at sunset with the Golden Gate Bridge

Sunset over SF looked promising again today so we went down to the bay to take it in.

1 year ago 12 0 0 0

Posts by Joe Hellerstein