Advertisement · 728 × 90

Posts by Ilia Breitburg

Post image

LOL

2 days ago 0 0 0 0
Video

Going to help with making the distribution more even with my upcoming Pebble contest submission

6 days ago 2 0 1 0
Video

what if you could navigate unicode characters by visual similarity?

1 week ago 1034 121 45 8
Video

AI coworker UI idea: badges!

2 weeks ago 0 0 0 0

@ericmigi.com any timelines on exposing the speaker APIs for Pebble 2 Duo?

3 weeks ago 0 0 0 0

claude may occasionally transfer funds...

3 weeks ago 67 3 9 0
Post image
4 weeks ago 0 0 0 0
Post image
4 weeks ago 0 0 0 0
Post image

Hello Graphite

1 month ago 0 0 0 0

nice digicam you got there

1 month ago 2 1 0 0
Advertisement
Post image

The default hostname is "...'s MacBook"

1 month ago 0 0 0 0
Post image

It appears that the new MacBook Neo was supposed to be named just MacBook

1 month ago 1 0 1 0
Post image

How about a new RSS client?

1 month ago 0 0 0 0
Post image

AI models are actively avoiding Taylor Swift: evals.breitburg.com/swiftie-bench/

1 month ago 0 0 0 0

Opus 4.6 stands at 22%, GPT-5.3 Codex at 14%, and GPT-5.4 is at 0%. Can't wait to see Anthropic’s efforts to counter this as well!

1 month ago 0 0 0 0
Code Comments Slop — Ilia Breitburg's Evals Handcrafted evals for AI models.

Introducing the "Code Comments Slop" bench, that measures the rate at which LLMs put sloppy sections in code comments like these:

# ============================================
# CONFIG
# ============================================

evals.breitburg.com/code-comment...

1 month ago 0 0 1 0

Any interesting observations? I’ve been a Claude fanboy for more than a year now, is trying Codex worth the time?

1 month ago 1 0 1 0

Why 5.2 instead of 5.3?

1 month ago 1 0 1 0
Advertisement

I don’t get why Anthropic hasn’t used Claude Code to rewrite their bloated Electron app as native. Should be much easier than a C compiler.

1 month ago 107 13 8 1

When you write a reward function for RL, you essentially write your wish to a genie in Python

1 month ago 0 0 0 0
Preview
Invisible Details of Interaction Design What makes great interactions feel right?

Taking time to read pint articles (sometimes pint for a while). I appreciated this one about interactions design by Rauno Freiberg.

rauno.me/craft/intera...

2 months ago 9 2 1 1
Post image

Need to frame that

2 months ago 0 0 0 0
Post image
2 months ago 2 0 0 0

Yooo! Welcome. First time here?

2 months ago 1 0 1 0

I've been working on a custom IMAP/SMTP server that acts as a Telegram proxy for any email client. Your Telegram messages arrive as emails, and you reply to send messages back. Really fun stuff. Hope to finish and open-source it soon

2 months ago 0 0 0 0
Advertisement

Would you prefer phone calls and emails instead of instant messaging for communication with friends? I think there's certainly something about mail that was lost with IM. The friction of being unable to edit or delete what you've written makes you more present and think more when composing

2 months ago 0 0 1 0

Game engines are O(n) in scene complexity. Diffusion models are O(1), so the same cost whether you’re rendering an empty room or a million polygons. What if you made the engine differentiable and optimized the diffusion model against it directly, rather than sampled frames? Has anyone tried it?

3 months ago 1 0 0 0
Preview
The Math of Why You Can't Focus at Work Interruptions, recovery time, and task size: three numbers that determine if you'll get real work done. Interactive visualizations show the math behind bad days.

I recommend reading The Math of Why You Can't Focus at Work.

Excellent visualization about how we lose valuable focus time because of interruptions and what we can do about it.

justoffbyone.com/posts/math-o...

3 months ago 5 1 0 0
Post image

The perfect Claude Code machine acquired

3 months ago 0 0 0 0

Oh hey I’m on TV

3 months ago 1 0 0 0