1. Obviously terrible to have a Molotov thrown against your house, not appropriate response.
2. Of all analogies to make, "ring of power" is a choice, given the story's theme that the only way to stop the ring's destructive power is to destroy it.
blog.samaltman.com/2279512
Posts by Alex Irpan
I did #enigmarch this year and am revealing all the puzzles in a one day event this Saturday! If you're free, come check it out
enigmarch.alexirpan.com
There is now another amicus brief filed by a number of former high ranking military officials (up to Admiral level), arguing these actions hurt the military's adherence to the rule of law.
storage.courtlistener.com/recap/gov.us...
I didn't know where this post was going when I started and I'm not sure where it went now that it ended, but that felt correct in some way.
www.alexirpan.com/2025/11/16/a...
First paper since switching into AI safety team🎉
We look at problems that could be solved if the model behaved consistently over a set of prompts, and tried training that in output space and internal activations. Both were effective. See thread or paper for details.
For the past month I have been working on a blog post about niche MLP fandom drama. Well here it is.
www.alexirpan.com/2025/07/21/b...
"I don't play gacha games because they're a scam"
vs
"Let me do one more hyperparam sweep before giving up. One more prompt tuning run. I swear we'll beat baseline. I know it's gonna beat the baseline this time. It's gonna win. This time for sure."
I am now back from #MITMysteryHunt with no memory of anything besides Hunt from MLK weekend. Really this is probably for the best.
The ship has sailed, but I wish the ML reporting default was % incorrect rather than % correct. It better matches loss curves and magnifies the capture of edge cases.
95% accuracy -> 97.5% accuracy = meh
5% error -> 2.5% error = omg we've halved the error rate
The question of "how's o1 using its test compute" is better asked to someone who worked on it, since AFAIK that hasn't been disclosed. But yes, language models having really dynamic / freeform actions makes them hard to think about.