Advertisement · 728 × 90

Posts by brendan chambers

100%

3 days ago 1 0 0 0

& those blocks of color really lighten the massing and fade it to an unimposing scale

3 days ago 0 0 1 0
Park Circle became Park Oracle. Patterson Park became Pattereon Park. Alan Wright Park became something that cannot be pronounced by humans.

Park Circle became Park Oracle. Patterson Park became Pattereon Park. Alan Wright Park became something that cannot be pronounced by humans.

tried Photoshop's new upscaler for a map that's getting printed and... it renamed all our streets and parks

5 days ago 310 58 26 7

Including link to another recent fast-slow optimizer manuscript arxiv.org/pdf/2510.15830

1 week ago 0 0 0 0

Really interesting read. There was some nice discussion on X here: x.com/rosinality/s...

1 week ago 0 0 1 0

this motivates a strong fast-slow optimizer framed as Nexus Regularization

1 week ago 0 0 1 0

Nexus: Same Pretraining Loss, Better Downstream Generalization l
arxiv.org/pdf/2604.09258

The authors emphasize the closeness of learned minima across domains for generalization performance from a theoretical perspective, for reducing interference without changing the data mixture

1 week ago 0 0 1 0
Video

I can't go back to the regular YouTube UI after this 😅

Obsidian Reader now makes the transcript interactive so you can scrub, highlight, auto-scroll. It feels so nice.

1 week ago 251 27 13 7
Post image

claude mythos is far more token-efficient than opus. continuing the trend

2 weeks ago 147 9 6 0
Advertisement
A carpet of pink and purple bluebells, with some bare trees dotted around and some straggly bits of holly and ivy. Some low sunlight is coming through the trees making some of the bluebells appear pink, not purple.

A carpet of pink and purple bluebells, with some bare trees dotted around and some straggly bits of holly and ivy. Some low sunlight is coming through the trees making some of the bluebells appear pink, not purple.

An hour later than the previous shot and the light had intensified.

#bluebells #woodlandphotography #springflowers

2 weeks ago 476 66 18 5
Post image

Google releases Gemma 4. ✨

Gemma 4 introduces 4 models: E2B, E4B, 26B-A4B, 31B.
The multimodal reasoning models are under Apache 2.0.

Run E2B and E4B on ~6GB RAM, and on phones.
Run 26B-A4B and 31B on ~18GB.

GGUFs: huggingface.co/collections/...
Guide: unsloth.ai/docs/models/...

2 weeks ago 60 11 1 1
Post image Post image

I tried out the Armenian thing with Claude and I am shocked at the level of self observation it's capable of here. I've never ever seen a model see itself bug out in some way and then notice it and attribute it to the tokenizer (possibly correct, or just very plausible) like this before

3 weeks ago 85 9 4 0
Post image

PrismML releases 1-bit LLM (open-weight), or a 8B LLM that fits in 1.15GM of VRAM

Website: prismml.com
Blog: prismml.com/news/bonsai-8b
HuggingFace: huggingface.co/collections/...

3 weeks ago 44 7 2 2
Post image Post image
3 weeks ago 109 14 6 1

gonna slap a 'i used uv before they got bought by sama' sticker on my laptop

1 month ago 127 11 7 0
Why NVIDIA builds their own open models | Nemotron w/ Bryan Catanzaro
Why NVIDIA builds their own open models | Nemotron w/ Bryan Catanzaro YouTube video by Interconnects AI

For people who are just learning about Nemotron with the awesome Nemotron 3 Super drop, recommend you watching this interview I did with one of the leads Bryan Catanzaro -- Nemotron as a project is a LONG time coming.
www.youtube.com/watch?v=Y3Vb...

1 month ago 36 3 1 2
Advertisement
Post image

But in the backward pass, the story is much worse. Gradients get compressed via projection onto a D-dimensional subspace, and most of the training signal simply vanishes.

1 month ago 10 1 1 0
Post image

Common Corpus just breaking 1M downloads: it took some time but open data in ai is actually popular.

1 month ago 59 5 3 1

A little offended Grammarly didn't make a sloppelganger of me

1 month ago 1666 210 33 97
GitHub - karpathy/autoresearch: AI agents running research on single-GPU nanochat training automatically AI agents running research on single-GPU nanochat training automatically - karpathy/autoresearch

“autoresearch” micro teaching repo from Karpathy

readme edits seem like such a nice dx for open ended hparam tuning, and maybe other kinds of hill climbing too, so much less painful than the old days

github.com/karpathy/aut...

1 month ago 0 0 0 0
Post image

It was all about spying on Americans: www.theatlantic.com/technology/2...

1 month ago 49 10 2 0
Post image

FlashSampling: Fast and Memory-Efficient Exact Sampling

Paper: flashsampling.github.io/FlashSamplin...

1 month ago 15 2 0 0
Post image

We analyzed 250K+ queries & 430K+ clickstream interactions from Asta, our AI-powered research assistant—and today we're releasing the full dataset. How do researchers actually use AI science tools? Here's what we found. 🧵

1 month ago 23 6 1 1
Post image
1 month ago 236 40 3 2
Preview
Permissioned Data Diary 2: Buckets The second in a series of posts building up a solution to permissioned data on atproto. We introduce buckets: a new protocol primitive for creating a shared social context.

new blog post on permissioned data in atproto! this one introduces "buckets", the protocol-level primitive for shared access control. I walk through two approaches that don't quite work and land on something that I think does

let me know your thoughts!

1 month ago 287 56 17 21

tldr iiuc we are once again enclosing the commons and industrializing craft, dispossessing laborers while apotheosizing capital, and to slow down this doomloop we need to innovate new collectives and public goods

1 month ago 1 0 1 0
Advertisement
Preview
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models Decoder-only language models have the ability to dynamically switch between various computational tasks based on input prompts. Despite many successful applications of prompting, there is very limited...

This has a very cool result on in-context learned classification tasks, where they disentangle representational quality (how well-separated concept labels are) and readout alignment (how good it is at reading out its own inner labels). Adding demo examples helps through readout, not representations!

1 month ago 36 5 1 0

Designing around the tight bottleneck on latency and throughput that separates local and cloud compute is such an interesting problem. Significant challenges though

2 months ago 1 0 0 0
Anti-homeless benches in Pokemon Legends ZA

Anti-homeless benches in Pokemon Legends ZA

why is there anti-homeless architecture in pokemon

2 months ago 2462 340 58 27
Preview
Data Centers Ditching the Power Grid, Mark Carney's Viral Speech, and Some Joy Here are some trends I'm following

A year ago, data center developers were focused on connecting to the grid. Today roughly 1/3 of all planned capacity is onsite power - and 72% of that planned capacity is fossil gas. Homer City PA's data center project could soon be one of the largest single sources of carbon emissions in the US.

2 months ago 69 42 5 9