I like opus 4.7, but they need to halve the adderall intake
Posts by
increasingly sympathetic
though maybe it has 250k long CoT's and its basically its own ttc harness... who knows!
idk, I have a strong prior that better base models are just better at everything in a way that benchmarking(even across a huge range of evals) always understates, also what does this mean they can do with more ttc with the base?
seems to me the closest to a GPT-n ish level stack moar layerz vibe
I think its actually slightly harder, eyeballing it
Ant ECI vs Epoch ECI
Sonnet 4.5: 144 vs 147
Opus 4.5: 148 vs 150
Opus 4.6: 152 vs 155
I think the 5.4 pro equivalence, if you are referring to the ECI stuff, is complicated by the fact that the version of ECI they maintain internally is very different from the public version and not directly comparable
www.newsweek.com/joe-biden-io...
dropping a falklands-shaped nuke on the discourse with "americans are the indigenous people of the moon"
4096 dimensional subspace is 0.08, but the distribution is beta with tiny variance so by change there is never such a direction by chance!
I think this means that if you found this direction, its instrumentally useful to have an "idk" direction in the residual stream, which seems plausible!
from what I can tell, this is really hard to do for dimensionality reasons! Like if this exists the model definitely needed it, developed it intentionally.
consider the case of d-model 4096 and a 50k vocab to be conservative, the mean length of the projection of the constant vector onto a random
tired: OpenAI has lost the Mandate of Heaven
wired: OpenAI has gained the gates of hell
this fucking piece of shit
i come bearing deleted roon tweets
"adam tooze comes out as a treatler"
You bolt awake in the shores of Little St James. You are not online. It is 1997 AD. You are the Treasury Secretary Larry Summers, and you have changed your mind. The future cannot come to pass. The financial industry must not be deregulated
that scene from challengers but the boys heads are replaced with the claude logo
Anthropic CEO being the only one to not put out a Manchurian candidate style post about how David Sacks is the best most kind and public spirited person ever furthering my impression of them as the least bad of those companies
the xkcd hangun petri dish comic except its about how while base models outperform RL'ed models on multiple choice benchmarks at high pass@N, so does a uniform distribution
basic someone saw this coming in like 2005 but unfortunately he started a religion about it that mostly serves to make worrying about this seem silly. also it produced marketing and seed funding for the companies that are doing it
"enriched" forward pass
makes sense, models can totally smuggle information in the kv cache across token indices but if we suppose some intermediate computation is completely independent from the token we emit then this info can't participate in any of the more complex stuff a la arxiv.org/abs/2402.12875 - its just an
If I am an expert in layer 16 of 32 of a vanilla transformer and realize that my job in at some token is to compute some sum so that it can be used down the line, I can do that, then any attention head in layer 16 can deposit that info to any future token without any intermediate unembeddings right?
I bet heath ceramics would do it www.youtube.com/watch?v=v678...
the meta ai video slop TikTok is I think the first tech thing where I have been 100% on the side of the Luddites (vernacular usage, I know the actual Luddites were more complex than that, nerds). Usually I think there's a little too much of that reflexively on the left tbh, but no this shit sucks.
it is obscene to treat the unfortunate but (as he himself pointed out) routine murder of this Nazi shithead who believed that Jewish gelt was corrupting the blood of white Americans like a national tragedy
Interesting, what about muon/shampoo or other spectrum-y ones?
modern ai is basically a bunch of rogue google employees taking google projects that were done pretty cautiously and making them less cautious
Some days the Iliad being about Helen just becomes a lot more believable.
sorry i just got back from dropping off my wife with Bari Weiss Da Strap God
bar crawl on Ceres with Amos Expanse