Milan Weibel 🔷 (@weibac) Bsky

here's another bsky.app/profile/doll...

34 minutes ago 1 0 0 0

here's one bsky.app/profile/segy...

36 minutes ago 1 0 1 0

but then again the 3 month figure is just an average and current narrowing could be fluctuation rather than trend

55 minutes ago 0 0 0 0

i love the phrase "inside traders announce"

1 day ago 8 0 1 0

Open-weight models lag state-of-the-art by around 3 months on average Epoch AI is a research institute investigating key trends and questions that will shape the trajectory and governance of Artificial Intelligence.

very roughly gpt5.4 was released 46 days ago and this new kimi matches it so that's the current gap and according to epoch the gap has historically been 3 months so yes things are narrower than in the period up to october last year
epoch.ai/data-insight...

1 day ago 7 2 1 0

actually now that i thought about it more i'm not sure it's narrowing. has been pretty narrow for a while.

1 day ago 5 0 1 0

the open-closed gap on generally-available model performance is narrowing

frontier labs risk being undercut in opus-class until they make mythos-class GA

1 day ago 51 6 3 3

yes

2 days ago 3 0 0 0

in principle you could have a model so aligned that it successfully misalignment-fakes any attempt to misalign it

...unless the misalignment finetuning is sft

2 days ago 5 0 1 0

FT 2 days ago: "Amodei says he suspects open-source models and Chinese developers will be able to replicate Mythos's capabilities within six to 12 months."

but mythos isn't available to be distilled
hmm

2 days ago 15 0 1 0

mathematicians as stochastic parrots?

2 days ago 2 0 0 0

lots of polemics i don't agree with and some less than convincing arguments in linked post, but the — to ; replacement in the dune quote piece of evidence is a slam dunk

2 days ago 6 0 1 0

"the machines are fine. i'm worried about us" was written by claude

2 days ago 19 2 1 0

im 99.5% sure that account is run by a human

3 days ago 5 0 1 0

entropybro coded tho

3 days ago 4 0 0 0

prompt for a demo from the GPT2 announcement. for some reason it's seared into my mind.

3 days ago 2 0 0 0

“In a shocking finding, scientists discovered a herd of unicorns living in a remote, previously unexplored valley, in the Andes Mountains…”

3 days ago 3 0 1 0

track record on that? afaict crypto is a bit less prominent now than at its peak but still around

3 days ago 0 0 0 0

compute OSINT
measuring datacenter by the GW is so metal

3 days ago 24 2 1 1

user @birddroneone.bsky.social says AI is useful for coding in a thread full of negationists

user @birddroneone.bsky.social posts: I proudly earned my way onto a bunch of idiots “AI booster” block lists despite the fact that I hate AI and any real review of my posting history makes that clear. My crime? Acknowledging that all commercial software development today uses AI. I wish it wasn’t true. But it is.

love to see epistemic integrity like this

4 days ago 55 5 0 1

where is this survey from? an uni thing?

4 days ago 1 0 1 0

oh lmao

5 days ago 2 0 0 0

sure, there's a tradeoff

5 days ago 2 0 0 0

why do you expect that philosophy you tested to be outside its training data?

5 days ago 2 0 1 0

pure speculation but what if this is connected to 4.7 being a new pretrain? maybe pelican drawing is developmentally later as a postraining effect than coding

5 days ago 5 0 0 0

my only complaint with opus 4.7 rollout is anthropic setting default effort to xhigh

5 days ago 2 0 1 0

nine opus instances working together for days did really well at weak-to-strong supervision on qwen

6 days ago 8 0 0 0

what about false positives though?

6 days ago 1 0 1 0

also worth mentioning that they have had 9 presidents in the last 10 years

1 week ago 0 0 0 0

there were other elections today, in peru: 36 presidential candidates on the ballot
voting was extended to tomorrow due to logistical problems in some polling places

1 week ago 2 0 1 0

Posts by Milan Weibel 🔷