Daniel Erenrich (@d.erenrich.net) Bsky

Yeah I'm aware of that. I'm not sure what I said that made you think otherwise. I don't think it's very similar to fuzzing but the op does

5 days ago 0 0 1 0

the faster we find vulnerabilities the less time there is to exploit them? if you don't find them fast enough vulnerabilities will pile up in the codebase faster than they are closed

5 days ago 0 0 0 0

yeah, i understand that. the claim is that this tool speeds up the process dramatically.

5 days ago 0 0 1 0

do you not find the linux local privilege escalation vulnerability interesting? i guess the details there are more sparse?

5 days ago 0 0 0 0

a lot of the tools we use already are not deterministic, right? that's what fuzzers are? randomness is also used to make a more minimal trigger

if the tools were good enough wouldn't these old vulnerabilities have been found by them? a lot of resources have been spent hardening these platforms.

5 days ago 0 0 1 0

Aren't most security vulnerabilities logic bugs?

5 days ago 1 0 1 0

What would a bug bounty for this kind of exploit look like? I would've thought 20k would be in range

And yeah I do not doubt the process could be optimized

5 days ago 0 0 1 0

They found remotely exploitable dos and local privilege escalation bugs. I think everyone would care about such exploits. The maintainers decided them worth fixing. Academic and industry security researchers often point their fuzzers at major code bases in the hopes of finding stuff like that

5 days ago 0 0 0 0

If that's so why did they find decades old bugs that previous researchers couldn't find. Do you think they had way more human triagers?

5 days ago 0 0 1 0

At least it's an impressive fuzzer if it found many previously unfound issues?

5 days ago 0 0 3 0

Clearly an April fools prank

1 week ago 0 0 0 0

It's just nice to imagine the governments are capable of cooperating and getting things done

3 weeks ago 1 0 1 0

If a cat has nine lives does it game over after nine or ten deaths?

1 month ago 0 0 0 0

Isn't it just crepes

1 month ago 0 0 1 0

Work with us – Wikimedia Foundation Make the internet a better place for free knowledge.

maybe wikimediafoundation.org/jobs/#sectio... i work there is you have questions

1 month ago 1 0 0 0

I can see you aren't interested in an actual conversation. That's fine. Have a good weekend

2 months ago 1 0 1 0

Yes I dropped the word "by" thanks for the correction

2 months ago 1 0 1 0

i'm aware of and appreciate your work. but the o1 chart you showed does not prove what you seemed to say. there are benchmarks not created or revealed to the major labs that are also going up. i believe you say that the private benchmarks are leaked via the APIs but that doesn't explain open models

2 months ago 1 0 1 0

Yes. But that's with fixed techniques and architecture. o3 outperformed o1 at the same or lower cost on this and similar tasks

2 months ago 1 0 1 0

But model performance on AIME and similar tasks has continued to improve and the price per token has not gone exponential. I assume you attribute that to over fitting or lies? But you're ignoring the possibility of fundamental architecture improvements

2 months ago 1 0 1 0

Work with us – Wikimedia Foundation Make the internet a better place for free knowledge.

wikimediafoundation.org/jobs/#sectio... ? i work there if you have questions

2 months ago 3 0 1 0

WikiTok

yeah categories are a good resource here. i was gonna read the code but the file was long so i thought i'd ask. so many people keep building things like this e.g. wikitok.vercel.app

recs are challenging when you want to preserve privacy and have a small userbase (and have a small budget).

2 months ago 121 5 2 0

What does the heart button do? I work at the wikimedia foundation and we're exploring stuff like this.

2 months ago 217 0 3 0

Your brain is being mean to you

2 months ago 2 0 1 0

Like you can talk. We've been waiting for Quest 65 for years

2 months ago 41 0 1 0

What happens if we get between them?

2 months ago 5 0 1 0

When are we getting a creepy centipede wiki plushie?

2 months ago 9 0 0 0

Crazy that they know your secret hacker alias

3 months ago 5 0 0 0

When talking about LLMs where do you draw the line for "large"? 200 million params? I think this question greatly changes the answer

3 months ago 0 0 0 0

According To Wikipedia YouTube video by Daniel Erenrich

according to @wikipedia.org www.youtube.com/watch?v=cT6y...

3 months ago 0 0 0 0

Posts by Daniel Erenrich