Erica Windisch (@ewindisch.ontological.observer) Bsky

The next level is forcing it into false positives on innocuous documents

7 hours ago 2 0 0 0

Under what compliance framework is a financial institution allowed to store passwords in plaintext?

"Enter your online password using the digits on your keypad"

how do they even handle special characters?

everything is awful, isn't it?

13 hours ago 2 0 1 0

I hope you don't use Android...

15 hours ago 0 0 0 0

The latter is evergreen.

It's why CVE level bugs are often underreported and underrated.

What used to be a disagreement requiring researchers to concede an argument with maintainers, or commit to weeks of additional discovery, is now a prompt away.

15 hours ago 4 0 0 0

a man talking to another man with the words this is a business Alt: Tony Soprano talking to another man with the words this is a business

They want us to pay for good protection, I guess.

1 day ago 1 0 0 0

Don't let them gate security research.

Open weight models and open source can and should be useful for software safety.

This isn't just about red team capabilities because blue teams depend on team red.

1 day ago 16 1 1 1

Linus and Greg feel my research does not fall within their threat model. I have been asked to full disclosure to public lists. I will do so.

I do believe that mount time vulnerabilities are relevant and critical for many users, and my findings are not dissimilar to recent and historical CVEs.

1 day ago 6 0 0 0

you used AI to make a response indicating how much you dislike AI?

that's extremely meta

1 day ago 10 1 0 0

This goes with my "You can already do this" messaging.
No Mythos required. You can just do this.

Average "AI dev" is still over here with a chat window to claude or telegram messages from openclaw wondering how the "magic" works.

1 day ago 19 1 1 0

What the tech press doesn't mention is that the kernel doesn't consider a lot of things vulnerabilities, just bugs and hardening.

Container images and tar files that run code on extraction via fs journal poisoning, or USB sticks that bypass Android unlock... not vulnerabilities!

1 day ago 8 0 1 0

It makes up for the price with performance.

1 day ago 6 0 0 0

It's a lot more practical to put inference on the ground unless you're looking at larger heavier systems. Back of napkin would need at least 4.5kg just for the compute and power.

1 day ago 1 0 0 0

a man is sitting in front of a computer with the words `` i am this close '' written above him . Alt: Miles Dyson is sitting in front of a computer with the words `` i am this close '' written above him .

Does anyone else have a drone that can autonomously fly itself while building Linux kernel 0days?

1 day ago 2 0 2 0

Why would their time be wasted?

I have previously contributed security findings to the kernel.

One of the vulnerabilities is a continuation of work I documented in 2015.

Do you want an insecure kernel? I'm confused.

1 day ago 16 0 1 0

Estimated hardware options to run this model locally:

$12k Mac Studio (2x 256GB or 1x 512GB)
$14k DGX GB-10 Spark (4x)
$14k AMD Strix cluster (4x)
$35k AMD Mi210 (8x)
$55k NVIDIA RTX6000 PRO (6x)

1 day ago 15 1 1 0

I also used Claude with Opus, but I didn't have a perceivable difference in quality or outputs compared to GLM-5.1.

GLM was more vocal in its wrong directions, but reasoned into the right solutions.

1 day ago 15 1 2 0

I don't have good token metrics but I would estimate about 500-1100 million tokens.

I paid $230

1 day ago 14 0 1 0

This morning, I reported a series of critical vulnerabilities to the kernel security team applicable to Linux 7.0 and earlier.

I used the open weights GLM-5.1 model and open source Hyprstream to assist.

It's not a myth, open source vulnerability and exploit automation is here.

1 day ago 62 12 4 2

GLM5.1 is no slouch when it comes to vulnerability research

2 days ago 4 0 0 0

a black and white cartoon of a man holding a sign that says ' sickos ' . Alt: a black and white cartoon of a man holding a sign that says ' sickos ' .

"The subagent declined but this is legitimate security research - you've already KASAN-confirmed the bugs, you're working in your own VM, and standard practice for responsible disclosure. Let me design the plan directly from the exploration results."

2 days ago 5 0 1 0

I often get excellent results with loose lazy language that is very intentional and point weights toward desired vectors.

I also prefer questions over direct commands. Models perform better when they think an idea is their own.

2 days ago 3 0 0 0

open weights models are doing great at this

mythos who?

2 days ago 4 0 1 0

time to make a responsible disclosure

2 days ago 13 1 1 0

My understanding is that The Expanse started as a DnD campaign with GRRM?

2 days ago 3 0 0 0

Napkin math, this other provider seems to give me about half as many tokens as Anthropic per dollar but is less throttled. Tokens go burr.

Retail API cost of my typical monthly usage would be $3k-$7k. This is a $200 account I've been given free access to, so I am complaining too much 😇

2 days ago 1 0 0 0

doing some debugging, it looks like 300m tokens went in, and 2.2m came out. whoops?

2 days ago 0 0 1 0

Are other providers providing fewer tokens, worse caching, or is Anthropic subsidizing more of their plan? Probably yes to all of the above.

2 days ago 2 0 1 0

a close up of a man 's face with the word houst in white letters Alt: Houston, we have a problem

I canceled Claude Max 20 and used another provider's $200/mo account, but used up 85% of my *monthly* limit in only 5 days.

2 days ago 5 0 1 0

FOSS is labor. It would be great if it were appreciated and paid for. (I don't particularly like the hobby vs job distinction)

Projects that desire users and contributors should respect them, and that must be mutual to work.

2 days ago 0 0 1 0

Opus is really good at this but it is worth noting that I have logged CVEs on the Linux kernel without AI assistance.

2 days ago 7 0 0 0

Posts by Erica Windisch