Advertisement · 728 × 90

Posts by Simon Lermen

Interesting article by the nytimes featuring our research on anonymity.

1 month ago 0 0 0 0
Post image

Thanks for featuring our research.

1 month ago 0 0 0 0
Preview
Anthropic Isn’t Exaggerating About an AI Panopticon In the debate about the military’s use of artificial intelligence, prompted by Anthropic’s dispute with the Pentagon that’s now headed to the courts, much has been said about the concerns related to a...

What's the endstate of AI making mass surveillance cheap and scalable? "The study [..] speaks to one important aspect of the shifting paradigm of mass surveillance. (The research was led by Simon Lermen of MATS Research and Daniel Paleka from ETH Zurich.)"
www.bloomberg.com/opinion/arti...

1 month ago 0 0 0 0

What a video. Crazy that it's like one 84 year old politician who is taking this stuff seriously.

1 month ago 1 0 0 0
Post image

“I didn’t write that”
“Yes you did”

Research by @simonlermen.bsky.social et al shows LLMs can deanonymize pseudonymous users of online platforms using unstructured content (eg link pseudonymous Hacker News posts with LinkedIn profiles or interview transcripts):

buff.ly/bAdgQpx

1 month ago 1 1 0 0
Preview
Large-Scale Online Deanonymization with LLMs We measure the capabilities of LLMs to deanonymize users online.

I am one of the authors. Also check out my blogpost: simonlermen.substack.com/p/large-scal...

1 month ago 4 0 0 0

Happy to share my matsprogram.org project that I have been working on in the last couple of months. We explore how LLMs can be used for large-scale deanonymization online.

2 months ago 4 0 0 0

Our paper on AI-powered spear phishing, co-authored with @fredheiding.bsky.social , has been accepted at the ICML 2025 Workshop on Reliable and Responsible Foundation Models!
openreview.net/pdf?id=f0uFp...

9 months ago 1 1 0 0
Advertisement

Do you think there is any comparable thing in China to AI Twitter or Bluesky? Where people discuss ideas

1 year ago 1 0 1 0

Are you working at DeepSeek?

1 year ago 1 0 0 0

Why so mean old man

1 year ago 0 0 1 0
Post image

Grok's DeepSearch was launched with Zero safety features, you can ask it about assasslnations, dru*gs. This has been online for a few days now with no changes.

1 year ago 2 0 0 0

I’m mostly interested in not dying

1 year ago 2 0 0 0

If you are trying to understand its reasoning, it seems like a necessary step to have legible chain-of-thought.

1 year ago 2 0 1 0
Preview
OpenAI’s Economic Blueprint The Blueprint outlines policy proposals for how the US can maximize AI’s benefits, bolster national security, and drive economic growth

you should be carefully here, huge datacenters with their own powerstructures are being discussed, huge new semiconductor facilities. situation might change
openai.com/global-affai...

1 year ago 0 0 1 0
Advertisement

To be fair, the pre-training and all those mega datacenters do have some significant environmental impact. buying products from AI labs does fund this. But agree that individual energy use per reply is like the weakest argument against AI.

1 year ago 1 0 1 0
Preview
Human study on AI spear phishing campaigns — LessWrong TL;DR: We ran a human subject study on whether language models can successfully spear-phish people. We use AI agents built from GPT-4o and Claude 3.5…

I published a human study with @fredheiding.bsky.social
We use AI agents built from GPT-4o and Claude 3.5 Sonnet to search the web for available information on a target and use this for highly personalized phishing messages. achieved click-through rates above 50%
www.lesswrong.com/posts/GCHyDK...

1 year ago 4 1 0 0

Has anyone ever tried with constitutional AI to add something on: always show your entire reasoning? What happens if you ask the model if it left out steps in its reasoning? can it verbalize them?

1 year ago 0 0 1 0

They achieve this in part by immediately releasing models after training such as o3, other companies wait for safety and security evaluations and estimates of societal impact. They also used to wait with releases such as with GPT-4

1 year ago 1 0 1 0

sometimes fancy terms just serve to confuse people

1 year ago 3 0 0 0

They have already made billions in revenue, but defining it as profits makes it almost impossible to reach

1 year ago 0 1 0 0

crazy that they use profits instead of revenue. so they can always just hack this by spending a bit more on R&D

1 year ago 0 0 1 0

my guess is he thinks of some sort of conscious experience of wanting here...

1 year ago 1 0 0 0

its behavior is at if it wants to win, same will be true about powerful AI agents. whether it actually wants something in a way that satisfies you doesn't matter

1 year ago 0 0 0 0

So RL-training the model to achieve some goal such as with constitutional AI can't lead to the model having a goal? do you think AlphaZero wants to win at chess?

1 year ago 0 0 1 0
Advertisement

💯

1 year ago 1 0 0 0

Well, we observe computation in superposition

1 year ago 0 0 1 0

I agree that it doesn't PROVE multiverses. But I don't like the sneering tone, what is superposition? It sure seems like the electron is in many places at once, all interpretations of that seem a bit crazy. Everett's manyworlds is a common position among physicists, including some i know.

1 year ago 0 0 2 0

The many worlds interpretation is a commonly held view by many physicists. And it is not like other interpretations are less "weird".

1 year ago 0 0 0 0

The many worlds interpretation is a commonly held view by many physicists. And it is not like other interpretations are less "weird"

1 year ago 0 0 2 0