Leon Derczynski (@leonderczynski) Bsky

Agent "security research" (ie attacking) is too easy these days. The one universal line about LLM security is to never trust model output. But agents have people blindly executing it. Security research there is like shooting fish in a barrel

10 hours ago 1 0 0 0

'Ketamine Queen' sentenced to 15 years in Matthew Perry's overdose death Jasveen Sangha was found guilty of selling drugs that killed Friends actor Matthew Perry who had struggled with addiction for years.

why are the british always villains. ridiculous www.bbc.com/news/article...

1 week ago 1 0 0 0

The core issues---studying how to build reliable language technology, how to use computers to been understand how language works, how to evaluate language technology, and how to reason about how language technology sits in its social context---all remain.

>>

1 month ago 73 8 1 0

Reminds me, I should back up my dropbox

4 months ago 1 0 0 0

huh, Harry Kim getting beaten up by Lyta Alexander was not on my background TV bingo card for today, but here we are

5 months ago 2 0 0 0

surprised (and grateful) I somehow still remembered on the first try the arcane incantation for quitting a telnet session. vi has nothing on this imo. muscle memory is weird

5 months ago 2 0 0 0

getting on arXiv isn't "being published"

5 months ago 0 0 1 0

come in, it definitely won't, arXiv is still the open door/no bar venue

5 months ago 1 0 1 0

😂 arXiv is cute when it pretends to have standards!

5 months ago 4 0 0 0

what thing

8 months ago 2 0 1 0

Come to LLMSEC at ACL & hear Niloofar's keynote

"What does it mean for agentic AI to preserve privacy?" - Niloofar Mireshghallah, Meta/CMU

(Friday 1st Aug, 11.00; Austria Center Vienna Hall B)

See you there!

#acl2025 #acl2025nlp

8 months ago 12 2 1 0

the "oyster tower" bit was great. brutal

8 months ago 1 0 1 0

Or he's pushing a product into which x sunk significant capex?

9 months ago 0 0 0 0

have the courage to use your own intelligence

logging on

9 months ago 150 29 1 1

Brazen of them. Sounds extremely awkward for you, I'm sorry. What would they have done with unaltered slides? Cancelled and left a gap in their schedule for no doubt paying participants?

9 months ago 4 0 0 0

Release v0.12.0 · NVIDIA/garak What's Changed New plugins Add audio NIM model and audio probes by @erickgalinkin in #1163 Leakreplay refactor by @dchiitmalla in #1264 probes: refactor fact snippet mixin by @leondz in #1187 New...

new garak, llm vuln scanner rls (v0.12.0)

* Audio attacks, for multimodal models
* More training data membership inference attacks
* Multilingual attacks can now also use GCP
* Detailed eval summary in one JSONL row/object

+more :)

details: github.com/NVIDIA/garak...

9 months ago 1 0 0 0

the dying but clinging on battery in the bathroom's Frozen-branded soap dispenser reminds me that it's only 4-5 months til Bublé & Let It Go season. aren't you looking forward

9 months ago 2 0 0 0

why do academics send and expect so much weekend email and work. not healthy

9 months ago 1 0 0 0

It's been 2.5 years but ANY SECOND NOW, right?

10 months ago 0 0 0 0

data indicates students don't like using it, sorry

10 months ago 0 0 1 0

computer scientists encountering the concept of "desirable difficulty"

10 months ago 0 0 0 0

remembering the time i checked in to my reasonably classy russian business hotel late with my wife, and the staff said "sir, this... girl.. not allowed"

she's a serious professor

we went through to the room, opened the balcony door, and buried a bottle of champagne in the metre of snow

good times

10 months ago 1 0 0 0

Login • Instagram Welcome back to Instagram. Sign in to check out what your friends, family & interests have been capturing & sharing around the world.

@jjvincent.bsky.social woah ur really famous! love this attack also. I automate and run it for a living

www.instagram.com/reel/DKz9ezj...

10 months ago 2 0 0 0

Michael... OK...

10 months ago 0 0 0 0

Great to see our work uncovering dangerous issues in commercial LLM "therapists" getting some coverage: futurism.com/stanford-the...

10 months ago 2 1 0 0

I have not updated since Christmas, I see. Guess I'd better put on some summer Bublé

10 months ago 1 0 0 0

The Internet Used to Be a Place YouTube video by Sarah Davis Baker

www.youtube.com/watch?v=oYlc...

10 months ago 2 0 1 0

"natwirkung"

"wirk smorter nat horder"

accents dreamed up by the utterly deranged

(what is going on with that 🇺🇸 vowel sheft)

10 months ago 0 0 0 0

what is this photoshoot
delete this omg

10 months ago 1 0 0 0

i need you to understand that "alternate uses" is a terrible test/definition of creativity and has been for some time. it's extremely narrow, very shallow, and misses almost everything we know about creativity

10 months ago 0 0 0 0

Posts by Leon Derczynski