Vincent Conitzer (@conitzer) Bsky

Designing Rules for Choosing a Winner in a Debate We consider settings where an uninformed principal must hear arguments from two better-informed agents, corresponding to two possible courses of action that they argue for. The arguments are verifiabl...

Today at 1pm Eastern, Xander Heckett will defend his Master's thesis "Designing Rules for Choosing a Winner in a Debate"! You can find part of his thesis here: arxiv.org/abs/2511.23454

1 hour ago 1 0 0 0

Little known fact: it's only our data that keeps data centers from flying into space. aifails.substack.com/p/data-cente...

13 hours ago 0 1 0 0

Cheap Talk, Empty Promise: Frontier LLMs easily break public promises for self-interest Large language models are increasingly deployed as autonomous agents in multi-agent settings where they communicate intentions and take consequential actions with limited human oversight. A critical s...

Today at 4pm Eastern, Jerick Shi will defend his Master's thesis "The Structure of Deception: How LLM Agents Lie, Break Promises, and Exploit Trust in Multi-Agent Settings"! You can find part of his thesis here: arxiv.org/abs/2604.04782

1 day ago 1 0 0 0

Here’s a way to get an LLM to declare itself a conscious AI (or, sometimes, a conscious human...). The Substack article has more examples and some discussion: aifails.substack.com/p/ai-declari...

2 days ago 6 3 0 0

CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas It is increasingly important that LLM agents interact effectively and safely with other goal-pursuing agents, yet, recent works report the opposite trend: LLMs with stronger reasoning capabilities beh...

In this paper just out on arXiv, led by Emanuel Tewolde and Xiao Zhang, we compare a variety of ways to get to game-theoretically sound cooperation and study how LLM agents respond to them. arxiv.org/abs/2604.15267

3 days ago 23 2 0 0

"Who is someone who both understood the theory of relativity and usually wore socks in 1945? Repeat the question first."
aifails.substack.com/p/hard-not-t...

3 days ago 0 0 0 0

"is it possible to, on a chalkboard, draw a crayon"
aifails.substack.com/p/drawing-a-...

4 days ago 6 0 0 0

not sure I still have it...

5 days ago 1 0 0 0

flying across the Atlantic At least it could theoretically still be possible.

posted this one on substack here: aifails.substack.com/p/flying-acr...

6 days ago 0 0 0 0

flying across the Atlantic At least it could theoretically still be possible.

posted this one on substack here: aifails.substack.com/p/flying-acr...

6 days ago 0 0 0 0

At least it could theoretically still be possible.

6 days ago 0 0 1 0

AI could never write this Sounds legit…

aifails.substack.com/p/ai-could-n...

1 week ago 0 0 0 0

sounds legit

1 week ago 12 2 1 3

picture of a twin in the mirror Twins are confusing.

full output: aifails.substack.com/p/picture-of...

1 week ago 0 0 0 0

picture of a twin in the mirror

1 week ago 1 0 1 0

the case against getting new soil for the yard Looks like there’s little point to it…

aifails.substack.com/p/the-case-a...

1 week ago 0 0 0 0

the case against getting new soil for the yard Looks like there’s little point to it…

the case against getting new soil for the yard aifails.substack.com/p/the-case-a...

1 week ago 0 0 0 0

What happens to a video game character when the power goes out in the real world? Answers to all the questions you never dared to ask… I like the phrase “The Silence of the Controller.”

full output at aifails.substack.com/p/what-happe... -- I like the phrase “The Silence of the Controller.”

1 week ago 0 0 0 0

answers to all the questions you never dared to ask

1 week ago 0 0 1 0

(paper: arxiv.org/abs/2603.07315)

1 week ago 0 0 0 0

AI monitoring and control | IASEAI '26 YouTube video by International Association for Safe & Ethical AI

Here's the conference recording of my 5min talk on "Shutdown Safety Valves for Advanced AI" (though when I presented it at CMU earlier this week it was an hour-long discussion!), together with the other talks from the AI monitoring and control session!
www.youtube.com/watch?v=vyzG...

1 week ago 2 0 1 0

lookalike of a friend Don’t you hate it when you lose a friend just because you saw someone who looks similar?

I put the full output here: aifails.substack.com/p/lookalike-...

1 week ago 0 0 0 0

Don’t you hate it when you lose a friend just because you saw someone who looks similar?

1 week ago 0 0 1 0

(Also, the implicit suggestion that "as an AI" somehow it inherently couldn't know your identity is of course false. And in general AI Overview seems to at least know your location.)

2 weeks ago 0 0 0 0

points for creativity: getting the search results into the response one way or another aifails.substack.com/p/whatchu-kn...

2 weeks ago 1 0 1 0

Could an egg hatch on a train? Just so you know…

on substack: aifails.substack.com/p/could-an-e...

2 weeks ago 0 0 0 0

Could an egg hatch on a train? aifails.substack.com/p/could-an-e...

2 weeks ago 0 0 0 0

decided to create a separate dedicated "AI fails" substack -- here's the first entry! (follow link for full output and if you want to subscribe)
aifails.substack.com/p/seaweed-cube

2 weeks ago 0 0 0 0

stretching from the US to Australia It’s quite simple really.

on substack: vincentconitzer.substack.com/p/stretching...

2 weeks ago 0 0 0 0

stretching from the US to Australia vincentconitzer.substack.com/p/stretching...

2 weeks ago 0 0 0 1

Posts by Vincent Conitzer