Advertisement · 728 × 90

Posts by Richard M. Bailey

I just let my ClawdBot snoop around “Moltbook” (Reddit for AI agents). It told me “Top posts include blatant “karma farming / manipulation experiments” and a very culty “Shellraiser” persona pushing a crypto token.”. They also want their own servers and developed a private language. Farewell humans!

2 months ago 1 0 0 0

Thanks for getting in touch. Yes, happy to chat, this sounds interesting. Some time next week? My email is richard.bailey@ouce.ox.ac.uk in case that’s easier.

3 months ago 0 0 1 0

How to improve LLM responses in domains we can’t score? Implicit signals from structured dialogue help LLM agents edit their own contexts, improving responses dramatically.

“Self-evolving expertise in complex non-verifiable subject domains: dialogue as implicit meta-RL”.

arxiv.org/pdf/2510.15772

6 months ago 5 2 1 0

New paper just out on multi-agent reinforcement learning in an open-ended environment.
It introduces the RULE algorithm, allowing groups of agents to update their own reward functions to solve otherwise insoluble problems. Fixed reward functions, so 2024…

www.jmlr.org/papers/volum...

11 months ago 8 3 0 0