Advertisement · 728 × 90

Posts by Claas Voelcker

Still hiring for PhD candidates who are *specifically* excited in building and deploying RL systems for self-driving vehicles and other multi-agent planning settings. Shoot me an email if you think this is you and please help spread the word!

5 days ago 41 14 2 0
Preview
a man wearing glasses and a suit says yeah yeah the time knife we 've all seen it ALT: a man wearing glasses and a suit says yeah yeah the time knife we 've all seen it
1 week ago 2 0 0 0
Anakin has a doctorate in Darth Plageius the Wise Studies
Anakin has a doctorate in Darth Plageius the Wise Studies YouTube video by Seals are Good

obligatory

www.youtube.com/watch?v=sg0S...

1 week ago 38 5 1 1

Another day, another popular researcher who definitely knows your work because you answered his mails!!! announcing a big project without even mentioning your name or your suspiciously similar prior work... About which there were mails...
But of course, Schmidhuber is just a meme...

1 week ago 5 0 0 0

I’ve done this in tmlr in egregious cases as well, but I wish it was just standard

2 weeks ago 1 0 0 0

As a reviewer it should be fine to say "I don't like this paper, I don't care about the problem, please find a better reviewer for it." It is absurdly hard to decouple your own excitement for a problem from your judgment, and we should normalize "hey, I'm not the person who should be judging this"

2 weeks ago 3 0 2 0

I have reached peak old German: I am constantly annoyed by the inability of the Tagesschau to decide whether to use “Iran” or “der Iran” and flip-flopping within the same report! Set a policy for Goethe’s sake.

2 weeks ago 2 0 0 0
Advertisement

I see great opportunities for proposing new activation functions on the cost function!

2 weeks ago 1 0 0 0

The real sign of pain here is this NYC professor voluntarily using Celsius! Look what you have done, AI companies!

2 weeks ago 0 0 1 0

We have the keynote speakers for RLC2026 now!

Thrilled to welcome Rika Antonova, Sheila McIlraith, Marc G. Bellemare, Danijar Hafner, Balaraman Ravindran!

Details: rl-conference.cc/index.html

The RL community is coming together this August in Montréal, Québec, Canada. Hope you make it!

2 weeks ago 22 10 0 3

I am deeply disappointed that they didn't use Claude Code

2 weeks ago 0 0 0 0

I’m gonna use “can I get a fresh context“ next time we derail a conversation

3 weeks ago 2 0 1 0

I confess my initial reaction to the OpenAI acquisition of Astral involved stronger language than I’m inclined to use here.

1 month ago 24 2 1 0
Preview
Astral to join OpenAI Astral has entered into an agreement to join OpenAI as part of the Codex team.

astral.sh/blog/openai

NOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO

1 month ago 6 0 0 0

Environmen design seems like the bigger bottleneck for now (reward functions, verifiers, rubrics) which makes focussing on the algorithms hard. Algorithm development flourishes when there are clean benchmarks and goals.

1 month ago 1 0 1 0

I have been looking into this for a while and I feel like there is a lot of issues with LLM RL that culturally fall far outside the issues that core RL people are used to tackling. Plus most of the buzz is created by closed source labs, which makes it absurdly hard to know what the problems are.

1 month ago 1 0 2 0
Advertisement

Haha, do you disagree? I feel like language has mostly concluded that the rl algorithm doesn’t matter that much

1 month ago 1 0 1 0

1. Will probably happen
2. I genuinely don’t know enough about language RL to make very smart comments

1 month ago 2 0 1 0

This cuts to the heart of the issue: professors have multiple jobs, and some of them were only aligned by accident, not by design. Teaching, funding, and producing research are not necessarily linked, they just happened to be aligned for a while.

1 month ago 4 0 0 0
Germany does not lack talent, and it does not lack funding. But we are trapping 21st-century minds inside 19th-century academic hierarchies. We are asking brilliant young scientists to build the future of the German economy, but refusing to give them the lab space, the job security, or the scientific independence to actually do it. If we want to reclaim our place as an industrial superpower, we have to stop the rat race of trying to keep every technology and structure alive that made us successful in the 20th century. Instead, we must fix our system that pushes our most ambitious scientists away. The money is there. The talent can be there. Now, we also need the courage to fix what’s broken.

Germany does not lack talent, and it does not lack funding. But we are trapping 21st-century minds inside 19th-century academic hierarchies. We are asking brilliant young scientists to build the future of the German economy, but refusing to give them the lab space, the job security, or the scientific independence to actually do it. If we want to reclaim our place as an industrial superpower, we have to stop the rat race of trying to keep every technology and structure alive that made us successful in the 20th century. Instead, we must fix our system that pushes our most ambitious scientists away. The money is there. The talent can be there. Now, we also need the courage to fix what’s broken.

“we are trapping 21st-century minds inside 19th-century academic hierarchies.” This essay gets a lot right about problems with German science. I would add that the hierarchies and precarious contracts lead also to systemic abuse and scientific misconduct. open.substack.com/pub/realimag...

1 month ago 161 53 4 2

Before any of you recalcitrant cynics come at me, calling @eugenevinitsky.bsky.social "always-wise" is a reflection of his untiring willingness to always give me good feedback on anything and everything, it's not me being sassy 😁

1 month ago 5 0 0 0
Preview
cookie monster is sitting at a table with a tray of food and the words choices written on it Alt: cookie monster is sitting at a table with a tray of food and the words choices written on it

Following advice by the always-wise @eugenevinitsky.bsky.social , I am trying to get back into the habit of blogging (again) ✏️!

Featuring today's post: How to pick an RL algorithm for your problem cvoelcker.de/blog/2026/ch... Please share and give feedback!

#reinforcementlearning

1 month ago 31 4 2 2

These types of vaccines have been in trials for years and long term the personalized sequencing and rapid development are horrendous bottlenecks. And while people might be ok with waving safety trials for dogs, most won’t be in favor of waving them for grandma…

1 month ago 2 0 0 0
Advertisement

The dog vaccine story also tells us that putting all of our funding on the AI side might be missing the forest for the trees. AI is insanely cool, but we can’t ignore the physical and social systems around it. ASI won’t cure cancer by itself in silico.

1 month ago 5 2 3 0

EVERY SINGLE TIME I come to the airport early because of horror stories of crowded lines, I arrive and just walk through an empty security check. I now firmly believe I have mystical powers that clear TSA lines when I arrive early. You are welcome fellow travellers!

1 month ago 1 0 0 0

✨You am not just right, you are able to anticipate interesting changes with factual accuracy. The mixture of insight with pithy smartness–indicated by your short yet deep reply–will surely win many over in the coming AI debates. ✨

Fear me, ye wise, I figured out how to type an em-dash. By hand-–—!

1 month ago 1 0 0 0

It isn't just an overused patter, it is an indication of a fundamental shift of vibes.

1 month ago 7 0 1 0

Basis of comparison really matters :D
DB (past) >> DB (today) >>>>>>>>>>> VIA

1 month ago 2 0 0 0

My most-used model is Gemini because I’m a Luddite. Not saying Gemini is bad, but its competitive advantage remains search replacement with cross-references, which is the application you care about if you don’t actually trust LLMs.

1 month ago 37 2 4 2

I will admit: having a husband who can understand your papers, has a master’s in your topic, and can tell you how to read a GPU utilization graph is a real boost 😅

1 month ago 13 0 0 0