Bernhard Jaeger (@bernhard-jaeger) Bsky

Still hiring for PhD candidates who are *specifically* excited in building and deploying RL systems for self-driving vehicles and other multi-agent planning settings. Shoot me an email if you think this is you and please help spread the word!

6 days ago 41 14 2 0

"Wir wollen Impulse für gute Beschäftigungsbedingungen in der Wissenschaft geben." | VolkswagenStiftungCloseWeiterWeiter

👉 Wir drehen an einer wichtigen Stellschraube für bessere #Arbeitsbedingungen in der #Wissenschaft: Vollzeitstellen sollen für Doktorand:innen in Deutschland zur Norm werden.

"Wir möchten dazu beitragen, das #Wissenschaftssystem in Deutschland weiterzuentwickeln und zu verbessern."
🌐 sohub.io/kyr3

2 weeks ago 56 15 3 8

Thank you for making the world a better place!

2 weeks ago 1 0 0 0

Naver AI has put the word "World" into "World Models", at least on a metropolis scale: A world model for Seoul using Naver Maps (the Seoul capital area is 25.6 million people, btw.).

seoul-world-model.github.io
arxiv.org/abs/2603.15583

@jnhwkim.bsky.social

1 month ago 46 10 0 1

Regularized self-play RL in grounded simulation effectively adapts driving policies to completely new cities. 🗽 -> 🗼

Really enjoyed collaborating on this work, led by Zilin and Saeed! Check out Zilin's post below for a great summary

🧵: x.com/nirhso/statu...
📄: arxiv.org/abs/2602.15891

2 months ago 21 3 0 4

🚀 Excited to share REPPO, a new on-policy RL agent!

TL;DR: Replace PPO with REPPO for fewer hyperparameter headaches and more robust training.

REPPO, led by @cvoelcker.bsky.social, will be presented at ICLR 2026. How does it work? 🧵👇

2 months ago 25 10 1 0

Who’s Behind All Those Robotaxi Teleoperations? With teleoperators, humans are back in the loop. Somewhat alarming, however, is the cascading impact that teleoperators could trigger, including corporations pushing liability issues downstream.

It had been previously reported. Waymo got it taken down very quickly, but not before the Internet Archive got a copy of the text summary:

web.archive.org/web/20250703...

2 months ago 6 3 0 2

Crongratulations Andreas!

2 months ago 1 0 0 0

haha, thought it was weird that the integers were defined as float and thought it was about cache line optimizations.

3 months ago 0 0 0 0

Tübingen AI Research Building, where the Cluster of Excellence "Machine Learning" is based.

📢We’re hiring: W3-Professorship in Machine Learning in Physics @unituebingen.bsky.social! What we’re looking for: Established research profile in a core area of #physics (condensedmatter, quantum or theoretical particle physics), strong track record in research questions related to #ML and/or #AI.

4 months ago 6 10 1 1

is faster?

3 months ago 0 0 1 0

PufferDrive 2.0 release YouTube video by Daphne Cornelisse

What if you could train agents on a 𝗱𝗲𝗰𝗮𝗱𝗲 of driving experience in 𝘂𝗻𝗱𝗲𝗿 𝗮𝗻 𝗵𝗼𝘂𝗿, on a single GPU?

Excited to share 𝙋𝙪𝙛𝙛𝙚𝙧𝘿𝙧𝙞𝙫𝙚 2.0: A fast, friendly driving simulator with RL training via PufferLib at 𝟯𝟬𝟬𝗞 𝘀𝘁𝗲𝗽𝘀/𝘀𝗲𝗰 🐡 + 🚗

youtu.be/LfQ324R-cbE?...

3 months ago 53 10 3 1

Our new E2E driving method, TransFuser v6, is out on ArXiv.
It outperforms all other methods on CARLA by a wide margin, 95 DS on Bench2Drive!
We show that minimizing the asymmetry between data annotator and policy is key for strong IL results.

Code, models, and paper:
ln2697.github.io/lead/

3 months ago 30 6 0 1

Suggestions for Individual Donors from Coefficient Giving Staff – 2025 | Coefficient Giving The 2025 edition of our annual tradition: a list of giving opportunities suggested by Coefficient Giving program staff.

Our staff's 2025 recommendations for individual donors, fresh off the press: coefficientgiving.org/research/su...

4 months ago 2 2 0 1

The Future of Focused Research Organizations: Working with Convergent on the NSF Tech Labs Initiative

This article is from people who have thought about FROs for years and have experience with what works and what doesn't.

I have always appreciated the restraint in defining the niche of FROs in the broader ecosystem; it comes out clearly in this piece.

www.essentialtechnology.blog/p/the-future...

4 months ago 3 3 0 0

Unfortunately it appears much of the academic community has reconstituted itself on LinkedIn

4 months ago 58 7 11 3

I am so happy and excited that this project got funded!

4 months ago 30 3 5 0

AI-powered assistants for scientific discovery Andreas Geiger receives ERC Consolidator Grant

More details at tuebingen.ai/news/ai-powe...

4 months ago 8 2 0 0

true, you could try to collect some dataset withgood coverage by running online RL first and then do offline RL in future iterations to save sim compute

4 months ago 1 0 0 0

This is not bringing back offline RL (but online RL). The purpose of closed-loop training here is to gather data in OOD states with the model.
Stitching doesn't work if your base dataset doesn't cover the state space well, which is the case in autonomous driving.

4 months ago 0 0 1 0

Beyond Behavior Cloning in Autonomous Driving: a Survey of Closed-Loop Training Techniques | Research Behavior cloning, the dominant approach for training autonomous vehicle (AV) policies, suffers from a fundamental gap: policies trained open-loop on temporally independent samples must operate in clos...

Speaking of RL, Nvidia also just published a survey on the importance of closed-loop training (RL, etc.) in E2E driving.

research.nvidia.com/publication/...

4 months ago 9 2 1 0

Demonstrably Safe AI For Autonomous Driving Autonomous driving is the ultimate challenge for AI in the physical world. At Waymo, we’re solving it by prioritizing demonstrably safe AI, where safety is central to how we engineer our models and AI...

waymo.com/blog/2025/12...

Waymo is training End-to-End driving models with RL in simulation.

4 months ago 24 6 2 2

😂

4 months ago 1 0 0 0

Tired Europe: Let's do tons of AI regulations
Wired Europe: Let's do tons of AI open source
#aiPULSE2025

4 months ago 10 1 0 0

This essay, roughly on dual use, has been haunting me for a while now:
dl.acm.org/doi/pdf/10.1...

4 months ago 28 3 3 0

Excited to be at #Neurips2025 this week to present our paper "Monoculture or Multiplicity: Which is it?", joint work with Moritz Hardt.

📄 Paper #1000: openreview.net/pdf?id=DO5Lt...
📍 Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM

Feel free to come by and reach out!

A short 🧵.

4 months ago 16 4 1 0

Attending #Neurips2025? Get your personalized Scholar Inbox conference program now to easily navigate the poster sessions and find what you are looking for:
www.scholar-inbox.com/conference/n...

4 months ago 35 12 0 0

Scholar Inbox for NeurIPS is live now.

4 months ago 14 5 0 2

Preprint site arXiv is banning computer-science reviews: here’s why The repository is taking steps to tackle a surge in low quality, AI-generated content.

www.nature.com/articles/d41...

ArXiv banned surveys due to AI slop spam.
Now we need to wait for them to be peer-reviewed.
Bad development, we need to find better solutions to AI slop than banning unreviewed papers.
Getting a survey reviewed at a good journal can take over a year. :(

4 months ago 0 0 0 0

Quick reminder about the EPFL PhD program deadline (EDIC) on Dec 15.

4 months ago 4 2 0 0

Posts by Bernhard Jaeger