Advertisement ยท 728 ร— 90

Posts by Bernhard Jaeger

Still hiring for PhD candidates who are *specifically* excited in building and deploying RL systems for self-driving vehicles and other multi-agent planning settings. Shoot me an email if you think this is you and please help spread the word!

6 days ago 41 14 2 0
Preview
"Wir wollen Impulse fรผr gute Beschรคftigungsbedingungen in der Wissenschaft geben." | VolkswagenStiftungCloseWeiterWeiter

๐Ÿ‘‰ Wir drehen an einer wichtigen Stellschraube fรผr bessere #Arbeitsbedingungen in der #Wissenschaft: Vollzeitstellen sollen fรผr Doktorand:innen in Deutschland zur Norm werden.

"Wir mรถchten dazu beitragen, das #Wissenschaftssystem in Deutschland weiterzuentwickeln und zu verbessern."
๐ŸŒ sohub.io/kyr3

2 weeks ago 56 15 3 8

Thank you for making the world a better place!

2 weeks ago 1 0 0 0
Video

Naver AI has put the word "World" into "World Models", at least on a metropolis scale: A world model for Seoul using Naver Maps (the Seoul capital area is 25.6 million people, btw.).

seoul-world-model.github.io
arxiv.org/abs/2603.15583

@jnhwkim.bsky.social

1 month ago 46 10 0 1
Post image

Regularized self-play RL in grounded simulation effectively adapts driving policies to completely new cities. ๐Ÿ—ฝ -> ๐Ÿ—ผ

Really enjoyed collaborating on this work, led by Zilin and Saeed! Check out Zilin's post below for a great summary

๐Ÿงต: x.com/nirhso/statu...
๐Ÿ“„: arxiv.org/abs/2602.15891

2 months ago 21 3 0 4

๐Ÿš€ Excited to share REPPO, a new on-policy RL agent!

TL;DR: Replace PPO with REPPO for fewer hyperparameter headaches and more robust training.

REPPO, led by @cvoelcker.bsky.social, will be presented at ICLR 2026. How does it work? ๐Ÿงต๐Ÿ‘‡

2 months ago 25 10 1 0
Preview
Whoโ€™s Behind All Those Robotaxi Teleoperations? With teleoperators, humans are back in the loop. Somewhat alarming, however, is the cascading impact that teleoperators could trigger, including corporations pushing liability issues downstream.

It had been previously reported. Waymo got it taken down very quickly, but not before the Internet Archive got a copy of the text summary:

web.archive.org/web/20250703...

2 months ago 6 3 0 2

Crongratulations Andreas!

2 months ago 1 0 0 0
Advertisement

haha, thought it was weird that the integers were defined as float and thought it was about cache line optimizations.

3 months ago 0 0 0 0
Tรผbingen AI Research Building, where the Cluster of Excellence "Machine Learning" is based.

Tรผbingen AI Research Building, where the Cluster of Excellence "Machine Learning" is based.

๐Ÿ“ขWeโ€™re hiring: W3-Professorship in Machine Learning in Physics @unituebingen.bsky.social! What weโ€™re looking for: Established research profile in a core area of #physics (condensedmatter, quantum or theoretical particle physics), strong track record in research questions related to #ML and/or #AI.

4 months ago 6 10 1 1

is faster?

3 months ago 0 0 1 0
PufferDrive 2.0 release
PufferDrive 2.0 release YouTube video by Daphne Cornelisse

What if you could train agents on a ๐—ฑ๐—ฒ๐—ฐ๐—ฎ๐—ฑ๐—ฒ of driving experience in ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ ๐—ฎ๐—ป ๐—ต๐—ผ๐˜‚๐—ฟ, on a single GPU?

Excited to share ๐™‹๐™ช๐™›๐™›๐™š๐™ง๐˜ฟ๐™ง๐™ž๐™ซ๐™š 2.0: A fast, friendly driving simulator with RL training via PufferLib at ๐Ÿฏ๐Ÿฌ๐Ÿฌ๐—ž ๐˜€๐˜๐—ฒ๐—ฝ๐˜€/๐˜€๐—ฒ๐—ฐ ๐Ÿก + ๐Ÿš—

youtu.be/LfQ324R-cbE?...

3 months ago 53 10 3 1
Video

Our new E2E driving method, TransFuser v6, is out on ArXiv.
It outperforms all other methods on CARLA by a wide margin, 95 DS on Bench2Drive!
We show that minimizing the asymmetry between data annotator and policy is key for strong IL results.

Code, models, and paper:
ln2697.github.io/lead/

3 months ago 30 6 0 1
Preview
Suggestions for Individual Donors from Coefficient Giving Staff โ€“ 2025 | Coefficient Giving The 2025 edition of our annual tradition: a list of giving opportunities suggested by Coefficient Giving program staff.

Our staff's 2025 recommendations for individual donors, fresh off the press: coefficientgiving.org/research/su...

4 months ago 2 2 0 1
Preview
The Future of Focused Research Organizations: Working with Convergent on the NSF Tech Labs Initiative

This article is from people who have thought about FROs for years and have experience with what works and what doesn't.

I have always appreciated the restraint in defining the niche of FROs in the broader ecosystem; it comes out clearly in this piece.

www.essentialtechnology.blog/p/the-future...

4 months ago 3 3 0 0

Unfortunately it appears much of the academic community has reconstituted itself on LinkedIn

4 months ago 58 7 11 3
Advertisement

I am so happy and excited that this project got funded!

4 months ago 30 3 5 0
Preview
AI-powered assistants for scientific discovery Andreas Geiger receives ERC Consolidator Grant

More details at tuebingen.ai/news/ai-powe...

4 months ago 8 2 0 0

true, you could try to collect some dataset withgood coverage by running online RL first and then do offline RL in future iterations to save sim compute

4 months ago 1 0 0 0

This is not bringing back offline RL (but online RL). The purpose of closed-loop training here is to gather data in OOD states with the model.
Stitching doesn't work if your base dataset doesn't cover the state space well, which is the case in autonomous driving.

4 months ago 0 0 1 0
Beyond Behavior Cloning in Autonomous Driving: a Survey of Closed-Loop Training Techniques | Research Behavior cloning, the dominant approach for training autonomous vehicle (AV) policies, suffers from a fundamental gap: policies trained open-loop on temporally independent samples must operate in clos...

Speaking of RL, Nvidia also just published a survey on the importance of closed-loop training (RL, etc.) in E2E driving.

research.nvidia.com/publication/...

4 months ago 9 2 1 0
Preview
Demonstrably Safe AI For Autonomous Driving Autonomous driving is the ultimate challenge for AI in the physical world. At Waymo, weโ€™re solving it by prioritizing demonstrably safe AI, where safety is central to how we engineer our models and AI...

waymo.com/blog/2025/12...

Waymo is training End-to-End driving models with RL in simulation.

4 months ago 24 6 2 2

๐Ÿ˜‚

4 months ago 1 0 0 0

Tired Europe: Let's do tons of AI regulations
Wired Europe: Let's do tons of AI open source
#aiPULSE2025

4 months ago 10 1 0 0
Advertisement
Post image

This essay, roughly on dual use, has been haunting me for a while now:
dl.acm.org/doi/pdf/10.1...

4 months ago 28 3 3 0
Post image

Excited to be at #Neurips2025 this week to present our paper "Monoculture or Multiplicity: Which is it?", joint work with Moritz Hardt.

๐Ÿ“„ Paper #1000: openreview.net/pdf?id=DO5Lt...
๐Ÿ“ Wed, Dec 3, 2025 โ€ข 4:30 PM โ€“ 7:30 PM

Feel free to come by and reach out!

A short ๐Ÿงต.

4 months ago 16 4 1 0
Post image

Attending #Neurips2025? Get your personalized Scholar Inbox conference program now to easily navigate the poster sessions and find what you are looking for:
www.scholar-inbox.com/conference/n...

4 months ago 35 12 0 0

Scholar Inbox for NeurIPS is live now.

4 months ago 14 5 0 2
Preview
Preprint site arXiv is banning computer-science reviews: hereโ€™s why The repository is taking steps to tackle a surge in low quality, AI-generated content.

www.nature.com/articles/d41...

ArXiv banned surveys due to AI slop spam.
Now we need to wait for them to be peer-reviewed.
Bad development, we need to find better solutions to AI slop than banning unreviewed papers.
Getting a survey reviewed at a good journal can take over a year. :(

4 months ago 0 0 0 0

Quick reminder about the EPFL PhD program deadline (EDIC) on Dec 15.

4 months ago 4 2 0 0