Advertisement · 728 × 90

Posts by Bethge Lab

Coordinator (m/f/d, E13 TV-L, 75%)

🚨 We're hiring: Research Coordinator for CRC Robust Vision
@unituebingen.bsky.social

Take a leading role in our DFG funded consortium in neuroscience, ML & CV

MSc/PhD + science mgmt. Apply by Mar 15 👇 join our team w/ @jakhmack.bsky.social @bethgelab.bsky.social

uni-tuebingen.de/en/universit...

2 months ago 11 7 0 0
Post image

Excited to be in Vienna for #ACL2025 🇦🇹!You'll find @dziadzio.bsky.social and I by our ONEBench poster, so do drop by!

🗓️Wed, July 30, 11-12:30 CET
📍Hall 4/5

I’m also excited to talk about lifelong and personalised benchmarking, data curation and vision-language in general! Let’s connect!

8 months ago 4 1 0 0
Postdoctoral Researcher (m/f/d, E13 TV-L, 100%)

🧠🤖 We’re hiring a Postdoc in NeuroAI!

Join CRC1233 "Robust Vision" (Uni Tübingen) to build benchmarks & evaluation methods for vision models, bridging brain & AI. Work with top faculty & shape vision research.

Apply: tinyurl.com/3jtb4an6

#NeuroAI #Jobs

9 months ago 17 13 0 1
Post image

🧵1/ 🚨 New paper: A Sober Look at Progress in Language Model Reasoning
We re-evaluate recent SFT and RL models for mathematical reasoning and find most gains vanish under rigorous, multi-seed, standardized evaluation.

📊 bethgelab.github.io/sober-reason...
📄 arxiv.org/abs/2504.07086

1 year ago 14 5 1 0
Post image

🧠 Keeping LLMs factually up to date is a common motivation for knowledge editing.

But what would it actually take to support this in practice at the scale and speed the real world demands?

We explore this question and really push the limits of lifelong knowledge editing in the wild.
👇

1 year ago 29 8 1 4