New paper: Back into Platoโs Cave
Are vision and language models converging to the same representation of reality? The Platonic Representation Hypothesis says yes. BUT we find the evidence for this is more fragile than it looks.
Project page: akoepke.github.io/cave_umwelten/
1/9
Posts by Diane Larlus
Happy Tuesday ! ๐ฒ๐ป
Work with @mbsariyildiz.bsky.social @weinzaepfelp.bsky.social @skamalas.bsky.social @pdejorge.bsky.social and other @naverlabseurope.bsky.social colleagues
Research described in these 2 papers:
UNIC - europe.naverlabs.com/unic #ECCV2024
DUNE - europe.naverlabs.com/dune #CVPR2025
๐ฌ Nice video illustrating our recent line of work on multi-teacher distillation for learning visual representations
The DUNE encoder is a ViT distilled from multiple heterogenous teachers and is incredibly powerful, even outperforming the MASt3R on localization. We use it for navigation and manipulation.
@mbsariyildiz.bsky.social
@weinzaepfelp.bsky.social
@skamalas.bsky.social
@dlarlus.bsky.social
et al.
๐จ Happy to announce CVPR@Paris'26 which will take place on June 1st in Paris. The goal of the event is to share a little bit of the conference before it happens. We will have poster sessions as well as several plenary talks by world-class speakers.
info: cvprinparis.github.io/CVPR2026InPa...
The keynotes and award talks at 3DV 2026 have been made available, with talks by Christian Rupprecht ("3D computer vision done and Dusted?", Angela Dai, Alec Jacobson, Jitendra Malik, Songyou Peng and Michael Niemeyer.
www.youtube.com/playlist?lis...
Today NeurIPS is announcing our official satellite event in Paris.
After responding to the call from Ellis following the success of EurIPS in December, we are pleased to reach a new milestone by joining forces with the NeurIPS organizing committee for the 2026 edition.
Spring flowers on top of a castle
A bike behind a bench, mountains at a distance
Spring commute ๐ฑ๐ธ๐ฒ
Amazing slides! And good cat-to-slide ratio ๐ป
The ๐ฅ๐ฅAnny roll continues this week with the release of Multi-HMR trained with the Anny body model. Multi-person, single shot Human Mesh Recovery predictions are now made in the open, interpretable parameter space of Anny!
๐งโ๐คโ๐งMulti-HMR code: github.com/naver/multi-...
๐Anny: tinyurl.com/5fsekm9z
We're pleased to share that the much awaited Anny-One dataset is now available!
Download โก๏ธ tinyurl.com/anny-one
*โฃ 800K images
๐Multi-view & multi-person
๐ Indoor 3D scenes
๐ทWild cameras
๐ฅAnny format! github.com/naver/anny
S-MUSt3R: Sliding Multi-view 3D Reconstruction
Leonid Antsfeld, Boris Chidlovskii, Yohann Cabon, @vincentleroy.bsky.social Jerome Revaud
tl;dr: sliding MUSt3R for cheap long seq
The most interesting is alignment: point-based+camera-based. KDTree local desc for loop closure
arxiv.org/abs/2602.04517
It's time, dear #CVPR2026 reviewers. Time to decide.
We kick off our 2026 Open Seminar series with Dimitris Samaras @stonybrooku.bsky.social on 'Controllable and Efficient Diffusion Models'. Learn how they can be steered & optimized for efficient, controllable generation!
๐
Tuesday Jan 20th: 10am CET
More info & registration โก๏ธ tinyurl.com/y3m23j2x
๐ ๐ฒ๐๐ต๐ฐ๐: ๐ฐ๐ ๐ ๐ฒ๐๐ต ๐ฅ๐ฒ๐ฐ๐ผ๐ป๐๐๐ฟ๐๐ฐ๐๐ถ๐ผ๐ป ๐ฎ๐ป๐ฑ ๐ง๐ฟ๐ฎ๐ฐ๐ธ๐ถ๐ป๐ด ๐ณ๐ฟ๐ผ๐บ ๐ ๐ผ๐ป๐ผ๐ฐ๐๐น๐ฎ๐ฟ ๐ฉ๐ถ๐ฑ๐ฒ๐ผ
Zeren Jiang, Chuanxia Zheng, Iro Laina ... Andrea Vedaldi
arxiv.org/abs/2601.05251
Trending on www.scholar-inbox.com
Sunset on the Alps, above the clouds
The 4th AI for Robotics workshop surfaced converging themes around embodied perception, task learning & evaluation methodologies - emphasising a shift to integrated, context-aware systems. Dive into our key takeaways #AI #Robotics #spatialAI
๐ฅฝ โก๏ธ tinyurl.com/bvxxcn5e
Check out our 2025 highlights in computer vision!
๐Five new *St3R models (MASt3R-SfM, MUSt3R, PanSt3R, HAMSt3R, HOSt3R)
๐คฉAnny parametric 3D human model (Apache 2.0)
๐คUniversal encoder for all-in-one vision FM
Watch the highlights ๐
More info โถ๏ธ tinyurl.com/muvs5vnu
Congratulations Andreas !
Can vision transformers learn without images?๐ค๐
Our latest work shows that pretraining ViTs on procedural symbolic data (eg sequences of balanced parentheses) makes subsequent standard training (eg on ImageNet) more data efficient! How is this possible?! โฌ๏ธ๐งต
The 5th edition of the Conference on Lifelong Learning Agents is going to Romania in 2026 & I'm happy to serve as program chair
We've just put the CfP online โฌ๏ธ & we're particularly excited to broaden the scope to all facets of adaptation of ML models.
Please check out these exciting additions!
New startup Gradium spawned out of @kyutai-labs.bsky.social
Congrats Nicolas ! On the PhD and on those beautifully crafted slides ๐คฉ
This fourth edition of the AI4Robotics workshop was such a nice event ๐คฉ If you couldn't attend, recordings for all the talks are now available online ๐ฌ๐ฟ
europe.naverlabs.com/updates/ai4r...
๐ @bjornih.bsky.social will present his #BMVC2025 paper in half an hour !
The 4th #AI4RoboticsWorkshop is over! A big THANK YOU to all our fab speakers & participants for great presentations & conversations. Snapshot souvenir โฌ๏ธ For those who missed out -recordings available soon! tinyurl.com/bdtk2nzs
Recordings will soon be available, stay tuned !