Advertisement · 728 × 90

Posts by Vincent Leroy

Video

What a nice morning to find that Alzugaray, Taher @marwantaher.bsky.social , Davison @ajdavison.bsky.social have turned ACE into a SLAM system. Since we did ACE, I was wondering whether it was possible, and I always thought who if not those folks can pull it off. 🙂

ACESLAM: arxiv.org/abs/2512.14032

4 months ago 30 3 1 0
Snippings of recorded talks

Snippings of recorded talks

Last week’s #AI4Robotics workshop was fab & fabulously full 😊 If you missed out we've good news -
📽️ All 15 talk recordings are now online 🚀! tinyurl.com/4ns5apvd
Catch up on cutting-edge work in #robot perception, action & autonomy - from #SLAM & control to computer vision & large-scale learning!

4 months ago 35 10 1 2
Post image

The 4th #AI4RoboticsWorkshop is over! A big THANK YOU to all our fab speakers & participants for great presentations & conversations. Snapshot souvenir ⬇️ For those who missed out -recordings available soon! tinyurl.com/bdtk2nzs

5 months ago 96 9 5 0
Video

Eric Brachmann @ericbrachmann.bsky.social on the future of mapping ... or localization w/o explicit maps, and links to SfM.

AI 4 Robotics Workshop at @naverlabseurope.bsky.social

5 months ago 10 2 0 0
Post image

Daniel Cremers presents the history of SLAM and his contributions, eg. the new VISTA SLAM (3DV 2026).

AI 4 Robotics workshop at @naverlabseurope.bsky.social

5 months ago 9 2 0 0
Post image

We’re ready to roll for Day 2 of the #AI4RoboticsWorkshop !
Livestream starts 9am CET:🎥 tinyurl.com/bdtk2nzs
Today’s great lineup of speakers: @dcremers.bsky.social –
@ericbrachmann.bsky.social - Aniruddha Kembhavi - Adrien Gaidon - Nicolas Mansard & Justin Carpentier

5 months ago 7 3 0 1
Post image

We’re live! 🚀 Streaming: tinyurl.com/bdtk2nzs
The International Workshop on AI4Robotics by @naverlabseurope
2dys of Spatial AI, SLAM, robot learning, HRI, autonomy
This AM CET: @martinhumenberger.bsky.social @marcpollefeys.bsky.social Andrea Vedaldi Cordelia Schmid & @andrewdavidson.bsky.social ⬇️

5 months ago 22 8 0 0
Advertisement
Post image

Marc Pollefeys talks about Global SfM meeting feedforward reconstruction, targeting scenes with a high number of hops between cameras to cover the entire scene (and many other Spatial AI contributions)

@marcpollefeys.bsky.social

AI 4 Robotics Workshop at @naverlabseurope.bsky.social

5 months ago 15 2 0 0

Excellent speaker lineup for the @naverlabseurope.bsky.social AI for Robotics Workshop.
For those at home, the event is live-streamed on the landing page: europe.naverlabs.com/updates/ai4r...

5 months ago 8 3 0 0

A new model for human mesh recovery, high-performing and w/o using any 3D scans, has been published by my excellent colleagues at @naverlabseurope.bsky.social. Excellent work!

5 months ago 24 3 0 0
Post image

Nice article on PansSt3R with LojzeZust in #ICCV2025 Daily! rsipvision.com/ICCV2025-Tue...

6 months ago 11 3 0 0

If you want to learn more about general models for 3D and semantic understanding, or just have a quick chat, come meet us this afternoon at posters #462 (HAMSt3R) and #540 (PanSt3R) :)

6 months ago 8 0 0 0
Post image

We’re #HiringNow a Senior Research Scientist in ML for Robotics in Grenoble, France
Don't miss us at #iros2025 this week to chat about the role, research & team with Darko Drakulic & @rbregier.bsky.social on Tue 21 & Thur 23 Oct!
More info on where to find them & job ad ⬇️⬇️

6 months ago 4 3 1 0
Video

We round off #iccv25 with highlight paper Geo4D on Thur
TLDR: Geo4D repurposes a video diffusion model to reconstruct dynamic scenes in 4D
Paper: arxiv.org/abs/2504.07961
Project: geo4d.github.io
@chuanxiaz.bsky.social @dlarlus.bsky.social rlus.bsky.social @oxford-vgg.bsky.social gg.bsky.social
🧵9

6 months ago 9 2 0 0
Video

Tuesday is Poster Session 2 with
HAMS3tR: Human-Aware Multi-view Stereo 3D Reconstruction @iccv.bsky.social
TLDR: HAMSt3R extends MASt3R to handle scenes involving people.
Paper: arxiv.org/abs/2508.16433
@weinzaepfelp.bsky.social @vincentleroy.bsky.social 🧵7/9

6 months ago 5 1 1 0
Advertisement
Post image

It's a marathon Sunday for @gabrielacsurka.bsky.social – co-author of PANst3R, Gabriela is also giving 2 invited wksp talks AND keynote dinner speech #WiCV 😅
CALIPOSE: 🏃https://sites.google.com/view/calipose2025/
CRoCoDL: 🏃‍♀️https://localizoo.com/workshop/
WICV: sites.google.com/view/wicv-ic... 🧵6/9

6 months ago 6 1 1 0
Post image

If you’ve never heard the story of how the #DUSt3R or *St3R family was born & what’s going on now - don’t miss @vincentleroy.bsky.social & his new talk at the #E2E3D workshop, also Oct 19th
Workshop E2E3D (afternoon): e2e3d.github.io 🧵5/9

6 months ago 6 1 1 0
Preview
ICCV 2025 5 papers, invited speaker, WiCV sponsor and Challenge sponsor

Aloha #iccv25 – here we come! Excited to be presenting new *St3R models PANSt3R, HAMSt3R & HOSt3R. We're also introducing ‘Geo4D' and ‘LUDVIG’ 🫢 giving invited talks and mentoring! Full @iccv.bsky.social
programme below (or tinyurl.com/asbn5b5d) 🧵1/9

6 months ago 17 4 1 1
Video

🔥 ACE-G, the next evolutionary step of ACE at #ICCV2025 🔥

We disentangle coordinate regression and latent map representation which lets us pre-train the regressor to generalize from mapping data to difficult query images.

Page: nianticspatial.github.io/ace-g/

Stellar work by Leonard Bruns et al.!

6 months ago 24 5 2 1
Video

📽️ Check out Visual Odometry Transformer! VoT is an end-to-end model for getting accurate metric camera poses from monocular videos.

vladimiryugay.github.io/vot/

6 months ago 10 4 1 0

RaySt3R was accepted to NeurIPS! Check out the HuggingFace demo for image to 3D in cluttered scenes huggingface.co/spaces/bartd...

7 months ago 5 2 0 0
Preview
Cameras as Relative Positional Encoding Transformers are increasingly prevalent for multi-view computer vision tasks, where geometric relationships between viewpoints are critical for 3D perception. To leverage these relationships, multi-vi...

This or some kind of PRoPe arxiv.org/abs/2507.10496

7 months ago 3 0 1 0

Yes exactly if you are dealing with unordered image collections. The same certainly cannot be said when dealing with videos where the displacement between two frames makes more sense.

7 months ago 3 0 1 0
Advertisement

Wrt to RoPE: People are applying it wrongly for multi-view, if you do it VGGT style you will mix the positions of all the images. You should do it independently (i.e. no rope between images).

7 months ago 5 1 2 0
Post image Post image Post image Post image

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Nikhil Keetha et 16 al.

tl;dr: VGGT meets Pow3R. No RoPE. Metric scale.
arxiv.org/abs/2509.13414

1/

7 months ago 17 2 3 0
Post image Post image Post image Post image

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

@nikv9.bsky.social et al.

tl;dr: flexible input & metric output version of VGGT

arxiv.org/abs/2509.13414

7 months ago 2 2 0 0
Towards the Next Generation of 3D Reconstruction

And here is a link to the thesis itself: liu.diva-portal.org/smash/record...

7 months ago 19 6 0 0
Post image Post image Post image Post image

Towards the Next Generation of 3D Reconstruction

@parskatt.bsky.social PhD Thesis.

tl;dr: would be useful in teaching image matching - nice explanations. (too) Fancy and stylish notation. Cool Ack section and cover image.

liu.diva-portal.org/smash/record...

7 months ago 33 8 2 0
Post image

How to name your method: a comprehensive flow chart

7 months ago 43 10 1 0
Video

Want more visibility for your SLAM-related paper at #ICCV2025?

Submit to the Nectar Track of our Neural SLAM workshop before Sep. 15!

We welcome any recently published high-quality papers (ICCV, CVPR, NeurIPS, Arxiv, etc.)!

🌐 More info: sites.google.com/view/neuslam...

7 months ago 7 3 0 0
Advertisement