Advertisement Β· 728 Γ— 90

Posts by Sayan Deb Sarkar

πŸ“° Paper: arxiv.org/abs/2502.15011
▢️ Project Page: sayands.github.io/crossover/
πŸ’» Codebase: github.com/GradientSpaces…

Work w/ Ondrej Miksik, @marcpollefeys.bsky.social, @danielbarath.bsky.social and @ir0armeni.bsky.social ✨

(3/3)

10 months ago 2 1 0 0

πŸ—“οΈ Thursday 12 June 3:00 p.m. - 3:45 p.m. CDT
πŸ“ OpenSun3D Workshop Poster Session Arch 211

(2/3)

10 months ago 0 0 1 0

✨ Excited to head off to Nashville for #CVPR2025

🎀 Catch me at the poster sessions or just come say hi to grab β˜•

πŸ—“οΈ Friday 13 June 4:00 p.m. - 6:00 p.m. CDT
πŸ“ Poster Session #2 β€” Exhibit Hall D Highlight Poster #346

(1/3)

10 months ago 3 0 1 0
Video

πŸ₯³Excited to share our latest work, WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments, accepted to #CVPR2025 🌐

We present a robust monocular RGB SLAM system that uses uncertainty-aware tracking and mapping to handle dynamic scenes.

1 year ago 4 1 1 2

πŸ† CrossOver is accepted as a π—›π—Άπ—΄π—΅π—Ήπ—Άπ—΄π—΅π˜ at #CVPR2025! ✨
πŸ’» Fully open-sourced code with all pre-trained checkpoints: github.com/GradientSpac...

πŸ“‘ Stay tuned for a deep-dive thread and what else we are cooking 🍳

1 year ago 5 1 0 0

Looking forward to it!

1 year ago 1 0 0 0

But, the multimodal problem is same as in image generative tasks β€” as in, what is the perfect 3D scan given a text input?

1 year ago 1 0 1 0

In this case, what would be a definitive ground truth?

1 year ago 0 0 1 0

Thanks for sharing our work! Yes, I think that’d be a pretty neat downstream application but maybe it is more multimodal generation rather than reconstruction.

1 year ago 1 0 1 0
Advertisement
Preview
CrossOver: 3D Scene Cross-Modal Alignment Multi-modal 3D object understanding has gained significant attention, yet current approaches often assume complete data availability and rigid alignment across all modalities. We present CrossOver, a ...

πŸ”— arXiv: arxiv.org/abs/2502.15011
πŸ“‚ Project page: sayands.github.io/crossover/

Joint work with Ondrej Miksik, @marcpollefeys.bsky.social, @danielbarath.bsky.social and @ir0armeni.bsky.social πŸ€πŸ’‘

1 year ago 5 1 0 0
Video

πŸŽ‰ Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 🌐✨

We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities β€” no semantic annotations needed!πŸš€

1 year ago 18 3 2 3
Preview
CrossOver: 3D Scene Cross-Modal Alignment Multi-modal 3D object understanding has gained significant attention, yet current approaches often assume complete data availability and rigid alignment across all modalities. We present CrossOver, a ...

πŸ”— arXiv: arxiv.org/abs/2502.15011
πŸ“‚ Project page: sayands.github.io/crossover/

Joint work with Ondrej Miksik, @marcpollefeys.bsky.social @danielbarath.bsky.social and @ir0armeni.bsky.social πŸ€πŸ’‘

1 year ago 0 0 0 0
Post image

πŸš€πŸš€PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.

1/7

1 year ago 69 21 1 5

Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.

1 year ago 1 0 0 0

Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.

1 year ago 2 0 1 0

Would love to be added! I’m a PhD student working on 3D scene understanding and spatial AI.

1 year ago 1 0 1 0

Could you add me? I’m a PhD student working on 3D scene understanding.

1 year ago 1 0 1 0
Advertisement