π° Paper: arxiv.org/abs/2502.15011
βΆοΈ Project Page: sayands.github.io/crossover/
π» Codebase: github.com/GradientSpacesβ¦
Work w/ Ondrej Miksik, @marcpollefeys.bsky.social, @danielbarath.bsky.social and @ir0armeni.bsky.social β¨
(3/3)
Posts by Sayan Deb Sarkar
ποΈ Thursday 12 June 3:00 p.m. - 3:45 p.m. CDT
π OpenSun3D Workshop Poster Session Arch 211
(2/3)
β¨ Excited to head off to Nashville for #CVPR2025
π€ Catch me at the poster sessions or just come say hi to grab β
ποΈ Friday 13 June 4:00 p.m. - 6:00 p.m. CDT
π Poster Session #2 β Exhibit Hall D Highlight Poster #346
(1/3)
π₯³Excited to share our latest work, WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments, accepted to #CVPR2025 π
We present a robust monocular RGB SLAM system that uses uncertainty-aware tracking and mapping to handle dynamic scenes.
π CrossOver is accepted as a ππΆπ΄π΅πΉπΆπ΄π΅π at #CVPR2025! β¨
π» Fully open-sourced code with all pre-trained checkpoints: github.com/GradientSpac...
π‘ Stay tuned for a deep-dive thread and what else we are cooking π³
Looking forward to it!
But, the multimodal problem is same as in image generative tasks β as in, what is the perfect 3D scan given a text input?
In this case, what would be a definitive ground truth?
Thanks for sharing our work! Yes, I think thatβd be a pretty neat downstream application but maybe it is more multimodal generation rather than reconstruction.
π arXiv: arxiv.org/abs/2502.15011
π Project page: sayands.github.io/crossover/
Joint work with Ondrej Miksik, @marcpollefeys.bsky.social, @danielbarath.bsky.social and @ir0armeni.bsky.social π€π‘
π Excited to share our latest work, CrossOver: 3D Scene Cross-Modal Alignment, accepted to #CVPR2025 πβ¨
We learn a unified, modality-agnostic embedding space, enabling seamless scene-level alignment across multiple modalities β no semantic annotations needed!π
π arXiv: arxiv.org/abs/2502.15011
π Project page: sayands.github.io/crossover/
Joint work with Ondrej Miksik, @marcpollefeys.bsky.social @danielbarath.bsky.social and @ir0armeni.bsky.social π€π‘
ππPaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.
1/7
Would love to be added! Iβm a PhD student working on 3D scene understanding and spatial AI.
Would love to be added! Iβm a PhD student working on 3D scene understanding and spatial AI.
Would love to be added! Iβm a PhD student working on 3D scene understanding and spatial AI.
Could you add me? Iβm a PhD student working on 3D scene understanding.