๐ new state-of-the-art on ILIAS dataset!
Curious how well the latest models can recognize particular objects?
We evaluated the base and large variants of DINOv3 and Perception Encoder (PE) on instance-level image retrieval.
See the results ๐ vrg.fel.cvut.cz/ilias/
Posts by Nikolaos-Antonios Ypsilantis
Our work, GROVE, has been accepted to ICCV 2025! ๐ This is collab. w. Cordelia Schmid & @josef-sivic.bsky.social.
We will release code, models and datasets within next 2 weeks.
We are also working on a search demo for the proposed datasets with user prompts!
I hope to see you all in Honolulu!
๐จ Deadline Extension
Instance-Level Recognition and Generation (ILR+G) Workshop at ICCV2025 @iccv.bsky.social
๐
new deadline: June 26, 2025 (23:59 AoE)
๐ paper submission: cmt3.research.microsoft.com/ILRnG2025
๐ ILR+G website: ilr-workshop.github.io/ICCVW2025/
#ICCV2025 #ComputerVision #AI
The Visual Recognition Group at CTU in Prague organizes the 49th Pattern Recognition and Computer Vision Colloquium with D. Karatzas, M. Masana, T. Tommasi, P. Mettes @pascalmettes.bsky.social , E. Brachmann @ericbrachmann.bsky.social and V. Stojnic @stojnicv.xyz
cmp.felk.cvut.cz/colloquium/#...
We are happy to share LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation.
LPOSS is a training-free method for open-vocabulary semantic segmentation using Vision-Language Models.
You can find easy cases in older datasets, this one mostly consists of hard examples. We created a new test dataset for instance-level retrieval to better reflect real world challenges. How does your new representation or retrieval model perform on ILIAS with 100m distractors? Accepted at CVPR'25
ILIAS is a large-scale test dataset for evaluation on Instance-Level Image retrieval At Scale. It is designed to support future research in image-to-image and text-to-image retrieval for particular objects and serves as a benchmark for evaluating foundation models and retrieval techniques.
ILIAS: Instance-Level Image retrieval At Scale
@gkordo.bsky.social, Vladan Stojniฤ @annetka.bsky.social Pavel ล uma, Nikolaos-Antonios Ypsilantis @nikos-efth.bsky.social Zakaria Laskar,Jiลรญ Matas, Ondลej Chum, @gtolias.bsky.social
tl;dr: SigLIP rules. Lots of ablations
arxiv.org/abs/2502.11748
1/
For PhD and MSc students interested in a research visit to Prague/VRG in 2025: we're open to hosting short-term collaborations or internships on a range of computer vision topics. If this sounds exciting, reach out by e-mail! We'd love to discuss potential projects. Some examples ๐งต
#Internship #CV
Excited to present UDON at NeurIPS '24 tomorrow (Thursday 12/12)! If you are interested in a scalable training method for multi-domain image embeddings, come to poster #1410 in the East Exhibit Hall A-C of the Vancouver Convention Center from 11 am to 2 pm (PST) to discuss!