@neuripsconf.bsky.social
Does the "Potential Positive and Negative Societal Impacts" section count toward the 9-page limit?
Thanks!
Posts by Samyak Rawlekar
Hi, I would love to be added to it if possible. I am a PhD student at UIUC working on vision-language models.
(8/8)
Paper: openaccess.thecvf.com/content/WACV...
Project Page: samyakr99.github.io/PositiveCoOp/
#WACV2025 #AI #MachineLearning #ComputerVision #CLIP #MultiLabelRecognition #PromptLearning
(7/8) This work is done at UIUC with
@shubhangb.bsky.social and Prof. Narendra Ahuja
Excited to discuss more at WACV 2025! Come find us at Poster Session 3 - 2nd March 11:15-1PM
(6/8) TL;DR: If you're using VLMs for MLR, skip negative prompts and use learned embeddings instead!
This saves compute, parameters, and improves performance.
(5/8) Why is Negative Prompting Ineffective?
π We analyze the LAION-400M dataset and find that less than 0.5% of captions contain negative words.
β CLIP simply doesnβt learn meaningful representations for class absence!
(4/8)Results on COCO & VOC2007
β
PositiveCoOp outperforms existing dual-prompt methods (like DualCoOp)
β
A simple vision-only baseline performs surprisingly well shows prompting isnβt always necessary!
β
NegativeCoOp performs the worst, proves negative prompting is not optimal
(3/8) We introduce PositiveCoOp and NegativeCoOp:
πΉ PositiveCoOp learns only positive prompts via CLIP and replaces negative prompts with learned embeddings
πΉ NegativeCoOp does the opposite.
πΉ Which one works better? (Spoiler: PositiveCoOp wins! π)
(2/8) We show that negative prompts hurt MLR performance:
π VLMs like CLIP are trained on image-caption data that focus on whatβs present, not whatβs absent.
π As a result, negative prompts often highlight the same regions as positive ones!
(1/8)Vision-language models like CLIP have been used for multi-label recognition (MLR) by learning both positive and negative prompts for associated with presence and absence of each class.
But is learning negative prompts actually helping detect absence? π€
Excited to Present Our paper "PositiveCoOp: Rethinking Prompting Strategies for Multi-Label Recognition with Partial Annotations" at WACV 2025! @wacvconference.bsky.social π’
π§΅ A thread on what we found! π§΅