GSM8K-V Shows Vision Language Models Lag on Visual Math Problems
GSM8K‑V adds visual format to 1,319 grade‑school math problems. Gemini‑2.5‑Pro scores 95.22% on text but only 46.93% on the visual version, showing a gap for VLMs. getnews.me/gsm8k-v-shows-vision-lan... #gsm8kv #visionlanguagemodels
0
0
0
0