Advertisement Β· 728 Γ— 90

Posts by Isra Salazar

Preview
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation The evaluation of vision-language models (VLMs) has mainly relied on English-language benchmarks, leaving significant gaps in both multilingual and multicultural coverage. While multilingual benchmark...

Check out the paper at:
πŸ“œPaper: arxiv.org/abs/2504.07072
πŸ’ΏData: hf.co/datasets/Coh...
🌐Website: cohere.com/research/kal...
Huge thanks to everyone involved! This was a big collaboration πŸ‘

1 year ago 2 1 0 0
Post image

Today we are releasing Kaleidoscope πŸŽ‰

A comprehensive multimodal & multilingual benchmark for VLMs! It contains real questions from exams in different languages.

🌍 20,911 questions and 18 languages
πŸ“š 14 subjects (STEM β†’ Humanities)
πŸ“Έ 55% multimodal questions

1 year ago 26 6 1 1

Me :)

1 year ago 1 0 1 0