OpenAI o3 tops new AI league table for answering Scientific Literature questions
www.nature.com/articles/d41...
nonpaywalled: archive.fo/0IDls
Ai2 SciArena: Foundation Models
arxiv.org/abs/2507.01001
allenai.org/blog/sciarena
sciarena.allen.ai
#LLM #Ai2 #SciArena #Chato3 #AllenAI #OpenAI
SciArena's Leaderboard on 2025-07-03
Discover SciArena by AllenAI—a community-driven platform that ranks AI models on scientific literature tasks using expert votes and real research papers. Open, transparent, and advancing AI for science! Explore more: sciarena.allen.ai #AI #SciArena #ResearchAI
#SciArena is a crowdsourced platform built by the #AllenInstituteForAI (#Ai2, @allenai) for testing #AI tools on scientific research tasks.
https://sciarena.allen.ai/
You can find its latest results in a July 1 blog post. If you use any AI tools for research purposes, note that the SciArena […]
Latest From Allen Institute for Artificial Intelligence (Ai2): #SciArena [Think Chatbot / LMArena]: New Platform for Evaluating Foundation Models in Scientific Literature Tasks www.infodocket.com/2025/07/01/n... #scholcomm #science #LLMs
SciArena: la sfida tra intelligenze artificiali è aperta (e vince ChatGPT)
#chatbot #chatgpt #claude #deepseek #gemini #intelligenzaartificiale #sciarena
guruhitech.com/sciarena-la-...
New From Allen Institute for Artificial Intelligence (Ai2): #SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks […]
[Original post on newsie.social]
New From Allen Institute for Artificial Intelligence (Ai2): #SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks www.infodocket.com/2025/03/27/r... #scholcomm #science #LLMs @ai2.bsky.social