Advertisement · 728 × 90
#
Hashtag
#SciArena
Advertisement · 728 × 90
Preview
OpenAI's o3 tops new AI league table for answering scientific questions SciArena uses votes by researchers to evaluate large language models’ responses on technical topics.

OpenAI o3 tops new AI league table for answering Scientific Literature questions
www.nature.com/articles/d41...
nonpaywalled: archive.fo/0IDls

Ai2 SciArena: Foundation Models
arxiv.org/abs/2507.01001
allenai.org/blog/sciarena

sciarena.allen.ai

#LLM #Ai2 #SciArena #Chato3 #AllenAI #OpenAI

0 0 0 0
SciArena's Leaderboard on 2025-07-03

SciArena's Leaderboard on 2025-07-03

Discover SciArena by AllenAI—a community-driven platform that ranks AI models on scientific literature tasks using expert votes and real research papers. Open, transparent, and advancing AI for science! Explore more: sciarena.allen.ai #AI #SciArena #ResearchAI

1 0 2 0
Original post on fediscience.org

#SciArena is a crowdsourced platform built by the #AllenInstituteForAI (#Ai2, @allenai) for testing #AI tools on scientific research tasks.
https://sciarena.allen.ai/

You can find its latest results in a July 1 blog post. If you use any AI tools for research purposes, note that the SciArena […]

1 0 0 0
Post image

Latest From Allen Institute for Artificial Intelligence (Ai2): #SciArena [Think Chatbot / LMArena]: New Platform for Evaluating Foundation Models in Scientific Literature Tasks www.infodocket.com/2025/07/01/n... #scholcomm #science #LLMs

0 1 0 0
Preview
SciArena: ChatGPT domina la nuova arena scientifica dell’AI Allen Institute lancia SciArena: gli scienziati testano i modelli AI in ambito accademico. ChatGPT o3 domina tutte le categorie scientifiche.

SciArena: la sfida tra intelligenze artificiali è aperta (e vince ChatGPT)
#chatbot #chatgpt #claude #deepseek #gemini #intelligenzaartificiale #sciarena
guruhitech.com/sciarena-la-...

1 0 0 0
Post image

New From Allen Institute for Artificial Intelligence (Ai2): #SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks […]

[Original post on newsie.social]

0 0 0 0
Post image

New From Allen Institute for Artificial Intelligence (Ai2): #SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks www.infodocket.com/2025/03/27/r... #scholcomm #science #LLMs @ai2.bsky.social

0 0 0 0