#arcagi2 hashtag - Bluesky

2 months ago

ARC-AGI-2: A New Challenge for Frontier AI Reasoning Systems The Abstraction and Reasoning Corpus for Artificial General Intelligence (ARC-AGI), introduced in 2019, established a challenging benchmark for evaluating the general fluid intelligence of artificial ...

Data contamination threatens #LLM #AIEvaluation
Scaling has “limits to growth”. New #ARCAGI2 counters this problem with contamination resistant, compositional reasoning tests and human baselines require original reasoning Not just memory recall evaluation arxiv.org/abs/2505.11831

1 1 0 0

AI Daily Post

@aidailypost.com

3 months ago

Google Gemini’s Deep Think just crushed the ARC‑AGI‑2 benchmark, and Nvidia dropped a fresh open‑source kit for autonomous driving. Plus, Cosmos Cookbook and Flux.2 updates from Black Forest Labs. Dive into the details! #GoogleGemini #ARCAGI2 #NvidiaAI

🔗 aidailypost.com/news/google-...

0 0 0 0

GOMOOT

@meneguzzo68.bsky.social

1 year ago

ARC-AGI-2 mette in crisi i modelli IA più avanzati Il benchmark ARC-AGI-2 evidenzia il limite attuale dell’IA e indica la direzione della ricerca: efficienza, flessibilità e capacità di apprendimento autonomo.

💡 ARC-AGI-2 mette in crisi i modelli IA più avanzati

gomoot.com/arc-agi-2-me...

#agi #arcagi2 #arcprize #benchmark #blog #chatgpt #claude #deepseekr1 #geminiflash #news #openai #picks #sonnet #tech #tecnologia

0 0 0 0

KINEWS24.de

@kinews24.bsky.social

1 year ago

Neuer Test ARC-AGI-2 zeigt: MENSCH GEWINNT GEGEN KI!

KI-Modelle scheitern kläglich beim ARC-AGI-2 Test, während Menschen ihn locker lösen! 🤯 Neuer Benchmark enthüllt eklatante Schwächen aktueller KI.

#ai #ki #agi #arcagi2 #künstlicheintelligenz #artificialintelligence

kinews24.de/arc-agi-2/