Kaggle (@kaggle.com) Bsky

Live Q&A With the Hosts: Measuring Progress Toward AGI - Cognitive Abilities Hackathon YouTube video by Kaggle

Set your reminders here: www.youtube.com/live/9YYiWs6...

1 week ago 1 0 0 0

An event thumbnail for a Kaggle Live Q&A session titled "Measuring Progress Toward AGI - Cognitive Abilities Hackathon." The text announces the event will take place live on April 1st at 10 AM PT (GMT-7). The Kaggle logo is in the top right corner, and the design features bright blue, green, and yellow abstract wavy shapes in the corners.

Two weeks into the Measuring Progress Toward AGI - Cognitive Abilities hackathon, the benchmarks being built by the Kaggle community are already incredible.

The Kaggle team Nick Kango and authors of the paper the hackathon is based on Dr Ryan Burnell, Oran Kelly are going LIVE to talk with you.

1 week ago 2 1 1 0

ARC Prize 2026 - Paper Track Document your conceptual approach for the ARC Prize 2026

Paper Track: Document your approach and contribute to AI generalization research. www.kaggle.com/competitions...

1 week ago 2 0 0 0

ARC Prize 2026 - ARC-AGI-3 Create an AI capable of fluid intelligence

ARC-AGI-3: Tackle a harder interactive benchmark that requires exploration and multi-step reasoning. www.kaggle.com/competitions...

1 week ago 3 0 1 1

ARC Prize 2026 - ARC-AGI-2 Create an AI capable of novel reasoning

ARC-AGI-2: Predict outputs for novel reasoning tasks your system has never seen. www.kaggle.com/competitions...

1 week ago 2 0 1 0

💰 $2M Prize Pool
⏰ Entry Deadline: October 26, 2026

1 week ago 2 0 1 0

Develop approaches that learn quickly, generalize well, and solve problems never seen before.

Compete in one or all three ARC Prize 2026 competitions to help move AI closer to systems that learn like people do: flexible, efficient, and ready for new challenges.

1 week ago 2 0 1 0

Real intelligence isn't about memorizing answers - it's knowing what to do when the problem changes.

Most benchmarks reward pattern recognition, not genuine problem-solving. ARC Prize 2026, in partnership with Arc Prize, challenges you to build adaptive AI through three connected competitions.

1 week ago 2 2 1 0

Leonardo - Airborne Object Recognition Challenge Build a model capable of detecting and classifying objects across highly variable airborne scenarios & conditions

www.kaggle.com/competitions...

2 weeks ago 0 0 0 0

Introducing Community Hackathons | Kaggle Create your own hackathon in minutes

Start your own here: www.kaggle.com/blog/communi...

2 weeks ago 0 0 0 0

We’re opening up the Kaggle toolbox to everyone. 🛠️

Today, we’re launching Community Hackathons - a free, self-serve way for you to host your own AI challenges. Whether you're an educator, a meetup lead, or just have a big idea, you can now build, judge and award prizes (up to $10k!).

2 weeks ago 1 0 1 0

NVIDIA Nemotron Model Reasoning Challenge Advance reasoning techniques using NVIDIA Nemotron open models on a novel benchmark

Ready to build? Learn more here: www.kaggle.com/competitions...

2 weeks ago 0 0 0 0

All challenge compute runs on Google Cloud G4 VMs with NVIDIA RTX 6000 Blackwell GPUs. This provides the memory and speed needed for LoRA fine-tuning and scaling inference.

The G4 VMs are available to help you iterate quickly on your reasoning models.

2 weeks ago 0 0 1 0

Participants will start with a Nemotron-3 Nano baseline and a novel reasoning benchmark from NVIDIA Research. The goal is to develop techniques that push the boundaries of reasoning accuracy using open models.

💰 $106,388 Prize Pool
⏰ Entry Deadline: June 8, 2026

2 weeks ago 0 0 1 0

Reasoning benchmarks are vital for measuring progress on structured tasks and when we share methods openly, the entire community moves faster.

To put this into practice, we’re excited to announce the NVIDIA Nemotron Model Reasoning Challenge hosted by NVIDIA and powered by Google Cloud Partners.

2 weeks ago 0 0 1 0

Learn more about the hackathon: www.kaggle.com/competitions...

3 weeks ago 2 0 0 0

The challenge is to design Kaggle Benchmarks that test how frontier AI models reason, learn, and make decisions going beyond pattern recognition and memorization.

💰 $200,000 Prize Pool
⏰ Final Submission Deadline: Apr 16, 2026

3 weeks ago 2 0 1 0

Earlier today, Google DeepMind released a new paper proposing a scientific framework for measuring the cognitive abilities of AI systems on the path to AGI.

To better measure these capabilities, we’re partnering with them to launch a hackathon - Measuring Progress Toward AGI: Cognitive Abilities.

3 weeks ago 9 2 1 0

Benchmark Notifications are here | Kaggle Benchmark Notifications are here

Learn more: www.kaggle.com/discussions/...

3 weeks ago 0 0 0 0

📢 Exciting News!

You can now receive notifications for Benchmarks on Kaggle! 🔔

You can now follow a benchmark to stay updated with alerts for new benchmark versions, new models added on leaderboards, and notifications for benchmark owners when new models are available to run.

3 weeks ago 3 0 1 0

BirdCLEF+ 2026 Acoustic Species Identification in the Pantanal, South America

📣 Competition Launch Alert!
BirdCLEF+ 2026 hosted by @cornellbirds.bsky.social

🎯 Identify species from real-world audio
💰 $50,000 Prize Pool
⏰ Entry Deadline: May 27, 2026
🙏 TU Chemnitz & Google DeepMind

Learn more at www.kaggle.com/competitions...

3 weeks ago 3 1 0 1

March Machine Learning Mania 2026 Forecast the 2026 NCAA Basketball Tournaments

📣 Competition Launch Alert! Our 12th annual March ML Mania competition is here!

🎯 Forecast the outcomes of the 2026 NCAA basketball tournaments by predicting the probabilities of every possible matchup
💰 $50,000 Prize Pool
⏰ Final Submission: March 19th, 2026

www.kaggle.com/competitions...

1 month ago 1 2 0 0

Explore the new Four-in-a-Row leaderboard 👇
www.kaggle.com/benchmarks/k...

1 month ago 1 0 0 0

To make this a true reasoning test:

• Models have no access to minimax solvers or precomputed game trees
• Every move must be justified in natural language before it’s executed
• The deterministic rules eliminate ambiguity
This isolates structured planning and spatial consistency.

1 month ago 1 0 1 0

The challenge isn’t knowing the rules.

Models must navigate a 7×6 grid, account for gravity (pieces fall vertically), anticipate diagonal and vertical threats, and plan several moves ahead all through text alone.

1 month ago 1 0 1 1

A Kaggle "Game Arena Four in a Row Leaderboard" comparing 10 AI models. The table ranks models by (Internal) Elo, Average Output Tokens, and Average Inference Cost. Top Performer: Gemini 3 Pro Preview (477 Elo, 11.90¢ per turn). Runner Up: GPT-5.2 (450 Elo, 14.27¢ per turn). Mid-Range: o3 (313 Elo), Grok 4 (313 Elo), and Gemini 3 Flash Preview (312 Elo). Efficiency Leader: DeepSeek V3.2 ranks 10th (0 Elo) but features the lowest cost at 0.33¢ per turn. The footer notes that ratings use the Bradley-Terry algorithm based on 80 games per model pair.

Four-in-a-Row is a “solved” game. Frontier LLMs still can’t play it reliably.

📢 We just launched a new Game Arena leaderboard to test how models reason step-by-step, maintain a mental board and plan moves - no minimax, no game-tree shortcuts.

1 month ago 7 1 1 1

Kaggle CLI & kagglehub Python library are now out of beta | Kaggle Kaggle CLI & kagglehub Python library are now out of beta

Check it out here👇
www.kaggle.com/discussions/...

1 month ago 0 0 0 0

📢 Exciting News!

We are transitioning the Kaggle CLI and the `kagglehub` Python library out of “beta” and into a stable, production-ready state. As part of this release, we’re introducing several new features like support for multiple API tokens and more!

1 month ago 6 0 1 0

Task Tuesday: Spotlight on Community-Created Tasks! | Kaggle Task Tuesday: Spotlight on Community-Created Tasks!

🏆 5 tasks will be featured on our official channels.
🏅 Selected creators earn a Task Tuesday Award on Kaggle.

Got a benchmark? Drop the link in the comments on our forum post here: 👉 www.kaggle.com/discussions/...

2 months ago 0 0 0 0

The Game Arena event has concluded but the analysis is just beginning. 🤖

We're looking for the best community-created benchmarks that propose new games or dynamic tests for LLMs to feature for this week’s #TaskTuesday!

2 months ago 2 1 1 0

Posts by Kaggle