ARC-AGI v2

ARC-AGI-V2 · General reasoning

Live · 2024

ARC-AGI v2 is a general reasoning benchmark published in 2024 measuring abstract reasoning over visual grids. Each task requires inferring the transformation rule from 2-3 examples. Contamination risk: low.

What this benchmark measures

Abstract reasoning over visual grids. Each task requires inferring the transformation rule from 2-3 examples.

v2 launched 2024-12 with harder tasks designed to remain unsolvable by pure pattern matching. $1M public prize for >85% on private set.

Claimed scores

No claims have been recorded yet for this benchmark in the Policy Window catalog.

Interpretation guidance

Contamination risk: low

Benchmark items are unlikely to appear in training corpora — scores are credible reflections of underlying capability.

How to cite this benchmark

Use the primary methodology source for academic citations; reference the Policy Window article for the cross-model leaderboard.

Related benchmarks (general reasoning)

References

  1. ARC-AGI v2 methodology

Take this further — sign up free

Save, compare, or get alerts when ARC-AGI v2 changes. Policy Window is the analyst workbench layered on top of this wiki — free for researchers, civil society, and verified policymakers.

Generated from the Policy Window catalog at . Each claim cites the originating primary source.

Wiki articles regenerate when the underlying catalog updates. Tracked revisions arrive in a future iteration; subscribe via the CTA above to be notified when this article changes.