Agreed, it also leads performance on arc-agi-1. Here's the leaderboard where you... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		grantpitt 82 days ago \| parent \| context \| favorite \| on: Gemini 3 Agreed, it also leads performance on arc-agi-1. Here's the leaderboard where you can toggle between arc-agi-1 and 2: https://arcprize.org/leaderboard

energy123 81 days ago [–]

It leads on arc-agi-1 with Gemini 3.0 Deep Think, which uses "tool calls" according to google's post, whereas regular Gemini 3.0 Pro doesn't use "tool calls" for the same benchmark. I am unsure how significant this difference is.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact