Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
grantpitt
31 days ago
|
parent
|
context
|
favorite
| on:
Gemini 3
Agreed, it also leads performance on arc-agi-1. Here's the leaderboard where you can toggle between arc-agi-1 and 2:
https://arcprize.org/leaderboard
energy123
31 days ago
[–]
It leads on arc-agi-1 with Gemini 3.0 Deep Think, which uses "tool calls" according to google's post, whereas regular Gemini 3.0 Pro doesn't use "tool calls" for the same benchmark. I am unsure how significant this difference is.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: