Curious to see it in action. Gemini 2.5 has already been very impressive as a study buddy for courses like set theory, information theory, and automata.
Although I’m always a bit skeptical of these benchmarks. Seems quite unlikely that all of the questions remain out of their training data.