This is a case for example I wish AI was targeting. However its more likely they will build more benchmarks and target dev's/SWE's quickly and hard because that's probably what their VC's want them to do. The domains that will generally benefit society - that's more of a maybe they might do later.
They just released a benchmark today to try to attempt it (OpenAI).
They just released a benchmark today to try to attempt it (OpenAI).