Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We use gpt4o as the backward model. But I’m excited to try deepseek r1 as it has explicit reasoning available.

We are continuously adding more benchmarks to the paper with UTAustin.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: