Yup in my private evals I have repeatedly found that DeepSeek has the best model...

__alexs · 2025-04-30T17:05:21 1746032721

Publishing them might help you find out.

refulgentis · 2025-04-30T17:54:33 1746035673

^ This.

If I had to hazard a guess, as a poor soul doomed to maintain several closed and open source models acting agentically, I think you are hyper focused on chat trivia use cases (DeepSeek has a very, very, hard time tool calling and they say as much themselves in their API docs)