Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why should we start fine tuning gemma when it is so bad. Why not instead focus the fine-tuning efforts on Qwen, when it starts off with much, much better outputs?


Speed critical applications, I suppose. Have you compared the speeds?

(I did. I won't give you number (which I cannot remember precisely), but Gemma was much faster. So, it will depend on the application.)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: